Skip to main content

Showing 1–49 of 49 results for author: Das, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.14801  [pdf, other

    stat.AP math.OC stat.CO

    GLASD: A Loss-Function-Agnostic Global Optimizer for Robust Correlation Estimation under Data Contamination and Heavy Tails

    Authors: Priyam Das

    Abstract: Robust correlation estimation is essential in high-dimensional settings, particularly when data are contaminated by outliers or exhibit heavy-tailed behavior. Many robust loss functions of practical interest-such as those involving truncation or redescending M-estimators-lead to objective functions that are inherently non-convex and non-differentiable. Traditional methods typically focus on a sing… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  2. arXiv:2505.24006  [pdf, ps, other

    stat.ME cs.LG stat.ML

    A2 Copula-Driven Spatial Bayesian Neural Network For Modeling Non-Gaussian Dependence: A Simulation Study

    Authors: Agnideep Aich, Sameera Hewage, Md Monzur Murshed, Ashit Baran Aich, Amanda Mayeaux, Asim K. Dey, Kumer P. Das, Bruce Wade

    Abstract: In this paper, we introduce the A2 Copula Spatial Bayesian Neural Network (A2-SBNN), a predictive spatial model designed to map coordinates to continuous fields while capturing both typical spatial patterns and extreme dependencies. By embedding the dual-tail novel Archimedean copula viz. A2 directly into the network's weight initialization, A2-SBNN naturally models complex spatial relationships,… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    MSC Class: 62H12; 62P10; 65C20; 62F15; 68T07

  3. arXiv:2505.21730  [pdf, other

    stat.ME stat.AP stat.CO stat.ML

    pared: Model selection using multi-objective optimization

    Authors: Priyam Das, Sarah Robinson, Christine B. Peterson

    Abstract: Motivation: Model selection is a ubiquitous challenge in statistics. For penalized models, model selection typically entails tuning hyperparameters to maximize a measure of fit or minimize out-of-sample prediction error. However, these criteria fail to reflect other desirable characteristics, such as model sparsity, interpretability, or smoothness. Results: We present the R package pared to enable… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  4. arXiv:2505.19612  [pdf, ps, other

    cs.SI stat.ME

    Optimal Intervention for Self-triggering Spatial Networks with Application to Urban Crime Analytics

    Authors: Pramit Das, Moulinath Banerjee, Yuekai Sun

    Abstract: In many network systems, events at one node trigger further activity at other nodes, e.g., social media users reacting to each other's posts or the clustering of criminal activity in urban environments. These systems are typically referred to as self-exciting networks. In such systems, targeted intervention at critical nodes can be an effective strategy for mitigating undesirable consequences such… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  5. arXiv:2503.20807  [pdf, other

    stat.ML cs.AI cs.CL cs.LG

    Fundamental Safety-Capability Trade-offs in Fine-tuning Large Language Models

    Authors: Pin-Yu Chen, Han Shen, Payel Das, Tianyi Chen

    Abstract: Fine-tuning Large Language Models (LLMs) on some task-specific datasets has been a primary use of LLMs. However, it has been empirically observed that this approach to enhancing capability inevitably compromises safety, a phenomenon also known as the safety-capability trade-off in LLM fine-tuning. This paper presents a theoretical framework for understanding the interplay between safety and capabi… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: The first two authors contribute equally to this work and are listed in alphabetical order

  6. arXiv:2503.14009  [pdf, other

    q-bio.QM math.DS nlin.CD stat.ML

    Developing cholera outbreak forecasting through qualitative dynamics: Insights into Malawi case study

    Authors: Adrita Ghosh, Parthasakha Das, Tanujit Chakraborty, Pritha Das, Dibakar Ghosh

    Abstract: Cholera, an acute diarrheal disease, is a serious concern in developing and underdeveloped areas. A qualitative understanding of cholera epidemics aims to foresee transmission patterns based on reported data and mechanistic models. The mechanistic model is a crucial tool for capturing the dynamics of disease transmission and population spread. However, using real-time cholera cases is essential fo… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Journal ref: Journal of Theoretical Biology, 2025

  7. arXiv:2502.07111  [pdf, other

    cs.LG stat.AP stat.ME

    Likelihood-Free Estimation for Spatiotemporal Hawkes processes with missing data and application to predictive policing

    Authors: Pramit Das, Moulinath Banerjee, Yuekai Sun

    Abstract: With the growing use of AI technology, many police departments use forecasting software to predict probable crime hotspots and allocate patrolling resources effectively for crime prevention. The clustered nature of crime data makes self-exciting Hawkes processes a popular modeling choice. However, one significant challenge in fitting such models is the inherent missingness in crime data due to non… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  8. arXiv:2412.05998  [pdf, ps, other

    stat.ME

    B-MASTER: Scalable Bayesian Multivariate Regression for Master Predictor Discovery in Colorectal Cancer Microbiome-Metabolite Profiles

    Authors: Priyam Das, Tanujit Dey, Christine Peterson, Sounak Chakraborty

    Abstract: The gut microbiome significantly influences responses to cancer therapies, including immunotherapies, primarily through its impact on the metabolome. Despite some studies on effects of specific microbial genera on individual metabolites, there is little prior work identifying key microbiome components at the genus level that shape the overall metabolome profile. To address this gap, we introduce B… ▽ More

    Submitted 12 September, 2025; v1 submitted 8 December, 2024; originally announced December 2024.

  9. arXiv:2412.03596  [pdf, ps, other

    stat.ME

    SMART-MC: Characterizing the Dynamics of Multiple Sclerosis Therapy Transitions Using a Covariate-Based Markov Model

    Authors: Beomchang Kim, Zongqi Xia, Priyam Das

    Abstract: Treatment switching is a common occurrence in the management of Multiple Sclerosis (MS), where patients transition across various disease-modifying therapies (DMTs) due to heterogeneous treatment responses, differences in disease progression, patient characteristics, and therapy-associated adverse effects. To investigate how patient-level covariates influence the likelihood of treatment transition… ▽ More

    Submitted 26 August, 2025; v1 submitted 2 December, 2024; originally announced December 2024.

  10. arXiv:2409.04365  [pdf, other

    stat.ML cs.LG stat.ME

    Leveraging Machine Learning for Official Statistics: A Statistical Manifesto

    Authors: Marco Puts, David Salgado, Piet Daas

    Abstract: It is important for official statistics production to apply ML with statistical rigor, as it presents both opportunities and challenges. Although machine learning has enjoyed rapid technological advances in recent years, its application does not possess the methodological robustness necessary to produce high quality statistical results. In order to account for all sources of error in machine learn… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: 29 pages, 4 figures, 1 table. To appear in the proceedings of the conference on Foundations and Advances of Machine Learning in Official Statistics, which was held in Wiesbaden, from 3rd to 5th April, 2024

    MSC Class: 62D05; 68T05 ACM Class: I.2.6; G.3

  11. arXiv:2407.06212  [pdf

    cs.LG stat.ML

    Bias Correction in Machine Learning-based Classification of Rare Events

    Authors: Luuk Gubbels, Marco Puts, Piet Daas

    Abstract: Online platform businesses can be identified by using web-scraped texts. This is a classification problem that combines elements of natural language processing and rare event detection. Because online platforms are rare, accurately identifying them with Machine Learning algorithms is challenging. Here, we describe the development of a Machine Learning-based text classification approach that reduce… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 2 pages, 1 figure, 1 table

  12. arXiv:2304.13794  [pdf, other

    math.PR stat.AP

    Hölder regularity and roughness: construction and examples

    Authors: Erhan Bayraktar, Purba Das, Donghan Kim

    Abstract: We study how to construct a stochastic process on a finite interval with given `roughness' and finite joint moments of marginal distributions. We first extend Ciesielski's isomorphism along a general sequence of partitions, and provide a characterization of Hölder regularity of a function in terms of its Schauder coefficients. Using this characterization we provide a better (pathwise) estimator of… ▽ More

    Submitted 6 May, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted for publication at Bernoulli

    MSC Class: 60H07; 60G22; 60G17; 62P05; 62M09; 42A16

    Journal ref: Bernoulli 31(2): 1084-1113 (May 2025)

  13. arXiv:2212.14777  [pdf, other

    stat.ME

    Polynomial spline regression: Theory and Application

    Authors: Mithun Kumar Acharjee, Kumer Pial Das

    Abstract: To deal with non-linear relations between the predictors and the response, we can use transformations to make the data look linear or approximately linear. In practice, however, transformation methods may be ineffective, and it may be more efficient to use flexible regression techniques that can automatically handle nonlinear behavior. One such method is the Polynomial Spline (PS) regression. Beca… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

    Comments: 18 pages, 5 figures

  14. arXiv:2207.07174  [pdf, other

    cs.LG stat.ML

    Attribute Graphs Underlying Molecular Generative Models: Path to Learning with Limited Data

    Authors: Samuel C. Hoffman, Payel Das, Karthikeyan Shanmugam, Kahini Wadhawan, Prasanna Sattigeri

    Abstract: Training generative models that capture rich semantics of the data and interpreting the latent representations encoded by such models are very important problems in un-/self-supervised learning. In this work, we provide a simple algorithm that relies on perturbation experiments on latent codes of a pre-trained generative autoencoder to uncover an attribute graph that is implied by the generative m… ▽ More

    Submitted 29 August, 2024; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: New experiments; reframed contributions

  15. arXiv:2204.09042  [pdf, other

    q-bio.QM cs.LG q-bio.BM stat.ML

    Accelerating Inhibitor Discovery With A Deep Generative Foundation Model: Validation for SARS-CoV-2 Drug Targets

    Authors: Vijil Chenthamarakshan, Samuel C. Hoffman, C. David Owen, Petra Lukacik, Claire Strain-Damerell, Daren Fearon, Tika R. Malla, Anthony Tumber, Christopher J. Schofield, Helen M. E. Duyvesteyn, Wanwisa Dejnirattisai, Loic Carrique, Thomas S. Walter, Gavin R. Screaton, Tetiana Matviiuk, Aleksandra Mojsilovic, Jason Crain, Martin A. Walsh, David I. Stuart, Payel Das

    Abstract: The discovery of novel inhibitor molecules for emerging drug-target proteins is widely acknowledged as a challenging inverse design problem: Exhaustive exploration of the vast chemical search space is impractical, especially when the target structure or active molecules are unknown. Here we validate experimentally the broad utility of a deep generative framework trained at-scale on protein sequenc… ▽ More

    Submitted 14 October, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Revised title, abstract, and text; additional figures

  16. arXiv:2203.06848  [pdf

    cs.LG q-fin.ST stat.AP

    A Comparative Study on Forecasting of Retail Sales

    Authors: Md Rashidul Hasan, Muntasir A Kabir, Rezoan A Shuvro, Pankaz Das

    Abstract: Predicting product sales of large retail companies is a challenging task considering volatile nature of trends, seasonalities, events as well as unknown factors such as market competitions, change in customer's preferences, or unforeseen events, e.g., COVID-19 outbreak. In this paper, we benchmark forecasting models on historical sales data from Walmart to predict their future sales. We provide a… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  17. arXiv:2111.07458  [pdf, ps, other

    cs.LG stat.ML

    Mean-based Best Arm Identification in Stochastic Bandits under Reward Contamination

    Authors: Arpan Mukherjee, Ali Tajer, Pin-Yu Chen, Payel Das

    Abstract: This paper investigates the problem of best arm identification in $\textit{contaminated}$ stochastic multi-arm bandits. In this setting, the rewards obtained from any arm are replaced by samples from an adversarial model with probability $\varepsilon$. A fixed confidence (infinite-horizon) setting is considered, where the goal of the learner is to identify the arm with the largest mean. Owing to t… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

  18. Non-Asymptotic Guarantees for Reliable Identification of Granger Causality via the LASSO

    Authors: Proloy Das, Behtash Babadi

    Abstract: Granger causality is among the widely used data-driven approaches for causal analysis of time series data with applications in various areas including economics, molecular biology, and neuroscience. Two of the main challenges of this methodology are: 1) over-fitting as a result of limited data duration, and 2) correlated process noise as a confounding factor, both leading to errors in identifying… ▽ More

    Submitted 14 July, 2023; v1 submitted 3 March, 2021; originally announced March 2021.

  19. arXiv:2102.08659  [pdf, other

    stat.ML cs.LG

    Unbiased Estimations based on Binary Classifiers: A Maximum Likelihood Approach

    Authors: Marco J. H. Puts, Piet J. H. Daas

    Abstract: Binary classifiers trained on a certain proportion of positive items introduce a bias when applied to data sets with different proportions of positive items. Most solutions for dealing with this issue assume that some information on the latter distribution is known. However, this is not always the case, certainly when this proportion is the target variable. In this paper a maximum likelihood estim… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: 2 pages, 2 figures, SDSS symposium 2021

    ACM Class: G.3

  20. arXiv:2101.00357  [pdf, other

    stat.AP

    How do mobility restrictions and social distancing during COVID-19 affect the crude oil price?

    Authors: Asim K. Dey, Kumer P. Das

    Abstract: We develop an air mobility index and use the newly developed Apple's driving trend index to evaluate the impact of COVID-19 on the crude oil price. We use quantile regression and stationary and non-stationary extreme value models to study the impact. We find that both the \textit{air mobility index} and \textit{driving trend index} significantly influence lower and upper quantiles as well as the m… ▽ More

    Submitted 1 January, 2021; originally announced January 2021.

  21. arXiv:2009.02439  [pdf, other

    cs.LG math.OC stat.ML

    Optimizing Mode Connectivity via Neuron Alignment

    Authors: N. Joseph Tatro, Pin-Yu Chen, Payel Das, Igor Melnyk, Prasanna Sattigeri, Rongjie Lai

    Abstract: The loss landscapes of deep neural networks are not well understood due to their high nonconvexity. Empirically, the local minima of these loss functions can be connected by a learned curve in model space, along which the loss remains nearly constant; a feature known as mode connectivity. Yet, current curve finding algorithms do not consider the influence of symmetry in the loss surface created by… ▽ More

    Submitted 2 November, 2020; v1 submitted 4 September, 2020; originally announced September 2020.

    Comments: Accepted to NeurIPS 2020, 24 pages, 9 figures, code available at https://github.com/IBM/NeuronAlignment

    Journal ref: Advances in Neural Information Processing Systems, Volume 33, 2020

  22. arXiv:2008.01674  [pdf

    stat.ML cs.LG physics.soc-ph

    A Machine Learning Approach for Modelling Parking Duration in Urban Land-use

    Authors: Janak Parmar, Pritikana Das, Sanjaykumar Dave

    Abstract: Parking is an inevitable issue in the fast-growing developing countries. Increasing number of vehicles require more and more urban land to be allocated for parking. However, a little attention has been conferred to the parking issues in developing countries like India. This study proposes a model for analysing the influence of car users' socioeconomic and travel characteristics on parking duration… ▽ More

    Submitted 10 October, 2023; v1 submitted 4 August, 2020; originally announced August 2020.

    Journal ref: Physica A: Statistical Mechanics and its Applications, 2021

  23. arXiv:2007.01615  [pdf, ps, other

    math.ST stat.ME

    On Second order correctness of Bootstrap in Logistic Regression

    Authors: Debraj Das, Priyam Das

    Abstract: In the fields of clinical trials, biomedical surveys, marketing, banking, with dichotomous response variable, the logistic regression is considered as an alternative convenient approach to linear regression. In this paper, we develop a novel bootstrap technique based on perturbation resampling method for approximating the distribution of the maximum likelihood estimator (MLE) of the regression par… ▽ More

    Submitted 18 September, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: 38 pages

    MSC Class: 62J12 (primary) 62F40; 62E20 (secondary)

  24. arXiv:2006.03963  [pdf, other

    cs.LG stat.ML

    Combinatorial Black-Box Optimization with Expert Advice

    Authors: Hamid Dadkhahi, Karthikeyan Shanmugam, Jesus Rios, Payel Das, Samuel Hoffman, Troy David Loeffler, Subramanian Sankaranarayanan

    Abstract: We consider the problem of black-box function optimization over the boolean hypercube. Despite the vast literature on black-box function optimization over continuous domains, not much attention has been paid to learning models for optimization over combinatorial domains until recently. However, the computational complexity of the recently devised algorithms are prohibitive even for moderate number… ▽ More

    Submitted 13 October, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

    Journal ref: KDD 2020

  25. arXiv:2005.11248  [pdf, other

    cs.LG q-bio.QM stat.ML

    Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics

    Authors: Payel Das, Tom Sercu, Kahini Wadhawan, Inkit Padhi, Sebastian Gehrmann, Flaviu Cipcigan, Vijil Chenthamarakshan, Hendrik Strobelt, Cicero dos Santos, Pin-Yu Chen, Yi Yan Yang, Jeremy Tan, James Hedrick, Jason Crain, Aleksandra Mojsilovic

    Abstract: De novo therapeutic design is challenged by a vast chemical repertoire and multiple constraints, e.g., high broad-spectrum potency and low toxicity. We propose CLaSS (Controlled Latent attribute Space Sampling) - an efficient computational method for attribute-controlled generation of molecules, which leverages guidance from classifiers trained on an informative latent space of molecules modeled u… ▽ More

    Submitted 25 February, 2021; v1 submitted 22 May, 2020; originally announced May 2020.

    Journal ref: Nature Biomedical Engineering (2021)

  26. arXiv:2005.00060  [pdf, other

    cs.LG cs.CV stat.ML

    Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness

    Authors: Pu Zhao, Pin-Yu Chen, Payel Das, Karthikeyan Natesan Ramamurthy, Xue Lin

    Abstract: Mode connectivity provides novel geometric insights on analyzing loss landscapes and enables building high-accuracy pathways between well-trained neural networks. In this work, we propose to employ mode connectivity in loss landscapes to study the adversarial robustness of deep neural networks, and provide novel methods for improving this robustness. Our experiments cover various types of adversar… ▽ More

    Submitted 2 July, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: accepted by ICLR 2020

  27. arXiv:2004.01215  [pdf, other

    cs.LG q-bio.QM stat.ML

    CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models

    Authors: Vijil Chenthamarakshan, Payel Das, Samuel C. Hoffman, Hendrik Strobelt, Inkit Padhi, Kar Wai Lim, Benjamin Hoover, Matteo Manica, Jannis Born, Teodoro Laino, Aleksandra Mojsilovic

    Abstract: The novel nature of SARS-CoV-2 calls for the development of efficient de novo drug design approaches. In this study, we propose an end-to-end framework, named CogMol (Controlled Generation of Molecules), for designing new drug-like small molecules targeting novel viral proteins with high affinity and off-target selectivity. CogMol combines adaptive pre-training of a molecular SMILES Variational Au… ▽ More

    Submitted 23 June, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

  28. arXiv:2003.05408  [pdf, other

    physics.med-ph cs.CV eess.IV stat.AP

    Early Response Assessment in Lung Cancer Patients using Spatio-temporal CBCT Images

    Authors: Bijju Kranthi Veduruparthi, Jayanta Mukherjee, Partha Pratim Das, Mandira Saha, Sanjoy Chatterjee, Raj Kumar Shrimali, Soumendranath Ray, Sriram Prasath

    Abstract: We report a model to predict patient's radiological response to curative radiation therapy (RT) for non-small-cell lung cancer (NSCLC). Cone-Beam Computed Tomography images acquired weekly during the six-week course of RT were contoured with the Gross Tumor Volume (GTV) by senior radiation oncologists for 53 patients (7 images per patient). Deformable registration of the images yielded six def… ▽ More

    Submitted 7 March, 2020; originally announced March 2020.

  29. arXiv:2003.03537  [pdf, other

    physics.med-ph cs.CV stat.AP

    Novel Radiomic Feature for Survival Prediction of Lung Cancer Patients using Low-Dose CBCT Images

    Authors: Bijju Kranthi Veduruparthi, Jayanta Mukherjee, Partha Pratim Das, Moses Arunsingh, Raj Kumar Shrimali, Sriram Prasath, Soumendranath Ray, Sanjay Chatterjee

    Abstract: Prediction of survivability in a patient for tumor progression is useful to estimate the effectiveness of a treatment protocol. In our work, we present a model to take into account the heterogeneous nature of a tumor to predict survival. The tumor heterogeneity is measured in terms of its mass by combining information regarding the radiodensity obtained in images with the gross tumor volume (GTV).… ▽ More

    Submitted 7 March, 2020; originally announced March 2020.

    Comments: Under review in SPIE Journal of Medical Imaging

  30. arXiv:2002.01119  [pdf, other

    cs.LG cs.DC stat.ML

    Improving Efficiency in Large-Scale Decentralized Distributed Training

    Authors: Wei Zhang, Xiaodong Cui, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, Youssef Mroueh, Alper Buyuktosunoglu, Payel Das, David Kung, Michael Picheny

    Abstract: Decentralized Parallel SGD (D-PSGD) and its asynchronous variant Asynchronous Parallel SGD (AD-PSGD) is a family of distributed learning algorithms that have been demonstrated to perform well for large-scale deep learning tasks. One drawback of (A)D-PSGD is that the spectral gap of the mixing matrix decreases when the number of learners in the system increases, which hampers convergence. In this p… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

    Journal ref: 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP'2020) Oral

  31. arXiv:1910.07899  [pdf, other

    cs.LG stat.ML

    Design, Benchmarking and Explainability Analysis of a Game-Theoretic Framework towards Energy Efficiency in Smart Infrastructure

    Authors: Ioannis C. Konstantakopoulos, Hari Prasanna Das, Andrew R. Barkan, Shiying He, Tanya Veeravalli, Huihan Liu, Aummul Baneen Manasawala, Yu-Wen Lin, Costas J. Spanos

    Abstract: In this paper, we propose a gamification approach as a novel framework for smart building infrastructure with the goal of motivating human occupants to reconsider personal energy usage and to have positive effects on their environment. Human interaction in the context of cyber-physical systems is a core component and consideration in the implementation of any smart building technology. Research ha… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1809.05142, arXiv:1810.10533

  32. A Novel Graphical Lasso based approach towards Segmentation Analysis in Energy Game-Theoretic Frameworks

    Authors: Hari Prasanna Das, Ioannis C. Konstantakopoulos, Aummul Baneen Manasawala, Tanya Veeravalli, Huihan Liu, Costas J. Spanos

    Abstract: Energy game-theoretic frameworks have emerged to be a successful strategy to encourage energy efficient behavior in large scale by leveraging human-in-the-loop strategy. A number of such frameworks have been introduced over the years which formulate the energy saving process as a competitive game with appropriate incentives for energy efficient players. However, prior works involve an incentive de… ▽ More

    Submitted 5 October, 2019; originally announced October 2019.

    Comments: Proceedings of the Special Session on Machine Learning in Energy Application, International Conference on Machine Learning and Applications (ICMLA) 2019. arXiv admin note: text overlap with arXiv:1810.10533

    Journal ref: 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

  33. arXiv:1909.04024  [pdf, ps, other

    stat.ME stat.CO

    Estimating the Optimal Linear Combination of Biomarkers using Spherically Constrained Optimization

    Authors: Priyam Das, Debsurya De, Raju Maiti, Mona Kamal, Katherine A. Hutcheson, Clifton D. Fuller, Bibhas Chakraborty, Christine B. Peterson

    Abstract: In the context of a binary classification problem, the optimal linear combination of continuous predictors can be estimated by maximizing an empirical estimate of the area under the receiver operating characteristic (ROC) curve (AUC). For multi-category responses, the optimal predictor combination can similarly be obtained by maximization of the empirical hypervolume under the manifold (HUM). This… ▽ More

    Submitted 7 January, 2021; v1 submitted 6 September, 2019; originally announced September 2019.

  34. arXiv:1908.01686  [pdf, other

    cs.LG stat.ML

    Likelihood Contribution based Multi-scale Architecture for Generative Flows

    Authors: Hari Prasanna Das, Pieter Abbeel, Costas J. Spanos

    Abstract: Deep generative modeling using flows has gained popularity owing to the tractable exact log-likelihood estimation with efficient training and synthesis process. However, flow models suffer from the challenge of having high dimensional latent space, the same in dimension as the input space. An effective solution to the above challenge as proposed by Dinh et al. (2016) is a multi-scale architecture,… ▽ More

    Submitted 27 January, 2022; v1 submitted 5 August, 2019; originally announced August 2019.

  35. arXiv:1906.08451  [pdf, ps, other

    eess.SP cs.IT q-bio.NC stat.ME

    Multitaper Spectral Analysis of Neuronal Spiking Activity Driven by Latent Stationary Processes

    Authors: Proloy Das, Behtash Babadi

    Abstract: Investigating the spectral properties of the neural covariates that underlie spiking activity is an important problem in systems neuroscience, as it allows to study the role of brain rhythms in cognitive functions. While the spectral estimation of continuous time-series is a well-established domain, computing the spectral representation of these neural covariates from spiking data sets forth vario… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

  36. arXiv:1904.10046  [pdf, other

    stat.ME stat.AP

    A distribution-free smoothed combination method of biomarkers to improve diagnostic accuracy in multi-category classification

    Authors: Raju Maiti, Jialiang Li, Priyam Das, Lei Feng, Derek Hausenloy, Bibhas Chakraborty

    Abstract: Results from multiple diagnostic tests are usually combined to improve the overall diagnostic accuracy. For binary classification, maximization of the empirical estimate of the area under the receiver operating characteristic (ROC) curve is widely adopted to produce the optimal linear combination of multiple biomarkers. In the presence of large number of biomarkers, this method proves to be comput… ▽ More

    Submitted 22 April, 2019; originally announced April 2019.

  37. NExUS: Bayesian simultaneous network estimation across unequal sample sizes

    Authors: Priyam Das, Christine Peterson, Kim-Anh Do, Rehan Akbani, Veerabhadran Baladandayuthapani

    Abstract: Network-based analyses of high-throughput genomics data provide a holistic, systems-level understanding of various biological mechanisms for a common population. However, when estimating multiple networks across heterogeneous sub-populations, varying sample sizes pose a challenge in the estimation and inference, as network differences may be driven by differences in power. We are particularly inte… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: 8 pages, 8 figues

  38. arXiv:1810.10533  [pdf, other

    cs.LG stat.AP stat.ML

    Segmentation Analysis in Human Centric Cyber-Physical Systems using Graphical Lasso

    Authors: Hari Prasanna Das, Ioannis C. Konstantakopoulos, Aummul Baneen Manasawala, Tanya Veeravalli, Huihan Liu, Costas J. Spanos

    Abstract: A generalized gamification framework is introduced as a form of smart infrastructure with potential to improve sustainability and energy efficiency by leveraging humans-in-the-loop strategy. The proposed framework enables a Human-Centric Cyber-Physical System using an interface to allow building managers to interact with occupants. The interface is designed for occupant engagement-integration supp… ▽ More

    Submitted 16 January, 2019; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1809.05142

  39. arXiv:1810.07743  [pdf, other

    q-bio.QM cs.LG stat.ML

    PepCVAE: Semi-Supervised Targeted Design of Antimicrobial Peptide Sequences

    Authors: Payel Das, Kahini Wadhawan, Oscar Chang, Tom Sercu, Cicero Dos Santos, Matthew Riemer, Vijil Chenthamarakshan, Inkit Padhi, Aleksandra Mojsilovic

    Abstract: Given the emerging global threat of antimicrobial resistance, new methods for next-generation antimicrobial design are urgently needed. We report a peptide generation framework PepCVAE, based on a semi-supervised variational autoencoder (VAE) model, for designing novel antimicrobial peptide (AMP) sequences. Our model learns a rich latent space of the biological peptide context by taking advantage… ▽ More

    Submitted 13 November, 2018; v1 submitted 17 October, 2018; originally announced October 2018.

  40. arXiv:1712.08041  [pdf, other

    q-bio.NC stat.ML

    Autism Classification Using Brain Functional Connectivity Dynamics and Machine Learning

    Authors: Ravi Tejwani, Adam Liska, Hongyuan You, Jenna Reinen, Payel Das

    Abstract: The goal of the present study is to identify autism using machine learning techniques and resting-state brain imaging data, leveraging the temporal variability of the functional connections (FC) as the only information. We estimated and compared the FC variability across brain regions between typical, healthy subjects and autistic population by analyzing brain imaging data from a world-wide multi-… ▽ More

    Submitted 21 December, 2017; originally announced December 2017.

  41. arXiv:1711.06195  [pdf, other

    stat.ML cs.LG

    Neurology-as-a-Service for the Developing World

    Authors: Tejas Dharamsi, Payel Das, Tejaswini Pedapati, Gregory Bramble, Vinod Muthusamy, Horst Samulowitz, Kush R. Varshney, Yuvaraj Rajamanickam, John Thomas, Justin Dauwels

    Abstract: Electroencephalography (EEG) is an extensively-used and well-studied technique in the field of medical diagnostics and treatment for brain disorders, including epilepsy, migraines, and tumors. The analysis and interpretation of EEGs require physicians to have specialized training, which is not common even among most doctors in the developed world, let alone the developing world where physician sho… ▽ More

    Submitted 21 November, 2017; v1 submitted 16 November, 2017; originally announced November 2017.

    Comments: Presented at NIPS 2017 Workshop on Machine Learning for the Developing World

  42. arXiv:1707.08719  [pdf, other

    stat.AP

    Analysis of Deformation Fields in Spatio-temporal CBCT images of lungs for radiotherapy patients

    Authors: Bijju Kranthi Veduruparthi, Jayanta Mukherjee, Partha Pratim Das, Mandira Saha, Raj Kumar Shrimali, Sanjoy Chatterjee, Soumendranath Ray, Sriram Prasath

    Abstract: Deformable registration of spatiotemporal Cone-Beam Computed Tomography (CBCT) images taken sequentially during the radiation treatment course yields a deformation field for a pair of images. The Jacobian of this field at any voxel provides a measure of the expansion or contraction of a unit volume. We analyze the Jacobian at different sections of the tumor volumes obtained from delineation done b… ▽ More

    Submitted 27 July, 2017; originally announced July 2017.

  43. Bayesian Non-parametric Simultaneous Quantile Regression for Complete and Grid Data

    Authors: Priyam Das, Subhashis Ghosal

    Abstract: In this paper, we consider Bayesian methods for non-parametric quantile regressions with multiple continuous predictors ranging values in the unit interval. In the first method, the quantile function is assumed to be smooth over the explanatory variable and is expanded in tensor product of B-spline basis functions. While in the second method, the distribution function is assumed to be smooth over… ▽ More

    Submitted 30 November, 2016; originally announced December 2016.

    Comments: 25 pages

  44. arXiv:1610.07024  [pdf, other

    stat.AP

    Understanding Sea Ice Melting via Functional Data Analysis

    Authors: Purba Das, Ananya Lahiri, Sourish Das

    Abstract: In this article, we considered the problem of sea ice cover is melting. Considering the `satellite passive microwave remote sensing data' as functional data, we studied daily observation of sea ice cover of each year as a smooth continuous function of time. We investigated the mean function for the sea ice area for following decades and computed the corresponding $95\%$ bootstrap confidence interv… ▽ More

    Submitted 22 October, 2016; originally announced October 2016.

    Comments: 9 pages, 22 figures

    MSC Class: 62-04; 90-08; 62G

  45. Analyzing Ozone Concentration by Bayesian Spatio-temporal Quantile Regression

    Authors: Priyam Das, Subhashis Ghosal

    Abstract: Ground level Ozone is one of the six common air-pollutants on which the EPA has set national air quality standards. In order to capture the spatio-temporal trend of 1-hour and 8-hour average ozone concentration in the US, we develop a method for spatio-temporal simultaneous quantile regression. Unlike existing procedures, in the proposed method, smoothing across the sites is incorporated within mo… ▽ More

    Submitted 5 December, 2016; v1 submitted 15 September, 2016; originally announced September 2016.

  46. Bayesian Quantile Regression Using Random B-spline Series Prior

    Authors: Priyam Das, Subhashis Ghoshal

    Abstract: We consider a Bayesian method for simultaneous quantile regression on a real variable. By monotone transformation, we can make both the response variable and the predictor variable take values in the unit interval. A representation of quantile function is given by a convex combination of two monotone increasing functions $ξ_1$ and $ξ_2$ not depending on the prediction variables. In a Bayesian appr… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

  47. arXiv:1604.08636  [pdf, other

    math.OC cs.DS stat.ME

    Recursive Modified Pattern Search on High-dimensional Simplex : A Blackbox Optimization Technique

    Authors: Priyam Das

    Abstract: In this paper, a novel derivative-free pattern search based algorithm for Black-box optimization is proposed over a simplex constrained parameter space. At each iteration, starting from the current solution, new possible set of solutions are found by adding a set of derived step-size vectors to the initial starting point. While deriving these step-size vectors, precautions and adjustments are cons… ▽ More

    Submitted 30 January, 2019; v1 submitted 28 April, 2016; originally announced April 2016.

  48. Black-box optimization on hyper-rectangle using Recursive Modified Pattern Search and application to ROC-based Classification Problem

    Authors: Priyam Das

    Abstract: In statistics, it is common to encounter multi-modal and non-smooth likelihood (or objective function) maximization problems, where the parameters have known upper and lower bounds. This paper proposes a novel derivative-free global optimization technique that can be used to solve those problems even when the objective function is not known explicitly or its derivatives are difficult or expensive… ▽ More

    Submitted 12 September, 2023; v1 submitted 28 April, 2016; originally announced April 2016.

  49. Multi-task Sparse Structure Learning

    Authors: Andre R. Goncalves, Puja Das, Soumyadeep Chatterjee, Vidyashankar Sivakumar, Fernando J. Von Zuben, Arindam Banerjee

    Abstract: Multi-task learning (MTL) aims to improve generalization performance by learning multiple related tasks simultaneously. While sometimes the underlying task relationship structure is known, often the structure needs to be estimated from data at hand. In this paper, we present a novel family of models for MTL, applicable to regression and classification problems, capable of learning the structure of… ▽ More

    Submitted 1 September, 2014; v1 submitted 31 August, 2014; originally announced September 2014.

    Comments: 23rd ACM International Conference on Information and Knowledge Management - CIKM 2014

    ACM Class: I.5.1, J.2