Skip to main content

Showing 1–19 of 19 results for author: Datta, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.20641  [pdf, other

    stat.ME stat.AP

    The Curious Problem of the Normal Inverse Mean

    Authors: Soham Ghosh, Uttaran Chatterjee, Jyotishka Datta

    Abstract: In astronomical observations, the estimation of distances from parallaxes is a challenging task due to the inherent measurement errors and the non-linear relationship between the parallax and the distance. This study leverages ideas from robust Bayesian inference to tackle these challenges, investigating a broad class of prior densities for estimating distances with a reduced bias and variance. Th… ▽ More

    Submitted 27 October, 2024; originally announced October 2024.

    Comments: 26 pages

    MSC Class: 62C10l; 62F35

  2. arXiv:2406.17058  [pdf, other

    stat.ME cs.LG

    Horseshoe-type Priors for Independent Component Estimation

    Authors: Jyotishka Datta, Nicholas G. Polson

    Abstract: Independent Component Estimation (ICE) has many applications in modern day machine learning as a feature engineering extraction method. Horseshoe-type priors are used to provide scalable algorithms that enables both point estimates via expectation-maximization (EM) and full posterior sampling via Markov Chain Monte Carlo (MCMC) algorithms. Our methodology also applies to flow-based methods for non… ▽ More

    Submitted 1 September, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 23 pages, 2 figures

    MSC Class: 62F15; 62H25; 68T07

  3. arXiv:2308.03355  [pdf, other

    stat.ME stat.AP

    Nonparametric Bayes multiresolution testing for high-dimensional rare events

    Authors: Jyotishka Datta, Sayantan Banerjee, David B. Dunson

    Abstract: In a variety of application areas, there is interest in assessing evidence of differences in the intensity of event realizations between groups. For example, in cancer genomic studies collecting data on rare variants, the focus is on assessing whether and how the variant profile changes with the disease subtype. Motivated by this application, we develop multiresolution nonparametric Bayes tests fo… ▽ More

    Submitted 19 January, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

  4. arXiv:2305.03158  [pdf, other

    stat.CO stat.ME

    Quantile Importance Sampling

    Authors: Jyotishka Datta, Nicholas G. Polson

    Abstract: In Bayesian inference, the approximation of integrals of the form $ψ= \mathbb{E}_{F}{l(X)} = \int_χ l(\mathbf{x}) d F(\mathbf{x})$ is a fundamental challenge. Such integrals are crucial for evidence estimation, which is important for various purposes, including model selection and numerical analysis. The existing strategies for evidence estimation are classified into four categories: deterministic… ▽ More

    Submitted 25 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    MSC Class: 65C05; 62F15

  5. arXiv:2303.06914  [pdf, ps, other

    stat.ME stat.CO

    Maximum a Posteriori Estimation in Graphical Models Using Local Linear Approximation

    Authors: Ksheera Sagar, Jyotishka Datta, Sayantan Banerjee, Anindya Bhadra

    Abstract: Sparse structure learning in high-dimensional Gaussian graphical models is an important problem in multivariate statistical signal processing; since the sparsity pattern naturally encodes the conditional independence relationship among variables. However, maximum a posteriori (MAP) estimation is challenging under hierarchical prior models, and traditional numerical optimization routines or expecta… ▽ More

    Submitted 23 September, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  6. arXiv:2212.05486  [pdf, other

    stat.AP

    Quantifying the Effect of Socio-Economic Predictors and Built Environment on Mental Health Events in Little Rock, AR

    Authors: Alfieri Ek, Samantha Robinson, Grant Drawve, Jyotishka Datta

    Abstract: Proper allocation of law enforcement resources remains a critical issue in crime prediction and prevention that operates by characterizing spatially aggregated crime activities and a multitude of predictor variables of interest. Despite the critical nature of proper resource allocation for mental health incidents, there has been little progress in statistical modeling of the geo-spatial nature of… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  7. arXiv:2205.01016  [pdf, other

    stat.ME

    Evidence Estimation in Gaussian Graphical Models Using a Telescoping Block Decomposition of the Precision Matrix

    Authors: Anindya Bhadra, Ksheera Sagar, David Rowe, Sayantan Banerjee, Jyotishka Datta

    Abstract: Marginal likelihood, also known as model evidence, is a fundamental quantity in Bayesian statistics. It is used for model selection using Bayes factors or for empirical Bayes tuning of prior hyper-parameters. Yet, the calculation of evidence has remained a longstanding open problem in Gaussian graphical models. Currently, the only feasible solutions that exist are for special cases such as the Wis… ▽ More

    Submitted 30 August, 2024; v1 submitted 2 May, 2022; originally announced May 2022.

  8. arXiv:2204.14121  [pdf, other

    stat.ME

    Inverse Probability Weighting: from Survey Sampling to Evidence Estimation

    Authors: Jyotishka Datta, Nicholas Polson

    Abstract: We consider the class of inverse probability weight (IPW) estimators, including the popular Horvitz-Thompson and Hajek estimators used routinely in survey sampling, causal inference and evidence estimation for Bayesian computation. We focus on the 'weak paradoxes' for these estimators due to two counterexamples by Basu [1988] and Wasserman [2004] and investigate the two natural Bayesian answers to… ▽ More

    Submitted 13 April, 2025; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: 23 pages, 4 figures. Reorganized the paper. Fixed a typo in one of the definitions

    MSC Class: 62F15; 62F12; 62D05; 65C05

  9. arXiv:2110.11561  [pdf, other

    stat.ME cs.LG stat.ML

    Merging Two Cultures: Deep and Statistical Learning

    Authors: Anindya Bhadra, Jyotishka Datta, Nick Polson, Vadim Sokolov, Jianeng Xu

    Abstract: Merging the two cultures of deep and statistical learning provides insights into structured high-dimensional data. Traditional statistical modeling is still a dominant strategy for structured tabular data. Deep learning can be viewed through the lens of generalized linear models (GLMs) with composite link functions. Sufficient dimensionality reduction (SDR) and sparsity performs nonlinear feature… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: text overlap with arXiv:2106.14085

  10. arXiv:2104.10750  [pdf, other

    math.ST stat.ME

    Precision Matrix Estimation under the Horseshoe-like Prior-Penalty Dual

    Authors: Ksheera Sagar, Sayantan Banerjee, Jyotishka Datta, Anindya Bhadra

    Abstract: Precision matrix estimation in a multivariate Gaussian model is fundamental to network estimation. Although there exist both Bayesian and frequentist approaches to this, it is difficult to obtain good Bayesian and frequentist properties under the same prior--penalty dual. To bridge this gap, our contribution is a novel prior--penalty dual that closely approximates the graphical horseshoe prior and… ▽ More

    Submitted 18 January, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: 29 pages, 2 figures

  11. arXiv:2102.12938  [pdf, other

    stat.ME math.ST

    On Posterior consistency of Bayesian Changepoint models

    Authors: Nilabja Guha, Jyotishka Datta

    Abstract: While there have been a lot of recent developments in the context of Bayesian model selection and variable selection for high dimensional linear models, there is not much work in the presence of change point in literature, unlike the frequentist counterpart. We consider a hierarchical Bayesian linear model where the active set of covariates that affects the observations through a mean model can va… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

  12. Group Inverse-Gamma Gamma Shrinkage for Sparse Regression with Block-Correlated Predictors

    Authors: Jonathan Boss, Jyotishka Datta, Xin Wang, Sung Kyun Park, Jian Kang, Bhramar Mukherjee

    Abstract: Heavy-tailed continuous shrinkage priors, such as the horseshoe prior, are widely used for sparse estimation problems. However, there is limited work extending these priors to predictors with grouping structures. Of particular interest in this article, is regression coefficient estimation where pockets of high collinearity in the covariate space are contained within known covariate groupings. To a… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

    Comments: 44 pages, 4 figures

  13. arXiv:2010.03228  [pdf, other

    stat.ML cs.AI cs.LG

    FairMixRep : Self-supervised Robust Representation Learning for Heterogeneous Data with Fairness constraints

    Authors: Souradip Chakraborty, Ekansh Verma, Saswata Sahoo, Jyotishka Datta

    Abstract: Representation Learning in a heterogeneous space with mixed variables of numerical and categorical types has interesting challenges due to its complex feature manifold. Moreover, feature learning in an unsupervised setup, without class labels and a suitable learning loss function, adds to the problem complexity. Further, the learned representation and subsequent predictions should not reflect disc… ▽ More

    Submitted 14 October, 2020; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: This paper has been accepted at the ICDM'2020 DLC Workshop

  14. arXiv:1904.10939  [pdf, other

    stat.ME stat.ML

    Horseshoe Regularization for Machine Learning in Complex and Deep Models

    Authors: Anindya Bhadra, Jyotishka Datta, Yunfan Li, Nicholas G. Polson

    Abstract: Since the advent of the horseshoe priors for regularization, global-local shrinkage methods have proved to be a fertile ground for the development of Bayesian methodology in machine learning, specifically for high-dimensional regression and classification problems. They have achieved remarkable success in computation, and enjoy strong theoretical support. Most of the existing literature has focuse… ▽ More

    Submitted 22 November, 2019; v1 submitted 24 April, 2019; originally announced April 2019.

  15. arXiv:1903.06768  [pdf, other

    stat.ME

    Joint Mean-Covariance Estimation via the Horseshoe with an Application in Genomic Data Analysis

    Authors: Yunfan Li, Jyotishka Datta, Bruce A. Craig, Anindya Bhadra

    Abstract: Seemingly unrelated regression is a natural framework for regressing multiple correlated responses on multiple predictors. The model is very flexible, with multiple linear regression and covariance selection models being special cases. However, its practical deployment in genomic data analysis under a Bayesian framework is limited due to both statistical and computational challenges. The statistic… ▽ More

    Submitted 22 July, 2019; v1 submitted 15 March, 2019; originally announced March 2019.

  16. arXiv:1706.10179  [pdf, other

    stat.ME

    Lasso Meets Horseshoe : A Survey

    Authors: Anindya Bhadra, Jyotishka Datta, Nicholas G. Polson, Brandon T. Willard

    Abstract: The goal of this paper is to contrast and survey the major advances in two of the most commonly used high-dimensional techniques, namely, the Lasso and horseshoe regularization. Lasso is a gold standard for predictor selection while horseshoe is a state-of-the-art Bayesian estimator for sparse signals. Lasso is fast and scalable and uses convex optimization whilst the horseshoe is non-convex. Our… ▽ More

    Submitted 3 March, 2019; v1 submitted 30 June, 2017; originally announced June 2017.

    Comments: 32 pages, 4 figures

    MSC Class: Primary 62J07; 62J05; Secondary 62H15; 62F03

  17. arXiv:1702.07400  [pdf, other

    stat.ML stat.CO

    Horseshoe Regularization for Feature Subset Selection

    Authors: Anindya Bhadra, Jyotishka Datta, Nicholas G. Polson, Brandon Willard

    Abstract: Feature subset selection arises in many high-dimensional applications of statistics, such as compressed sensing and genomics. The $\ell_0$ penalty is ideal for this task, the caveat being it requires the NP-hard combinatorial evaluation of all models. A recent area of considerable interest is to develop efficient algorithms to fit models with a non-convex $\ell_γ$ penalty for $γ\in (0,1)$, which r… ▽ More

    Submitted 22 June, 2017; v1 submitted 23 February, 2017; originally announced February 2017.

  18. arXiv:1510.04320  [pdf, ps, other

    stat.ME

    Inference on High-Dimensional Sparse Count Data

    Authors: Jyotishka Datta, David B. Dunson

    Abstract: In a variety of application areas, there is a growing interest in analyzing high dimensional sparse count data, with sparsity exhibited by an over-abundance of zeros and small non-zero counts. Existing approaches for analyzing multivariate count data via Poisson or negative binomial log-linear hierarchical models with zero-inflation cannot flexibly adapt to the level and nature of sparsity in the… ▽ More

    Submitted 14 April, 2016; v1 submitted 14 October, 2015; originally announced October 2015.

    Comments: 20 pages, 7 figures, 2 tables. (This version has a new result regarding tighter control on false discoveries and another real data example. Additional proofs and examples are given in the supplementary file.)

    MSC Class: 62C10; 62F15

  19. arXiv:1510.03516  [pdf, ps, other

    stat.ME

    Default Bayesian analysis with global-local shrinkage priors

    Authors: Anindya Bhadra, Jyotishka Datta, Nicholas G. Polson, Brandon T. Willard

    Abstract: We provide a framework for assessing the default nature of a prior distribution using the property of regular variation, which we study for global-local shrinkage priors. In particular, we demonstrate the horseshoe priors, originally designed to handle sparsity, also possess regular variation and thus are appropriate for default Bayesian analysis. To illustrate our methodology, we solve a problem… ▽ More

    Submitted 14 May, 2016; v1 submitted 12 October, 2015; originally announced October 2015.

    Comments: 28 pages, 7 figures, 6 tables

    MSC Class: 62C10; 62F15