Skip to main content

Showing 1–50 of 66 results for author: Pati, D

.
  1. arXiv:2505.24066  [pdf, ps, other

    math.ST stat.ME stat.ML

    Adaptive finite element type decomposition of Gaussian processes

    Authors: Jaehoan Kim, Anirban Bhattacharya, Debdeep Pati

    Abstract: In this paper, we investigate a class of approximate Gaussian processes (GP) obtained by taking a linear combination of compactly supported basis functions with the basis coefficients endowed with a dependent Gaussian prior distribution. This general class includes a popular approach that uses a finite element approximation of the stochastic partial differential equation (SPDE) associated with Mat… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 50 pages, 7 figures

  2. arXiv:2504.11636  [pdf, other

    stat.ME

    Scalable Efficient Inference in Complex Surveys through Targeted Resampling of Weights

    Authors: Snigdha Das, Dipankar Bandyopadhyay, Debdeep Pati

    Abstract: Survey data often arises from complex sampling designs, such as stratified or multistage sampling, with unequal inclusion probabilities. When sampling is informative, traditional inference methods yield biased estimators and poor coverage. Classical pseudo-likelihood based methods provide accurate asymptotic inference but lack finite-sample uncertainty quantification and the ability to integrate p… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 43 pages, 5 figures

  3. arXiv:2504.05431  [pdf, other

    stat.ME math.ST

    A Generalized Tangent Approximation Framework for Strongly Super-Gaussian Likelihoods

    Authors: Somjit Roy, Pritam Dey, Debdeep Pati, Bani K. Mallick

    Abstract: Tangent approximation form a popular class of variational inference (VI) techniques for Bayesian analysis in intractable non-conjugate models. It is based on the principle of convex duality to construct a minorant of the marginal likelihood, making the problem tractable. Despite its extensive applications, a general methodology for tangent approximation encompassing a large class of likelihoods be… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: TAVIE introduces a tangent approximation-based variational inference framework for strongly super-Gaussian likelihoods, offering broad model applicability and provable optimality guarantees

  4. arXiv:2310.18047  [pdf, other

    stat.ME

    Robust Bayesian Inference on Riemannian Submanifold

    Authors: Rong Tang, Anirban Bhattacharya, Debdeep Pati, Yun Yang

    Abstract: Non-Euclidean spaces routinely arise in modern statistical applications such as in medical imaging, robotics, and computer vision, to name a few. While traditional Bayesian approaches are applicable to such settings by considering an ambient Euclidean space as the parameter space, we demonstrate the benefits of integrating manifold structure into the Bayesian framework, both theoretically and comp… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  5. arXiv:2310.12447  [pdf, other

    stat.ML cs.LG

    Constrained Reweighting of Distributions: an Optimal Transport Approach

    Authors: Abhisek Chakraborty, Anirban Bhattacharya, Debdeep Pati

    Abstract: We commonly encounter the problem of identifying an optimally weight adjusted version of the empirical distribution of observed data, adhering to predefined constraints on the weights. Such constraints often manifest as restrictions on the moments, tail behaviour, shapes, number of modes, etc., of the resulting weight adjusted empirical distribution. In this article, we substantially enhance the f… ▽ More

    Submitted 16 January, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2303.10085

  6. arXiv:2309.06349  [pdf, other

    stat.ML cs.LG eess.SY math.OC math.ST

    Generalized Regret Analysis of Thompson Sampling using Fractional Posteriors

    Authors: Prateek Jaiswal, Debdeep Pati, Anirban Bhattacharya, Bani K. Mallick

    Abstract: Thompson sampling (TS) is one of the most popular and earliest algorithms to solve stochastic multi-armed bandit problems. We consider a variant of TS, named $α$-TS, where we use a fractional or $α$-posterior ($α\in(0,1)$) instead of the standard posterior distribution. To compute an $α$-posterior, the likelihood in the definition of the standard posterior is tempered with a factor $α$. For $α$-TS… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  7. Covariate-Assisted Bayesian Graph Learning for Heterogeneous Data

    Authors: Yabo Niu, Yang Ni, Debdeep Pati, Bani K. Mallick

    Abstract: In a traditional Gaussian graphical model, data homogeneity is routinely assumed with no extra variables affecting the conditional independence. In modern genomic datasets, there is an abundance of auxiliary information, which often gets under-utilized in determining the joint dependency structure. In this article, we consider a Bayesian approach to model undirected graphs underlying heterogeneous… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 58 pages, 12 figures, accepted by Journal of the American Statistical Association

  8. arXiv:2307.10099  [pdf, other

    math.ST stat.CO stat.ML

    Memory Efficient And Minimax Distribution Estimation Under Wasserstein Distance Using Bayesian Histograms

    Authors: Peter Matthew Jacobs, Lekha Patel, Anirban Bhattacharya, Debdeep Pati

    Abstract: We study Bayesian histograms for distribution estimation on $[0,1]^d$ under the Wasserstein $W_v, 1 \leq v < \infty$ distance in the i.i.d sampling regime. We newly show that when $d < 2v$, histograms possess a special \textit{memory efficiency} property, whereby in reference to the sample size $n$, order $n^{d/2v}$ bins are needed to obtain minimax rate optimality. This result holds for the poste… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  9. arXiv:2306.01122  [pdf, other

    stat.ML cs.LG math.ST

    On the Convergence of Coordinate Ascent Variational Inference

    Authors: Anirban Bhattacharya, Debdeep Pati, Yun Yang

    Abstract: As a computational alternative to Markov chain Monte Carlo approaches, variational inference (VI) is becoming more and more popular for approximating intractable posterior distributions in large-scale Bayesian models due to its comparable efficacy and superior efficiency. Several recent works provide theoretical justifications of VI by proving its statistical optimality for parameter estimation un… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  10. arXiv:2305.17557  [pdf, other

    stat.ML cs.CY cs.LG

    Fair Clustering via Hierarchical Fair-Dirichlet Process

    Authors: Abhisek Chakraborty, Anirban Bhattacharya, Debdeep Pati

    Abstract: The advent of ML-driven decision-making and policy formation has led to an increasing focus on algorithmic fairness. As clustering is one of the most commonly used unsupervised machine learning approaches, there has naturally been a proliferation of literature on {\em fair clustering}. A popular notion of fairness in clustering mandates the clusters to be {\em balanced}, i.e., each level of a prot… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

  11. Blocked Gibbs sampler for hierarchical Dirichlet processes

    Authors: Snigdha Das, Yabo Niu, Yang Ni, Bani K. Mallick, Debdeep Pati

    Abstract: Posterior computation in hierarchical Dirichlet process (HDP) mixture models is an active area of research in nonparametric Bayes inference of grouped data. Existing literature almost exclusively focuses on the Chinese restaurant franchise (CRF) analogy of the marginal distribution of the parameters, which can mix poorly and has a quadratic complexity with the sample size. A recently developed sli… ▽ More

    Submitted 4 August, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

  12. arXiv:2303.10085  [pdf, other

    stat.ME cs.LG stat.ML

    Robust probabilistic inference via a constrained transport metric

    Authors: Abhisek Chakraborty, Anirban Bhattacharya, Debdeep Pati

    Abstract: Flexible Bayesian models are typically constructed using limits of large parametric models with a multitude of parameters that are often uninterpretable. In this article, we offer a novel alternative by constructing an exponentially tilted empirical likelihood carefully designed to concentrate near a parametric family of distributions of choice with respect to a novel variant of the Wasserstein me… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

  13. arXiv:2303.08979  [pdf, other

    stat.ME stat.CO

    An Approximate Bayesian Approach to Covariate-dependent Graphical Modeling

    Authors: Sutanoy Dasgupta, Peng Zhao, Jacob Helwig, Prasenjit Ghosh, Debdeep Pati, Bani K. Mallick

    Abstract: Gaussian graphical models typically assume a homogeneous structure across all subjects, which is often restrictive in applications. In this article, we propose a weighted pseudo-likelihood approach for graphical modeling which allows different subjects to have different graphical structures depending on extraneous covariates. The pseudo-likelihood approach replaces the joint distribution by a prod… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  14. arXiv:2210.00091  [pdf, other

    stat.ME stat.ML

    Factorized Fusion Shrinkage for Dynamic Relational Data

    Authors: Peng Zhao, Anirban Bhattacharya, Debdeep Pati, Bani K. Mallick

    Abstract: Modern data science applications often involve complex relational data with dynamic structures. An abrupt change in such dynamic relational data is typically observed in systems that undergo regime changes due to interventions. In such a case, we consider a factorized fusion shrinkage model in which all decomposed factors are dynamically shrunk towards group-wise fusion structures, where the shrin… ▽ More

    Submitted 12 July, 2024; v1 submitted 30 September, 2022; originally announced October 2022.

  15. arXiv:2209.15117  [pdf, other

    stat.ML math.ST stat.CO

    Structured Optimal Variational Inference for Dynamic Latent Space Models

    Authors: Peng Zhao, Anirban Bhattacharya, Debdeep Pati, Bani K. Mallick

    Abstract: We consider a latent space model for dynamic networks, where our objective is to estimate the pairwise inner products plus the intercept of the latent positions. To balance posterior inference and computational scalability, we consider a structured mean-field variational inference framework, where the time-dependent properties of the dynamic networks are exploited to facilitate computation and inf… ▽ More

    Submitted 15 October, 2024; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted by the Journal of Machine Learning Research http://jmlr.org/papers/v25/22-0514.html

  16. arXiv:2112.09865  [pdf, other

    stat.ML cs.LG

    Off-Policy Evaluation Using Information Borrowing and Context-Based Switching

    Authors: Sutanoy Dasgupta, Yabo Niu, Kishan Panaganti, Dileep Kalathil, Debdeep Pati, Bani Mallick

    Abstract: We consider the off-policy evaluation (OPE) problem in contextual bandits, where the goal is to estimate the value of a target policy using the data collected by a logging policy. Most popular approaches to the OPE are variants of the doubly robust (DR) estimator obtained by combining a direct method (DM) estimator and a correction term involving the inverse propensity score (IPS). Existing algori… ▽ More

    Submitted 18 August, 2024; v1 submitted 18 December, 2021; originally announced December 2021.

    Comments: 23 pages, 6 figures, manuscript under review

  17. arXiv:2103.08092  [pdf, other

    math.ST

    Adaptive posterior convergence in sparse high dimensional clipped generalized linear models

    Authors: Biraj Subhra Guha, Debdeep Pati

    Abstract: We develop a framework to study posterior contraction rates in sparse high dimensional generalized linear models (GLM). We introduce a new family of GLMs, denoted by clipped GLM, which subsumes many standard GLMs and makes minor modification of the rest. With a sparsity inducing prior on the regression coefficients, we delineate sufficient conditions on true data generating density that leads to m… ▽ More

    Submitted 14 March, 2021; originally announced March 2021.

  18. arXiv:2102.12976  [pdf, other

    stat.CO stat.ME

    A Hybrid Approximation to the Marginal Likelihood

    Authors: Eric Chuu, Debdeep Pati, Anirban Bhattacharya

    Abstract: Computing the marginal likelihood or evidence is one of the core challenges in Bayesian analysis. While there are many established methods for estimating this quantity, they predominantly rely on using a large number of posterior samples obtained from a Markov Chain Monte Carlo (MCMC) algorithm. As the dimension of the parameter space increases, however, many of these methods become prohibitively… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  19. arXiv:2010.14056  [pdf, ps, other

    math.ST cs.LG stat.ML

    Statistical Guarantees for Transformation Based Models with Applications to Implicit Variational Inference

    Authors: Sean Plummer, Shuang Zhou, Anirban Bhattacharya, David Dunson, Debdeep Pati

    Abstract: Transformation-based methods have been an attractive approach in non-parametric inference for problems such as unconditional and conditional density estimation due to their unique hierarchical structure that models the data as flexible transformation of a set of common latent variables. More recently, transformation-based models have been used in variational inference (VI) to construct flexible im… ▽ More

    Submitted 4 November, 2020; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: First two authors contributed equally to this work. arXiv admin note: text overlap with arXiv:1701.07572

  20. arXiv:2010.13039  [pdf, other

    math.ST stat.CO stat.ML

    Statistical optimality and stability of tangent transform algorithms in logit models

    Authors: Indrajit Ghosh, Anirban Bhattacharya, Debdeep Pati

    Abstract: A systematic approach to finding variational approximation in an otherwise intractable non-conjugate model is to exploit the general principle of convex duality by minorizing the marginal likelihood that renders the problem tractable. While such approaches are popular in the context of variational inference in non-conjugate Bayesian models, theoretical guarantees on statistical optimality and algo… ▽ More

    Submitted 25 October, 2020; originally announced October 2020.

    Comments: 46 pages

  21. arXiv:2010.09540  [pdf, ps, other

    stat.ML cs.LG

    Statistical Guarantees and Algorithmic Convergence Issues of Variational Boosting

    Authors: Biraj Subhra Guha, Anirban Bhattacharya, Debdeep Pati

    Abstract: We provide statistical guarantees for Bayesian variational boosting by proposing a novel small bandwidth Gaussian mixture variational family. We employ a functional version of Frank-Wolfe optimization as our variational algorithm and study frequentist properties of the iterative boosting updates. Comparisons are drawn to the recent literature on boosting, describing how the choice of the variation… ▽ More

    Submitted 21 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

  22. arXiv:2008.04537  [pdf, other

    math.ST

    Evidence bounds in singular models: probabilistic and variational perspectives

    Authors: Anirban Bhattacharya, Debdeep Pati, Sean Plummer

    Abstract: The marginal likelihood or evidence in Bayesian statistics contains an intrinsic penalty for larger model sizes and is a fundamental quantity in Bayesian model comparison. Over the past two decades, there has been steadily increasing activity to understand the nature of this penalty in singular statistical models, building on pioneering work by Sumio Watanabe. Unlike regular models where the Bayes… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: 45 pages

  23. arXiv:2007.06715  [pdf, other

    math.DS math.ST

    Dynamics of coordinate ascent variational inference: A case study in 2D Ising models

    Authors: Sean Plummer, Debdeep Pati, Anirban Bhattacharya

    Abstract: Variational algorithms have gained prominence over the past two decades as a scalable computational environment for Bayesian inference. In this article, we explore tools from the dynamical systems literature to study convergence of coordinate ascent algorithms for mean field variational inference. Focusing on the Ising model defined on two nodes, we fully characterize the dynamics of the sequentia… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  24. Radius and equation of state constraints from massive neutron stars and GW190814

    Authors: Yeunhwan Lim, Anirban Bhattacharya, Jeremy W. Holt, Debdeep Pati

    Abstract: Motivated by the unknown nature of the $2.50-2.67\,M_\odot$ compact object in the binary merger event GW190814, we study the maximum neutron star mass based on constraints from low-energy nuclear physics, neutron star tidal deformabilities from GW170817, and simultaneous mass-radius measurements of PSR J0030+045 from NICER. Our prior distribution is based on a combination of nuclear modeling valid… ▽ More

    Submitted 5 April, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: 6 pages, 4 figures, revised figures

    Journal ref: Phys. Rev. C 104, 032802 (2021)

  25. arXiv:2007.02192  [pdf, other

    math.ST stat.AP stat.CO stat.ME stat.ML

    Tail-adaptive Bayesian shrinkage

    Authors: Se Yoon Lee, Peng Zhao, Debdeep Pati, Bani K. Mallick

    Abstract: Robust Bayesian methods for high-dimensional regression problems under diverse sparse regimes are studied. Traditional shrinkage priors are primarily designed to detect a handful of signals from tens of thousands of predictors in the so-called ultra-sparsity domain. However, they may not perform desirably when the degree of sparsity is moderate. In this paper, we propose a robust sparse estimation… ▽ More

    Submitted 24 October, 2024; v1 submitted 4 July, 2020; originally announced July 2020.

    Comments: Accepted in Electronic Journal of Statistics

  26. arXiv:2005.07844  [pdf, other

    math.ST

    Nonasymptotic Laplace approximation under model misspecification

    Authors: Anirban Bhattacharya, Debdeep Pati

    Abstract: We present non-asymptotic two-sided bounds to the log-marginal likelihood in Bayesian inference. The classical Laplace approximation is recovered as the leading term. Our derivation permits model misspecification and allows the parameter dimension to grow with the sample size. We do not make any assumptions about the asymptotic shape of the posterior, and instead require certain regularity conditi… ▽ More

    Submitted 20 June, 2020; v1 submitted 15 May, 2020; originally announced May 2020.

    Comments: 23 pages. Fixed minor technical glitches in the proof of Theorem 2 in the updated version

  27. arXiv:2001.09391  [pdf, other

    math.ST

    Mass-shifting phenomenon of truncated multivariate normal priors

    Authors: Shuang Zhou, Pallavi Ray, Debdeep Pati, Anirban Bhattacharya

    Abstract: We show that lower-dimensional marginal densities of dependent zero-mean normal distributions truncated to the positive orthant exhibit a mass-shifting phenomenon. Despite the truncated multivariate normal density having a mode at the origin, the marginal density assigns increasingly small mass near the origin as the dimension increases. The phenomenon accentuates with stronger correlation between… ▽ More

    Submitted 18 May, 2020; v1 submitted 25 January, 2020; originally announced January 2020.

    Comments: 32 pages, 12 figures

  28. arXiv:1912.05084  [pdf, other

    stat.ME

    Bayesian Copula Density Deconvolution for Zero-Inflated Data in Nutritional Epidemiology

    Authors: Abhra Sarkar, Debdeep Pati, Bani K. Mallick, Raymond J. Carroll

    Abstract: Estimating the marginal and joint densities of the long-term average intakes of different dietary components is an important problem in nutritional epidemiology. Since these variables cannot be directly measured, data are usually collected in the form of 24-hour recalls of the intakes, which show marked patterns of conditional heteroscedasticity. Significantly compounding the challenges, the recal… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

  29. arXiv:1910.06235  [pdf, other

    math.ST

    Gaussian Processes with Errors in Variables: Theory and Computation

    Authors: Shuang Zhou, Debdeep Pati, Tianying Wang, Yun Yang, Raymond J. Carroll

    Abstract: Covariate measurement error in nonparametric regression is a common problem in nutritional epidemiology and geostatistics, and other fields. Over the last two decades, this problem has received substantial attention in the frequentist literature. Bayesian approaches for handling measurement error have only been explored recently and are surprisingly successful, although the lack of a proper theore… ▽ More

    Submitted 26 January, 2023; v1 submitted 14 October, 2019; originally announced October 2019.

  30. arXiv:1910.03060  [pdf, other

    cs.DC cs.LG

    Impact of Inference Accelerators on hardware selection

    Authors: Dibyajyoti Pati, Caroline Favart, Purujit Bahl, Vivek Soni, Yun-chan Tsai, Michael Potter, Jiahui Guan, Xiaomeng Dong, V. Ratna Saripalli

    Abstract: As opportunities for AI-assisted healthcare grow steadily, model deployment faces challenges due to the specific characteristics of the industry. The configuration choice for a production device can impact model performance while influencing operational costs. Moreover, in healthcare some situations might require fast, but not real time, inference. We study different configurations and conduct a c… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

  31. arXiv:1910.02052  [pdf, other

    eess.SP cs.AI cs.LG

    AI Assisted Annotator using Reinforcement Learning

    Authors: V. Ratna Saripalli, Gopal Avinash, Dibyajyoti Pati, Michael Potter, Charles W. Anderson

    Abstract: Healthcare data suffers from both noise and lack of ground truth. The cost of data increases as it is cleaned and annotated in healthcare. Unlike other data sets, medical data annotation, which is critical to accurate ground truth, requires medical domain expertise for a better patient outcome. In this work, we report on the use of reinforcement learning to mimic the decision making process of ann… ▽ More

    Submitted 11 June, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: 10 pages

  32. arXiv:1902.04701  [pdf, other

    stat.ME

    Efficient Bayesian shape-restricted function estimation with constrained Gaussian process priors

    Authors: Pallavi Ray, Debdeep Pati, Anirban Bhattacharya

    Abstract: This article revisits the problem of Bayesian shape-restricted inference in the light of a recently developed approximate Gaussian process that admits an equivalent formulation of the shape constraints in terms of the basis coefficients. We propose a strategy to efficiently sample from the resulting constrained posterior by absorbing a smooth relaxation of the constraint in the likelihood and usin… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  33. arXiv:1901.04134  [pdf, ps, other

    math.ST

    Bayesian Graph Selection Consistency Under Model Misspecification

    Authors: Yabo Niu, Debdeep Pati, Bani Mallick

    Abstract: Gaussian graphical models are a popular tool to learn the dependence structure in the form of a graph among variables of interest. Bayesian methods have gained in popularity in the last two decades due to their ability to simultaneously learn the covariance and the graph and characterize uncertainty in the selection. For scalability of the Markov chain Monte Carlo algorithms, decomposability is co… ▽ More

    Submitted 31 March, 2019; v1 submitted 14 January, 2019; originally announced January 2019.

    Comments: 43 pages

    MSC Class: 62F15 (Primary); 60K35 (Secondary)

  34. arXiv:1811.00724  [pdf, other

    stat.AP

    Bayesian Hierarchical Modeling on Covariance Valued Data

    Authors: Satwik Acharyya, Zhengwu Zhang, Anirban Bhattacharya, Debdeep Pati

    Abstract: Analysis of structural and functional connectivity (FC) of human brains is of pivotal importance for diagnosis of cognitive ability. The Human Connectome Project (HCP) provides an excellent source of neural data across different regions of interest (ROIs) of the living human brain. Individual specific data were available from an existing analysis (Dai et al., 2017) in the form of time varying cova… ▽ More

    Submitted 9 July, 2020; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: Some key references are missing in the old version which are corrected in this version

  35. arXiv:1808.05977  [pdf, other

    nucl-th nucl-ex stat.AP

    Revisiting the proton-radius problem using constrained Gaussian processes

    Authors: Shuang Zhou, P. Giuliani, J. Piekarewicz, Anirban Bhattacharya, Debdeep Pati

    Abstract: Background: The "proton radius puzzle" refers to an eight-year old problem that highlights major inconsistencies in the extraction of the charge radius of the proton from muonic Lamb-shift experiments as compared against experiments using elastic electron scattering. For the latter, the determination of the charge radius involves an extrapolation of the experimental form factor to zero momentum tr… ▽ More

    Submitted 17 August, 2018; originally announced August 2018.

    Journal ref: Phys. Rev. C 99, 055202 (2019)

  36. arXiv:1807.09155  [pdf, other

    stat.CO

    The Soft Multivariate Truncated Normal Distribution with Applications to Bayesian Constrained Estimation

    Authors: Allyson Souris, Anirban Bhattacharya, Debdeep Pati

    Abstract: We propose a new distribution, called the soft tMVN distribution, which provides a smooth approximation to the truncated multivariate normal (tMVN) distribution with linear constraints. An efficient blocked Gibbs sampler is developed to sample from the soft tMVN distribution in high dimensions. We provide theoretical support to the approximation capability of the soft tMVN and provide further empi… ▽ More

    Submitted 2 September, 2019; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: 23 Pages, 9 Figures

  37. arXiv:1804.01458  [pdf, other

    stat.ME

    Shape-Constrained Univariate Density Estimation

    Authors: Sutanoy Dasgupta, Debdeep Pati, Ian H. Jermyn, Anuj Srivastava

    Abstract: While the problem of estimating a probability density function (pdf) from its observations is classical, the estimation under additional shape constraints is both important and challenging. We introduce an efficient, geometric approach for estimating pdfs given the number of its modes. This approach explores the space of constrained pdf's using an action of the diffeomorphism group that preserves… ▽ More

    Submitted 4 April, 2018; originally announced April 2018.

    Comments: 31 pages, Initial version. Presented at IISA 2017, Shape Constrained Methods Workshp, BIRS,2018

  38. arXiv:1712.08983  [pdf, ps, other

    math.ST stat.ML

    On Statistical Optimality of Variational Bayes

    Authors: Debdeep Pati, Anirban Bhattacharya, Yun Yang

    Abstract: The article addresses a long-standing open problem on the justification of using variational Bayes methods for parameter estimation. We provide general conditions for obtaining optimal risk bounds for point estimates acquired from mean-field variational Bayesian inference. The conditions pertain to the existence of certain test functions for the distance metric on the parameter space and minimal a… ▽ More

    Submitted 24 December, 2017; originally announced December 2017.

    Comments: Accepted at AISTATS 2018

  39. arXiv:1710.03266  [pdf, other

    math.ST stat.CO stat.ME stat.ML

    $α$-Variational Inference with Statistical Guarantees

    Authors: Yun Yang, Debdeep Pati, Anirban Bhattacharya

    Abstract: We propose a family of variational approximations to Bayesian posterior distributions, called $α$-VB, with provable statistical guarantees. The standard variational approximation is a special case of $α$-VB with $α=1$. When $α\in(0,1]$, a novel class of variational inequalities are developed for linking the Bayes risk under the variational approximation to the objective function in the variational… ▽ More

    Submitted 7 February, 2018; v1 submitted 9 October, 2017; originally announced October 2017.

  40. arXiv:1708.04753  [pdf, other

    math.ST stat.CO stat.ML

    Frequentist coverage and sup-norm convergence rate in Gaussian process regression

    Authors: Yun Yang, Anirban Bhattacharya, Debdeep Pati

    Abstract: Gaussian process (GP) regression is a powerful interpolation technique due to its flexibility in capturing non-linearity. In this paper, we provide a general framework for understanding the frequentist coverage of point-wise and simultaneous Bayesian credible sets in GP regression. As an intermediate result, we develop a Bernstein von-Mises type result under supremum norm in random design GP regre… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

  41. arXiv:1704.00247  [pdf, other

    stat.ME

    Compressed Covariance Estimation With Automated Dimension Learning

    Authors: Gautam Sabnis, Debdeep Pati, Anirban Bhattacharya

    Abstract: We propose a method for estimating a covariance matrix that can be represented as a sum of a low-rank matrix and a diagonal matrix. The proposed method compresses high-dimensional data, computes the sample covariance in the compressed space, and lifts it back to the ambient space via a decompression operation. A salient feature of our approach relative to existing literature on combining sparsity… ▽ More

    Submitted 1 April, 2017; originally announced April 2017.

  42. arXiv:1701.07572  [pdf, ps, other

    math.ST

    Adaptive posterior convergence rates in non-linear latent variable models

    Authors: Shuang Zhou, Debdeep Pati, Anirban Bhattacharya, David Dunson

    Abstract: Non-linear latent variable models have become increasingly popular in a variety of applications. However, there has been little study on theoretical properties of these models. In this article, we study rates of posterior contraction in univariate density estimation for a class of non-linear latent variable models where unobserved U(0,1) latent variables are related to the response variables via a… ▽ More

    Submitted 25 January, 2017; originally announced January 2017.

    Comments: arXiv admin note: substantial text overlap with arXiv:1109.5000

  43. arXiv:1701.05656  [pdf, other

    stat.ME

    A Two-Step Geometric Framework For Density Modeling

    Authors: Sutanoy Dasgupta, Debdeep Pati, Anuj Srivastava

    Abstract: We introduce a novel two-step approach for estimating a probability density function (pdf) given its samples, with the second and important step coming from a geometric formulation. The procedure involves obtaining an initial estimate of the pdf and then transforming it via a warping function to reach the final estimate. The initial estimate is intended to be computationally fast, albeit suboptima… ▽ More

    Submitted 12 December, 2017; v1 submitted 19 January, 2017; originally announced January 2017.

    Comments: Submitted to a journal currently. Sections rewritten and arranged differently from previous version. Some errors in proofs and notations corrected. Section on Bivariate density estimation and Irrelevant predictors removed. Performance of conditional density estimation compared to a different package with different simulation examples

  44. arXiv:1701.00311  [pdf, ps, other

    math.ST stat.ME stat.ML

    Bayesian model selection consistency and oracle inequality with intractable marginal likelihood

    Authors: Yun Yang, Debdeep Pati

    Abstract: In this article, we investigate large sample properties of model selection procedures in a general Bayesian framework when a closed form expression of the marginal likelihood function is not available or a local asymptotic quadratic approximation of the log-likelihood function does not exist. Under appropriate identifiability assumptions on the true model, we provide sufficient conditions for a Ba… ▽ More

    Submitted 9 January, 2017; v1 submitted 1 January, 2017; originally announced January 2017.

  45. arXiv:1612.06040  [pdf, other

    stat.ME math.ST

    Monte Carlo goodness-of-fit tests for degree corrected and related stochastic blockmodels

    Authors: Vishesh Karwa, Debdeep Pati, Sonja Petrović, Liam Solus, Nikita Alexeev, Mateja Raič, Dane Wilburne, Robert Williams, Bowei Yan

    Abstract: We construct Bayesian and frequentist finite-sample goodness-of-fit tests for three different variants of the stochastic blockmodel for network data. Since all of the stochastic blockmodel variants are log-linear in form when block assignments are known, the tests for the \emph{latent} block model versions combine a block membership estimator with the algebraic statistics machinery for testing goo… ▽ More

    Submitted 6 March, 2024; v1 submitted 18 December, 2016; originally announced December 2016.

    Comments: substantial revision from v3, updated simulations and theoretical discussions

    MSC Class: 62R01; 05C82

    Journal ref: Journal of the Royal Statistical Society Series B: Statistical Methodology, Volume 86, Issue 1, February 2024, Pages 90-121

  46. arXiv:1612.02875  [pdf, other

    stat.ME

    A Divide and Conquer Strategy for High Dimensional Bayesian Factor Models

    Authors: Gautam Sabnis, Debdeep Pati, Barbara Engelhardt, Natesh Pillai

    Abstract: We propose a distributed computing framework, based on a divide and conquer strategy and hierarchical modeling, to accelerate posterior inference for high-dimensional Bayesian factor models. Our approach distributes the task of high-dimensional covariance matrix estimation to multiple cores, solves each subproblem separately via a latent factor model, and then combines these estimates to produce a… ▽ More

    Submitted 28 December, 2016; v1 submitted 8 December, 2016; originally announced December 2016.

  47. arXiv:1611.01125  [pdf, other

    math.ST

    Bayesian fractional posteriors

    Authors: Anirban Bhattacharya, Debdeep Pati, Yun Yang

    Abstract: We consider the fractional posterior distribution that is obtained by updating a prior distribution via Bayes theorem with a fractional likelihood function, a usual likelihood function raised to a fractional power. First, we analyze the contraction property of the fractional posterior in a general misspecified framework. Our contraction results only require a prior mass condition on certain Kullba… ▽ More

    Submitted 7 November, 2016; v1 submitted 3 November, 2016; originally announced November 2016.

    Comments: 35 pages

  48. arXiv:1607.02670  [pdf, other

    stat.ML

    Sparse additive Gaussian process with soft interactions

    Authors: Garret Vo, Debdeep Pati

    Abstract: Additive nonparametric regression models provide an attractive tool for variable selection in high dimensions when the relationship between the response and predictors is complex. They offer greater flexibility compared to parametric non-linear regression models and better interpretability and scalability than the non-parametric regression models. However, achieving sparsity simultaneously in the… ▽ More

    Submitted 9 July, 2016; originally announced July 2016.

    Comments: Submitted to Technometrics Journal

  49. arXiv:1605.05671  [pdf, ps, other

    math.ST stat.CO

    Sub-optimality of some continuous shrinkage priors

    Authors: Anirban Bhattacharya, David B. Dunson, Debdeep Pati, Natesh S. Pillai

    Abstract: Two-component mixture priors provide a traditional way to induce sparsity in high-dimensional Bayes models. However, several aspects of such a prior, including computational complexities in high-dimensions, interpretation of exact zeros and non-sparse posterior summaries under standard loss functions, has motivated an amazing variety of continuous shrinkage priors, which can be expressed as global… ▽ More

    Submitted 18 May, 2016; originally announced May 2016.

    Comments: Some of the results were announced in this earlier paper arXiv:1212.6088. To appear in Stochastic Processes and Applications, special issue in memoriam Prof. Evarist Gine

  50. arXiv:1602.09100  [pdf, other

    stat.ME math.ST

    Bayesian Variable Selection for Skewed Heteroscedastic Response

    Authors: Libo Wang, Yuanyuan Tang, Debajyoti Sinha, Debdeep Pati, Stuart Lipsitz

    Abstract: In this article, we propose new Bayesian methods for selecting and estimating a sparse coefficient vector for skewed heteroscedastic response. Our novel Bayesian procedures effectively estimate the median and other quantile functions, accommodate non-local prior for regression effects without compromising ease of implementation via sampling based tools, and asymptotically select the true set of pr… ▽ More

    Submitted 3 July, 2017; v1 submitted 29 February, 2016; originally announced February 2016.