Skip to main content

Showing 1–18 of 18 results for author: Tokdar, S T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.11332  [pdf, other

    stat.ME stat.AP

    Stochastic Block Covariance Matrix Estimation

    Authors: Yunran Chen, Surya T Tokdar, Jennifer M Groh

    Abstract: Motivated by a neuroscience application we study the problem of statistical estimation of a high-dimensional covariance matrix with a block structure. The block model embeds a structural assumption: the population of items (neurons) can be divided into latent sub-populations with shared associative covariation within blocks and shared associative or dis-associative covariation across blocks. Unlik… ▽ More

    Submitted 27 February, 2025; v1 submitted 16 February, 2025; originally announced February 2025.

  2. arXiv:2502.00126  [pdf, other

    stat.ME math.ST

    A Bayesian decision-theoretic approach to sparse estimation

    Authors: Aihua Li, Surya T. Tokdar, Jason Xu

    Abstract: We extend the work of Hahn and Carvalho (2015) and develop a doubly-regularized sparse regression estimator by synthesizing Bayesian regularization with penalized least squares within a decision-theoretic framework. In contrast to existing Bayesian decision-theoretic formulation chiefly reliant upon the symmetric 0-1 loss, the new method -- which we call Bayesian Decoupling -- employs a family of… ▽ More

    Submitted 31 January, 2025; originally announced February 2025.

    Comments: Submitted to Biometrika

  3. arXiv:2410.00781  [pdf, other

    stat.ME

    Modeling Neural Switching via Drift-Diffusion Models

    Authors: Nicholas Marco, Jennifer M. Groh, Surya T. Tokdar

    Abstract: Neural encoding is a field in neuroscience that focuses on characterizing how information from stimuli is encoded in the spiking activity of neurons. When more than one stimulus is present, a theory known as multiplexing posits that neurons temporally switch between encoding various stimuli, creating a fluctuating firing pattern. Here, we propose a new statistical framework to analyze rate fluctua… ▽ More

    Submitted 11 March, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

  4. Heavy-Tailed Density Estimation

    Authors: Surya T Tokdar, Sheng Jiang, Erika L Cunningham

    Abstract: A novel statistical method is proposed and investigated for estimating a heavy tailed density under mild smoothness assumptions. Statistical analyses of heavy-tailed distributions are susceptible to the problem of sparse information in the tail of the distribution getting washed away by unrelated features of a hefty bulk. The proposed Bayesian method avoids this problem by incorporating smoothness… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Combined article with all technical details uploaded here to complement JASA publication

    MSC Class: 62G

  5. arXiv:1912.05738  [pdf, ps, other

    math.ST stat.ME

    Variable Selection Consistency of Gaussian Process Regression

    Authors: Sheng Jiang, Surya T. Tokdar

    Abstract: Bayesian nonparametric regression under a rescaled Gaussian process prior offers smoothness-adaptive function estimation with near minimax-optimal error rates. Hierarchical extensions of this approach, equipped with stochastic variable selection, are known to also adapt to the unknown intrinsic dimension of a sparse true regression function. But it remains unclear if such extensions offer variable… ▽ More

    Submitted 11 December, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

    MSC Class: 62G08; 62G20

  6. arXiv:1911.04387  [pdf, other

    stat.AP

    Analyzing second order stochasticity of neural spiking under stimuli-bundle exposure

    Authors: Chris Glynn, Surya T Tokdar, Azeem Zaman, Valeria C Caruso, Jeffrey T Mohl, Shawn M Willett, Jennifer M Groh

    Abstract: Conventional analysis of neuroscience data involves computing average neural activity over a group of trials and/or a period of time. This approach may be particularly problematic when assessing the response patterns of neurons to more than one simultaneously presented stimulus. In such cases, the brain must represent each individual component of the stimuli bundle, but trial-and-time-pooled avera… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

    Comments: 26 pages, 7 figures

  7. arXiv:1910.13119  [pdf, other

    stat.ME

    Joint Quantile Regression for Spatial Data

    Authors: Xu Chen, Surya T. Tokdar

    Abstract: Linear quantile regression is a powerful tool to investigate how predictors may affect a response heterogeneously across different quantile levels. Unfortunately, existing approaches find it extremely difficult to adjust for any dependency between observation units, largely because such methods are not based upon a fully generative model of the data. For analyzing spatially indexed data, we addres… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: 30 pages, 10 figures

  8. arXiv:1809.04347  [pdf, other

    stat.ME

    High-dimensional Bayesian Fourier Analysis For Detecting Circadian Gene Expressions

    Authors: Silvia Montagna, Irina Irincheeva, Surya T. Tokdar

    Abstract: In genomic applications, there is often interest in identifying genes whose time-course expression trajectories exhibit periodic oscillations with a period of approximately 24 hours. Such genes are usually referred to as circadian, and their identification is a crucial step toward discovering physiological processes that are clock-controlled. It is natural to expect that the expression of gene i a… ▽ More

    Submitted 27 February, 2024; v1 submitted 12 September, 2018; originally announced September 2018.

  9. arXiv:1611.09790  [pdf, other

    stat.ME stat.CO

    Paired-move multiple-try stochastic search for Bayesian variable selection

    Authors: Xu Chen, Shaan Qamar, Surya T. Tokdar

    Abstract: Variable selection is a key issue when analyzing high-dimensional data. The explosion of data with large sample sizes and dimensionality brings new challenges to this problem in both inference accuracy and computational complexity. To alleviate these problems, we propose a new scalable Markov chain Monte Carlo (MCMC) sampling algorithm for "large $p$ small $n$" scenarios by generalizing multiple-t… ▽ More

    Submitted 29 November, 2016; originally announced November 2016.

    Comments: 28 pages; 5 figures; 5 tables

  10. arXiv:1511.03947  [pdf, other

    stat.ML cs.LG stat.ME

    Bayesian Analysis of Dynamic Linear Topic Models

    Authors: Chris Glynn, Surya T. Tokdar, David L. Banks, Brian Howard

    Abstract: In dynamic topic modeling, the proportional contribution of a topic to a document depends on the temporal dynamics of that topic's overall prevalence in the corpus. We extend the Dynamic Topic Model of Blei and Lafferty (2006) by explicitly modeling document level topic proportions with covariates and dynamic structure that includes polynomial trends and periodicity. A Markov Chain Monte Carlo (MC… ▽ More

    Submitted 12 November, 2015; originally announced November 2015.

  11. arXiv:1411.7009  [pdf, other

    stat.ME stat.CO

    Additive Gaussian Process Regression

    Authors: Shaan Qamar, Surya T. Tokdar

    Abstract: Additive-interactive regression has recently been shown to offer attractive minimax error rates over traditional nonparametric multivariate regression in a wide variety of settings, including cases where the predictor count is much larger than the sample size and many of the predictors have important effects on the response, potentially through complex interactions. We present a Bayesian implement… ▽ More

    Submitted 25 November, 2014; originally announced November 2014.

    Comments: 28 pages; 9 figures; 5 tables

  12. arXiv:1308.4756  [pdf, ps, other

    stat.ME

    Computer emulation with non-stationary Gaussian processes

    Authors: Silvia Montagna, Surya T. Tokdar

    Abstract: Gaussian process (GP) models are widely used to emulate propagation uncertainty in computer experiments. GP emulation sits comfortably within an analytically tractable Bayesian framework. Apart from propagating uncertainty of the input variables, a GP emulator trained on finitely many runs of the experiment also offers error bars for response surface estimates at unseen input values. This helps se… ▽ More

    Submitted 29 January, 2015; v1 submitted 21 August, 2013; originally announced August 2013.

  13. arXiv:1112.0716  [pdf, ps, other

    math.ST stat.ME stat.ML

    Dimension adaptability of Gaussian process models with variable selection and projection

    Authors: Surya T. Tokdar

    Abstract: It is now known that an extended Gaussian process model equipped with rescaling can adapt to different smoothness levels of a function valued parameter in many nonparametric Bayesian analyses, offering a posterior convergence rate that is optimal (up to logarithmic factors) for the smoothness class the true function belongs to. This optimal rate also depends on the dimension of the function's doma… ▽ More

    Submitted 3 December, 2011; originally announced December 2011.

    Comments: 14 pages

    MSC Class: 62G07; 62G08; 62G20

  14. arXiv:1111.4148  [pdf, ps, other

    math.ST stat.ME

    Adaptive Convergence Rates of a Dirichlet Process Mixture of Multivariate Normals

    Authors: Surya T. Tokdar

    Abstract: It is shown that a simple Dirichlet process mixture of multivariate normals offers Bayesian density estimation with adaptive posterior convergence rates. Toward this, a novel sieve for non-parametric mixture densities is explored, and its rate adaptability to various smoothness classes of densities in arbitrary dimension is demonstrated. This sieve construction is expected to offer a substantial t… ▽ More

    Submitted 17 November, 2011; originally announced November 2011.

    Comments: 12 pages

  15. arXiv:1108.2883  [pdf, ps, other

    math.ST stat.CO stat.ME

    Bayesian test of normality versus a Dirichlet process mixture alternative

    Authors: Surya T. Tokdar, Ryan Martin

    Abstract: We propose a Bayesian test of normality for univariate or multivariate data against alternative nonparametric models characterized by Dirichlet process mixture distributions. The alternative models are based on the principles of embedding and predictive matching. They can be interpreted to offer random granulation of a normal distribution into a mixture of normals with mixture components occupying… ▽ More

    Submitted 14 November, 2019; v1 submitted 14 August, 2011; originally announced August 2011.

    Comments: 24 pages, 5 figures, 1 table

    Journal ref: Sankhya B, volume 83, pages 66--96, 2021

  16. arXiv:1108.0445  [pdf, other

    stat.CO stat.ME stat.ML

    Adaptive Gaussian Predictive Process Approximation

    Authors: Surya T Tokdar

    Abstract: We address the issue of knots selection for Gaussian predictive process methodology. Predictive process approximation provides an effective solution to the cubic order computational complexity of Gaussian process models. This approximation crucially depends on a set of points, called knots, at which the original process is retained, while the rest is approximated via a deterministic extrapolation.… ▽ More

    Submitted 1 August, 2011; originally announced August 2011.

    Comments: 20 pages, 5 figures

  17. A nonparametric empirical Bayes framework for large-scale multiple testing

    Authors: Ryan Martin, Surya T. Tokdar

    Abstract: We propose a flexible and identifiable version of the two-groups model, motivated by hierarchical Bayes considerations, that features an empirical null and a semiparametric mixture model for the non-null cases. We use a computationally efficient predictive recursion marginal likelihood procedure to estimate the model parameters, even the nonparametric mixing distribution. This leads to a nonparame… ▽ More

    Submitted 1 October, 2011; v1 submitted 20 June, 2011; originally announced June 2011.

    Comments: 18 pages, 4 figures, 3 tables

    Journal ref: Biostatistics 13(3):427-439, 2012

  18. arXiv:1106.3352  [pdf, ps, other

    stat.ME math.ST

    Semiparametric inference in mixture models with predictive recursion marginal likelihood

    Authors: Ryan Martin, Surya T. Tokdar

    Abstract: Predictive recursion is an accurate and computationally efficient algorithm for nonparametric estimation of mixing densities in mixture models. In semiparametric mixture models, however, the algorithm fails to account for any uncertainty in the additional unknown structural parameter. As an alternative to existing profile likelihood methods, we treat predictive recursion as a filter approximation… ▽ More

    Submitted 16 June, 2011; originally announced June 2011.

    Journal ref: Biometrika, 98(3), 567-582, 2011