Skip to main content

Showing 1–19 of 19 results for author: Wade, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.16295  [pdf, ps, other

    stat.CO

    Understanding uncertainty in Bayesian cluster analysis

    Authors: Cecilia Balocchi, Sara Wade

    Abstract: The Bayesian approach to clustering is often appreciated for its ability to provide uncertainty in the partition structure. However, summarizing the posterior distribution over the clustering structure can be challenging, due the discrete, unordered nature and massive dimension of the space. While recent advancements provide a single clustering estimate to represent the posterior, this ignores unc… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  2. arXiv:2503.11808  [pdf, ps, other

    cs.LG stat.ME stat.ML

    Understanding the Trade-offs in Accuracy and Uncertainty Quantification: Architecture and Inference Choices in Bayesian Neural Networks

    Authors: Alisa Sheinkman, Sara Wade

    Abstract: As modern neural networks get more complex, specifying a model with high predictive performance and sound uncertainty quantification becomes a more challenging task. Despite some promising theoretical results on the true posterior predictive distribution of Bayesian neural networks, the properties of even the most commonly used posterior approximations are often questioned. Computational burdens a… ▽ More

    Submitted 17 June, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: 24 pages

  3. arXiv:2411.11132  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Variational Bayesian Bow tie Neural Networks with Shrinkage

    Authors: Alisa Sheinkman, Sara Wade

    Abstract: Despite the dominant role of deep models in machine learning, limitations persist, including overconfident predictions, susceptibility to adversarial attacks, and underestimation of variability in predictions. The Bayesian paradigm provides a natural framework to overcome such issues and has become the gold standard for uncertainty estimation with deep models, also providing improved accuracy and… ▽ More

    Submitted 17 June, 2025; v1 submitted 17 November, 2024; originally announced November 2024.

  4. arXiv:2407.02676  [pdf, other

    stat.ME

    Covariate-dependent hierarchical Dirichlet processes

    Authors: Huizi Zhang, Sara Wade, Natalia Bochkina

    Abstract: Bayesian hierarchical modelling is a natural framework to effectively integrate data and borrow information across groups. In this paper, we address problems related to density estimation and identifying clusters across related groups, by proposing a hierarchical Bayesian approach that incorporates additional covariate information. To achieve flexibility, our approach builds on ideas from Bayesian… ▽ More

    Submitted 17 April, 2025; v1 submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2307.16298  [pdf, other

    stat.ME

    Bayesian dependent mixture models: A predictive comparison and survey

    Authors: Sara Wade, Vanda Inacio, Sonia Petrone

    Abstract: For exchangeable data, mixture models are an extremely useful tool for density estimation due to their attractive balance between smoothness and flexibility. When additional covariate information is present, mixture models can be extended for flexible regression by modeling the mixture parameters, namely the weights and atoms, as functions of the covariates. These types of models are interpretable… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

  6. arXiv:2212.02505  [pdf, other

    q-bio.GN stat.ME

    Shared Differential Clustering across Single-cell RNA Sequencing Datasets with the Hierarchical Dirichlet Process

    Authors: Jinlu Liu, Sara Wade, Natalia Bochkina

    Abstract: Single-cell RNA sequencing (scRNA-seq) is powerful technology that allows researchers to understand gene expression patterns at the single-cell level. However, analysing scRNA-seq data is challenging due to issues and biases in data collection. In this work, we construct an integrated Bayesian model that simultaneously addresses normalization, imputation and batch effects and also nonparametricall… ▽ More

    Submitted 13 December, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

  7. arXiv:2209.15321  [pdf, other

    stat.ML cs.LG

    Leveraging variational autoencoders for multiple data imputation

    Authors: Breeshey Roskams-Hieter, Jude Wells, Sara Wade

    Abstract: Missing data persists as a major barrier to data analysis across numerous applications. Recently, deep generative models have been used for imputation of missing data, motivated by their ability to capture highly non-linear and complex relationships in the data. In this work, we investigate the ability of deep models, namely variational autoencoders (VAEs), to account for uncertainty in missing da… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: 17 pages, 3 main figures, 6 supplementary figures

  8. arXiv:2208.12830  [pdf, ps, other

    stat.ML cs.LG stat.CO

    Mixtures of Gaussian Process Experts with SMC$^2$

    Authors: Teemu Härkönen, Sara Wade, Kody Law, Lassi Roininen

    Abstract: Gaussian processes are a key component of many flexible statistical and machine learning models. However, they exhibit cubic computational complexity and high memory constraints due to the need of inverting and storing a full covariance matrix. To circumvent this, mixtures of Gaussian process experts have been considered where data points are assigned to independent experts, reducing the complexit… ▽ More

    Submitted 6 July, 2025; v1 submitted 26 August, 2022; originally announced August 2022.

  9. arXiv:2206.11051  [pdf, other

    stat.ME stat.AP

    Bayesian nonparametric scalar-on-image regression via Potts-Gibbs random partition models

    Authors: Mica Teo Shu Xian, Sara Wade

    Abstract: Scalar-on-image regression aims to investigate changes in a scalar response of interest based on high-dimensional imaging data. We propose a novel Bayesian nonparametric scalar-on-image regression model that utilises the spatial coordinates of the voxels to group voxels with similar effects on the response to have a common coefficient. We employ the Potts-Gibbs random partition model as the prior… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  10. arXiv:2202.02923  [pdf, other

    stat.AP stat.CO

    Bayesian calibration of simulation models: A tutorial and an Australian smoking behaviour model

    Authors: Stephen Wade, Marianne F Weber, Peter Sarich, Pavla Vaneckova, Silvia Behar-Harpaz, Preston J Ngo, Sonya Cressman, Coral E Gartner, John M Murray, Tony A Blakely, Emily Banks, Martin C Tammemagi, Karen Canfell, Michael Caruana

    Abstract: Simulation models of epidemiological, biological, ecological, and environmental processes are increasingly being calibrated using Bayesian statistics. The Bayesian approach provides simple rules to synthesise multiple data sources and to calculate uncertainty in model output due to uncertainty in the calibration data. As the number of tutorials and studies published grow, the solutions to common d… ▽ More

    Submitted 7 March, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

    Comments: 49 pages, 5 figures, 17 tables

    MSC Class: 62P20 (Primary) 62M09 (Secondary) ACM Class: G.3

  11. arXiv:2109.14171  [pdf, other

    stat.ML cs.LG stat.CO

    Non-stationary Gaussian process discriminant analysis with variable selection for high-dimensional functional data

    Authors: W Yu, S Wade, H D Bondell, L Azizi

    Abstract: High-dimensional classification and feature selection tasks are ubiquitous with the recent advancement in data acquisition technology. In several application areas such as biology, genomics and proteomics, the data are often functional in their nature and exhibit a degree of roughness and non-stationarity. These structures pose additional challenges to commonly used methods that rely mainly on a t… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

  12. arXiv:2103.03321  [pdf, other

    stat.CO stat.ML

    On MCMC for variationally sparse Gaussian processes: A pseudo-marginal approach

    Authors: Karla Monterrubio-Gómez, Sara Wade

    Abstract: Gaussian processes (GPs) are frequently used in machine learning and statistics to construct powerful models. However, when employing GPs in practice, important considerations must be made, regarding the high computational burden, approximation of the posterior, choice of the covariance function and inference of its hyperparmeters. To address these issues, Hensman et al. (2015) combine variational… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

  13. Fast Deep Mixtures of Gaussian Process Experts

    Authors: Clement Etienam, Kody Law, Sara Wade, Vitaly Zankin

    Abstract: Mixtures of experts have become an indispensable tool for flexible modelling in a supervised learning context, allowing not only the mean function but the entire density of the output to change with the inputs. Sparse Gaussian processes (GP) have shown promise as a leading candidate for the experts in such models, and in this article, we propose to design the gating network for selecting the exper… ▽ More

    Submitted 30 November, 2023; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 22 pages, 28 figures, to be published in Machine Learning journal

    Journal ref: Machine Learning (2024)

  14. arXiv:1905.12969  [pdf, other

    stat.ML cs.LG

    Enriched Mixtures of Gaussian Process Experts

    Authors: Charles W. L. Gadd, Sara Wade, Alexis Boukouvalas

    Abstract: Mixtures of experts probabilistically divide the input space into regions, where the assumptions of each expert, or conditional model, need only hold locally. Combined with Gaussian process (GP) experts, this results in a powerful and highly flexible model. We focus on alternative mixtures of GP experts, which model the joint distribution of the inputs and targets explicitly. We highlight issues o… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  15. arXiv:1905.07172  [pdf, other

    stat.AP

    Colombian Women's Life Patterns: A Multivariate Density Regression Approach

    Authors: Sara Wade, Raffaella Piccarreta, Andrea Cremaschi, Isadora Antoniano-Villalobos

    Abstract: Women in Colombia face difficulties related to the patriarchal traits of their societies and well-known conflict afflicting the country since 1948. In this critical context, our aim is to study the relationship between baseline socio-demographic factors and variables associated to fertility, partnership patterns, and work activity. To best exploit the explanatory structure, we propose a Bayesian m… ▽ More

    Submitted 20 January, 2021; v1 submitted 17 May, 2019; originally announced May 2019.

    Comments: to appear in Bayesian analysis

  16. arXiv:1804.01431  [pdf, other

    stat.CO

    Posterior Inference for Sparse Hierarchical Non-stationary Models

    Authors: Karla Monterrubio-Gómez, Lassi Roininen, Sara Wade, Theo Damoulas, Mark Girolami

    Abstract: Gaussian processes are valuable tools for non-parametric modelling, where typically an assumption of stationarity is employed. While removing this assumption can improve prediction, fitting such models is challenging. In this work, hierarchical models are constructed based on Gaussian Markov random fields with stochastic spatially varying parameters. Importantly, this allows for non-stationarity w… ▽ More

    Submitted 1 May, 2019; v1 submitted 4 April, 2018; originally announced April 2018.

  17. arXiv:1803.10746  [pdf, other

    stat.ML cs.LG

    Pseudo-marginal Bayesian inference for supervised Gaussian process latent variable models

    Authors: Charles Gadd, Sara Wade, Akeel Shah, Dimitris Grammatopoulos

    Abstract: We introduce a Bayesian framework for inference with a supervised version of the Gaussian process latent variable model. The framework overcomes the high correlations between latent variables and hyperparameters by using an unbiased pseudo estimate for the marginal likelihood that approximately integrates over the latent variables. This is used to construct a Markov Chain to explore the posterior… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

    Comments: 9 pages, 2 figures, working paper

  18. arXiv:1706.02480  [pdf, other

    stat.ML cs.LG

    Forward Thinking: Building and Training Neural Networks One Layer at a Time

    Authors: Chris Hettinger, Tanner Christensen, Ben Ehlert, Jeffrey Humpherys, Tyler Jarvis, Sean Wade

    Abstract: We present a general framework for training deep neural networks without backpropagation. This substantially decreases training time and also allows for construction of deep networks with many sorts of learners, including networks whose layers are defined by functions that are not easily differentiated, like decision trees. The main idea is that layers can be trained one at a time, and once they a… ▽ More

    Submitted 8 June, 2017; originally announced June 2017.

  19. Bayesian cluster analysis: Point estimation and credible balls

    Authors: Sara Wade, Zoubin Ghahramani

    Abstract: Clustering is widely studied in statistics and machine learning, with applications in a variety of fields. As opposed to classical algorithms which return a single clustering solution, Bayesian nonparametric models provide a posterior over the entire space of partitions, allowing one to assess statistical properties, such as uncertainty on the number of clusters. However, an important problem is h… ▽ More

    Submitted 8 February, 2019; v1 submitted 13 May, 2015; originally announced May 2015.

    Journal ref: Bayesian Anal., Volume 13, Number 2 (2018), 559-626