Skip to main content

Showing 1–11 of 11 results for author: Sudderth, E B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.17131  [pdf, other

    stat.ME cs.LG stat.AP

    Bayesian temporal biclustering with applications to multi-subject neuroscience studies

    Authors: Federica Zoe Ricci, Erik B. Sudderth, Jaylen Lee, Megan A. K. Peters, Marina Vannucci, Michele Guindani

    Abstract: We consider the problem of analyzing multivariate time series collected on multiple subjects, with the goal of identifying groups of subjects exhibiting similar trends in their recorded measurements over time as well as time-varying groups of associated measurements. To this end, we propose a Bayesian model for temporal biclustering featuring nested partitions, where a time-invariant partition of… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2306.08230  [pdf, other

    cs.LG stat.ML

    Unbiased Learning of Deep Generative Models with Structured Discrete Representations

    Authors: Harry Bendekgey, Gabriel Hope, Erik B. Sudderth

    Abstract: By composing graphical models with deep learning architectures, we learn generative models with the strengths of both frameworks. The structured variational autoencoder (SVAE) inherits structure and interpretability from graphical models, and flexible likelihoods for high-dimensional data from deep learning, but poses substantial optimization challenges. We propose novel algorithms for learning SV… ▽ More

    Submitted 14 November, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: 38 pages, 7 figures

  3. arXiv:2012.06718  [pdf, other

    cs.LG cs.CV stat.ML

    Learning Consistent Deep Generative Models from Sparse Data via Prediction Constraints

    Authors: Gabriel Hope, Madina Abdrakhmanova, Xiaoyin Chen, Michael C. Hughes, Michael C. Hughes, Erik B. Sudderth

    Abstract: We develop a new framework for learning variational autoencoders and other deep generative models that balances generative and discriminative goals. Our framework optimizes model parameters to maximize a variational lower bound on the likelihood of observed data, subject to a task-specific prediction constraint that prevents model misspecification from leading to inaccurate predictions. We further… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

  4. arXiv:1712.00499  [pdf, other

    cs.LG stat.ML

    Prediction-Constrained Topic Models for Antidepressant Recommendation

    Authors: Michael C. Hughes, Gabriel Hope, Leah Weiner, Thomas H. McCoy, Roy H. Perlis, Erik B. Sudderth, Finale Doshi-Velez

    Abstract: Supervisory signals can help topic models discover low-dimensional data representations that are more interpretable for clinical tasks. We propose a framework for training supervised latent Dirichlet allocation that balances two goals: faithful generative explanations of high-dimensional data and accurate prediction of associated class labels. Existing approaches fail to balance these goals by not… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: Accepted poster at NIPS 2017 Workshop on Machine Learning for Health (https://ml4health.github.io/2017/)

  5. arXiv:1711.03946  [pdf, other

    cs.CL cs.LG stat.ML

    Bayesian Paragraph Vectors

    Authors: Geng Ji, Robert Bamler, Erik B. Sudderth, Stephan Mandt

    Abstract: Word2vec (Mikolov et al., 2013) has proven to be successful in natural language processing by capturing the semantic relationships between different words. Built on top of single-word embeddings, paragraph vectors (Le and Mikolov, 2014) find fixed-length representations for pieces of text with arbitrary lengths, such as documents, paragraphs, and sentences. In this work, we propose a novel interpr… ▽ More

    Submitted 7 December, 2017; v1 submitted 10 November, 2017; originally announced November 2017.

    Comments: Presented at the NIPS 2017 workshop "Advances in Approximate Bayesian Inference"

  6. arXiv:1707.07341  [pdf, other

    stat.ML cs.AI cs.LG

    Prediction-Constrained Training for Semi-Supervised Mixture and Topic Models

    Authors: Michael C. Hughes, Leah Weiner, Gabriel Hope, Thomas H. McCoy Jr., Roy H. Perlis, Erik B. Sudderth, Finale Doshi-Velez

    Abstract: Supervisory signals have the potential to make low-dimensional data representations, like those learned by mixture and topic models, more interpretable and useful. We propose a framework for training latent variable models that explicitly balances two goals: recovery of faithful generative explanations of high-dimensional data, and accurate prediction of associated semantic labels. Existing approa… ▽ More

    Submitted 23 July, 2017; originally announced July 2017.

  7. arXiv:1609.07521  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Fast Learning of Clusters and Topics via Sparse Posteriors

    Authors: Michael C. Hughes, Erik B. Sudderth

    Abstract: Mixture models and topic models generate each observation from a single cluster, but standard variational posteriors for each observation assign positive probability to all possible clusters. This requires dense storage and runtime costs that scale with the total number of clusters, even though typically only a few clusters have significant posterior mass for any data point. We propose a constrain… ▽ More

    Submitted 23 September, 2016; originally announced September 2016.

  8. arXiv:1308.4747  [pdf, ps, other

    stat.ME stat.ML

    Joint modeling of multiple time series via the beta process with application to motion capture segmentation

    Authors: Emily B. Fox, Michael C. Hughes, Erik B. Sudderth, Michael I. Jordan

    Abstract: We propose a Bayesian nonparametric approach to the problem of jointly modeling multiple related time series. Our model discovers a latent set of dynamical behaviors shared among the sequences, and segments each time series into regions defined by a subset of these behaviors. Using a beta process prior, the size of the behavior set and the sharing pattern are both inferred from data. We develop Ma… ▽ More

    Submitted 13 November, 2014; v1 submitted 21 August, 2013; originally announced August 2013.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS742 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org). arXiv admin note: text overlap with arXiv:1111.4226

    Report number: IMS-AOAS-AOAS742

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 3, 1281-1313

  9. arXiv:1111.4226  [pdf, other

    stat.ME stat.ML

    Joint Modeling of Multiple Related Time Series via the Beta Process

    Authors: Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, Alan S. Willsky

    Abstract: We propose a Bayesian nonparametric approach to the problem of jointly modeling multiple related time series. Our approach is based on the discovery of a set of latent, shared dynamical behaviors. Using a beta process prior, the size of the set and the sharing pattern are both inferred from data. We develop efficient Markov chain Monte Carlo methods based on the Indian buffet process representatio… ▽ More

    Submitted 17 November, 2011; originally announced November 2011.

    Comments: 33 pages, 8 figures

  10. Bayesian Nonparametric Inference of Switching Linear Dynamical Systems

    Authors: Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, Alan S. Willsky

    Abstract: Many complex dynamical phenomena can be effectively modeled by a system that switches among a set of conditionally linear dynamical modes. We consider two such models: the switching linear dynamical system (SLDS) and the switching vector autoregressive (VAR) process. Our Bayesian nonparametric approach utilizes a hierarchical Dirichlet process prior to learn an unknown number of persistent, smoot… ▽ More

    Submitted 19 March, 2010; originally announced March 2010.

    Comments: 50 pages, 7 figures

  11. arXiv:0905.2592  [pdf, ps, other

    stat.ME stat.AP stat.ML

    A sticky HDP-HMM with application to speaker diarization

    Authors: Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, Alan S. Willsky

    Abstract: We consider the problem of speaker diarization, the problem of segmenting an audio recording of a meeting into temporal segments corresponding to individual speakers. The problem is rendered particularly difficult by the fact that we are not allowed to assume knowledge of the number of people participating in the meeting. To address this problem, we take a Bayesian nonparametric approach to speake… ▽ More

    Submitted 16 August, 2011; v1 submitted 15 May, 2009; originally announced May 2009.

    Comments: Published in at http://dx.doi.org/10.1214/10-AOAS395 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS395

    Journal ref: Annals of Applied Statistics 2011, Vol. 5, No. 2A, 1020-1056