Skip to main content

Showing 1–4 of 4 results for author: Terhorst, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.01684  [pdf, other

    stat.ME stat.ML

    Dendrogram of mixing measures: Hierarchical clustering and model selection for finite mixture models

    Authors: Dat Do, Linh Do, Scott A. McKinley, Jonathan Terhorst, XuanLong Nguyen

    Abstract: We present a new way to summarize and select mixture models via the hierarchical clustering tree (dendrogram) constructed from an overfitted latent mixing measure. Our proposed method bridges agglomerative hierarchical clustering and mixture modeling. The dendrogram's construction is derived from the theory of convergence of the mixing measures, and as a result, we can both consistently select the… ▽ More

    Submitted 8 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: 53 pages, 11 figures

  2. arXiv:2111.10841  [pdf, other

    stat.ME

    A linear adjustment based approach to posterior drift in transfer learning

    Authors: Subha Maity, Diptavo Dutta, Jonathan Terhorst, Yuekai Sun, Moulinath Banerjee

    Abstract: We present a new model and methods for the posterior drift problem where the regression function in the target domain is modeled as a linear adjustment (on an appropriate scale) of that in the source domain, an idea that inherits the simplicity and the usefulness of generalized linear models and accelerated failure time models from the classical statistics literature, and study the theoretical pro… ▽ More

    Submitted 12 December, 2021; v1 submitted 21 November, 2021; originally announced November 2021.

  3. arXiv:2003.01640  [pdf, other

    cs.LG stat.ML

    Explaining Groups of Points in Low-Dimensional Representations

    Authors: Gregory Plumb, Jonathan Terhorst, Sriram Sankararaman, Ameet Talwalkar

    Abstract: A common workflow in data exploration is to learn a low-dimensional representation of the data, identify groups of points in that representation, and examine the differences between the groups to determine what they represent. We treat this workflow as an interpretable machine learning problem by leveraging the model that learned the low-dimensional representation to help identify the key differen… ▽ More

    Submitted 14 August, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

  4. arXiv:1409.1458  [pdf, ps, other

    cs.LG math.OC stat.ML

    Communication-Efficient Distributed Dual Coordinate Ascent

    Authors: Martin Jaggi, Virginia Smith, Martin Takáč, Jonathan Terhorst, Sanjay Krishnan, Thomas Hofmann, Michael I. Jordan

    Abstract: Communication remains the most significant bottleneck in the performance of distributed optimization algorithms for large-scale machine learning. In this paper, we propose a communication-efficient framework, CoCoA, that uses local computation in a primal-dual setting to dramatically reduce the amount of necessary communication. We provide a strong convergence rate analysis for this class of algor… ▽ More

    Submitted 29 September, 2014; v1 submitted 4 September, 2014; originally announced September 2014.

    Comments: NIPS 2014 version, including proofs. Published in Advances in Neural Information Processing Systems 27 (NIPS 2014)

    MSC Class: 90C25; 68W15 ACM Class: G.1.6; C.1.4