Sublinear Variational Optimization of Gaussian Mixture Models with Millions to Billions of Parameters

Salwig, Sebastian; Kahlke, Till; Hirschberger, Florian; Forster, Dennis; Lücke, Jörg

Statistics > Machine Learning

arXiv:2501.12299 (stat)

[Submitted on 21 Jan 2025]

Title:Sublinear Variational Optimization of Gaussian Mixture Models with Millions to Billions of Parameters

Authors:Sebastian Salwig, Till Kahlke, Florian Hirschberger, Dennis Forster, Jörg Lücke

View PDF HTML (experimental)

Abstract:Gaussian Mixture Models (GMMs) range among the most frequently used machine learning models. However, training large, general GMMs becomes computationally prohibitive for datasets with many data points $N$ of high-dimensionality $D$. For GMMs with arbitrary covariances, we here derive a highly efficient variational approximation, which is integrated with mixtures of factor analyzers (MFAs). For GMMs with $C$ components, our proposed algorithm significantly reduces runtime complexity per iteration from $\mathcal{O}(NCD^2)$ to a complexity scaling linearly with $D$ and remaining constant w.r.t. $C$. Numerical validation of this theoretical complexity reduction then shows the following: the distance evaluations required for the entire GMM optimization process scale sublinearly with $NC$. On large-scale benchmarks, this sublinearity results in speed-ups of an order-of-magnitude compared to the state-of-the-art. As a proof of concept, we train GMMs with over 10 billion parameters on about 100 million images, and observe training times of approximately nine hours on a single state-of-the-art CPU.

Comments:	22 pages, 6 figures (and 17 pages, 3 figures in Appendix)
Subjects:	Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2501.12299 [stat.ML]
	(or arXiv:2501.12299v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2501.12299

Submission history

From: Till Kahlke [view email]
[v1] Tue, 21 Jan 2025 17:11:25 UTC (2,085 KB)

Statistics > Machine Learning

Title:Sublinear Variational Optimization of Gaussian Mixture Models with Millions to Billions of Parameters

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Sublinear Variational Optimization of Gaussian Mixture Models with Millions to Billions of Parameters

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators