Skip to main content

Showing 1–14 of 14 results for author: Høgsgaard, M M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.14673  [pdf, ps, other

    stat.ML cs.LG

    Uniform Mean Estimation for Heavy-Tailed Distributions via Median-of-Means

    Authors: Mikael Møller Høgsgaard, Andrea Paudice

    Abstract: The Median of Means (MoM) is a mean estimator that has gained popularity in the context of heavy-tailed data. In this work, we analyze its performance in the task of simultaneously estimating the mean of each function in a class $\mathcal{F}$ when the data distribution possesses only the first $p$ moments for $p \in (1,2]$. We prove a new sample complexity bound using a novel symmetrization techni… ▽ More

    Submitted 19 June, 2025; v1 submitted 17 June, 2025; originally announced June 2025.

  2. arXiv:2503.09384  [pdf, ps, other

    cs.LG

    Revisiting Agnostic Boosting

    Authors: Arthur da Cunha, Mikael Møller Høgsgaard, Andrea Paudice, Yuxin Sun

    Abstract: Boosting is a key method in statistical learning, allowing for converting weak learners into strong ones. While well studied in the realizable case, the statistical properties of weak-to-strong learning remains less understood in the agnostic setting, where there are no assumptions on the distribution of the labels. In this work, we propose a new agnostic boosting algorithm with substantially impr… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  3. arXiv:2502.16462  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Improved Margin Generalization Bounds for Voting Classifiers

    Authors: Mikael Møller Høgsgaard, Kasper Green Larsen

    Abstract: In this paper we establish a new margin-based generalization bound for voting classifiers, refining existing results and yielding tighter generalization guarantees for widely used boosting algorithms such as AdaBoost (Freund and Schapire, 1997). Furthermore, the new margin-based generalization bound enables the derivation of an optimal weak-to-strong learner: a Majority-of-3 large-margin classifie… ▽ More

    Submitted 3 June, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

  4. arXiv:2502.09496  [pdf, ps, other

    cs.LG stat.ML

    On Agnostic PAC Learning in the Small Error Regime

    Authors: Julian Asilis, Mikael Møller Høgsgaard, Grigoris Velegkas

    Abstract: Binary classification in the classic PAC model exhibits a curious phenomenon: Empirical Risk Minimization (ERM) learners are suboptimal in the realizable case yet optimal in the agnostic case. Roughly speaking, this owes itself to the fact that non-realizable distributions $\mathcal{D}$ are simply more difficult to learn than realizable distributions -- even when one discounts a learner's error by… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: 44 pages

  5. arXiv:2502.03620  [pdf, other

    cs.LG

    Efficient Optimal PAC Learning

    Authors: Mikael Møller Høgsgaard

    Abstract: Recent advances in the binary classification setting by Hanneke [2016b] and Larsen [2023] have resulted in optimal PAC learners. These learners leverage, respectively, a clever deterministic subsampling scheme and the classic heuristic of bagging Breiman [1996]. Both optimal PAC learners use, as a subroutine, the natural algorithm of empirical risk minimization. Consequently, the computational cos… ▽ More

    Submitted 7 February, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

  6. arXiv:2410.22749  [pdf, ps, other

    cs.LG math.ST stat.ML

    Understanding Aggregations of Proper Learners in Multiclass Classification

    Authors: Julian Asilis, Mikael Møller Høgsgaard, Grigoris Velegkas

    Abstract: Multiclass learnability is known to exhibit a properness barrier: there are learnable classes which cannot be learned by any proper learner. Binary classification faces no such barrier for learnability, but a similar one for optimal learning, which can in general only be achieved by improper learners. Fortunately, recent advances in binary classification have demonstrated that this requirement can… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: 23 pages

  7. arXiv:2408.17148  [pdf, other

    cs.LG cs.DS

    The Many Faces of Optimal Weak-to-Strong Learning

    Authors: Mikael Møller Høgsgaard, Kasper Green Larsen, Markus Engelund Mathiasen

    Abstract: Boosting is an extremely successful idea, allowing one to combine multiple low accuracy classifiers into a much more accurate voting classifier. In this work, we present a new and surprisingly simple Boosting algorithm that obtains a provably optimal sample complexity. Sample optimal Boosting algorithms have only recently been developed, and our new algorithm has the fastest runtime among all such… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  8. arXiv:2408.16653  [pdf, other

    cs.LG

    Optimal Parallelization of Boosting

    Authors: Arthur da Cunha, Mikael Møller Høgsgaard, Kasper Green Larsen

    Abstract: Recent works on the parallel complexity of Boosting have established strong lower bounds on the tradeoff between the number of training rounds $p$ and the total parallel work per round $t$. These works have also presented highly non-trivial parallel algorithms that shed light on different regions of this tradeoff. Despite these advancements, a significant gap persists between the theoretical lower… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  9. arXiv:2403.08831  [pdf, ps, other

    stat.ML cs.LG math.ST

    Majority-of-Three: The Simplest Optimal Learner?

    Authors: Ishaq Aden-Ali, Mikael Møller Høgsgaard, Kasper Green Larsen, Nikita Zhivotovskiy

    Abstract: Developing an optimal PAC learning algorithm in the realizable setting, where empirical risk minimization (ERM) is suboptimal, was a major open problem in learning theory for decades. The problem was finally resolved by Hanneke a few years ago. Unfortunately, Hanneke's algorithm is quite complex as it returns the majority vote of many ERM classifiers that are trained on carefully selected subsets… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 22 pages

  10. arXiv:2302.06165  [pdf, ps, other

    cs.DS cs.LG

    Sparse Dimensionality Reduction Revisited

    Authors: Mikael Møller Høgsgaard, Lion Kamma, Kasper Green Larsen, Jelani Nelson, Chris Schwiegelshohn

    Abstract: The sparse Johnson-Lindenstrauss transform is one of the central techniques in dimensionality reduction. It supports embedding a set of $n$ points in $\mathbb{R}^d$ into $m=O(\varepsilon^{-2} \lg n)$ dimensions while preserving all pairwise distances to within $1 \pm \varepsilon$. Each input point $x$ is embedded to $Ax$, where $A$ is an $m \times d$ matrix having $s$ non-zeros per column, allowin… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  11. arXiv:2302.03071  [pdf, ps, other

    cs.GT cs.DS

    Optimally Interpolating between Ex-Ante Fairness and Welfare

    Authors: Mikael Møller Høgsgaard, Panagiotis Karras, Wenyue Ma, Nidhi Rathi, Chris Schwiegelshohn

    Abstract: For the fundamental problem of allocating a set of resources among individuals with varied preferences, the quality of an allocation relates to the degree of fairness and the collective welfare achieved. Unfortunately, in many resource-allocation settings, it is computationally hard to maximize welfare while achieving fairness goals. In this work, we consider ex-ante notions of fairness; popular… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  12. arXiv:2301.11571  [pdf, other

    cs.LG cs.CC cs.DS

    AdaBoost is not an Optimal Weak to Strong Learner

    Authors: Mikael Møller Høgsgaard, Kasper Green Larsen, Martin Ritzert

    Abstract: AdaBoost is a classic boosting algorithm for combining multiple inaccurate classifiers produced by a weak learner, to produce a strong learner with arbitrarily high accuracy when given enough training data. Determining the optimal number of samples necessary to obtain a given accuracy of the strong learner, is a basic learning theoretic question. Larsen and Ritzert (NeurIPS'22) recently presented… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  13. arXiv:2207.03304  [pdf, ps, other

    cs.DS

    Barriers for Faster Dimensionality Reduction

    Authors: Ora Nova Fandina, Mikael Møller Høgsgaard, Kasper Green Larsen

    Abstract: The Johnson-Lindenstrauss transform allows one to embed a dataset of $n$ points in $\mathbb{R}^d$ into $\mathbb{R}^m,$ while preserving the pairwise distance between any pair of points up to a factor $(1 \pm \varepsilon)$, provided that $m = Ω(\varepsilon^{-2} \lg n)$. The transform has found an overwhelming number of algorithmic applications, allowing to speed up algorithms and reducing memory co… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  14. arXiv:2204.01800  [pdf, ps, other

    cs.DS cs.LG

    The Fast Johnson-Lindenstrauss Transform is Even Faster

    Authors: Ora Nova Fandina, Mikael Møller Høgsgaard, Kasper Green Larsen

    Abstract: The seminal Fast Johnson-Lindenstrauss (Fast JL) transform by Ailon and Chazelle (SICOMP'09) embeds a set of $n$ points in $d$-dimensional Euclidean space into optimal $k=O(\varepsilon^{-2} \ln n)$ dimensions, while preserving all pairwise distances to within a factor $(1 \pm \varepsilon)$. The Fast JL transform supports computing the embedding of a data point in $O(d \ln d +k \ln^2 n)$ time, wher… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.