Skip to main content

Showing 1–6 of 6 results for author: Hanchi, A E

.
  1. arXiv:2411.12029  [pdf, ps, other

    stat.ML cs.LG math.ST

    On the Efficiency of ERM in Feature Learning

    Authors: Ayoub El Hanchi, Chris J. Maddison, Murat A. Erdogdu

    Abstract: Given a collection of feature maps indexed by a set $\mathcal{T}$, we study the performance of empirical risk minimization (ERM) on regression problems with square loss over the union of the linear classes induced by these feature maps. This setup aims at capturing the simplest instance of feature learning, where the model is expected to jointly learn from the data an appropriate feature map and a… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Comments: 23 pages, 0 figures

  2. arXiv:2406.12145  [pdf, ps, other

    math.ST stat.ML

    Minimax Linear Regression under the Quantile Risk

    Authors: Ayoub El Hanchi, Chris J. Maddison, Murat A. Erdogdu

    Abstract: We study the problem of designing minimax procedures in linear regression under the quantile risk. We start by considering the realizable setting with independent Gaussian noise, where for any given noise level and distribution of inputs, we obtain the exact minimax quantile risk for a rich family of error functions and establish the minimaxity of OLS. This improves on the known lower bounds for t… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2310.12437  [pdf, ps, other

    math.ST stat.ML

    Optimal Excess Risk Bounds for Empirical Risk Minimization on $p$-Norm Linear Regression

    Authors: Ayoub El Hanchi, Murat A. Erdogdu

    Abstract: We study the performance of empirical risk minimization on the $p$-norm linear regression problem for $p \in (1, \infty)$. We show that, in the realizable case, under no moment assumptions, and up to a distribution-dependent constant, $O(d)$ samples are enough to exactly recover the target. Otherwise, for $p \in [2, \infty)$, and under weak moment assumptions on the target and the covariates, we p… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: Corrected typos

  4. arXiv:2210.01883  [pdf, other

    cs.LG

    Contrastive Learning Can Find An Optimal Basis For Approximately View-Invariant Functions

    Authors: Daniel D. Johnson, Ayoub El Hanchi, Chris J. Maddison

    Abstract: Contrastive learning is a powerful framework for learning self-supervised representations that generalize well to downstream supervised tasks. We show that multiple existing contrastive learning methods can be reinterpreted as learning kernel functions that approximate a fixed positive-pair kernel. We then prove that a simple representation obtained by combining this kernel with PCA provably minim… ▽ More

    Submitted 14 February, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Published at ICLR 2023

  5. arXiv:2103.12293  [pdf, other

    math.OC cs.LG stat.ML

    Stochastic Reweighted Gradient Descent

    Authors: Ayoub El Hanchi, David A. Stephens

    Abstract: Despite the strong theoretical guarantees that variance-reduced finite-sum optimization algorithms enjoy, their applicability remains limited to cases where the memory overhead they introduce (SAG/SAGA), or the periodic full gradient computation they require (SVRG/SARAH) are manageable. A promising approach to achieving variance reduction while avoiding these drawbacks is the use of importance sam… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  6. arXiv:2103.12243  [pdf, other

    cs.LG math.OC stat.ML

    Adaptive Importance Sampling for Finite-Sum Optimization and Sampling with Decreasing Step-Sizes

    Authors: Ayoub El Hanchi, David A. Stephens

    Abstract: Reducing the variance of the gradient estimator is known to improve the convergence rate of stochastic gradient-based optimization and sampling algorithms. One way of achieving variance reduction is to design importance sampling strategies. Recently, the problem of designing such schemes was formulated as an online learning problem with bandit feedback, and algorithms with sub-linear static regret… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: Advances in Neural Information Processing Systems, Dec 2020, Vancouver, Canada