Skip to main content

Showing 1–4 of 4 results for author: Worah, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.18463  [pdf, other

    cs.LG stat.ML

    Allocating Variance to Maximize Expectation

    Authors: Renato Purita Paes Leme, Cliff Stein, Yifeng Teng, Pratik Worah

    Abstract: We design efficient approximation algorithms for maximizing the expectation of the supremum of families of Gaussian random variables. In particular, let $\mathrm{OPT}:=\max_{σ_1,\cdots,σ_n}\mathbb{E}\left[\sum_{j=1}^{m}\max_{i\in S_j} X_i\right]$, where $X_i$ are Gaussian, $S_j\subset[n]$ and $\sum_iσ_i^2=1$, then our theoretical results include: - We characterize the optimal variance allocation… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  2. arXiv:2401.11562  [pdf, other

    stat.ML cs.LG q-bio.QM

    Enhancing selectivity using Wasserstein distance based reweighing

    Authors: Pratik Worah

    Abstract: Given two labeled data-sets $\mathcal{S}$ and $\mathcal{T}$, we design a simple and efficient greedy algorithm to reweigh the loss function such that the limiting distribution of the neural network weights that result from training on $\mathcal{S}$ approaches the limiting distribution that would have resulted by training on $\mathcal{T}$. On the theoretical side, we prove that when the metric en… ▽ More

    Submitted 25 February, 2025; v1 submitted 21 January, 2024; originally announced January 2024.

  3. arXiv:2303.15634  [pdf, other

    cs.LG math.OC stat.ML

    Learning Rate Schedules in the Presence of Distribution Shift

    Authors: Matthew Fahrbach, Adel Javanmard, Vahab Mirrokni, Pratik Worah

    Abstract: We design learning rate schedules that minimize regret for SGD-based online learning in the presence of a changing data distribution. We fully characterize the optimal learning rate schedule for online linear regression via a novel analysis with stochastic differential equations. For general convex loss functions, we propose new learning rate schedules that are robust to distribution shift and we… ▽ More

    Submitted 20 August, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 33 pages, 6 figures

    Journal ref: Proceedings of the 40th International Conference on Machine Learning (ICML 2023) 9523-9546

  4. arXiv:2006.08667  [pdf, other

    math.OC cs.LG stat.ML

    The Landscape of the Proximal Point Method for Nonconvex-Nonconcave Minimax Optimization

    Authors: Benjamin Grimmer, Haihao Lu, Pratik Worah, Vahab Mirrokni

    Abstract: Minimax optimization has become a central tool in machine learning with applications in robust optimization, reinforcement learning, GANs, etc. These applications are often nonconvex-nonconcave, but the existing theory is unable to identify and deal with the fundamental difficulties this poses. In this paper, we study the classic proximal point method (PPM) applied to nonconvex-nonconcave minimax… ▽ More

    Submitted 1 April, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: Notably updated version that connects our theory with that of Attouch and Wets from the 80s and notably expands on our first posting to apply to generic minimax problems (rather than requiring bilinear interaction)

    MSC Class: 65K05; 65K10; 90C26; 90C15; 90C30