Skip to main content

Showing 1–3 of 3 results for author: Gupte, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.14645  [pdf, ps, other

    cs.LG stat.ML

    Sparse Linear Regression and Lattice Problems

    Authors: Aparna Gupte, Neekon Vafa, Vinod Vaikuntanathan

    Abstract: Sparse linear regression (SLR) is a well-studied problem in statistics where one is given a design matrix $X\in\mathbb{R}^{m\times n}$ and a response vector $y=Xθ^*+w$ for a $k$-sparse vector $θ^*$ (that is, $\|θ^*\|_0\leq k$) and small, arbitrary noise $w$, and the goal is to find a $k$-sparse $\widehatθ \in \mathbb{R}^n$ that minimizes the mean squared prediction error… ▽ More

    Submitted 4 February, 2025; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: TCC 2024; minor edits

  2. arXiv:2206.05794  [pdf, other

    cs.LG stat.ML

    SGD and Weight Decay Secretly Minimize the Rank of Your Neural Network

    Authors: Tomer Galanti, Zachary S. Siegel, Aparna Gupte, Tomaso Poggio

    Abstract: We investigate the inherent bias of Stochastic Gradient Descent (SGD) toward learning low-rank weight matrices during the training of deep neural networks. Our results demonstrate that training with mini-batch SGD and weight decay induces a bias toward rank minimization in the weight matrices. Specifically, we show both theoretically and empirically that this bias becomes more pronounced with smal… ▽ More

    Submitted 18 October, 2024; v1 submitted 12 June, 2022; originally announced June 2022.

  3. arXiv:2106.03131  [pdf, ps, other

    cs.LG stat.ML

    The Fine-Grained Hardness of Sparse Linear Regression

    Authors: Aparna Gupte, Vinod Vaikuntanathan

    Abstract: Sparse linear regression is the well-studied inference problem where one is given a design matrix $\mathbf{A} \in \mathbb{R}^{M\times N}$ and a response vector $\mathbf{b} \in \mathbb{R}^M$, and the goal is to find a solution $\mathbf{x} \in \mathbb{R}^{N}$ which is $k$-sparse (that is, it has at most $k$ non-zero coordinates) and minimizes the prediction error… ▽ More

    Submitted 15 February, 2022; v1 submitted 6 June, 2021; originally announced June 2021.