Skip to main content

Showing 1–7 of 7 results for author: Rangamani, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.22059  [pdf, other

    cs.LG eess.SP stat.ML

    Low Rank and Sparse Fourier Structure in Recurrent Networks Trained on Modular Addition

    Authors: Akshay Rangamani

    Abstract: Modular addition tasks serve as a useful test bed for observing empirical phenomena in deep learning, including the phenomenon of \emph{grokking}. Prior work has shown that one-layer transformer architectures learn Fourier Multiplication circuits to solve modular addition tasks. In this paper, we show that Recurrent Neural Networks (RNNs) trained on modular addition tasks also use a Fourier Multip… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

    Comments: To appear at ICASSP 2025

  2. arXiv:2411.13733  [pdf, ps, other

    cs.LG stat.ML

    On Generalization Bounds for Neural Networks with Low Rank Layers

    Authors: Andrea Pinto, Akshay Rangamani, Tomaso Poggio

    Abstract: While previous optimization results have suggested that deep neural networks tend to favour low-rank weight matrices, the implications of this inductive bias on generalization bounds remain underexplored. In this paper, we apply Maurer's chain rule for Gaussian complexity to analyze how low-rank layers in deep networks can prevent the accumulation of rank and dimensionality factors that typically… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

    Comments: Published in the MIT DSpace repository: https://dspace.mit.edu/handle/1721.1/157263

  3. arXiv:2110.11536  [pdf, other

    cs.AI cs.LG

    Neural-guided, Bidirectional Program Search for Abstraction and Reasoning

    Authors: Simon Alford, Anshula Gandhi, Akshay Rangamani, Andrzej Banburski, Tony Wang, Sylee Dandekar, John Chin, Tomaso Poggio, Peter Chin

    Abstract: One of the challenges facing artificial intelligence research today is designing systems capable of utilizing systematic reasoning to generalize to new tasks. The Abstraction and Reasoning Corpus (ARC) measures such a capability through a set of visual reasoning tasks. In this paper we report incremental progress on ARC and lay the foundations for two approaches to abstraction and reasoning not ba… ▽ More

    Submitted 26 October, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at Complex Networks 2021

  4. arXiv:2006.15522  [pdf, other

    stat.ML cs.LG

    For interpolating kernel machines, minimizing the norm of the ERM solution minimizes stability

    Authors: Akshay Rangamani, Lorenzo Rosasco, Tomaso Poggio

    Abstract: We study the average $\mbox{CV}_{loo}$ stability of kernel ridge-less regression and derive corresponding risk bounds. We show that the interpolating solution with minimum norm minimizes a bound on $\mbox{CV}_{loo}$ stability, which in turn is controlled by the condition number of the empirical kernel matrix. The latter can be characterized in the asymptotic regime where both the dimension and car… ▽ More

    Submitted 11 October, 2020; v1 submitted 28 June, 2020; originally announced June 2020.

  5. arXiv:1902.02434  [pdf, other

    stat.ML cs.LG

    A Scale Invariant Flatness Measure for Deep Network Minima

    Authors: Akshay Rangamani, Nam H. Nguyen, Abhishek Kumar, Dzung Phan, Sang H. Chin, Trac D. Tran

    Abstract: It has been empirically observed that the flatness of minima obtained from training deep networks seems to correlate with better generalization. However, for deep networks with positively homogeneous activations, most measures of sharpness/flatness are not invariant to rescaling of the network parameters, corresponding to the same function. This means that the measure of flatness/sharpness can be… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

  6. arXiv:1803.04497  [pdf, other

    cs.SE cs.LG stat.ML

    Automated software vulnerability detection with machine learning

    Authors: Jacob A. Harer, Louis Y. Kim, Rebecca L. Russell, Onur Ozdemir, Leonard R. Kosta, Akshay Rangamani, Lei H. Hamilton, Gabriel I. Centeno, Jonathan R. Key, Paul M. Ellingwood, Erik Antelman, Alan Mackay, Marc W. McConley, Jeffrey M. Opper, Peter Chin, Tomo Lazovich

    Abstract: Thousands of security vulnerabilities are discovered in production software each year, either reported publicly to the Common Vulnerabilities and Exposures database or discovered internally in proprietary code. Vulnerabilities often manifest themselves in subtle ways that are not obvious to code reviewers or the developers themselves. With the wealth of open source code available for analysis, the… ▽ More

    Submitted 2 August, 2018; v1 submitted 14 February, 2018; originally announced March 2018.

  7. arXiv:1708.03735  [pdf, other

    cs.LG math.OC stat.ML

    Sparse Coding and Autoencoders

    Authors: Akshay Rangamani, Anirbit Mukherjee, Amitabh Basu, Tejaswini Ganapathy, Ashish Arora, Sang Chin, Trac D. Tran

    Abstract: In "Dictionary Learning" one tries to recover incoherent matrices $A^* \in \mathbb{R}^{n \times h}$ (typically overcomplete and whose columns are assumed to be normalized) and sparse vectors $x^* \in \mathbb{R}^h$ with a small support of size $h^p$ for some $0 <p < 1$ while having access to observations $y \in \mathbb{R}^n$ where $y = A^*x^*$. In this work we undertake a rigorous analysis of wheth… ▽ More

    Submitted 20 October, 2017; v1 submitted 11 August, 2017; originally announced August 2017.

    Comments: In this new version of the paper with a small change in the distributional assumptions we are actually able to prove the asymptotic criticality of a neighbourhood of the ground truth dictionary for even just the standard squared loss of the ReLU autoencoder (unlike the regularized loss in the older version)