Skip to main content

Showing 1–3 of 3 results for author: Tumma, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.19561  [pdf, other

    cs.LG

    Quantifying Memory Utilization with Effective State-Size

    Authors: Rom N. Parnichkun, Neehal Tumma, Armin W. Thomas, Alessandro Moro, Qi An, Taiji Suzuki, Atsushi Yamashita, Michael Poli, Stefano Massaroli

    Abstract: The need to develop a general framework for architecture analysis is becoming increasingly important, given the expanding design space of sequence models. To this end, we draw insights from classical signal processing and control theory, to develop a quantitative measure of \textit{memory utilization}: the internal mechanisms through which a model stores past information to produce future outputs.… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  2. arXiv:2310.03915  [pdf, other

    cs.LG

    Leveraging Low-Rank and Sparse Recurrent Connectivity for Robust Closed-Loop Control

    Authors: Neehal Tumma, Mathias Lechner, Noel Loo, Ramin Hasani, Daniela Rus

    Abstract: Developing autonomous agents that can interact with changing environments is an open challenge in machine learning. Robustness is particularly important in these settings as agents are often fit offline on expert demonstrations but deployed online where they must generalize to the closed feedback loop within the environment. In this work, we explore the application of recurrent neural networks to… ▽ More

    Submitted 30 November, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

  3. arXiv:2204.03208  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    A Joint Learning Approach for Semi-supervised Neural Topic Modeling

    Authors: Jeffrey Chiu, Rajat Mittal, Neehal Tumma, Abhishek Sharma, Finale Doshi-Velez

    Abstract: Topic models are some of the most popular ways to represent textual data in an interpret-able manner. Recently, advances in deep generative models, specifically auto-encoding variational Bayes (AEVB), have led to the introduction of unsupervised neural topic models, which leverage deep generative models as opposed to traditional statistics-based topic models. We extend upon these neural topic mode… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: To appear in the 6th ACL Workshop on Structured Prediction for NLP (SPNLP)