Skip to main content

Showing 1–27 of 27 results for author: Garg, I

.
  1. arXiv:2502.15725  [pdf, other

    cs.CL

    Town Hall Debate Prompting: Enhancing Logical Reasoning in LLMs through Multi-Persona Interaction

    Authors: Vivaan Sandwar, Bhav Jain, Rishan Thangaraj, Ishaan Garg, Michael Lam, Kevin Zhu

    Abstract: Debate is a commonly used form of human communication catered towards problem-solving because of its efficiency. Debate fundamentally allows multiple viewpoints to be brought up in problem-solving, and for complex problems, each viewpoint opens a new path for problem-solving. In this work, we apply this concept to LLM decision-making by proposing town hall-style debate prompting (THDP), a promptin… ▽ More

    Submitted 28 January, 2025; originally announced February 2025.

    Comments: Accepted to SoCal NLP Symposium 2024

  2. arXiv:2407.04797  [pdf, other

    cs.CV cs.LG

    Revealing the Utilized Rank of Subspaces of Learning in Neural Networks

    Authors: Isha Garg, Christian Koguchi, Eshan Verma, Daniel Ulbricht

    Abstract: In this work, we study how well the learned weights of a neural network utilize the space available to them. This notion is related to capacity, but additionally incorporates the interaction of the network architecture with the dataset. Most learned weights appear to be full rank, and are therefore not amenable to low rank decomposition. This deceptively implies that the weights are utilizing the… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Presented at Efficient Systems for Foundation Models Workshop at the International Conference on Machine Learning (ICML) 2024

  3. arXiv:2403.13082  [pdf, other

    cs.ET cs.LG

    Pruning for Improved ADC Efficiency in Crossbar-based Analog In-memory Accelerators

    Authors: Timur Ibrayev, Isha Garg, Indranil Chakraborty, Kaushik Roy

    Abstract: Deep learning has proved successful in many applications but suffers from high computational demands and requires custom accelerators for deployment. Crossbar-based analog in-memory architectures are attractive for acceleration of deep neural networks (DNN), due to their high data reuse and high efficiency enabled by combining storage and computation in memory. However, they require analog-to-digi… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 11 pages, 5 figures

  4. arXiv:2307.05831  [pdf, other

    cs.LG cs.AI

    Memorization Through the Lens of Curvature of Loss Function Around Samples

    Authors: Isha Garg, Deepak Ravikumar, Kaushik Roy

    Abstract: Deep neural networks are over-parameterized and easily overfit the datasets they train on. In the extreme case, it has been shown that these networks can memorize a training set with fully randomized labels. We propose using the curvature of loss function around each training sample, averaged over training epochs, as a measure of memorization of the sample. We use this metric to study the generali… ▽ More

    Submitted 1 October, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Preprint

  5. TOFU: Towards Obfuscated Federated Updates by Encoding Weight Updates into Gradients from Proxy Data

    Authors: Isha Garg, Manish Nagaraj, Kaushik Roy

    Abstract: Advances in Federated Learning and an abundance of user data have enabled rich collaborative learning between multiple clients, without sharing user data. This is done via a central server that aggregates learning in the form of weight updates. However, this comes at the cost of repeated expensive communication between the clients and the server, and concerns about compromised user privacy. The in… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: First two authors contributed equally to the paper

  6. arXiv:2112.10844  [pdf, other

    cs.CV

    Encoding Hierarchical Information in Neural Networks helps in Subpopulation Shift

    Authors: Amitangshu Mukherjee, Isha Garg, Kaushik Roy

    Abstract: Over the past decade, deep neural networks have proven to be adept in image classification tasks, often surpassing humans in terms of accuracy. However, standard neural networks often fail to understand the concept of hierarchical structures and dependencies among different classes for vision related tasks. Humans on the other hand, seem to intuitively learn categories conceptually, progressively… ▽ More

    Submitted 13 June, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: 15 pages, 7 figures

  7. arXiv:2104.12528  [pdf, other

    cs.LG eess.IV

    Spatio-Temporal Pruning and Quantization for Low-latency Spiking Neural Networks

    Authors: Sayeed Shafayet Chowdhury, Isha Garg, Kaushik Roy

    Abstract: Spiking Neural Networks (SNNs) are a promising alternative to traditional deep learning methods since they perform event-driven information processing. However, a major drawback of SNNs is high inference latency. The efficiency of SNNs could be enhanced using compression methods such as pruning and quantization. Notably, SNNs, unlike their non-spiking counterparts, consist of a temporal dimension,… ▽ More

    Submitted 28 April, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

  8. arXiv:2103.09762  [pdf, other

    cs.LG cs.CV

    Gradient Projection Memory for Continual Learning

    Authors: Gobinda Saha, Isha Garg, Kaushik Roy

    Abstract: The ability to learn continually without forgetting the past tasks is a desired attribute for artificial learning systems. Existing approaches to enable such learning in artificial neural networks usually rely on network growth, importance based weight update or replay of old data from the memory. In contrast, we propose a novel approach where a neural network learns new tasks by taking gradient s… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Comments: Accepted for Oral Presentation at ICLR 2021 https://openreview.net/forum?id=3AOj0RCNC2

    Journal ref: International Conference on Learning Representations (ICLR), 2021

  9. arXiv:2012.08398  [pdf, other

    cs.LG cs.CV

    Exploring Vicinal Risk Minimization for Lightweight Out-of-Distribution Detection

    Authors: Deepak Ravikumar, Sangamesh Kodge, Isha Garg, Kaushik Roy

    Abstract: Deep neural networks have found widespread adoption in solving complex tasks ranging from image recognition to natural language processing. However, these networks make confident mispredictions when presented with data that does not belong to the training distribution, i.e. out-of-distribution (OoD) samples. In this paper we explore whether the property of Vicinal Risk Minimization (VRM) to smooth… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

  10. arXiv:2010.01795  [pdf, other

    cs.LG cs.CV stat.ML

    DCT-SNN: Using DCT to Distribute Spatial Information over Time for Learning Low-Latency Spiking Neural Networks

    Authors: Isha Garg, Sayeed Shafayet Chowdhury, Kaushik Roy

    Abstract: Spiking Neural Networks (SNNs) offer a promising alternative to traditional deep learning frameworks, since they provide higher computational efficiency due to event-driven information processing. SNNs distribute the analog values of pixel intensities into binary spikes over time. However, the most widely used input coding schemes, such as Poisson based rate-coding, do not leverage the additional… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: The first two authors contributed equally to this paper

  11. arXiv:2008.01524  [pdf, other

    cs.LG stat.ML

    TREND: Transferability based Robust ENsemble Design

    Authors: Deepak Ravikumar, Sangamesh Kodge, Isha Garg, Kaushik Roy

    Abstract: Deep Learning models hold state-of-the-art performance in many fields, but their vulnerability to adversarial examples poses threat to their ubiquitous deployment in practical settings. Additionally, adversarial inputs generated on one classifier have been shown to transfer to other classifiers trained on similar data, which makes the attacks possible even if model parameters are not revealed to t… ▽ More

    Submitted 30 March, 2021; v1 submitted 4 August, 2020; originally announced August 2020.

  12. arXiv:2001.08650  [pdf, other

    cs.LG cs.CV stat.ML

    SPACE: Structured Compression and Sharing of Representational Space for Continual Learning

    Authors: Gobinda Saha, Isha Garg, Aayush Ankit, Kaushik Roy

    Abstract: Humans learn adaptively and efficiently throughout their lives. However, incrementally learning tasks causes artificial neural networks to overwrite relevant information learned about older tasks, resulting in 'Catastrophic Forgetting'. Efforts to overcome this phenomenon often utilize resources poorly, for instance, by growing the network architecture or needing to save parametric importance scor… ▽ More

    Submitted 3 February, 2021; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: The first two authors contributed equally to this paper

  13. Constructing Energy-efficient Mixed-precision Neural Networks through Principal Component Analysis for Edge Intelligence

    Authors: Indranil Chakraborty, Deboleena Roy, Isha Garg, Aayush Ankit, Kaushik Roy

    Abstract: The `Internet of Things' has brought increased demand for AI-based edge computing in applications ranging from healthcare monitoring systems to autonomous vehicles. Quantization is a powerful tool to address the growing computational cost of such applications, and yields significant compression over full-precision networks. However, quantization can result in substantial loss of performance for co… ▽ More

    Submitted 2 December, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: 14 pages, 4 figures, 8 tables

    Journal ref: Nature Machine Intelligence, 2, 43-55 (2020)

  14. A Low Effort Approach to Structured CNN Design Using PCA

    Authors: Isha Garg, Priyadarshini Panda, Kaushik Roy

    Abstract: Deep learning models hold state of the art performance in many fields, yet their design is still based on heuristics or grid search methods that often result in overparametrized networks. This work proposes a method to analyze a trained network and deduce an optimized, compressed architecture that preserves accuracy while keeping computational costs tractable. Model compression is an active field… ▽ More

    Submitted 10 January, 2020; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: To be Published in IEEE Access, Volume 8, 2020

  15. Topological pseudodefects of a supersymmetric $SO(10)$ model and cosmology

    Authors: Ila Garg, Urjit A. Yajnik

    Abstract: Obtaining realistic supersymmetry preserving vacua in the minimal renormalizable supersymmetric $Spin(10)$ GUT model introduces considerations of the non-trivial topology of the vacuum manifold. The $D$-parity of low energy unification schemes gets lifted to a one-parameter subgroup $U(1)_D$ of $Spin(10)$. Yet, the choice of the fields signaling spontaneous symmetry breaking leads to disconnected… ▽ More

    Submitted 29 September, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: 8 Pages, 2 figures, matches with the published version in PRD

    Journal ref: Phys. Rev. D 98, 063523 (2018)

  16. No-scale SUGRA Inflation and Type-I seesaw

    Authors: Ila Garg, Subhendra Mohanty

    Abstract: We show that MSSM with three right handed neutrinos incorporating a renormalizable Type-I seesaw superpotential and no-scale SURGA Kähler potential can lead to a Starobinsky kind of inflation potential along a flat direction associated with gauge invariant combination of Higgs, slepton and right handed sneutrino superfields. The inflation conditions put constraints on the Dirac Yukawa coupling and… ▽ More

    Submitted 27 July, 2018; v1 submitted 6 November, 2017; originally announced November 2017.

    Comments: 16 Pages, Version accepted for publication in IJMPA, Journal-ref: IJMPA 33, 1850127 (2018)

  17. Electroweak vacuum stability in presence of singlet scalar dark matter in TeV scale seesaw models

    Authors: Ila Garg, Srubabati Goswami, Vishnudath K. N., Najimuddin Khan

    Abstract: We consider singlet extensions of the standard model, both in the fermion and the scalar sector, to account for the generation of neutrino mass at the TeV scale and the existence of dark matter respectively. For the neutrino sector we consider models with extra singlet fermions which can generate neutrino mass via the so called inverse or linear seesaw mechanism whereas a singlet scalar is introdu… ▽ More

    Submitted 27 June, 2017; originally announced June 2017.

    Comments: 35 pages, 15 figures

    Journal ref: Phys. Rev. D 96, 055020 (2017)

  18. NMSGUT emergence and Trans-Unification RG flows

    Authors: Charanjit S. Aulakh, Ila Garg, Charanjit K. Khosa

    Abstract: Consistency of trans-unification RG evolution is used to discuss the domain of definition of the New Minimal Supersymmetric SO(10) GUT (NMSGUT). We compute the 1-loop RGE $β$ functions, simplifying generic formulae using constraints of gauge invariance and superpotential structure. We also calculate the 2 loop contributions to the gauge coupling and gaugino mass and indicate how to get full 2 loop… ▽ More

    Submitted 22 October, 2018; v1 submitted 1 September, 2015; originally announced September 2015.

    Comments: 36 pages, 1 Figure, 4 Tables, 77 equations, 42 references, RevTeX4 PDFLateX. Version published in Phys. Rev. D

    Journal ref: Phys. Rev. D 98, 075006 (2018)

  19. arXiv:1506.05204  [pdf, other

    hep-ph

    New minimal supersymmetric SO(10) GUT phenomenology and its cosmological implications

    Authors: Ila Garg

    Abstract: Supersymmetric GUTs based on SO(10) gauge group are leading contenders to describe particle physics beyond the Standard Model. Among these the "New minimal supersymmetric SO(10) grand unified theory" (NMSGUT) based on Higgs system 10+120+210+126+$\overline{126}$ has been developing since 1982. It now successfully fits the whole standard Model gauge coupling, symmetry breaking and fermion mass-mixi… ▽ More

    Submitted 4 September, 2015; v1 submitted 17 June, 2015; originally announced June 2015.

    Comments: Ph.D Thesis (Defended on 1 April, 2015), 187 Pages, 16 Figures, References added

  20. No scale SUGRA SO(10) derived Starobinsky Model of Inflation

    Authors: Ila Garg, Subhendra Mohanty

    Abstract: We show that a supersymmetric renormalizable theory based on gauge group SO(10) and Higgs system {\bf {10 $\oplus$ 210 $\oplus$ 126 $\oplus$ $\overline{\bf 126}$}} with no scale supergravity can lead to a Starobinsky kind of potential for inflation. Successful inflation is possible in the cases where the potential during inflation corresponds to… ▽ More

    Submitted 6 October, 2015; v1 submitted 29 April, 2015; originally announced April 2015.

    Comments: 6 pages, 2 figures, Replaced with version to appear in Phys Lett B

  21. Baryon Stability on the Higgs Dissolution Edge : Threshold corrections and suppression of Baryon violation in the NMSGUT

    Authors: Charanjit S. Aulakh, Ila Garg, Charanjit K. Khosa

    Abstract: Superheavy threshold corrections to the matching condition between matter Yukawa couplings of the effective Minimal Supersymmetric Standard Model (MSSM) and the New Minimal Supersymmetric (SO(10)) GUT(NMSGUT) provide a novel and generic mechanism for reducing the long standing and generically problematic operator dimension 5 Baryon decay rates. In suitable regions of the parameter space strong wav… ▽ More

    Submitted 25 March, 2014; v1 submitted 24 November, 2013; originally announced November 2013.

    Comments: PdfLatex. 50 pages. Version accepted for publication in Nuclear Phys.B(2014). Available online at http://dx.doi.org/10.1016/j.nuclphysb.2014.03.003. arXiv admin note: substantial text overlap with arXiv:1107.2963

    Journal ref: Nuclear Physics B882(2014) 397

  22. arXiv:1201.0519  [pdf, ps, other

    hep-ph astro-ph.CO

    Supersymmetric Seesaw Inflation

    Authors: Charanjit S. Aulakh, Ila Garg

    Abstract: Supersymmetric Unified theories which incorporate a renormalizable Type I seesaw mechanism for small neutrino masses can also provide slow roll inflection point inflation along a flat direction associated with a gauge invariant combination of the Higgs, slepton and right handed sneutrino superfields. Inflationary parameters are related to the Majorana and Dirac couplings responsible for neutrino m… ▽ More

    Submitted 20 September, 2012; v1 submitted 2 January, 2012; originally announced January 2012.

    Comments: Version published in Phys. Rev. D 86,065001 (2012), Sept. 15, 2012

    Journal ref: Phys. Rev. D 86, 065001 (2012)

  23. arXiv:1103.5601  [pdf, ps, other

    cond-mat.stat-mech q-bio.BM

    Non-linear interaction in random matrix models of RNA

    Authors: Itty Garg, Pradeep Bhadola, N. Deo

    Abstract: A non-linear Penner type interaction is introduced and studied in the random matrix model of homo-RNA. The asymptotics in length of the partition function is discussed for small and large $N$ (size of matrix). The interaction doubles the coupling ($v$) between the bases and the dependence of the combinatoric factor on ($v,N$) is found. For small $N$, the effect of interaction changes the power law… ▽ More

    Submitted 29 March, 2011; originally announced March 2011.

    Comments: 7 pages, 2 figures, 1 table

  24. arXiv:0911.3710  [pdf, ps, other

    q-bio.BM cond-mat.stat-mech

    Scaling, phase transition and genus distribution functions in matrix models of RNA with linear external interactions

    Authors: I. Garg, N. Deo

    Abstract: A linear external perturbation is introduced in the action of the partition function of the random matrix model of RNA [G. Vernizzi, H. Orland and A. Zee, Phys. Rev. Lett. 94, 168103 (2005)]. It is seen that (i). the perturbation distinguishes between paired and unpaired bases in that there are structural changes, from unpaired and paired base structures ($0 \leq α< 1$) to completely paired base… ▽ More

    Submitted 19 November, 2009; originally announced November 2009.

    Comments: 20 pages, 22 figures, 1 table

  25. arXiv:0908.3412  [pdf, ps, other

    cond-mat.mes-hall

    Nitrogen clusters inside C60 cage and new nanoscale energetic materials

    Authors: Hitesh Sharma, Isha Garg, Keya Dharamvir, V. K. Jindal

    Abstract: We explore the possibility to trap polynitrogen clusters inside C60 fullerene cage, opening a new direction of developing nitrogen-rich high energy materials. We found that a maximum of 13 nitrogen atoms can be encapsulated in a C60 cage. The nitrogen clusters in confinement exhibit unique stable structures in polymeric form which possess a large component of (~ 70-80%) single bond character. Th… ▽ More

    Submitted 24 August, 2009; originally announced August 2009.

    Comments: 25 pages Submitted to Carbon

  26. RNA matrix models with external interactions and their asymptotic behaviour

    Authors: I. Garg, N. Deo

    Abstract: We study a matrix model of RNA in which an external perturbation acts on n nucleotides of the polymer chain. The effect of the perturbation appears in the exponential generating function of the partition function as a factor $(1-\frac{nα}{L})$ [where $α$ is the ratio of strengths of the original to the perturbed term and L is length of the chain]. The asymptotic behaviour of the genus distributi… ▽ More

    Submitted 5 September, 2008; originally announced September 2008.

    Comments: 9 pages, 5 figures, 2 tables

  27. arXiv:0802.2440  [pdf, ps, other

    cond-mat.stat-mech cond-mat.soft q-bio.BM

    Genus Distributions For Extended Matrix Models Of RNA

    Authors: Itty Garg, N. Deo

    Abstract: We construct and study an extended random matrix model of RNA (polymer) folding. A perturbation which acts on all the nucleotides in the chain is added to the action of the RNA partition function. The effect of this perturbation on the partition function and the Genus Distributions is studied. This perturbation distinguishes between the paired and unpaired bases. For example, for $α= 1$ (where… ▽ More

    Submitted 23 April, 2008; v1 submitted 18 February, 2008; originally announced February 2008.

    Comments: 15 pages, 4 figures, 3 tables