Search | arXiv e-print repository

Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks

Authors: Ann Huang, Satpreet H. Singh, Flavio Martinelli, Kanaka Rajan

Abstract: Task-trained recurrent neural networks (RNNs) are widely used in neuroscience and machine learning to model dynamical computations. To gain mechanistic insight into how neural systems solve tasks, prior work often reverse-engineers individual trained networks. However, different RNNs trained on the same task and achieving similar performance can exhibit strikingly different internal solutions-a ph… ▽ More Task-trained recurrent neural networks (RNNs) are widely used in neuroscience and machine learning to model dynamical computations. To gain mechanistic insight into how neural systems solve tasks, prior work often reverse-engineers individual trained networks. However, different RNNs trained on the same task and achieving similar performance can exhibit strikingly different internal solutions-a phenomenon known as solution degeneracy. Here, we develop a unified framework to systematically quantify and control solution degeneracy across three levels: behavior, neural dynamics, and weight space. We apply this framework to 3,400 RNNs trained on four neuroscience-relevant tasks-flip-flop memory, sine wave generation, delayed discrimination, and path integration-while systematically varying task complexity, learning regime, network size, and regularization. We find that higher task complexity and stronger feature learning reduce degeneracy in neural dynamics but increase it in weight space, with mixed effects on behavior. In contrast, larger networks and structural regularization reduce degeneracy at all three levels. These findings empirically validate the Contravariance Principle and provide practical guidance for researchers aiming to tailor RNN solutions-whether to uncover shared neural mechanisms or to model individual variability observed in biological systems. This work provides a principled framework for quantifying and controlling solution degeneracy in task-trained RNNs, offering new tools for building more interpretable and biologically grounded models of neural computation. △ Less

Submitted 28 May, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

arXiv:2402.02656 [pdf, other]

RACER: An LLM-powered Methodology for Scalable Analysis of Semi-structured Mental Health Interviews

Authors: Satpreet Harcharan Singh, Kevin Jiang, Kanchan Bhasin, Ashutosh Sabharwal, Nidal Moukaddam, Ankit B Patel

Abstract: Semi-structured interviews (SSIs) are a commonly employed data-collection method in healthcare research, offering in-depth qualitative insights into subject experiences. Despite their value, the manual analysis of SSIs is notoriously time-consuming and labor-intensive, in part due to the difficulty of extracting and categorizing emotional responses, and challenges in scaling human evaluation for l… ▽ More Semi-structured interviews (SSIs) are a commonly employed data-collection method in healthcare research, offering in-depth qualitative insights into subject experiences. Despite their value, the manual analysis of SSIs is notoriously time-consuming and labor-intensive, in part due to the difficulty of extracting and categorizing emotional responses, and challenges in scaling human evaluation for large populations. In this study, we develop RACER, a Large Language Model (LLM) based expert-guided automated pipeline that efficiently converts raw interview transcripts into insightful domain-relevant themes and sub-themes. We used RACER to analyze SSIs conducted with 93 healthcare professionals and trainees to assess the broad personal and professional mental health impacts of the COVID-19 crisis. RACER achieves moderately high agreement with two human evaluators (72%), which approaches the human inter-rater agreement (77%). Interestingly, LLMs and humans struggle with similar content involving nuanced emotional, ambivalent/dialectical, and psychological statements. Our study highlights the opportunities and challenges in using LLMs to improve research efficiency and opens new avenues for scalable analysis of SSIs in healthcare research. △ Less

Submitted 4 February, 2024; originally announced February 2024.

arXiv:2109.12434 [pdf, other]

Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes

Authors: Satpreet Harcharan Singh, Floris van Breugel, Rajesh P. N. Rao, Bingni Wen Brunton

Abstract: Tracking a turbulent plume to locate its source is a complex control problem because it requires multi-sensory integration and must be robust to intermittent odors, changing wind direction, and variable plume statistics. This task is routinely performed by flying insects, often over long distances, in pursuit of food or mates. Several aspects of this remarkable behavior have been studied in detail… ▽ More Tracking a turbulent plume to locate its source is a complex control problem because it requires multi-sensory integration and must be robust to intermittent odors, changing wind direction, and variable plume statistics. This task is routinely performed by flying insects, often over long distances, in pursuit of food or mates. Several aspects of this remarkable behavior have been studied in detail in many experimental studies. Here, we take a complementary in silico approach, using artificial agents trained with reinforcement learning to develop an integrated understanding of the behaviors and neural computations that support plume tracking. Specifically, we use deep reinforcement learning (DRL) to train recurrent neural network (RNN) agents to locate the source of simulated turbulent plumes. Interestingly, the agents' emergent behaviors resemble those of flying insects, and the RNNs learn to represent task-relevant variables, such as head direction and time since last odor encounter. Our analyses suggest an intriguing experimentally testable hypothesis for tracking plumes in changing wind direction -- that agents follow local plume shape rather than the current wind direction. While reflexive short-memory behaviors are sufficient for tracking plumes in constant wind, longer timescales of memory are essential for tracking plumes that switch direction. At the level of neural dynamics, the RNNs' population activity is low-dimensional and organized into distinct dynamical structures, with some correspondence to behavioral modules. Our in silico approach provides key intuitions for turbulent plume tracking strategies and motivates future targeted experimental and theoretical developments. △ Less

Submitted 17 December, 2021; v1 submitted 25 September, 2021; originally announced September 2021.

ACM Class: I.2.6; I.2.0; I.5.1

arXiv:2104.14005 [pdf]

Unlocking capacities of viral genomics for the COVID-19 pandemic response

Authors: Sergey Knyazev, Karishma Chhugani, Varuni Sarwal, Ram Ayyala, Harman Singh, Smruthi Karthikeyan, Dhrithi Deshpande, Zoia Comarova, Angela Lu, Yuri Porozov, Aiping Wu, Malak Abedalthagafi, Shivashankar Nagaraj, Adam Smith, Pavel Skums, Jason Ladner, Tommy Tsan-Yuk Lam, Nicholas Wu, Alex Zelikovsky, Rob Knight, Keith Crandall, Serghei Mangul

Abstract: More than any other infectious disease epidemic, the COVID-19 pandemic has been characterized by the generation of large volumes of viral genomic data at an incredible pace due to recent advances in high-throughput sequencing technologies, the rapid global spread of SARS-CoV-2, and its persistent threat to public health. However, distinguishing the most epidemiologically relevant information encod… ▽ More More than any other infectious disease epidemic, the COVID-19 pandemic has been characterized by the generation of large volumes of viral genomic data at an incredible pace due to recent advances in high-throughput sequencing technologies, the rapid global spread of SARS-CoV-2, and its persistent threat to public health. However, distinguishing the most epidemiologically relevant information encoded in these vast amounts of data requires substantial effort across the research and public health communities. Studies of SARS-CoV-2 genomes have been critical in tracking the spread of variants and understanding its epidemic dynamics, and may prove crucial for controlling future epidemics and alleviating significant public health burdens. Together, genomic data and bioinformatics methods enable broad-scale investigations of the spread of SARS-CoV-2 at the local, national, and global scales and allow researchers the ability to efficiently track the emergence of novel variants, reconstruct epidemic dynamics, and provide important insights into drug and vaccine development and disease control. Here, we discuss the tremendous opportunities that genomics offers to unlock the effective use of SARS-CoV-2 genomic data for efficient public health surveillance and guiding timely responses to COVID-19. △ Less

Submitted 4 June, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

arXiv:2102.04400 [pdf]

doi 10.1364/JOSAA.415395

Rapid Classification of Glaucomatous Fundus Images

Authors: Hardit Singh, Simarjeet Saini, Vasudevan Lakshminarayanan

Abstract: We propose a new method for training convolutional neural networks which integrates reinforcement learning along with supervised learning and use ti for transfer learning for classification of glaucoma from colored fundus images. The training method uses hill climbing techniques via two different climber types, viz "random movment" and "random detection" integrated with supervised learning model t… ▽ More We propose a new method for training convolutional neural networks which integrates reinforcement learning along with supervised learning and use ti for transfer learning for classification of glaucoma from colored fundus images. The training method uses hill climbing techniques via two different climber types, viz "random movment" and "random detection" integrated with supervised learning model though stochastic gradient descent with momentum (SGDM) model. The model was trained and tested using the Drishti GS and RIM-ONE-r2 datasets having glaucomatous and normal fundus images. The performance metrics for prediction was tested by transfer learning on five CNN architectures, namely GoogLenet, DesnseNet-201, NASNet, VGG-19 and Inception-resnet-v2. A fivefold classification was used for evaluating the perfroamnace and high sensitivities while high maintaining high accuracies were achieved. Of the models tested, the denseNet-201 architecture performed the best in terms of sensitivity and area under the curve (AUC). This method of training allows transfer learning on small datasets and can be applied for tele-ophthalmology applications including training with local datasets. △ Less

Submitted 8 February, 2021; originally announced February 2021.

Comments: Submitted for publication in JOSA A: Optics and Image Science, currently under revision

MSC Class: 68 ACM Class: I.4.9; J.4

arXiv:2012.05454 [pdf, ps, other]

doi 10.1016/j.physd.2021.132988

A minimal model for synaptic integration in simple neurons

Authors: Adrian Joseph Alva, Harjinder Singh

Abstract: Synaptic integration is a prominent aspect of neuronal information processing. The detailed mechanisms that modulate synaptic inputs determine the computational properties of any given neuron. We study a simple model for the summation of excitatory inputs from synapses and illustrate its use by characterizing some functional properties of postsynaptic neurons. In this regard, we study the response… ▽ More Synaptic integration is a prominent aspect of neuronal information processing. The detailed mechanisms that modulate synaptic inputs determine the computational properties of any given neuron. We study a simple model for the summation of excitatory inputs from synapses and illustrate its use by characterizing some functional properties of postsynaptic neurons. In this regard, we study the response of postsynaptic neurons as defined by the model to two well known noise driven processes: stochastic and coherence resonance. The model requires a small number of parameters and is especially useful to isolate the role of integration mechanisms that rely on summation of inputs with little dendritic processing. △ Less

Submitted 14 July, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

Comments: 25 pages, 8 figures

Journal ref: Physica D 426 (2021) 132988

arXiv:2009.13946 [pdf, other]

ChemoVerse: Manifold traversal of latent spaces for novel molecule discovery

Authors: Harshdeep Singh, Nicholas McCarthy, Qurrat Ul Ain, Jeremiah Hayes

Abstract: In order to design a more potent and effective chemical entity, it is essential to identify molecular structures with the desired chemical properties. Recent advances in generative models using neural networks and machine learning are being widely used by many emerging startups and researchers in this domain to design virtual libraries of drug-like compounds. Although these models can help a scien… ▽ More In order to design a more potent and effective chemical entity, it is essential to identify molecular structures with the desired chemical properties. Recent advances in generative models using neural networks and machine learning are being widely used by many emerging startups and researchers in this domain to design virtual libraries of drug-like compounds. Although these models can help a scientist to produce novel molecular structures rapidly, the challenge still exists in the intelligent exploration of the latent spaces of generative models, thereby reducing the randomness in the generative procedure. In this work we present a manifold traversal with heuristic search to explore the latent chemical space. Different heuristics and scores such as the Tanimoto coefficient, synthetic accessibility, binding activity, and QED drug-likeness can be incorporated to increase the validity and proximity for desired molecular properties of the generated molecules. For evaluating the manifold traversal exploration, we produce the latent chemical space using various generative models such as grammar variational autoencoders (with and without attention) as they deal with the randomized generation and validity of compounds. With this novel traversal method, we are able to find more unseen compounds and more specific regions to mine in the latent space. Finally, these components are brought together in a simple platform allowing users to perform search, visualization and selection of novel generated compounds. △ Less

Submitted 29 September, 2020; originally announced September 2020.

Comments: 5 pages, 2 figures, Presented in First workshop on Applied Deep Generative Networks - ECAI 2020 ("link for the workshop: https://sites.google.com/view/adgn-20/home")

arXiv:2001.08349 [pdf, other]

Investigating naturalistic hand movements by behavior mining in long-term video and neural recordings

Authors: Satpreet H. Singh, Steven M. Peterson, Rajesh P. N. Rao, Bingni W. Brunton

Abstract: Recent technological advances in brain recording and artificial intelligence are propelling a new paradigm in neuroscience beyond the traditional controlled experiment. Rather than focusing on cued, repeated trials, naturalistic neuroscience studies neural processes underlying spontaneous behaviors performed in unconstrained settings. However, analyzing such unstructured data lacking a priori expe… ▽ More Recent technological advances in brain recording and artificial intelligence are propelling a new paradigm in neuroscience beyond the traditional controlled experiment. Rather than focusing on cued, repeated trials, naturalistic neuroscience studies neural processes underlying spontaneous behaviors performed in unconstrained settings. However, analyzing such unstructured data lacking a priori experimental design remains a significant challenge, especially when the data is multi-modal and long-term. Here we describe an automated approach for analyzing simultaneously recorded long-term, naturalistic electrocorticography (ECoG) and naturalistic behavior video data. We take a behavior-first approach to analyzing the long-term recordings. Using a combination of computer vision, discrete latent-variable modeling, and string pattern-matching on the behavioral video data, we find and annotate spontaneous human upper-limb movement events. We show results from our approach applied to data collected for 12 human subjects over 7--9 days for each subject. Our pipeline discovers and annotates over 40,000 instances of naturalistic human upper-limb movement events in the behavioral videos. Analysis of the simultaneously recorded brain data reveals neural signatures of movement that corroborate prior findings from traditional controlled experiments. We also prototype a decoder for a movement initiation detection task to demonstrate the efficacy of our pipeline as a source of training data for brain-computer interfacing applications. Our work addresses the unique data analysis challenges in studying naturalistic human behaviors, and contributes methods that may generalize to other neural recording modalities beyond ECoG. We publicly release our curated dataset, providing a resource to study naturalistic neural and behavioral variability at a scale not previously available. △ Less

Submitted 19 June, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

arXiv:2001.00114 [pdf]

Expertise and Task Pressure in fNIRS-based brain Connectomes

Authors: F. Deligianni, H. Singh, H. N. Modi, S. Jahani, M. Yucel, A. Darzi, D. R. Leff, G. Z. Yang

Abstract: Acquisition of bimanual motor skills, critical in several applications ranging from robotic teleoperations to surgery, is associated with a protracted learning curve. Brain connectivity based on functional Near Infrared Spectroscopy (fNIRS) data has shown promising results in distinguishing experts from novice surgeons. However, it is less well understood how expertise-related disparity in brain c… ▽ More Acquisition of bimanual motor skills, critical in several applications ranging from robotic teleoperations to surgery, is associated with a protracted learning curve. Brain connectivity based on functional Near Infrared Spectroscopy (fNIRS) data has shown promising results in distinguishing experts from novice surgeons. However, it is less well understood how expertise-related disparity in brain connectivity is modulated by dynamic temporal demands experienced during a surgical task. In this study, we use fNIRS to examine the interplay between frontal and motor brain regions in a cohort of surgical residents of varying expertise performing a laparoscopic surgical task under temporal demand. The results demonstrate that prefrontal-motor connectivity in senior residents is more resilient to time pressure. Furthermore, certain global characteristics of brain connectomes, such as the small-world index, may be used to detect the presence of an underlying stressor. △ Less

Submitted 31 December, 2019; originally announced January 2020.

arXiv:1503.08053 [pdf, ps, other]

doi 10.1063/1.4916899

Reliability of UPO based control strategies in biological systems

Authors: Nagender Mishra, Maria Hasse, B. Biswal, Harinder P. Singh

Abstract: Presence of recurrent and statistically significant unstable periodic orbits (UPOs) in time series obtained from biological systems are now routinely used as evidence for low dimensional chaos . Extracting accurate dynamical information from the detected UPO trajectories are vital for successful control strategies that either aim to stabilize the system near the fixed point or steer the system awa… ▽ More Presence of recurrent and statistically significant unstable periodic orbits (UPOs) in time series obtained from biological systems are now routinely used as evidence for low dimensional chaos . Extracting accurate dynamical information from the detected UPO trajectories are vital for successful control strategies that either aim to stabilize the system near the fixed point or steer the system away from the periodic orbits. A hybrid UPO detection method from return maps that combines topological recurrence criterion, matrix fit algorithm and stringent criterion for fixed point location gives accurate and statistically significant UPOs even in the presence of significant noise. Geometry of the return map, frequency of UPOs visiting the same trajectory, length of the data set, strength of the noise and degree of nonstationarity affect the efficacy of the proposed method. Results suggest that establishing determinism from unambiguous UPO detection is often possible in short data sets with significant noise, but derived dynamical properties are rarely accurate and adequate for controlling the dynamics around these UPOs. A repeat chaos control experiment on epileptic hippocampal slices through more stringent control strategy and adaptive UPO tracking is reinterpreted in this context through simulation of similar control experiments on an analogous but stochastic computer model of epileptic brain slices. Reproduction of equivalent results suggests that far more stringent criteria are needed for linking apparent success of control in such experiments with possible determinism in the underlying dynamics. △ Less

Submitted 27 March, 2015; originally announced March 2015.

Comments: Accepted for Publication in CHAOS

arXiv:1012.3354 [pdf, ps, other]

Intercellular synchronization of diffusively coupled astrocytes

Authors: Md. Jahoor Alam, Latika Bhayana, Gurumayum Reenaroy Devi, Heisnam Dinachandra Singh, R. K. Brojen Singh, B. Indrajit Sharma

Abstract: We examine the synchrony of the dynamics of localized [Ca^{2+}]_i oscillations in internal pool of astrocytes via diffusing coupling of a network of such cells in a certain topology where cytosolic Ca^{2+} and inositol 1,4,5-triphosphate (IP3) are coupling molecules; and possible long range interaction among the cells. Our numerical results claim that the cells exhibit fairly well coordinated beha… ▽ More We examine the synchrony of the dynamics of localized [Ca^{2+}]_i oscillations in internal pool of astrocytes via diffusing coupling of a network of such cells in a certain topology where cytosolic Ca^{2+} and inositol 1,4,5-triphosphate (IP3) are coupling molecules; and possible long range interaction among the cells. Our numerical results claim that the cells exhibit fairly well coordinated behaviour through this coupling mechanism. It is also seen in the results that as the number of coupling molecular species is increased, the rate of synchrony is also increased correspondingly. Apart from the topology of the cells taken, as the number of coupled cells around any one of the cells in the system is increased, the cell process information faster. △ Less

Submitted 14 December, 2010; originally announced December 2010.

Comments: 7 pages, 7 figures

MSC Class: 34-XX ACM Class: I.6.1

arXiv:1012.2990 [pdf, ps, other]

Measurement of phase synchrony of coupled segmentation clocks

Authors: Md. Jahoor Alam, Latika Bhayana, Gurumayum Reenaroy Devi, Heisnam Dinachandra Singh, R. K. Brojen Singh, B. Indrajit Sharma

Abstract: The temporal behaviour of segmentation clock oscillations show phase synchrony via mean field like coupling of delta protein restricting to nearest neighbours only, in a configuration of cells arranged in a regular three dimensional array. We found the increase of amplitudes of oscillating dynamical variables of the cells as the activation rate of delta-notch signaling is increased, however, the f… ▽ More The temporal behaviour of segmentation clock oscillations show phase synchrony via mean field like coupling of delta protein restricting to nearest neighbours only, in a configuration of cells arranged in a regular three dimensional array. We found the increase of amplitudes of oscillating dynamical variables of the cells as the activation rate of delta-notch signaling is increased, however, the frequencies of oscillations are decreased correspondingly. Our results show the phase transition from desynchronized to synchronized behaviour by identifying three regimes, namely, desynchronized, transition and synchronized regimes supported by various qualitative and quantitative measurements. △ Less

Submitted 14 December, 2010; originally announced December 2010.

Comments: 6 pages, 6 figures

MSC Class: 34Nxx ACM Class: J.3; I.6.0

Showing 1–12 of 12 results for author: Singh, H