Skip to main content

Showing 101–150 of 950 results for author: Srikanth

.
  1. arXiv:2407.04525  [pdf

    q-bio.NC cs.AI cs.LG

    Enhancing learning in spiking neural networks through neuronal heterogeneity and neuromodulatory signaling

    Authors: Alejandro Rodriguez-Garcia, Jie Mei, Srikanth Ramaswamy

    Abstract: Recent progress in artificial intelligence (AI) has been driven by insights from neuroscience, particularly with the development of artificial neural networks (ANNs). This has significantly enhanced the replication of complex cognitive tasks such as vision and natural language processing. Despite these advances, ANNs struggle with continual learning, adaptable knowledge transfer, robustness, and r… ▽ More

    Submitted 11 November, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: 30 pages, 4 figures, 3 boxes

    MSC Class: 92B20

  2. arXiv:2407.04444  [pdf, other

    cs.CL cs.SD eess.AS

    TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR

    Authors: Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Iuliia Thorbecke, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia, Aravind Ganapathiraju

    Abstract: In traditional conversational intelligence from speech, a cascaded pipeline is used, involving tasks such as voice activity detection, diarization, transcription, and subsequent processing with different NLP models for tasks like semantic endpointing and named entity recognition (NER). Our paper introduces TokenVerse, a single Transducer-based model designed to handle multiple tasks. This is achie… ▽ More

    Submitted 8 October, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: Accepted at EMNLP 2024 (Main Conference)

  3. arXiv:2407.04439  [pdf, other

    eess.AS

    XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models

    Authors: Shashi Kumar, Srikanth Madikeri, Juan Zuluaga-Gomez, Esaú Villatoro-Tello, Iuliia Thorbecke, Petr Motlicek, Manjunath K E, Aravind Ganapathiraju

    Abstract: Self-supervised pretrained models exhibit competitive performance in automatic speech recognition on finetuning, even with limited in-domain supervised data. However, popular pretrained models are not suitable for streaming ASR because they are trained with full attention context. In this paper, we introduce XLSR-Transducer, where the XLSR-53 model is used as encoder in transducer setup. Our exper… ▽ More

    Submitted 8 October, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: 5 pages, double column

  4. Simulations of cluster ultra-diffuse galaxies in MOND

    Authors: Srikanth T. Nagesh, Jonathan Freundlich, Benoit Famaey, Michal Bílek, Graeme Candlish, Rodrigo Ibata, Oliver Müller

    Abstract: Ultra-diffuse galaxies (UDGs) in the Coma cluster have velocity dispersion profiles that are in full agreement with the predictions of Modified Newtonian Dynamics (MOND) in isolation. However, the external field effect (EFE) from the cluster seriously deteriorates this agreement. It has been suggested that this could be related to the fact that UDGs are out-of-equilibrium objects whose stars have… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 10 Pages, 9 Figures, accepted by A&A

    Journal ref: A&A 690, A149 (2024)

  5. arXiv:2407.03387  [pdf, other

    cs.SE cs.AI cs.CL

    ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages

    Authors: Mehant Kammakomati, Sameer Pimparkhede, Srikanth Tamilselvam, Prince Kumar, Pushpak Bhattacharyya

    Abstract: Recent work shows Large Language Models (LLMs) struggle to understand natural language constraints for various text generation tasks in zero- and few-shot settings. While, in the code domain, there is wide usage of constraints in code format to maintain the integrity of code written in Domain-Specific Languages (DSLs) like JSON and YAML which are widely used for system-level programming tasks in e… ▽ More

    Submitted 24 March, 2025; v1 submitted 3 July, 2024; originally announced July 2024.

  6. Sequential Editing for Lifelong Training of Speech Recognition Models

    Authors: Devang Kulshreshtha, Saket Dingliwal, Brady Houston, Nikolaos Pappas, Srikanth Ronanki

    Abstract: Automatic Speech Recognition (ASR) traditionally assumes known domains, but adding data from a new domain raises concerns about computational inefficiencies linked to retraining models on both existing and new domains. Fine-tuning solely on new domain risks Catastrophic Forgetting (CF). To address this, Lifelong Learning (LLL) algorithms have been proposed for ASR. Prior research has explored tech… ▽ More

    Submitted 18 September, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: INTERSPEECH 2024

  7. arXiv:2406.11925  [pdf, other

    cs.SE cs.AI cs.CL

    DocCGen: Document-based Controlled Code Generation

    Authors: Sameer Pimparkhede, Mehant Kammakomati, Srikanth Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya

    Abstract: Recent developments show that Large Language Models (LLMs) produce state-of-the-art performance on natural language (NL) to code generation for resource-rich general-purpose languages like C++, Java, and Python. However, their practical usage for structured domain-specific languages (DSLs) such as YAML, JSON is limited due to domain-specific schema, grammar, and customizations generally unseen by… ▽ More

    Submitted 3 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  8. arXiv:2406.08900  [pdf, other

    eess.AS cs.SD eess.SP

    On Improving Error Resilience of Neural End-to-End Speech Coders

    Authors: Kishan Gupta, Nicola Pia, Srikanth Korse, Andreas Brendel, Guillaume Fuchs, Markus Multrus

    Abstract: Error resilient tools like Packet Loss Concealment (PLC) and Forward Error Correction (FEC) are essential to maintain a reliable speech communication for applications like Voice over Internet Protocol (VoIP), where packets are frequently delayed and lost. In recent times, end-to-end neural speech codecs have seen a significant rise, due to their ability to transmit speech signal at low bitrates bu… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  9. arXiv:2406.07568  [pdf, other

    cs.AI cs.LG cs.RO

    Reinforcement Learning Based Escape Route Generation in Low Visibility Environments

    Authors: Hari Srikanth

    Abstract: Structure fires are responsible for the majority of fire-related deaths nationwide. In order to assist with the rapid evacuation of trapped people, this paper proposes the use of a system that determines optimal search paths for firefighters and exit paths for civilians in real time based on environmental measurements. Through the use of a LiDAR mapping system evaluated and verified by a trust ran… ▽ More

    Submitted 27 May, 2024; originally announced June 2024.

  10. arXiv:2406.06835  [pdf, other

    cs.SE

    Large language models for generating rules, yay or nay?

    Authors: Shangeetha Sivasothy, Scott Barnett, Rena Logothetis, Mohamed Abdelrazek, Zafaryab Rasool, Srikanth Thudumu, Zac Brannelly

    Abstract: Engineering safety-critical systems such as medical devices and digital health intervention systems is complex, where long-term engagement with subject-matter experts (SMEs) is needed to capture the systems' expected behaviour. In this paper, we present a novel approach that leverages Large Language Models (LLMs), such as GPT-3.5 and GPT-4, as a potential world model to accelerate the engineering… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages, 1 figure

  11. arXiv:2406.00314  [pdf, other

    cs.CL cs.AI cs.LG

    CASE: Efficient Curricular Data Pre-training for Building Assistive Psychology Expert Models

    Authors: Sarthak Harne, Monjoy Narayan Choudhury, Madhav Rao, TK Srikanth, Seema Mehrotra, Apoorva Vashisht, Aarushi Basu, Manjit Sodhi

    Abstract: The limited availability of psychologists necessitates efficient identification of individuals requiring urgent mental healthcare. This study explores the use of Natural Language Processing (NLP) pipelines to analyze text data from online mental health forums used for consultations. By analyzing forum posts, these pipelines can flag users who may require immediate professional attention. A crucial… ▽ More

    Submitted 2 October, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  12. arXiv:2405.20350  [pdf, other

    cs.LG

    Linear Function Approximation as a Computationally Efficient Method to Solve Classical Reinforcement Learning Challenges

    Authors: Hari Srikanth

    Abstract: Neural Network based approximations of the Value function make up the core of leading Policy Based methods such as Trust Regional Policy Optimization (TRPO) and Proximal Policy Optimization (PPO). While this adds significant value when dealing with very complex environments, we note that in sufficiently low State and action space environments, a computationally expensive Neural Network architectur… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  13. arXiv:2405.13019  [pdf, other

    cs.CL cs.AI

    A Comprehensive Survey of Accelerated Generation Techniques in Large Language Models

    Authors: Mahsa Khoshnoodi, Vinija Jain, Mingye Gao, Malavika Srikanth, Aman Chadha

    Abstract: Despite the crucial importance of accelerating text generation in large language models (LLMs) for efficiently producing content, the sequential nature of this process often leads to high inference latency, posing challenges for real-time applications. Various techniques have been proposed and developed to address these challenges and improve efficiency. This paper presents a comprehensive survey… ▽ More

    Submitted 24 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  14. arXiv:2405.08317  [pdf, other

    cs.CL cs.SD eess.AS

    SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

    Authors: Raghuveer Peri, Sai Muralidhar Jayanthi, Srikanth Ronanki, Anshu Bhatia, Karel Mundnich, Saket Dingliwal, Nilaksh Das, Zejiang Hou, Goeric Huybrechts, Srikanth Vishnubhotla, Daniel Garcia-Romero, Sundararajan Srinivasan, Kyu J Han, Katrin Kirchhoff

    Abstract: Integrated Speech and Large Language Models (SLMs) that can follow speech instructions and generate relevant text responses have gained popularity lately. However, the safety and robustness of these models remains largely unclear. In this work, we investigate the potential vulnerabilities of such instruction-following speech-language models to adversarial attacks and jailbreaking. Specifically, we… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 9+6 pages, Submitted to ACL 2024

  15. arXiv:2405.08295  [pdf, other

    cs.CL cs.SD eess.AS

    SpeechVerse: A Large-scale Generalizable Audio Language Model

    Authors: Nilaksh Das, Saket Dingliwal, Srikanth Ronanki, Rohit Paturi, Zhaocheng Huang, Prashant Mathur, Jie Yuan, Dhanush Bekal, Xing Niu, Sai Muralidhar Jayanthi, Xilai Li, Karel Mundnich, Monica Sunkara, Sravan Bodapati, Sundararajan Srinivasan, Kyu J Han, Katrin Kirchhoff

    Abstract: Large language models (LLMs) have shown incredible proficiency in performing tasks that require semantic understanding of natural language instructions. Recently, many works have further expanded this capability to perceive multimodal audio and text inputs, but their capabilities are often limited to specific fine-tuned tasks such as automatic speech recognition and translation. We therefore devel… ▽ More

    Submitted 24 March, 2025; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: Single Column, 13 page

  16. Untangling individual cation roles in rock salt high-entropy oxides

    Authors: Saeed S. I. Almishal, Jacob T. Sivak, George N. Kotsonis, Yueze Tan, Matthew Furst, Dhiya Srikanth, Vincent H. Crespi, Venkatraman Gopalan, John T. Heron, Long-Qing Chen, Christina M. Rost, Susan B. Sinnott, Jon-Paul Maria

    Abstract: We unravel the distinct roles each cation plays in phase evolution, stability, and properties within Mg1/5Co1/5Ni1/5Cu1/5Zn1/5O high-entropy oxide (HEO) by integrating experimental findings, thermodynamic analyses, and first-principles predictions. Our approach is through sequentially removing one cation at a time from the five-component high-entropy oxide to create five four-component derivatives… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Saeed S. I. Almishal and Jacob T. Sivak contributed equally to this work

  17. arXiv:2405.06149  [pdf, other

    cs.AI cs.CV

    DisBeaNet: A Deep Neural Network to augment Unmanned Surface Vessels for maritime situational awareness

    Authors: Srikanth Vemula, Eulises Franco, Michael Frye

    Abstract: Intelligent detection and tracking of the vessels on the sea play a significant role in conducting traffic avoidance in unmanned surface vessels(USV). Current traffic avoidance software relies mainly on Automated Identification System (AIS) and radar to track other vessels to avoid collisions and acts as a typical perception system to detect targets. However, in a contested environment, emitting r… ▽ More

    Submitted 17 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  18. arXiv:2405.03657  [pdf, other

    quant-ph

    Computational complexity and quantum interpretations

    Authors: Vivek Kumar, M. P. Singh, R. Srikanth

    Abstract: In computational complexity theory, it remains to be understood whether $\textbf{BQP}$ is the same as $\textbf{BPP}$. Prima facie, one would expect that this mathematical question is quite unrelated to the foundational question of whether the quantum state is an element of reality or of the observer's knowledge. By contrast, here we argue that the complexity of computation in a physical theory may… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 6 pages, 3 figures

  19. arXiv:2405.02347  [pdf, other

    cs.LG cs.AI cs.CL

    COPAL: Continual Pruning in Large Language Generative Models

    Authors: Srikanth Malla, Joon Hee Choi, Chiho Choi

    Abstract: Adapting pre-trained large language models to different domains in natural language processing requires two key considerations: high computational demands and model's inability to continual adaptation. To simultaneously address both issues, this paper presents COPAL (COntinual Pruning in Adaptive Language settings), an algorithm developed for pruning large language generative models under a contin… ▽ More

    Submitted 14 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: ICML2024

  20. arXiv:2404.19422  [pdf, other

    cs.DS

    Efficient Algorithms for Earliest and Fastest Paths in Public Transport Networks

    Authors: Mithinti Srikanth, G. Ramakrishna

    Abstract: Public transport administrators rely on efficient algorithms for various problems that arise in public transport networks. In particular, our study focused on designing linear-time algorithms for two fundamental path problems: the earliest arrival time (\textsc{eat}) and the fastest path duration (\textsc{fpd}) on public transportation data. We conduct a comparative analysis with state-of-the-art… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  21. arXiv:2404.16326  [pdf, other

    cs.LG

    NeuroKoopman Dynamic Causal Discovery

    Authors: Rahmat Adesunkanmi, Balaji Sesha Srikanth Pokuri, Ratnesh Kumar

    Abstract: In many real-world applications where the system dynamics has an underlying interdependency among its variables (such as power grid, economics, neuroscience, omics networks, environmental ecosystems, and others), one is often interested in knowing whether the past values of one time series influences the future of another, known as Granger causality, and the associated underlying dynamics. This pa… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  22. arXiv:2404.14672  [pdf, ps, other

    math.RT math.AC

    Locally dualisable modular representations and local regularity

    Authors: Dave Benson, Srikanth B. Iyengar, Henning Krause, Julia Pevtsova

    Abstract: This work concerns the stable module category of a finite group over a field of characteristic dividing the group order. The minimal localising tensor ideals correspond to the non-maximal homogeneous prime ideals in the cohomology ring of the group. Given such a prime ideal, a number of characterisations of the dualisable objects in the corresponding tensor ideal are given. One characterisation of… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 34 pages

    MSC Class: 20C20 (primary); 18G80; 20J06 (secondary)

  23. arXiv:2404.11819  [pdf, other

    cs.CV

    Utilizing Adversarial Examples for Bias Mitigation and Accuracy Enhancement

    Authors: Pushkar Shukla, Dhruv Srikanth, Lee Cohen, Matthew Turk

    Abstract: We propose a novel approach to mitigate biases in computer vision models by utilizing counterfactual generation and fine-tuning. While counterfactuals have been used to analyze and address biases in DNN models, the counterfactuals themselves are often generated from biased generative models, which can introduce additional biases or spurious correlations. To address this issue, we propose using adv… ▽ More

    Submitted 27 June, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  24. arXiv:2404.11717  [pdf, other

    cs.CL

    How often are errors in natural language reasoning due to paraphrastic variability?

    Authors: Neha Srikanth, Marine Carpuat, Rachel Rudinger

    Abstract: Large language models have been shown to behave inconsistently in response to meaning-preserving paraphrastic inputs. At the same time, researchers evaluate the knowledge and reasoning abilities of these models with test evaluations that do not disaggregate the effect of paraphrastic variability on performance. We propose a metric for evaluating the paraphrastic consistency of natural language rea… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: accepted to TACL 2024 (pre-MIT Press publication version)

  25. arXiv:2403.20305  [pdf, ps, other

    cs.CC

    Local Correction of Linear Functions over the Boolean Cube

    Authors: Prashanth Amireddy, Amik Raj Behera, Manaswi Paraashar, Srikanth Srinivasan, Madhu Sudan

    Abstract: We consider the task of locally correcting, and locally list-correcting, multivariate linear functions over the domain $\{0,1\}^n$ over arbitrary fields and more generally Abelian groups. Such functions form error-correcting codes of relative distance $1/2$ and we give local-correction algorithms correcting up to nearly $1/4$-fraction errors making $\widetilde{\mathcal{O}}(\log n)$ queries. This q… ▽ More

    Submitted 25 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: 61 pages, To Appear in the Proceedings of the 56th Annual ACM Symposium on Theory of Computing, June 24-28 2024, Vancouver, Canada. Added a remark on local testing in the revision

  26. arXiv:2403.19816  [pdf, other

    cs.LG eess.SP

    The State of Lithium-Ion Battery Health Prognostics in the CPS Era

    Authors: Gaurav Shinde, Rohan Mohapatra, Pooja Krishan, Harish Garg, Srikanth Prabhu, Sanchari Das, Mohammad Masum, Saptarshi Sengupta

    Abstract: Lithium-ion batteries (Li-ion) have revolutionized energy storage technology, becoming integral to our daily lives by powering a diverse range of devices and applications. Their high energy density, fast power response, recyclability, and mobility advantages have made them the preferred choice for numerous sectors. This paper explores the seamless integration of Prognostics and Health Management w… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 18 pages, 12 figures, 6 tables. arXiv admin note: text overlap with arXiv:2310.00023

    MSC Class: 68 ACM Class: B.8.1

  27. arXiv:2403.15566  [pdf, ps, other

    math.AC math.AG

    Non-existence of Ulrich modules over Cohen-Macaulay local rings

    Authors: Srikanth B. Iyengar, Linquan Ma, Mark E. Walker, Ziquan Zhuang

    Abstract: Over a Cohen-Macaulay local ring, the minimal number of generators of a maximal Cohen-Macaulay module is bounded above by its multiplicity. In 1984 Ulrich asked whether there always exist modules for which equality holds; such modules are known nowadays as Ulrich modules. We answer this question in the negative by constructing families of two dimensional Cohen-Macaulay local rings that have no Ulr… ▽ More

    Submitted 12 March, 2025; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: 13 pages. The Introduction has been expanded, and a few minor corrections have been made in the text. This is slated to appear in the Commun. Am. Math. Soc

    MSC Class: 13C13 (primary); 13H10; 13C14; 14F06 (secondary)

  28. arXiv:2403.13188  [pdf, ps, other

    cs.CV cs.RO eess.IV

    Reflectivity Is All You Need!: Advancing LiDAR Semantic Segmentation

    Authors: Kasi Viswanath, Peng Jiang, Srikanth Saripalli

    Abstract: LiDAR semantic segmentation frameworks predominantly use geometry-based features to differentiate objects within a scan. Although these methods excel in scenarios with clear boundaries and distinct shapes, their performance declines in environments where boundaries are indistinct, particularly in off-road contexts. To address this issue, recent advances in 3D segmentation algorithms have aimed to… ▽ More

    Submitted 30 September, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  29. arXiv:2403.11367  [pdf, other

    cs.CV cs.GR cs.RO

    3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization

    Authors: Peng Jiang, Gaurav Pandey, Srikanth Saripalli

    Abstract: This paper presents a novel system designed for 3D mapping and visual relocalization using 3D Gaussian Splatting. Our proposed method uses LiDAR and camera data to create accurate and visually plausible representations of the environment. By leveraging LiDAR data to initiate the training of the 3D Gaussian Splatting map, our system constructs maps that are both detailed and geometrically accurate.… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 8 pages, 7 figures

  30. arXiv:2403.10205  [pdf, other

    cs.CL cs.AI

    Read between the lines -- Functionality Extraction From READMEs

    Authors: Prince Kumar, Srikanth Tamilselvam, Dinesh Garg

    Abstract: While text summarization is a well-known NLP task, in this paper, we introduce a novel and useful variant of it called functionality extraction from Git README files. Though this task is a text2text generation at an abstract level, it involves its own peculiarities and challenges making existing text2text generation systems not very useful. The motivation behind this task stems from a recent surge… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  31. arXiv:2403.01762  [pdf, other

    quant-ph

    Contextuality, superlocality and nonclassicality of supernoncontextuality

    Authors: Chellasamy Jebarathinam, R. Srikanth

    Abstract: Contextuality is a fundamental manifestation of nonclassicality, indicating that for certain quantum correlations, sets of jointly measurable variables cannot be pre-assigned values independently of the measurement context. In this work, we characterize nonclassical quantum correlation beyond contextuality, in terms of supernoncontextuality, namely the higher-than-quantum hidden-variable(HV) dimen… ▽ More

    Submitted 20 November, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: v2 (submitted to another journal): error in "which range noisy Peres box contextual" corrected- nonclassicality based on supernoncontextuality termed as "semi-device-independent contextuality"- a criterion and quantification of semi-device-independent contextuality studied - removed contents related to information-theoretic measures of simultaneous correlations in MUBs. 16 pages

  32. arXiv:2402.16226  [pdf

    cond-mat.soft cond-mat.mtrl-sci

    Entropic Cohesion in Vitrimers

    Authors: Rahul Karmakar, Himanshu, Srikanth Sastry, Sanat K Kumar, Tarak K Patra

    Abstract: Vitrimers are polymer networks that can undergo bond exchange reactions. They dynamically rearrange their structures while maintaining their overall integrity, thus resulting in unique properties such as self-healing, reprocessability, shape memory and adaptability. Here, we show that the introduction of dynamic bonds directly impacts the polymer density. For a limiting case, where the dynamic bon… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  33. arXiv:2402.11760  [pdf, other

    cs.LG cs.CV

    Reinforcement Learning as a Parsimonious Alternative to Prediction Cascades: A Case Study on Image Segmentation

    Authors: Bharat Srikishan, Anika Tabassum, Srikanth Allu, Ramakrishnan Kannan, Nikhil Muralidhar

    Abstract: Deep learning architectures have achieved state-of-the-art (SOTA) performance on computer vision tasks such as object detection and image segmentation. This may be attributed to the use of over-parameterized, monolithic deep learning architectures executed on large datasets. Although such architectures lead to increased accuracy, this is usually accompanied by a large increase in computation and m… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  34. arXiv:2402.08769  [pdf, other

    cs.LG cs.DC

    FLASH: Federated Learning Across Simultaneous Heterogeneities

    Authors: Xiangyu Chang, Sk Miraj Ahmed, Srikanth V. Krishnamurthy, Basak Guler, Ananthram Swami, Samet Oymak, Amit K. Roy-Chowdhury

    Abstract: The key premise of federated learning (FL) is to train ML models across a diverse set of data-owners (clients), without exchanging local data. An overarching challenge to this date is client heterogeneity, which may arise not only from variations in data distribution, but also in data quality, as well as compute/communication latency. An integrated view of these diverse and concurrent sources of h… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  35. arXiv:2402.07118  [pdf, other

    cs.HC cs.AI cs.LG eess.IV eess.SP

    Next-Generation Teleophthalmology: AI-enabled Quality Assessment Aiding Remote Smartphone-based Consultation

    Authors: Dhruv Srikanth, Jayang Gurung, N Satya Deepika, Vineet Joshi, Lopamudra Giri, Pravin Vaddavalli, Soumya Jana

    Abstract: Blindness and other eye diseases are a global health concern, particularly in low- and middle-income countries like India. In this regard, during the COVID-19 pandemic, teleophthalmology became a lifeline, and the Grabi attachment for smartphone-based eye imaging gained in use. However, quality of user-captured image often remained inadequate, requiring clinician vetting and delays. In this backdr… ▽ More

    Submitted 7 August, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: 4 pages, Presented at IEEE EMBC 2024

  36. arXiv:2402.01968  [pdf, other

    cs.MA cs.AI cs.LG

    A Survey on Context-Aware Multi-Agent Systems: Techniques, Challenges and Future Directions

    Authors: Hung Du, Srikanth Thudumu, Rajesh Vasa, Kon Mouzakis

    Abstract: Research interest in autonomous agents is on the rise as an emerging topic. The notable achievements of Large Language Models (LLMs) have demonstrated the considerable potential to attain human-like intelligence in autonomous agents. However, the challenge lies in enabling these agents to learn, reason, and navigate uncertainties in dynamic environments. Context awareness emerges as a pivotal elem… ▽ More

    Submitted 29 January, 2025; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 11 pages, 1 figure

  37. arXiv:2401.18009  [pdf

    cond-mat.mtrl-sci

    Tailoring magnetic and hyperthermia properties of biphase iron oxide nanocubes through post-annealing

    Authors: Supun B. Attanayake, Amit Chanda, Raja Das, Manh-Huong Phan, Hariharan Srikanth

    Abstract: Tailoring the magnetic properties of iron oxide nanosystems is essential to expand their biomedical applications. In this study, the 34 nm iron oxide nanocubes with two phases consisting of Fe3O4 and alpha-Fe2O3 were annealed for 2 hours in the presence of O2, N2, He, and Ar to tune the respective phase volume fractions and control the magnetic properties. X-ray diffraction and magnetic measuremen… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  38. arXiv:2401.08138  [pdf, other

    cs.SE cs.AI

    LLMs for Test Input Generation for Semantic Caches

    Authors: Zafaryab Rasool, Scott Barnett, David Willie, Stefanus Kurniawan, Sherwin Balugo, Srikanth Thudumu, Mohamed Abdelrazek

    Abstract: Large language models (LLMs) enable state-of-the-art semantic capabilities to be added to software systems such as semantic search of unstructured documents and text generation. However, these models are computationally expensive. At scale, the cost of serving thousands of users increases massively affecting also user experience. To address this problem, semantic caches are used to check for answe… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted in International Conference on AI Engineering Software Engineering (CAIN 2024)

  39. arXiv:2401.05856  [pdf, other

    cs.SE cs.AI

    Seven Failure Points When Engineering a Retrieval Augmented Generation System

    Authors: Scott Barnett, Stefanus Kurniawan, Srikanth Thudumu, Zach Brannelly, Mohamed Abdelrazek

    Abstract: Software engineers are increasingly adding semantic search capabilities to applications using a strategy known as Retrieval Augmented Generation (RAG). A RAG system involves finding documents that semantically match a query and then passing the documents to a large language model (LLM) such as ChatGPT to extract the right answer using an LLM. RAG systems aim to: a) reduce the problem of hallucinat… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  40. arXiv:2401.04130  [pdf, other

    cs.LG cs.AI

    Plug-and-Play Transformer Modules for Test-Time Adaptation

    Authors: Xiangyu Chang, Sk Miraj Ahmed, Srikanth V. Krishnamurthy, Basak Guler, Ananthram Swami, Samet Oymak, Amit K. Roy-Chowdhury

    Abstract: Parameter-efficient tuning (PET) methods such as LoRA, Adapter, and Visual Prompt Tuning (VPT) have found success in enabling adaptation to new domains by tuning small modules within a transformer model. However, the number of domains encountered during test time can be very large, and the data is usually unlabeled. Thus, adaptation to new domains is challenging; it is also impractical to generate… ▽ More

    Submitted 8 February, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

  41. arXiv:2401.02350  [pdf, ps, other

    math.AC math.RT

    Locally dualizable modules abound

    Authors: Jon F. Carlson, Srikanth B. Iyengar

    Abstract: It is proved that given any prime ideal $\mathfrak{p}$ of height at least 2 in a countable commutative noetherian ring $A$, there are uncountably many more dualizable objects in the $\mathfrak{p}$-local $\mathfrak{p}$-torsion stratum of the derived category of $A$ than those that are obtained as retracts of images of perfect $A$-complexes. An analogous result is established dealing with the stable… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: 7 pages

    MSC Class: 13D09 (primary); 18G80; 14F08 (secondary

  42. Off-Road LiDAR Intensity Based Semantic Segmentation

    Authors: Kasi Viswanath, Peng Jiang, Sujit PB, Srikanth Saripalli

    Abstract: LiDAR is used in autonomous driving to provide 3D spatial information and enable accurate perception in off-road environments, aiding in obstacle detection, mapping, and path planning. Learning-based LiDAR semantic segmentation utilizes machine learning techniques to automatically classify objects and regions in LiDAR point clouds. Learning-based models struggle in off-road environments due to the… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: Accepted to ISER 2023

  43. arXiv:2312.14350  [pdf

    physics.flu-dyn

    Convolution Neural Network Model Framework to Predict Microscale Drag Force for Turbulent Flow in Porous Media

    Authors: Vishal Srikanth, Andrey V. Kuznetsov

    Abstract: Convolution Neural Networks (CNN) are well-suited to model the nonlinear relationship between the microscale geometry of porous media and the corresponding flow distribution, thereby accurately and efficiently coupling the flow behavior at the micro- and macro- scale levels. In this paper, we have identified the challenges involved in implementing CNNs for macroscale model closure in the turbulent… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 22 pages, 11 figures

  44. Concatenating quantum error-correcting codes with decoherence-free subspaces and vice versa

    Authors: Nihar Ranjan Dash, Sanjoy Dutta, R. Srikanth, Subhashish Banerjee

    Abstract: Quantum error-correcting codes (QECCs) and decoherence-free subspace (DFS) codes provide active and passive means, respectively, to address certain types of errors that arise during quantum computation. The latter technique is suitable to correct correlated errors with certain symmetries and the former to correct independent errors. The concatenation of a QECC and a DFS code results in a degenerat… ▽ More

    Submitted 1 July, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Close to the published version; 13 pages, 8 figures

    Journal ref: Phys. Rev. A 109, 062411 (2024)

  45. arXiv:2312.04490  [pdf, other

    cond-mat.soft physics.chem-ph

    Assembling PNIPAM-Capped Gold Nanoparticles in Aqueous Solutions

    Authors: Binay P. Nayak, Hyeong Jin Kim, Srikanth Nayak, Wenjie Wang, Wei Bu, Surya K. Mallapragada, David Vaknin

    Abstract: Employing small angle X-ray scattering (SAXS), we explore the conditions under which the assembly of gold nanoparticles (AuNPs) grafted with the thermo-sensitive polymer Poly(N-isopropylacrylamide) (PNIPAM) emerges. We find that short-range order assembly emerges by combining the addition of electrolytes or poly-electrolytes with raising the temperature of the suspensions above the lower-critical… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Published at ACS Macro Letters, DOI - https://doi.org/10.1021/acsmacrolett.3c00617

    Journal ref: ACS Macro Lett. 2023, 12, XXX, 1659 to 1664

  46. arXiv:2312.02200  [pdf, other

    cs.CV cs.AI stat.AP

    An Empirical Study of Automated Mislabel Detection in Real World Vision Datasets

    Authors: Maya Srikanth, Jeremy Irvin, Brian Wesley Hill, Felipe Godoy, Ishan Sabane, Andrew Y. Ng

    Abstract: Major advancements in computer vision can primarily be attributed to the use of labeled datasets. However, acquiring labels for datasets often results in errors which can harm model performance. Recent works have proposed methods to automatically identify mislabeled images, but developing strategies to effectively implement them in real world datasets has been sparsely explored. Towards improved d… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  47. arXiv:2312.01459  [pdf, other

    cond-mat.soft cond-mat.dis-nn

    Yielding behaviour of active particles in bulk and in confinement

    Authors: Yagyik Goswami, G. V. Shivashankar, Srikanth Sastry

    Abstract: The investigation of collective behaviour in dense assemblies of self-propelled active particles has been motivated by a wide range of biological phenomena. Of particular interest are dynamical transitions of cellular and sub-cellular biological assemblies, including the cytoskeleton and the cell nucleus. Motivated by observations of mechanically induced changes in the dynamics of such systems, an… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 7 pages, 4 figures and 10 pages of supp. matt

  48. arXiv:2312.00434  [pdf, other

    cs.LG cs.AI cs.CY

    PEFTDebias : Capturing debiasing information using PEFTs

    Authors: Sumit Agarwal, Aditya Srikanth Veerubhotla, Srijan Bansal

    Abstract: The increasing use of foundation models highlights the urgent need to address and eliminate implicit biases present in them that arise during pretraining. In this paper, we introduce PEFTDebias, a novel approach that employs parameter-efficient fine-tuning (PEFT) to mitigate the biases within foundation models. PEFTDebias consists of two main phases: an upstream phase for acquiring debiasing param… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: EMNLP 2023

  49. arXiv:2311.15072  [pdf, other

    cs.CV cs.AI

    Introducing SSBD+ Dataset with a Convolutional Pipeline for detecting Self-Stimulatory Behaviours in Children using raw videos

    Authors: Vaibhavi Lokegaonkar, Vijay Jaisankar, Pon Deepika, Madhav Rao, T K Srikanth, Sarbani Mallick, Manjit Sodhi

    Abstract: Conventionally, evaluation for the diagnosis of Autism spectrum disorder is done by a trained specialist through questionnaire-based formal assessments and by observation of behavioral cues under various settings to capture the early warning signs of autism. These evaluation techniques are highly subjective and their accuracy relies on the experience of the specialist. In this regard, machine lear… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: Copyright 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  50. arXiv:2311.14786  [pdf, other

    cs.CV cs.AI cs.RO

    GPT-4V Takes the Wheel: Promises and Challenges for Pedestrian Behavior Prediction

    Authors: Jia Huang, Peng Jiang, Alvika Gautam, Srikanth Saripalli

    Abstract: Predicting pedestrian behavior is the key to ensure safety and reliability of autonomous vehicles. While deep learning methods have been promising by learning from annotated video frame sequences, they often fail to fully grasp the dynamic interactions between pedestrians and traffic, crucial for accurate predictions. These models also lack nuanced common sense reasoning. Moreover, the manual anno… ▽ More

    Submitted 25 January, 2024; v1 submitted 24 November, 2023; originally announced November 2023.