Skip to main content

Showing 1–50 of 267 results for author: Shyam

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.13932  [pdf, ps, other

    cs.SE cs.AI

    How Does LLM Reasoning Work for Code? A Survey and a Call to Action

    Authors: Ira Ceka, Saurabh Pujar, Irene Manotas, Gail Kaiser, Baishakhi Ray, Shyam Ramji

    Abstract: The rise of large language models (LLMs) has led to dramatic improvements across a wide range of natural language tasks. These advancements have extended into the domain of code, facilitating complex tasks such as code generation, translation, summarization, and repair. However, their utility for real-world deployment in-the-wild has only recently been studied, particularly on software engineering… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  2. arXiv:2506.13805  [pdf, ps, other

    cs.CY cs.AI

    Dr. GPT Will See You Now, but Should It? Exploring the Benefits and Harms of Large Language Models in Medical Diagnosis using Crowdsourced Clinical Cases

    Authors: Bonam Mingole, Aditya Majumdar, Firdaus Ahmed Choudhury, Jennifer L. Kraschnewski, Shyam S. Sundar, Amulya Yadav

    Abstract: The proliferation of Large Language Models (LLMs) in high-stakes applications such as medical (self-)diagnosis and preliminary triage raises significant ethical and practical concerns about the effectiveness, appropriateness, and possible harmfulness of the use of these technologies for health-related concerns and queries. Some prior work has considered the effectiveness of LLMs in answering exper… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  3. arXiv:2506.08311  [pdf, ps, other

    cs.SE cs.AI

    Understanding Software Engineering Agents Through the Lens of Traceability: An Empirical Study

    Authors: Ira Ceka, Saurabh Pujar, Shyam Ramji, Luca Buratti, Gail Kaiser, Baishakhi Ray

    Abstract: With the advent of large language models (LLMs), software engineering agents (SWE agents) have emerged as a powerful paradigm for automating a range of software tasks -- from code generation and repair to test case synthesis. These agents operate autonomously by interpreting user input and responding to environmental feedback. While various agent architectures have demonstrated strong empirical pe… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  4. arXiv:2506.06756  [pdf, ps, other

    cs.SD eess.AS

    Can Quantized Audio Language Models Perform Zero-Shot Spoofing Detection?

    Authors: Bikash Dutta, Rishabh Ranjan, Shyam Sathvik, Mayank Vatsa, Richa Singh

    Abstract: Quantization is essential for deploying large audio language models (LALMs) efficiently in resource-constrained environments. However, its impact on complex tasks, such as zero-shot audio spoofing detection, remains underexplored. This study evaluates the zero-shot capabilities of five LALMs, GAMA, LTU-AS, MERaLiON, Qwen-Audio, and SALMONN, across three distinct datasets: ASVspoof2019, In-the-Wild… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

    Comments: Accepted in Interspeech 2025

  5. arXiv:2506.05321  [pdf, other

    cs.LG

    LSM-2: Learning from Incomplete Wearable Sensor Data

    Authors: Maxwell A. Xu, Girish Narayanswamy, Kumar Ayush, Dimitris Spathis, Shun Liao, Shyam A. Tailor, Ahmed Metwally, A. Ali Heydari, Yuwei Zhang, Jake Garrison, Samy Abdel-Ghaffar, Xuhai Xu, Ken Gu, Jacob Sunshine, Ming-Zher Poh, Yun Liu, Tim Althoff, Shrikanth Narayanan, Pushmeet Kohli, Mark Malhotra, Shwetak Patel, Yuzhe Yang, James M. Rehg, Xin Liu, Daniel McDuff

    Abstract: Foundation models, a cornerstone of recent advancements in machine learning, have predominantly thrived on complete and well-structured data. Wearable sensor data frequently suffers from significant missingness, posing a substantial challenge for self-supervised learning (SSL) models that typically assume complete data inputs. This paper introduces the second generation of Large Sensor Model (LSM-… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Xu and Narayanswamy are co-first authors. McDuff and Liu are co-last authors

  6. arXiv:2506.03910  [pdf, ps, other

    cs.LG

    Enhancing Experimental Efficiency in Materials Design: A Comparative Study of Taguchi and Machine Learning Methods

    Authors: Shyam Prabhu, P Akshay Kumar, Antov Selwinston, Pavan Taduvai, Shreya Bairi, Rohit Batra

    Abstract: Materials design problems often require optimizing multiple variables, rendering full factorial exploration impractical. Design of experiment (DOE) methods, such as Taguchi technique, are commonly used to efficiently sample the design space but they inherently lack the ability to capture non-linear dependency of process variables. In this work, we demonstrate how machine learning (ML) methods can… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 7 pages, 3 figures

  7. arXiv:2505.24108  [pdf, ps, other

    cs.CV cs.LG

    Federated Foundation Model for GI Endoscopy Images

    Authors: Alina Devkota, Annahita Amireskandari, Joel Palko, Shyam Thakkar, Donald Adjeroh, Xiajun Jiang, Binod Bhattarai, Prashnna K. Gyawali

    Abstract: Gastrointestinal (GI) endoscopy is essential in identifying GI tract abnormalities in order to detect diseases in their early stages and improve patient outcomes. Although deep learning has shown success in supporting GI diagnostics and decision-making, these models require curated datasets with labels that are expensive to acquire. Foundation models offer a promising solution by learning general-… ▽ More

    Submitted 5 June, 2025; v1 submitted 29 May, 2025; originally announced May 2025.

    Comments: 11 pages, 11 figures, submitted to BHI2025

    ACM Class: I.2.10; I.4; I.5

  8. arXiv:2505.09610  [pdf, ps, other

    cs.AR cs.AI cs.CL cs.SE

    Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors

    Authors: Nicolas Dupuis, Ravi Nair, Shyam Ramji, Sean McClintock, Nishant Chauhan, Priyanka Nagpal, Bart Blaner, Ken Valk, Leon Stok, Ruchir Puri

    Abstract: The use of Large Language Models (LLMs) in hardware design has taken off in recent years, principally through its incorporation in tools that increase chip designer productivity. There has been considerable discussion about the use of LLMs in RTL specifications of chip designs, for which the two most popular languages are Verilog and VHDL. LLMs and their use in Verilog design has received signific… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  9. arXiv:2504.20435  [pdf, other

    cs.CV

    AI Assisted Cervical Cancer Screening for Cytology Samples in Developing Countries

    Authors: Love Panta, Suraj Prasai, Karishma Malla Vaidya, Shyam Shrestha, Suresh Manandhar

    Abstract: Cervical cancer remains a significant health challenge, with high incidence and mortality rates, particularly in transitioning countries. Conventional Liquid-Based Cytology(LBC) is a labor-intensive process, requires expert pathologists and is highly prone to errors, highlighting the need for more efficient screening methods. This paper introduces an innovative approach that integrates low-cost bi… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  10. arXiv:2504.18715  [pdf, other

    cs.CL cs.SD eess.AS

    Spatial Speech Translation: Translating Across Space With Binaural Hearables

    Authors: Tuochao Chen, Qirui Wang, Runlin He, Shyam Gollakota

    Abstract: Imagine being in a crowded space where people speak a different language and having hearables that transform the auditory space into your native language, while preserving the spatial cues for all speakers. We introduce spatial speech translation, a novel concept for hearables that translate speakers in the wearer's environment, while maintaining the direction and unique voice characteristics of e… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: Accepted by CHI2025

  11. arXiv:2504.04764  [pdf, other

    cs.CV cs.AI

    Enhancing Leaf Disease Classification Using GAT-GCN Hybrid Model

    Authors: Shyam Sundhar, Riya Sharma, Priyansh Maheshwari, Suvidha Rupesh Kumar, T. Sunil Kumar

    Abstract: Agriculture plays a critical role in the global economy, providing livelihoods and ensuring food security for billions. As innovative agricultural practices become more widespread, the risk of crop diseases has increased, highlighting the urgent need for efficient, low-intervention disease identification methods. This research presents a hybrid model combining Graph Attention Networks (GATs) and G… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  12. arXiv:2503.21204  [pdf, other

    cs.RO

    Dimensional optimization of single-DOF planar rigid link-flapping mechanisms for high lift and low power

    Authors: Shyam Sunder Nishad, Anupam Saxena

    Abstract: Rigid link flapping mechanisms remain the most practical choice for flapping wing micro-aerial vehicles (MAVs) to carry useful payloads and onboard batteries for free flight due to their long-term durability and reliability. However, to achieve high agility and maneuverability-like insects-MAVs with these mechanisms require significant weight reduction. One approach involves using single-DOF plana… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  13. arXiv:2503.16448  [pdf, ps, other

    cs.HC cs.CY cs.SI

    Towards Open Diversity-Aware Social Interactions

    Authors: Loizos Michael, Ivano Bison, Matteo Busso, Luca Cernuzzi, Amalia De Götzen, Shyam Diwakar, Kobi Gal, Amarsanaa Ganbold, George Gaskell, Daniel Gatica-Perez, Jessica Heesen, Daniele Miorandi, Salvador Ruiz-Correa, Laura Schelenz, Avi Segal, Carles Sierra, Hao Xu, Fausto Giunchiglia

    Abstract: Social Media and the Internet have catalyzed an unprecedented potential for exposure to human diversity in terms of demographics, talents, opinions, knowledge, and the like. However, this potential has not come with new, much needed, instruments and skills to harness it. This paper presents our work on promoting richer and deeper social relations through the design and development of the "Internet… ▽ More

    Submitted 17 February, 2025; originally announced March 2025.

  14. arXiv:2503.08796  [pdf, other

    cs.LG cs.AI

    Robust Multi-Objective Controlled Decoding of Large Language Models

    Authors: Seongho Son, William Bankes, Sangwoong Yoon, Shyam Sundhar Ramesh, Xiaohang Tang, Ilija Bogunovic

    Abstract: Test-time alignment of Large Language Models (LLMs) to human preferences offers a flexible way to generate responses aligned to diverse objectives without extensive retraining of LLMs. Existing methods achieve alignment to multiple objectives simultaneously (e.g., instruction-following, helpfulness, conciseness) by optimizing their corresponding reward functions. However, they often rely on predef… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 24 pages, 9 figures

  15. arXiv:2503.08437  [pdf, other

    cs.CV cs.AI cs.HC cs.RO

    ICPR 2024 Competition on Rider Intention Prediction

    Authors: Shankar Gangisetty, Abdul Wasi, Shyam Nandan Rai, C. V. Jawahar, Sajay Raj, Manish Prajapati, Ayesha Choudhary, Aaryadev Chandra, Dev Chandan, Shireen Chand, Suvaditya Mukherjee

    Abstract: The recent surge in the vehicle market has led to an alarming increase in road accidents. This underscores the critical importance of enhancing road safety measures, particularly for vulnerable road users like motorcyclists. Hence, we introduce the rider intention prediction (RIP) competition that aims to address challenges in rider safety by proactively predicting maneuvers before they occur, the… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  16. arXiv:2503.04764  [pdf

    cs.CY

    Artificial intelligence for objective assessment of acrobatic movements: How to apply machine learning for identifying tumbling elements in cheer sports

    Authors: Sophia Wesely, Ella Hofer, Robin Curth, Shyam Paryani, Nicole Mills, Olaf Ueberschär, Julia Westermayr

    Abstract: Over the past four decades, cheerleading has evolved from a sideline activity at major sporting events into a professional, competitive sport with growing global popularity. Evaluating tumbling elements in cheerleading relies on both objective measures and subjective judgments, such as difficulty and execution quality. However, the complexity of tumbling - encompassing team synchronicity, ground i… ▽ More

    Submitted 11 February, 2025; originally announced March 2025.

    Comments: 17 pages, 11 figures

  17. arXiv:2502.10516  [pdf, ps, other

    cs.GT

    A new lower bound for multi-color discrepancy with applications to fair division

    Authors: Ioannis Caragiannis, Kasper Green Larsen, Sudarshan Shyam

    Abstract: A classical problem in combinatorics seeks colorings of low discrepancy. More concretely, the goal is to color the elements of a set system so that the number of appearances of any color among the elements in each set is as balanced as possible. We present a new lower bound for multi-color discrepancy, showing that there is a set system with $n$ subsets over a set of elements in which any $k$-colo… ▽ More

    Submitted 18 February, 2025; v1 submitted 14 February, 2025; originally announced February 2025.

  18. arXiv:2502.08073  [pdf

    cs.CY

    Large language models perpetuate bias in palliative care: development and analysis of the Palliative Care Adversarial Dataset (PCAD)

    Authors: Naomi Akhras, Fares Antaki, Fannie Mottet, Olivia Nguyen, Shyam Sawhney, Sabrina Bajwah, Joanna M Davies

    Abstract: Bias and inequity in palliative care disproportionately affect marginalised groups. Large language models (LLMs), such as GPT-4o, hold potential to enhance care but risk perpetuating biases present in their training data. This study aimed to systematically evaluate whether GPT-4o propagates biases in palliative care responses using adversarially designed datasets. In July 2024, GPT-4o was probed u… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: The complete PCAD datasets are available on Figshare: dx.doi.org/10.6084/m9.figshare.28396016

  19. arXiv:2502.03950  [pdf, other

    cs.CV

    LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation Models

    Authors: Priyank Pathak, Shyam Marjit, Shruti Vyas, Yogesh S Rawat

    Abstract: Visual-language foundation Models (FMs) exhibit remarkable zero-shot generalization across diverse tasks, largely attributed to extensive pre-training on largescale datasets. However, their robustness on low-resolution/pixelated (LR) images, a common challenge in real-world scenarios, remains underexplored. We introduce LR0.FM, a comprehensive benchmark evaluating the impact of low resolution on t… ▽ More

    Submitted 18 May, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

    Comments: Accepted to ICLR 2025

  20. arXiv:2502.03347  [pdf, other

    cs.CY cs.SI

    DiversityOne: A Multi-Country Smartphone Sensor Dataset for Everyday Life Behavior Modeling

    Authors: Matteo Busso, Andrea Bontempelli, Leonardo Javier Malcotti, Lakmal Meegahapola, Peter Kun, Shyam Diwakar, Chaitanya Nutakki, Marcelo Dario Rodas Britez, Hao Xu, Donglei Song, Salvador Ruiz Correa, Andrea-Rebeca Mendoza-Lara, George Gaskell, Sally Stares, Miriam Bidoglia, Amarsanaa Ganbold, Altangerel Chagnaa, Luca Cernuzzi, Alethia Hume, Ronald Chenu-Abente, Roy Alia Asiku, Ivan Kayongo, Daniel Gatica-Perez, Amalia de Götzen, Ivano Bison , et al. (1 additional authors not shown)

    Abstract: Understanding everyday life behavior of young adults through personal devices, e.g., smartphones and smartwatches, is key for various applications, from enhancing the user experience in mobile apps to enabling appropriate interventions in digital health apps. Towards this goal, previous studies have relied on datasets combining passive sensor data with human-provided annotations or self-reports. H… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  21. arXiv:2502.01208  [pdf, ps, other

    cs.LG cs.CL

    On Almost Surely Safe Alignment of Large Language Models at Inference-Time

    Authors: Xiaotong Ji, Shyam Sundhar Ramesh, Matthieu Zimmer, Ilija Bogunovic, Jun Wang, Haitham Bou Ammar

    Abstract: We introduce a novel inference-time alignment approach for LLMs that aims to generate safe responses almost surely, i.e., with probability approaching one. Our approach models the generation of safe responses as a constrained Markov Decision Process (MDP) within the LLM's latent space. We augment a safety state that tracks the evolution of safety constraints and dynamically penalize unsafe generat… ▽ More

    Submitted 20 June, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

  22. arXiv:2501.05108  [pdf, other

    cs.CV

    Optimizing Multitask Industrial Processes with Predictive Action Guidance

    Authors: Naval Kishore Mehta, Arvind, Shyam Sunder Prasad, Sumeet Saurav, Sanjay Singh

    Abstract: Monitoring complex assembly processes is critical for maintaining productivity and ensuring compliance with assembly standards. However, variability in human actions and subjective task preferences complicate accurate task anticipation and guidance. To address these challenges, we introduce the Multi-Modal Transformer Fusion and Recurrent Units (MMTFRU) Network for egocentric activity anticipation… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

  23. arXiv:2501.00078  [pdf, other

    cs.HC cs.AI cs.LG

    Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors

    Authors: Niels Justesen, Maria Kaselimi, Sam Snodgrass, Miruna Vozaru, Matthew Schlegel, Jonas Wingren, Gabriella A. B. Barros, Tobias Mahlmann, Shyam Sudhakaran, Wesley Kerr, Albert Wang, Christoffer Holmgård, Georgios N. Yannakakis, Sebastian Risi, Julian Togelius

    Abstract: Artificial intelligence (AI) has enabled agents to master complex video games, from first-person shooters like Counter-Strike to real-time strategy games such as StarCraft II and racing games like Gran Turismo. While these achievements are notable, applying these AI methods in commercial video game production remains challenging due to computational constraints. In commercial scenarios, the majori… ▽ More

    Submitted 30 December, 2024; originally announced January 2025.

  24. arXiv:2412.18200  [pdf, other

    cs.NI

    Adapting Large Language Models for Improving TCP Fairness over WiFi

    Authors: Shyam Kumar Shrestha, Shiva Raj Pokhrel, Jonathan Kua

    Abstract: The new transmission control protocol (TCP) relies on Deep Learning (DL) for prediction and optimization, but requires significant manual effort to design deep neural networks (DNNs) and struggles with generalization in dynamic environments. Inspired by the success of large language models (LLMs), this study proposes TCP-LLM, a novel framework leveraging LLMs for TCP applications. TCP-LLM utilizes… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

  25. arXiv:2412.16482  [pdf, other

    cs.LG stat.ML

    Learn2Mix: Training Neural Networks Using Adaptive Data Integration

    Authors: Shyam Venkatasubramanian, Vahid Tarokh

    Abstract: Accelerating model convergence within resource-constrained environments is critical to ensure fast and efficient neural network training. This work presents learn2mix, a novel training strategy that adaptively adjusts class proportions within batches, focusing on classes with higher error rates. Unlike classical training methods that use static class proportions, learn2mix continually adapts class… ▽ More

    Submitted 13 February, 2025; v1 submitted 20 December, 2024; originally announced December 2024.

  26. arXiv:2411.18765  [pdf, ps, other

    cs.DS

    Near-Optimal Trace Reconstruction for Mildly Separated Strings

    Authors: Anders Aamand, Allen Liu, Shyam Narayanan

    Abstract: In the trace reconstruction problem our goal is to learn an unknown string $x\in \{0,1\}^n$ given independent traces of $x$. A trace is obtained by independently deleting each bit of $x$ with some probability $δ$ and concatenating the remaining bits. It is a major open question whether the trace reconstruction problem can be solved with a polynomial number of traces when the deletion probability… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  27. arXiv:2411.15242  [pdf, other

    cs.LG cs.AI cs.CL

    The Zamba2 Suite: Technical Report

    Authors: Paolo Glorioso, Quentin Anthony, Yury Tokpanov, Anna Golubeva, Vasudev Shyam, James Whittington, Jonathan Pilault, Beren Millidge

    Abstract: In this technical report, we present the Zamba2 series -- a suite of 1.2B, 2.7B, and 7.4B parameter hybrid Mamba2-transformer models that achieve state of the art performance against the leading open-weights models of their class, while achieving substantial gains in inference latency, throughput, and memory efficiency. The Zamba2 series builds upon our initial work with Zamba1-7B, optimizing its… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: 21/11/24 initial upload

  28. arXiv:2411.02298  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Sample-Efficient Private Learning of Mixtures of Gaussians

    Authors: Hassan Ashtiani, Mahbod Majid, Shyam Narayanan

    Abstract: We study the problem of learning mixtures of Gaussians with approximate differential privacy. We prove that roughly $kd^2 + k^{1.5} d^{1.75} + k^2 d$ samples suffice to learn a mixture of $k$ arbitrary $d$-dimensional Gaussians up to low total variation distance, with differential privacy. Our work improves over the previous best result [AAL24b] (which required roughly $k^2 d^4$ samples) and is pr… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: 52 pages. To appear in Neural Information Processing Systems (NeurIPS), 2024

  29. arXiv:2410.23087  [pdf, other

    cs.DS cs.LG stat.ML

    Statistical-Computational Trade-offs for Density Estimation

    Authors: Anders Aamand, Alexandr Andoni, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal, Haike Xu

    Abstract: We study the density estimation problem defined as follows: given $k$ distributions $p_1, \ldots, p_k$ over a discrete domain $[n]$, as well as a collection of samples chosen from a ``query'' distribution $q$ over $[n]$, output $p_i$ that is ``close'' to $q$. Recently~\cite{aamand2023data} gave the first and only known result that achieves sublinear bounds in {\em both} the sampling complexity and… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: To appear at NeurIPS 2024

  30. arXiv:2410.19250  [pdf, other

    cs.CL

    Have LLMs Reopened the Pandora's Box of AI-Generated Fake News?

    Authors: Xinyu Wang, Wenbo Zhang, Sai Koneru, Hangzhi Guo, Bonam Mingole, S. Shyam Sundar, Sarah Rajtmajer, Amulya Yadav

    Abstract: With the rise of AI-generated content spewed at scale from large language models (LLMs), genuine concerns about the spread of fake news have intensified. The perceived ability of LLMs to produce convincing fake news at scale poses new challenges for both human and automated fake news detection systems. To address this gap, this paper presents the findings from a university-level competition that a… ▽ More

    Submitted 29 March, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

  31. arXiv:2410.17863  [pdf, other

    eess.IV cs.CV cs.LG

    CASCRNet: An Atrous Spatial Pyramid Pooling and Shared Channel Residual based Network for Capsule Endoscopy

    Authors: K V Srinanda, M Manvith Prabhu, Shyam Lal

    Abstract: This manuscript summarizes work on the Capsule Vision Challenge 2024 by MISAHUB. To address the multi-class disease classification task, which is challenging due to the complexity and imbalance in the Capsule Vision challenge dataset, this paper proposes CASCRNet (Capsule endoscopy-Aspp-SCR-Network), a parameter-efficient and novel model that uses Shared Channel Residual (SCR) blocks and Atrous Sp… ▽ More

    Submitted 27 November, 2024; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: 8 pages, 4 figures

  32. arXiv:2410.15467  [pdf, other

    cs.CL cs.AI cs.HC

    Hey GPT, Can You be More Racist? Analysis from Crowdsourced Attempts to Elicit Biased Content from Generative AI

    Authors: Hangzhi Guo, Pranav Narayanan Venkit, Eunchae Jang, Mukund Srinath, Wenbo Zhang, Bonam Mingole, Vipul Gupta, Kush R. Varshney, S. Shyam Sundar, Amulya Yadav

    Abstract: The widespread adoption of large language models (LLMs) and generative AI (GenAI) tools across diverse applications has amplified the importance of addressing societal biases inherent within these technologies. While the NLP community has extensively studied LLM bias, research investigating how non-expert users perceive and interact with biases from these systems remains limited. As these technolo… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

  33. arXiv:2410.14643  [pdf, other

    cs.DS

    Instance-Optimality in I/O-Efficient Sampling and Sequential Estimation

    Authors: Shyam Narayanan, Václav Rozhoň, Jakub Tětek, Mikkel Thorup

    Abstract: Suppose we have a memory storing $0$s and $1$s and we want to estimate the frequency of $1$s by sampling. We want to do this I/O-efficiently, exploiting that each read gives a block of $B$ bits at unit cost; not just one bit. If the input consists of uniform blocks: either all 1s or all 0s, then sampling a whole block at a time does not reduce the number of samples needed for estimation. On the ot… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: To appear at FOCS 2024

  34. arXiv:2410.13638  [pdf, other

    cs.LG cs.AI cs.HC

    Scaling Wearable Foundation Models

    Authors: Girish Narayanswamy, Xin Liu, Kumar Ayush, Yuzhe Yang, Xuhai Xu, Shun Liao, Jake Garrison, Shyam Tailor, Jake Sunshine, Yun Liu, Tim Althoff, Shrikanth Narayanan, Pushmeet Kohli, Jiening Zhan, Mark Malhotra, Shwetak Patel, Samy Abdel-Ghaffar, Daniel McDuff

    Abstract: Wearable sensors have become ubiquitous thanks to a variety of health tracking features. The resulting continuous and longitudinal measurements from everyday life generate large volumes of data; however, making sense of these observations for scientific and actionable insights is non-trivial. Inspired by the empirical success of generative modeling, where large neural networks learn powerful repre… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  35. arXiv:2410.12219  [pdf, other

    cs.AI cs.CL cs.MM

    OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalities

    Authors: Lichang Chen, Hexiang Hu, Mingda Zhang, Yiwen Chen, Zifeng Wang, Yandong Li, Pranav Shyam, Tianyi Zhou, Heng Huang, Ming-Hsuan Yang, Boqing Gong

    Abstract: We introduce OmnixR, an evaluation suite designed to benchmark SoTA Omni-modality Language Models, such as GPT-4o and Gemini. Evaluating OLMs, which integrate multiple modalities such as text, vision, and audio, presents unique challenges. Particularly, the user message might often consist of multiple modalities, such that OLMs have to establish holistic understanding and reasoning across modaliti… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 19 pages, 6 figures, 12 tables

  36. arXiv:2410.05281  [pdf, other

    cs.CE cond-mat.mtrl-sci cs.LG physics.comp-ph

    Micrometer: Micromechanics Transformer for Predicting Mechanical Responses of Heterogeneous Materials

    Authors: Sifan Wang, Tong-Rui Liu, Shyam Sankaran, Paris Perdikaris

    Abstract: Heterogeneous materials, crucial in various engineering applications, exhibit complex multiscale behavior, which challenges the effectiveness of traditional computational methods. In this work, we introduce the Micromechanics Transformer ({\em Micrometer}), an artificial intelligence (AI) framework for predicting the mechanical response of heterogeneous materials, bridging the gap between advanced… ▽ More

    Submitted 23 September, 2024; originally announced October 2024.

    Comments: 36 pages, 12 figures, 9 tables

  37. arXiv:2410.05180  [pdf, other

    cs.CL

    Mitigating the Risk of Health Inequity Exacerbated by Large Language Models

    Authors: Yuelyu Ji, Wenhe Ma, Sonish Sivarajkumar, Hang Zhang, Eugene Mathew Sadhu, Zhuochun Li, Xizhi Wu, Shyam Visweswaran, Yanshan Wang

    Abstract: Recent advancements in large language models have demonstrated their potential in numerous medical applications, particularly in automating clinical trial matching for translational research and enhancing medical question answering for clinical decision support. However, our study shows that incorporating non decisive sociodemographic factors such as race, sex, income level, LGBT+ status, homeless… ▽ More

    Submitted 14 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

  38. arXiv:2409.15255  [pdf, other

    cs.RO cs.CV

    ZeroSCD: Zero-Shot Street Scene Change Detection

    Authors: Shyam Sundar Kannan, Byung-Cheol Min

    Abstract: Scene Change Detection is a challenging task in computer vision and robotics that aims to identify differences between two images of the same scene captured at different times. Traditional change detection methods rely on training models that take these image pairs as input and estimate the changes, which requires large amounts of annotated data, a costly and time-consuming process. To overcome th… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  39. arXiv:2409.12941  [pdf, other

    cs.CL

    Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation

    Authors: Satyapriya Krishna, Kalpesh Krishna, Anhad Mohananey, Steven Schwarcz, Adam Stambler, Shyam Upadhyay, Manaal Faruqui

    Abstract: Large Language Models (LLMs) have demonstrated significant performance improvements across various cognitive tasks. An emerging application is using LLMs to enhance retrieval-augmented generation (RAG) capabilities. These systems require LLMs to understand user queries, retrieve relevant information, and synthesize coherent and accurate responses. Given the increasing real-world deployment of such… ▽ More

    Submitted 24 January, 2025; v1 submitted 19 September, 2024; originally announced September 2024.

    Comments: Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), 2025

  40. arXiv:2409.10075  [pdf, other

    cs.LG

    Steinmetz Neural Networks for Complex-Valued Data

    Authors: Shyam Venkatasubramanian, Ali Pezeshki, Vahid Tarokh

    Abstract: We introduce a new approach to processing complex-valued data using DNNs consisting of parallel real-valued subnetworks with coupled outputs. Our proposed class of architectures, referred to as Steinmetz Neural Networks, incorporates multi-view learning to construct more interpretable representations in the latent space. Moreover, we present the Analytic Neural Network, which incorporates a consis… ▽ More

    Submitted 13 February, 2025; v1 submitted 16 September, 2024; originally announced September 2024.

  41. arXiv:2408.15878  [pdf, other

    cs.AR

    A Non-Traditional Approach to Assisting Data Address Translation

    Authors: Shyam Murthy, Gurindar S Sohi

    Abstract: This paper proposes a novel way to assist conventional data address translation. The approach, PC-Indexed Data Address Translation (PCAX), uses the PC of a load instruction, and not a data virtual address, to obtain the page table entry (PTE) for the data accessed by a load instruction. PCAX is intended to be used for a small subset of the static loads in a program. We observe that: (i) a small su… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  42. Meta-Learning in Audio and Speech Processing: An End to End Comprehensive Review

    Authors: Athul Raimon, Shubha Masti, Shyam K Sateesh, Siyani Vengatagiri, Bhaskarjyoti Das

    Abstract: This survey overviews various meta-learning approaches used in audio and speech processing scenarios. Meta-learning is used where model performance needs to be maximized with minimum annotated samples, making it suitable for low-sample audio processing. Although the field has made some significant contributions, audio meta-learning still lacks the presence of comprehensive survey papers. We presen… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: Survey Paper (15 pages, 1 figure)

  43. arXiv:2408.10328  [pdf, other

    cs.LG cs.AI cs.HC

    Decoding Human Emotions: Analyzing Multi-Channel EEG Data using LSTM Networks

    Authors: Shyam K Sateesh, Sparsh BK, Uma D

    Abstract: Emotion recognition from electroencephalogram (EEG) signals is a thriving field, particularly in neuroscience and Human-Computer Interaction (HCI). This study aims to understand and improve the predictive accuracy of emotional state classification through metrics such as valence, arousal, dominance, and likeness by applying a Long Short-Term Memory (LSTM) network to analyze EEG signals. Using a po… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 13 pages, 3 figures; accepted at ICDSA '24 Conference, Jaipur, India

  44. arXiv:2408.07832  [pdf, ps, other

    cs.CL cs.CV

    LADDER: Language-Driven Slice Discovery and Error Rectification in Vision Classifiers

    Authors: Shantanu Ghosh, Rayan Syed, Chenyu Wang, Vaibhav Choudhary, Binxu Li, Clare B. Poynton, Shyam Visweswaran, Kayhan Batmanghelich

    Abstract: Error slice discovery is crucial to diagnose and mitigate model errors. Current clustering or discrete attribute-based slice discovery methods face key limitations: 1) clustering results in incoherent slices, while assigning discrete attributes to slices leads to incomplete coverage of error patterns due to missing or insufficient attributes; 2) these methods lack complex reasoning, preventing the… ▽ More

    Submitted 29 May, 2025; v1 submitted 31 July, 2024; originally announced August 2024.

    Comments: ACL 2025 (Findings). Code: https://github.com/batmanlab/Ladder

  45. arXiv:2408.05190  [pdf, other

    cs.FL

    Parameterized Verification of Timed Networks with Clock Invariants

    Authors: Étienne André, Swen Jacobs, Shyam Lal Karra, Ocan Sankur

    Abstract: We consider parameterized verification problems for networks of timed automata (TAs) that communicate via disjunctive guards or lossy broadcast. To this end, we first consider disjunctive timed networks (DTNs), i.e., networks of TAs that communicate via location guards that enable a transition only if there is another process in a certain location. We solve for the first time the general case with… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 31 pages, 7 figures

    ACM Class: D.2.4; F.1.2

  46. arXiv:2408.04093  [pdf, other

    cs.LG cs.CL

    Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

    Authors: Vasudev Shyam, Jonathan Pilault, Emily Shepperd, Quentin Anthony, Beren Millidge

    Abstract: Our formulation reveals that the reduction across the sequence axis can be efficiently computed in parallel through a tree reduction. Our algorithm, called Tree Attention, for parallelizing exact attention computation across multiple GPUs enables cross-device decoding to be performed asymptotically faster (up to 8x faster in our experiments) than state-of-the-art approaches such as Ring Attention,… ▽ More

    Submitted 9 February, 2025; v1 submitted 7 August, 2024; originally announced August 2024.

  47. arXiv:2408.00695  [pdf, other

    cs.LG cs.AI

    Accelerating Full Waveform Inversion By Transfer Learning

    Authors: Divya Shyam Singh, Leon Herrmann, Qing Sun, Tim Bürchner, Felix Dietrich, Stefan Kollmannsberger

    Abstract: Full waveform inversion (FWI) is a powerful tool for reconstructing material fields based on sparsely measured data obtained by wave propagation. For specific problems, discretizing the material field with a neural network (NN) improves the robustness and reconstruction quality of the corresponding optimization problem. We call this method NN-based FWI. Starting from an initial guess, the weights… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  48. arXiv:2407.20284  [pdf

    cs.AI cs.LG

    MLtoGAI: Semantic Web based with Machine Learning for Enhanced Disease Prediction and Personalized Recommendations using Generative AI

    Authors: Shyam Dongre, Ritesh Chandra, Sonali Agarwal

    Abstract: In modern healthcare, addressing the complexities of accurate disease prediction and personalized recommendations is both crucial and challenging. This research introduces MLtoGAI, which integrates Semantic Web technology with Machine Learning (ML) to enhance disease prediction and offer user-friendly explanations through ChatGPT. The system comprises three key components: a reusable disease ontol… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  49. Mapping the individual, social, and biospheric impacts of Foundation Models

    Authors: Andrés Domínguez Hernández, Shyam Krishna, Antonella Maia Perini, Michael Katell, SJ Bennett, Ann Borda, Youmna Hashem, Semeli Hadjiloizou, Sabeehah Mahomed, Smera Jayadeva, Mhairi Aitken, David Leslie

    Abstract: Responding to the rapid roll-out and large-scale commercialization of foundation models, large language models, and generative AI, an emerging body of work is shedding light on the myriad impacts these technologies are having across society. Such research is expansive, ranging from the production of discriminatory, fake and toxic outputs, and privacy and copyright violations, to the unjust extract… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: ACM Conference on Fairness, Accountability, and Transparency (FAccT '24). Association for Computing Machinery, New York, NY, USA, 776-796

    Journal ref: In Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24). Association for Computing Machinery, New York, NY, USA, 776-796

  50. arXiv:2407.11240  [pdf, other

    cs.AI cs.CL

    Making New Connections: LLMs as Puzzle Generators for The New York Times' Connections Word Game

    Authors: Tim Merino, Sam Earle, Ryan Sudhakaran, Shyam Sudhakaran, Julian Togelius

    Abstract: The Connections puzzle is a word association game published daily by The New York Times (NYT). In this game, players are asked to find groups of four words that are connected by a common theme. While solving a given Connections puzzle requires both semantic knowledge and abstract reasoning, generating novel puzzles additionally requires a form of metacognition: generators must be able to accuratel… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.