Skip to main content

Showing 1–50 of 812 results for author: Krishnan

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.14571  [pdf, ps, other

    eess.SP cs.LG eess.AS

    The Perception of Phase Intercept Distortion and its Application in Data Augmentation

    Authors: Venkatakrishnan Vaidyanathapuram Krishnan, Nathaniel Condit-Schultz

    Abstract: Phase distortion refers to the alteration of the phase relationships between frequencies in a signal, which can be perceptible. In this paper, we discuss a special case of phase distortion known as phase-intercept distortion, which is created by a frequency-independent phase shift. We hypothesize that, though this form of distortion changes a signal's waveform significantly, the distortion is impe… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: Submitted to the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2025

  2. arXiv:2506.13538  [pdf, ps, other

    cs.SE cs.ET

    Model Context Protocol (MCP) at First Glance: Studying the Security and Maintainability of MCP Servers

    Authors: Mohammed Mehedi Hasan, Hao Li, Emad Fallahzadeh, Gopi Krishnan Rajbahadur, Bram Adams, Ahmed E. Hassan

    Abstract: Although Foundation Models (FMs), such as GPT-4, are increasingly used in domains like finance and software engineering, reliance on textual interfaces limits these models' real-world interaction. To address this, FM providers introduced tool calling-triggering a proliferation of frameworks with distinct tool interfaces. In late 2024, Anthropic introduced the Model Context Protocol (MCP) to standa… ▽ More

    Submitted 19 June, 2025; v1 submitted 16 June, 2025; originally announced June 2025.

  3. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  4. arXiv:2506.11315  [pdf, ps, other

    cs.LG

    Sampling Imbalanced Data with Multi-objective Bilevel Optimization

    Authors: Karen Medlin, Sven Leyffer, Krishnan Raghavan

    Abstract: Two-class classification problems are often characterized by an imbalance between the number of majority and minority datapoints resulting in poor classification of the minority class in particular. Traditional approaches, such as reweighting the loss function or naïve resampling, risk overfitting and subsequently fail to improve classification because they do not consider the diversity between ma… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  5. arXiv:2506.10999  [pdf

    cs.SE cs.AI

    Automated Validation of COBOL to Java Transformation

    Authors: Atul Kumar, Diptikalyan Saha, Toshikai Yasue, Kohichi Ono, Saravanan Krishnan, Sandeep Hans, Fumiko Satoh, Gerald Mitchell, Sachin Kumar

    Abstract: Recent advances in Large Language Model (LLM) based Generative AI techniques have made it feasible to translate enterpriselevel code from legacy languages such as COBOL to modern languages such as Java or Python. While the results of LLM-based automatic transformation are encouraging, the resulting code cannot be trusted to correctly translate the original code. We propose a framework and a tool t… ▽ More

    Submitted 14 April, 2025; originally announced June 2025.

    Comments: arXiv admin note: text overlap with arXiv:2504.10548

    Journal ref: ASE 2024

  6. arXiv:2506.10200  [pdf, ps, other

    cs.LG

    DynaSubVAE: Adaptive Subgrouping for Scalable and Robust OOD Detection

    Authors: Tina Behrouzi, Sana Tonekaboni, Rahul G. Krishnan, Anna Goldenberg

    Abstract: Real-world observational data often contain existing or emerging heterogeneous subpopulations that deviate from global patterns. The majority of models tend to overlook these underrepresented groups, leading to inaccurate or even harmful predictions. Existing solutions often rely on detecting these samples as Out-of-domain (OOD) rather than adapting the model to new emerging patterns. We introduce… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  7. arXiv:2506.07918  [pdf, ps, other

    cs.LG stat.ML

    CausalPFN: Amortized Causal Effect Estimation via In-Context Learning

    Authors: Vahid Balazadeh, Hamidreza Kamkari, Valentin Thomas, Benson Li, Junwei Ma, Jesse C. Cresswell, Rahul G. Krishnan

    Abstract: Causal effect estimation from observational data is fundamental across various applications. However, selecting an appropriate estimator from dozens of specialized methods demands substantial manual effort and domain expertise. We present CausalPFN, a single transformer that amortizes this workflow: trained once on a large library of simulated data-generating processes that satisfy ignorability, i… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  8. arXiv:2506.05047  [pdf, ps, other

    cs.LG

    Reliably detecting model failures in deployment without labels

    Authors: Viet Nguyen, Changjian Shui, Vijay Giri, Siddarth Arya, Amol Verma, Fahad Razak, Rahul G. Krishnan

    Abstract: The distribution of data changes over time; models operating operating in dynamic environments need retraining. But knowing when to retrain, without access to labels, is an open challenge since some, but not all shifts degrade model performance. This paper formalizes and addresses the problem of post-deployment deterioration (PDD) monitoring. We propose D3M, a practical and efficient monitoring al… ▽ More

    Submitted 9 June, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

    Comments: 36 pages, 6 figures, 7 tables, submitted to NeurIPS 2025, includes theoretical analysis and extensive empirical evaluation across benchmark and clinical datasets. Code available at https://github.com/teivng/d3m. Viet Nguyen and Changjian Shui contributed equally

  9. arXiv:2505.23027  [pdf, ps, other

    cs.LG cs.AI cs.CV

    Diverse Prototypical Ensembles Improve Robustness to Subpopulation Shift

    Authors: Minh Nguyen Nhat To, Paul F RWilson, Viet Nguyen, Mohamed Harmanani, Michael Cooper, Fahimeh Fooladgar, Purang Abolmaesumi, Parvin Mousavi, Rahul G. Krishnan

    Abstract: The subpopulationtion shift, characterized by a disparity in subpopulation distributibetween theween the training and target datasets, can significantly degrade the performance of machine learning models. Current solutions to subpopulation shift involve modifying empirical risk minimization with re-weighting strategies to improve generalization. This strategy relies on assumptions about the number… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: ICML 2025 Paper

  10. arXiv:2505.22704  [pdf, other

    cs.CL cs.AI

    Training Language Models to Generate Quality Code with Program Analysis Feedback

    Authors: Feng Yao, Zilong Wang, Liyuan Liu, Junxia Cui, Li Zhong, Xiaohan Fu, Haohui Mai, Vish Krishnan, Jianfeng Gao, Jingbo Shang

    Abstract: Code generation with large language models (LLMs), often termed vibe coding, is increasingly adopted in production but fails to ensure code quality, particularly in security (e.g., SQL injection vulnerabilities) and maintainability (e.g., missing type annotations). Existing methods, such as supervised fine-tuning and rule-based post-processing, rely on labor-intensive annotations or brittle heuris… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 10 pages, 3 figures

  11. arXiv:2505.22568  [pdf

    eess.IV cs.CV

    Multipath cycleGAN for harmonization of paired and unpaired low-dose lung computed tomography reconstruction kernels

    Authors: Aravind R. Krishnan, Thomas Z. Li, Lucas W. Remedios, Michael E. Kim, Chenyu Gao, Gaurav Rudravaram, Elyssa M. McMaster, Adam M. Saunders, Shunxing Bao, Kaiwen Xu, Lianrui Zuo, Kim L. Sandler, Fabien Maldonado, Yuankai Huo, Bennett A. Landman

    Abstract: Reconstruction kernels in computed tomography (CT) affect spatial resolution and noise characteristics, introducing systematic variability in quantitative imaging measurements such as emphysema quantification. Choosing an appropriate kernel is therefore essential for consistent quantitative analysis. We propose a multipath cycleGAN model for CT kernel harmonization, trained on a mixture of paired… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  12. arXiv:2505.19105  [pdf, ps, other

    cs.LG

    Latent Mamba Operator for Partial Differential Equations

    Authors: Karn Tiwari, Niladri Dutta, N M Anoop Krishnan, Prathosh A P

    Abstract: Neural operators have emerged as powerful data-driven frameworks for solving Partial Differential Equations (PDEs), offering significant speedups over numerical methods. However, existing neural operators struggle with scalability in high-dimensional spaces, incur high computational costs, and face challenges in capturing continuous and long-range dependencies in PDE dynamics. To address these lim… ▽ More

    Submitted 28 May, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

    Comments: Proceedings of the 42 nd International Conference on Machine Learning, Vancouver, Canada. PMLR 267, 2025. Copyright 2025 by the author(s)

  13. arXiv:2505.18495  [pdf, ps, other

    cs.LG

    Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking

    Authors: Chen-Hao Chao, Wei-Fang Sun, Hanwen Liang, Chun-Yi Lee, Rahul G. Krishnan

    Abstract: Masked diffusion models (MDM) are powerful generative models for discrete data that generate samples by progressively unmasking tokens in a sequence. Each token can take one of two states: masked or unmasked. We observe that token sequences often remain unchanged between consecutive sampling steps; consequently, the model repeatedly processes identical inputs, leading to redundant computation. To… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  14. arXiv:2505.15020  [pdf, ps, other

    cs.DC

    COSMIC: Enabling Full-Stack Co-Design and Optimization of Distributed Machine Learning Systems

    Authors: Aditi Raju, Jared Ni, William Won, Changhai Man, Srivatsan Krishnan, Srinivas Sridharan, Amir Yazdanbakhsh, Tushar Krishna, Vijay Janapa Reddi

    Abstract: Large-scale machine learning models necessitate distributed systems, posing significant design challenges due to the large parameter space across distinct design stacks. Existing studies often focus on optimizing individual system aspects in isolation. This work challenges this limitation and introduces COSMIC, a full-stack distributed machine learning systems environment enabling end-to-end simul… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 11 pages (excluding references), 10 figures, 6 tables

  15. arXiv:2505.13559  [pdf, ps, other

    cs.CL cs.LG

    CS-Sum: A Benchmark for Code-Switching Dialogue Summarization and the Limits of Large Language Models

    Authors: Sathya Krishnan Suresh, Tanmay Surana, Lim Zhi Hao, Eng Siong Chng

    Abstract: Code-switching (CS) poses a significant challenge for Large Language Models (LLMs), yet its comprehensibility remains underexplored in LLMs. We introduce CS-Sum, to evaluate the comprehensibility of CS by the LLMs through CS dialogue to English summarization. CS-Sum is the first benchmark for CS dialogue summarization across Mandarin-English (EN-ZH), Tamil-English (EN-TA), and Malay-English (EN-MS… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 17 pages, 5 figures and 11 tables

  16. arXiv:2505.10640  [pdf, ps, other

    cs.SE cs.AI cs.LG

    The Hitchhikers Guide to Production-ready Trustworthy Foundation Model powered Software (FMware)

    Authors: Kirill Vasilevski, Benjamin Rombaut, Gopi Krishnan Rajbahadur, Gustavo A. Oliva, Keheliya Gallaba, Filipe R. Cogo, Jiahuei Lin, Dayi Lin, Haoxiang Zhang, Bouyan Chen, Kishanthan Thangarajah, Ahmed E. Hassan, Zhen Ming Jiang

    Abstract: Foundation Models (FMs) such as Large Language Models (LLMs) are reshaping the software industry by enabling FMware, systems that integrate these FMs as core components. In this KDD 2025 tutorial, we present a comprehensive exploration of FMware that combines a curated catalogue of challenges with real-world production concerns. We first discuss the state of research and practice in building FMwar… ▽ More

    Submitted 2 June, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

  17. arXiv:2505.08638  [pdf, other

    cs.AI cs.CL

    TRAIL: Trace Reasoning and Agentic Issue Localization

    Authors: Darshan Deshpande, Varun Gangal, Hersh Mehta, Jitin Krishnan, Anand Kannappan, Rebecca Qian

    Abstract: The increasing adoption of agentic workflows across diverse domains brings a critical need to scalably and systematically evaluate the complex traces these systems generate. Current evaluation methods depend on manual, domain-specific human analysis of lengthy workflow traces - an approach that does not scale with the growing complexity and volume of agentic outputs. Error analysis in these settin… ▽ More

    Submitted 19 May, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

    Comments: Dataset: https://huggingface.co/datasets/PatronusAI/TRAIL

  18. arXiv:2505.06885  [pdf

    cs.SE cs.IR

    Incremental Analysis of Legacy Applications Using Knowledge Graphs for Application Modernization

    Authors: Saravanan Krishnan, Amith Singhee, Keerthi Narayan Raghunath, Alex Mathai, Atul Kumar, David Wenk

    Abstract: Industries such as banking, telecom, and airlines - o6en have large so6ware systems that are several decades old. Many of these systems are written in old programming languages such as COBOL, PL/1, Assembler, etc. In many cases, the documentation is not updated, and those who developed/designed these systems are no longer around. Understanding these systems for either modernization or even regular… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  19. arXiv:2505.06151  [pdf, ps, other

    cs.CL

    Estimating Quality in Therapeutic Conversations: A Multi-Dimensional Natural Language Processing Framework

    Authors: Alice Rueda, Argyrios Perivolaris, Niloy Roy, Dylan Weston, Sarmed Shaya, Zachary Cote, Martin Ivanov, Bazen G. Teferra, Yuqi Wu, Sirisha Rambhatla, Divya Sharma, Andrew Greenshaw, Rakesh Jetly, Yanbo Zhang, Bo Cao, Reza Samavi, Sridhar Krishnan, Venkat Bhat

    Abstract: Engagement between client and therapist is a critical determinant of therapeutic success. We propose a multi-dimensional natural language processing (NLP) framework that objectively classifies engagement quality in counseling sessions based on textual transcripts. Using 253 motivational interviewing transcripts (150 high-quality, 103 low-quality), we extracted 42 features across four domains: conv… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: 12 pages, 4 figures, 7 tables

  20. arXiv:2505.05885  [pdf, ps, other

    cs.DB cs.IR

    Cost-Effective, Low Latency Vector Search with Azure Cosmos DB

    Authors: Nitish Upreti, Krishnan Sundaram, Hari Sudan Sundar, Samer Boshra, Balachandar Perumalswamy, Shivam Atri, Martin Chisholm, Revti Raman Singh, Greg Yang, Subramanyam Pattipaka, Tamara Hass, Nitesh Dudhey, James Codella, Mark Hildebrand, Magdalen Manohar, Jack Moffitt, Haiyang Xu, Naren Datha, Suryansh Gupta, Ravishankar Krishnaswamy, Prashant Gupta, Abhishek Sahu, Ritika Mor, Santosh Kulkarni, Hemeswari Varada , et al. (11 additional authors not shown)

    Abstract: Vector indexing enables semantic search over diverse corpora and has become an important interface to databases for both users and AI agents. Efficient vector search requires deep optimizations in database systems. This has motivated a new class of specialized vector databases that optimize for vector search quality and cost. Instead, we argue that a scalable, high-performance, and cost-efficient… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    ACM Class: H.3.3

  21. arXiv:2505.05143  [pdf, ps, other

    cs.LG

    Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry

    Authors: Mohammed Adnan, Rohan Jain, Ekansh Sharma, Rahul Krishnan, Yani Ioannou

    Abstract: The Lottery Ticket Hypothesis (LTH) suggests there exists a sparse LTH mask and weights that achieve the same generalization performance as the dense model while using significantly fewer parameters. However, finding a LTH solution is computationally expensive, and a LTH sparsity mask does not generalize to other random weight initializations. Recent work has suggested that neural networks trained… ▽ More

    Submitted 9 June, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: Accepted at ICML 2025

  22. arXiv:2505.01482  [pdf, other

    cs.AI

    Understanding LLM Scientific Reasoning through Promptings and Model's Explanation on the Answers

    Authors: Alice Rueda, Mohammed S. Hassan, Argyrios Perivolaris, Bazen G. Teferra, Reza Samavi, Sirisha Rambhatla, Yuqi Wu, Yanbo Zhang, Bo Cao, Divya Sharma, Sridhar Krishnan Venkat Bhat

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities in natural language understanding, reasoning, and problem-solving across various domains. However, their ability to perform complex, multi-step reasoning task-essential for applications in science, medicine, and law-remains an area of active investigation. This paper examines the reasoning capabilities of contemporary LLMs, ana… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  23. arXiv:2505.00467  [pdf, ps, other

    cs.CL cs.AI

    Red Teaming Large Language Models for Healthcare

    Authors: Vahid Balazadeh, Michael Cooper, David Pellow, Atousa Assadi, Jennifer Bell, Mark Coastworth, Kaivalya Deshpande, Jim Fackler, Gabriel Funingana, Spencer Gable-Cook, Anirudh Gangadhar, Abhishek Jaiswal, Sumanth Kaja, Christopher Khoury, Amrit Krishnan, Randy Lin, Kaden McKeen, Sara Naimimohasses, Khashayar Namdar, Aviraj Newatia, Allan Pang, Anshul Pattoo, Sameer Peesapati, Diana Prepelita, Bogdana Rakova , et al. (10 additional authors not shown)

    Abstract: We present the design process and findings of the pre-conference workshop at the Machine Learning for Healthcare Conference (2024) entitled Red Teaming Large Language Models for Healthcare, which took place on August 15, 2024. Conference participants, comprising a mix of computational and clinical expertise, attempted to discover vulnerabilities -- realistic clinical prompts for which a large lang… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  24. arXiv:2504.21030  [pdf, ps, other

    cs.MA cs.AI

    Advancing Multi-Agent Systems Through Model Context Protocol: Architecture, Implementation, and Applications

    Authors: Naveen Krishnan

    Abstract: Multi-agent systems represent a significant advancement in artificial intelligence, enabling complex problem-solving through coordinated specialized agents. However, these systems face fundamental challenges in context management, coordination efficiency, and scalable operation. This paper introduces a comprehensive framework for advancing multi-agent systems through Model Context Protocol (MCP),… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

  25. arXiv:2504.20250  [pdf, other

    cs.LG q-fin.GN q-fin.ST stat.AP stat.ML

    Financial Data Analysis with Robust Federated Logistic Regression

    Authors: Kun Yang, Nikhil Krishnan, Sanjeev R. Kulkarni

    Abstract: In this study, we focus on the analysis of financial data in a federated setting, wherein data is distributed across multiple clients or locations, and the raw data never leaves the local devices. Our primary focus is not only on the development of efficient learning frameworks (for protecting user data privacy) in the field of federated learning but also on the importance of designing models that… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  26. arXiv:2504.20067  [pdf, other

    cs.DC

    Scalable and Performant Data Loading

    Authors: Moto Hira, Christian Puhrsch, Valentin Andrei, Roman Malinovskyy, Gael Le Lan, Abhinandan Krishnan, Joseph Cummings, Miguel Martin, Gokul Gunasekaran, Yuta Inoue, Alex J Turner, Raghuraman Krishnamoorthi

    Abstract: We present SPDL (Scalable and Performant Data Loading), an open-source, framework-agnostic library designed for efficiently loading array data to GPU. Data loading is often a bottleneck in AI applications, and is challenging to optimize because it requires coordination of network calls, CPU-bound tasks, and GPU device transfer. On top of that, Python's GIL (Global Interpreter Lock) makes it diffic… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: For the latest version of the software please visit https://facebookresearch.github.io/spdl/main/

  27. arXiv:2504.19916  [pdf, ps, other

    cs.IT

    An Achievability Bound for Type-Based Unsourced Multiple Access

    Authors: Deekshith Pathayappilly Krishnan, Kaan Okumus, Khac-Hoang Ngo, Giuseppe Durisi

    Abstract: We derive an achievability bound to quantify the performance of a type-based unsourced multiple access system -- an information-theoretic model for grant-free multiple access with correlated messages. The bound extends available achievability results for the per-user error probability in the unsourced multiple access framework, where, different from our setup, message collisions are treated as err… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 8 pages, 1 figure. Extended version of a paper accepted for presentation at ISIT 2025

  28. arXiv:2504.17277  [pdf, other

    cs.LG cs.AI

    ExOSITO: Explainable Off-Policy Learning with Side Information for Intensive Care Unit Blood Test Orders

    Authors: Zongliang Ji, Andre Carlos Kajdacsy-Balla Amaral, Anna Goldenberg, Rahul G. Krishnan

    Abstract: Ordering a minimal subset of lab tests for patients in the intensive care unit (ICU) can be challenging. Care teams must balance between ensuring the availability of the right information and reducing the clinical burden and costs associated with each lab test order. Most in-patient settings experience frequent over-ordering of lab tests, but are now aiming to reduce this burden on both hospital r… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: Accepted to the Conference on Health, Inference, and Learning (CHIL) 2025

  29. arXiv:2504.16916  [pdf, ps, other

    cs.RO eess.SY

    Zero-shot Sim-to-Real Transfer for Reinforcement Learning-based Visual Servoing of Soft Continuum Arms

    Authors: Hsin-Jung Yang, Mahsa Khosravi, Benjamin Walt, Girish Krishnan, Soumik Sarkar

    Abstract: Soft continuum arms (SCAs) soft and deformable nature presents challenges in modeling and control due to their infinite degrees of freedom and non-linear behavior. This work introduces a reinforcement learning (RL)-based framework for visual servoing tasks on SCAs with zero-shot sim-to-real transfer capabilities, demonstrated on a single section pneumatic manipulator capable of bending and twistin… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: The 7th Annual Learning for Dynamics & Control Conference (L4DC) 2025

  30. arXiv:2504.16743  [pdf

    cs.SE cs.CR

    Implementing AI Bill of Materials (AI BOM) with SPDX 3.0: A Comprehensive Guide to Creating AI and Dataset Bill of Materials

    Authors: Karen Bennet, Gopi Krishnan Rajbahadur, Arthit Suriyawongkul, Kate Stewart

    Abstract: A Software Bill of Materials (SBOM) is becoming an increasingly important tool in regulatory and technical spaces to introduce more transparency and security into a project's software supply chain. Artificial intelligence (AI) projects face unique challenges beyond the security of their software, and thus require a more expansive approach to a bill of materials. In this report, we introduce the… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: 71 pages, 11 tables, published on https://www.linuxfoundation.org/research/ai-bom

    ACM Class: D.2.9; K.6.3; K.6.4; I.2.m

  31. arXiv:2504.16556  [pdf, ps, other

    cs.GT

    How Irrationality Shapes Nash Equilibria: A Prospect-Theoretic Perspective

    Authors: Ashok Krishnan K. S., Hélène Le Cadre, Ana Bušić

    Abstract: Noncooperative games with uncertain payoffs have been classically studied under the expected-utility theory framework, which relies on the strong assumption that agents behave rationally. However, simple experiments on human decision makers found them to be not fully rational, due to their subjective risk perception. Prospect theory was proposed as an empirically-grounded model to incorporate irra… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  32. arXiv:2504.10548  [pdf

    cs.SE cs.AI

    Automated Testing of COBOL to Java Transformation

    Authors: Sandeep Hans, Atul Kumar, Toshikai Yasue, Kouichi Ono, Saravanan Krishnan, Devika Sondhi, Fumiko Satoh, Gerald Mitchell, Sachin Kumar, Diptikalyan Saha

    Abstract: Recent advances in Large Language Model (LLM) based Generative AI techniques have made it feasible to translate enterprise-level code from legacy languages such as COBOL to modern languages such as Java or Python. While the results of LLM-based automatic transformation are encouraging, the resulting code cannot be trusted to correctly translate the original code, making manual validation of transl… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  33. arXiv:2504.09430  [pdf, other

    eess.IV cs.CV

    Predicting ulcer in H&E images of inflammatory bowel disease using domain-knowledge-driven graph neural network

    Authors: Ruiwen Ding, Lin Li, Rajath Soans, Tosha Shah, Radha Krishnan, Marc Alexander Sze, Sasha Lukyanov, Yash Deshpande, Antong Chen

    Abstract: Inflammatory bowel disease (IBD) involves chronic inflammation of the digestive tract, with treatment options often burdened by adverse effects. Identifying biomarkers for personalized treatment is crucial. While immune cells play a key role in IBD, accurately identifying ulcer regions in whole slide images (WSIs) is essential for characterizing these cells and exploring potential therapeutics. Mu… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

    Comments: Work accepted at ISBI 2025

  34. arXiv:2504.06227  [pdf, other

    cs.CL

    LExT: Towards Evaluating Trustworthiness of Natural Language Explanations

    Authors: Krithi Shailya, Shreya Rajpal, Gokul S Krishnan, Balaraman Ravindran

    Abstract: As Large Language Models (LLMs) become increasingly integrated into high-stakes domains, there have been several approaches proposed toward generating natural language explanations. These explanations are crucial for enhancing the interpretability of a model, especially in sensitive domains like healthcare, where transparency and reliability are key. In light of such explanations being generated b… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  35. arXiv:2504.05058  [pdf, other

    cs.CL

    Not All Data Are Unlearned Equally

    Authors: Aravind Krishnan, Siva Reddy, Marius Mosbach

    Abstract: Machine unlearning is concerned with the task of removing knowledge learned from particular data points from a trained model. In the context of large language models (LLMs), unlearning has recently received increased attention, particularly for removing knowledge about named entities from models for privacy purposes. While various approaches have been proposed to address the unlearning problem, mo… ▽ More

    Submitted 24 April, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  36. arXiv:2504.04717  [pdf, other

    cs.CL cs.AI

    Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models

    Authors: Yubo Li, Xiaobin Shen, Xinyu Yao, Xueying Ding, Yidi Miao, Ramayya Krishnan, Rema Padman

    Abstract: Recent advancements in large language models (LLMs) have revolutionized their ability to handle single-turn tasks, yet real-world applications demand sophisticated multi-turn interactions. This survey provides a comprehensive review of recent advancements in evaluating and enhancing multi-turn interactions in LLMs. Focusing on task-specific scenarios, from instruction following in diverse domains… ▽ More

    Submitted 13 May, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  37. arXiv:2504.04404  [pdf, other

    cs.NI cs.AR

    OffRAC: Offloading Through Remote Accelerator Calls

    Authors: Ziyi Yang, Krishnan B. Iyer, Yixi Chen, Ran Shu, Zsolt István, Marco Canini, Suhaib A. Fahmy

    Abstract: Modern applications increasingly demand ultra-low latency for data processing, often facilitated by host-controlled accelerators like GPUs and FPGAs. However, significant delays result from host involvement in accessing accelerators. To address this limitation, we introduce a novel paradigm we call Offloading through Remote Accelerator Calls (OffRAC), which elevates accelerators to first-class com… ▽ More

    Submitted 8 April, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

    Comments: 19 pages

  38. arXiv:2503.22363  [pdf, ps, other

    cs.CV cs.AI

    ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection

    Authors: Nandakishor M, Vrinda Govind V, Anuradha Puthalath, Anzy L, Swathi P S, Aswathi R, Devaprabha A R, Varsha Raj, Midhuna Krishnan K, Akhila Anilkumar T V, Yamuna P V

    Abstract: Force estimation in human-object interactions is crucial for various fields like ergonomics, physical therapy, and sports science. Traditional methods depend on specialized equipment such as force plates and sensors, which makes accurate assessments both expensive and restricted to laboratory settings. In this paper, we introduce ForcePose, a novel deep learning framework that estimates applied fo… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  39. arXiv:2503.22353  [pdf, ps, other

    cs.CL cs.AI

    Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions

    Authors: Yubo Li, Yidi Miao, Xueying Ding, Ramayya Krishnan, Rema Padman

    Abstract: Large Language Models (LLMs) have shown remarkable capabilities across various tasks, but their deployment in high-stake domains requires consistent and coherent behavior across multiple rounds of user interaction. This paper introduces a comprehensive framework for evaluating and improving LLM response consistency, making three key contributions. Code and data are available at: https://github.com… ▽ More

    Submitted 5 June, 2025; v1 submitted 28 March, 2025; originally announced March 2025.

    Comments: 8 pages, 5 figures

  40. arXiv:2503.12687  [pdf, other

    cs.AI

    AI Agents: Evolution, Architecture, and Real-World Applications

    Authors: Naveen Krishnan

    Abstract: This paper examines the evolution, architecture, and practical applications of AI agents from their early, rule-based incarnations to modern sophisticated systems that integrate large language models with dedicated modules for perception, planning, and tool use. Emphasizing both theoretical foundations and real-world deployments, the paper reviews key agent paradigms, discusses limitations of curr… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

    Comments: 52 pages, 4 figures, comprehensive survey and analysis of AI agent evolution, architecture, evaluation frameworks, and applications

    MSC Class: 68T05; 68T20 ACM Class: I.2.6; I.2.8; I.2.11

  41. arXiv:2503.10352  [pdf, other

    cs.LG eess.SY math.OC

    Safe exploration in reproducing kernel Hilbert spaces

    Authors: Abdullah Tokmak, Kiran G. Krishnan, Thomas B. Schön, Dominik Baumann

    Abstract: Popular safe Bayesian optimization (BO) algorithms learn control policies for safety-critical systems in unknown environments. However, most algorithms make a smoothness assumption, which is encoded by a known bounded norm in a reproducing kernel Hilbert space (RKHS). The RKHS is a potentially infinite-dimensional space, and it remains unclear how to reliably obtain the RKHS norm of an unknown fun… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: Accepted to AISTATS 2025

  42. arXiv:2503.09032  [pdf, other

    cs.LG cs.AI cs.CL

    Teaching LLMs How to Learn with Contextual Fine-Tuning

    Authors: Younwoo Choi, Muhammad Adil Asif, Ziwen Han, John Willes, Rahul G. Krishnan

    Abstract: Prompting Large Language Models (LLMs), or providing context on the expected model of operation, is an effective way to steer the outputs of such models to satisfy human desiderata after they have been trained. But in rapidly evolving domains, there is often need to fine-tune LLMs to improve either the kind of knowledge in their memory or their abilities to perform open ended reasoning in new doma… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: ICLR 2025

  43. arXiv:2503.05675  [pdf, other

    cs.LG cs.DB

    Algorithmic Data Minimization for Machine Learning over Internet-of-Things Data Streams

    Authors: Ted Shaowang, Shinan Liu, Jonatas Marques, Nick Feamster, Sanjay Krishnan

    Abstract: Machine learning can analyze vast amounts of data generated by IoT devices to identify patterns, make predictions, and enable real-time decision-making. By processing sensor data, machine learning models can optimize processes, improve efficiency, and enhance personalized user experiences in smart systems. However, IoT systems are often deployed in sensitive environments such as households and off… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: 9 pages, 18 figures

  44. arXiv:2503.03986  [pdf, other

    cs.LG cs.AI

    Training neural networks faster with minimal tuning using pre-computed lists of hyperparameters for NAdamW

    Authors: Sourabh Medapati, Priya Kasimbeg, Shankar Krishnan, Naman Agarwal, George Dahl

    Abstract: If we want to train a neural network using any of the most popular optimization algorithms, we are immediately faced with a dilemma: how to set the various optimization and regularization hyperparameters? When computational resources are abundant, there are a variety of methods for finding good hyperparameter settings, but when resources are limited the only realistic choices are using standard de… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: Good defaults for NadamW Optimizer, generalizes well to unseen problems

  45. arXiv:2503.03056  [pdf, other

    cs.LG

    A2Perf: Real-World Autonomous Agents Benchmark

    Authors: Ikechukwu Uchendu, Jason Jabbour, Korneel Van den Berghe, Joel Runevic, Matthew Stewart, Jeffrey Ma, Srivatsan Krishnan, Izzeddin Gur, Austin Huang, Colton Bishop, Paige Bailey, Wenjie Jiang, Ebrahim M. Songhori, Sergio Guadarrama, Jie Tan, Jordan K. Terry, Aleksandra Faust, Vijay Janapa Reddi

    Abstract: Autonomous agents and systems cover a number of application areas, from robotics and digital assistants to combinatorial optimization, all sharing common, unresolved research challenges. It is not sufficient for agents to merely solve a given task; they must generalize to out-of-distribution tasks, perform reliably, and use hardware resources efficiently during training and inference, among other… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 32 pages, 12 figures, preprint

  46. arXiv:2503.01522  [pdf, ps, other

    cs.IT cs.CR

    Byzantine Distributed Function Computation

    Authors: Hari Krishnan P. Anilkumar, Neha Sangwan, Varun Narayanan, Vinod M. Prabhakaran

    Abstract: We study the distributed function computation problem with $k$ users of which at most $s$ may be controlled by an adversary and characterize the set of functions of the sources the decoder can reconstruct robustly in the following sense -- if the users behave honestly, the function is recovered with high probability (w.h.p.); if they behave adversarially, w.h.p, either one of the adversarial users… ▽ More

    Submitted 10 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

  47. arXiv:2502.19782  [pdf, other

    cs.CV

    Open-Vocabulary Semantic Part Segmentation of 3D Human

    Authors: Keito Suzuki, Bang Du, Girish Krishnan, Kunyao Chen, Runfa Blark Li, Truong Nguyen

    Abstract: 3D part segmentation is still an open problem in the field of 3D vision and AR/VR. Due to limited 3D labeled data, traditional supervised segmentation methods fall short in generalizing to unseen shapes and categories. Recently, the advancement in vision-language models' zero-shot abilities has brought a surge in open-world 3D segmentation methods. While these methods show promising results for 3D… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: 3DV 2025

  48. arXiv:2502.14976  [pdf, other

    cs.LG cs.CR cs.CV

    EigenShield: Causal Subspace Filtering via Random Matrix Theory for Adversarially Robust Vision-Language Models

    Authors: Nastaran Darabi, Devashri Naik, Sina Tayebati, Dinithi Jayasuriya, Ranganath Krishnan, Amit Ranjan Trivedi

    Abstract: Vision-Language Models (VLMs) inherit adversarial vulnerabilities of Large Language Models (LLMs), which are further exacerbated by their multimodal nature. Existing defenses, including adversarial training, input transformations, and heuristic detection, are computationally expensive, architecture-dependent, and fragile against adaptive attacks. We introduce EigenShield, an inference-time defense… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  49. arXiv:2502.13010  [pdf, other

    cs.CL cs.MA

    Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge

    Authors: Mohammad Reza Rezaei, Reza Saadati Fard, Rahul G. Krishnan, Milad Lankarany

    Abstract: Large Language Models (LLMs) have significantly advanced medical question-answering by leveraging extensive clinical data and medical literature. However, the rapid evolution of medical knowledge and the labor-intensive process of manually updating domain-specific resources pose challenges to the reliability of these systems. To address this, we introduce Agentic Medical Graph-RAG (AMG-RAG), a com… ▽ More

    Submitted 27 May, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

  50. arXiv:2502.08972  [pdf, other

    cs.CL cs.AI

    Tuning-Free Personalized Alignment via Trial-Error-Explain In-Context Learning

    Authors: Hyundong Cho, Karishma Sharma, Nicolaas Jedema, Leonardo F. R. Ribeiro, Alessandro Moschitti, Ravi Krishnan, Jonathan May

    Abstract: Language models are aligned to the collective voice of many, resulting in generic outputs that do not align with specific users' styles. In this work, we present Trial-Error-Explain In-Context Learning (TICL), a tuning-free method that personalizes language models for text generation tasks with fewer than 10 examples per user. TICL iteratively expands an in-context learning prompt via a trial-erro… ▽ More

    Submitted 5 April, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

    Comments: NAACL 2025 Findings