Skip to main content

Showing 1–50 of 587 results for author: Arvind

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.20689  [pdf

    eess.IV cs.AI cs.CV cs.LG

    U-R-VEDA: Integrating UNET, Residual Links, Edge and Dual Attention, and Vision Transformer for Accurate Semantic Segmentation of CMRs

    Authors: Racheal Mukisa, Arvind K. Bansal

    Abstract: Artificial intelligence, including deep learning models, will play a transformative role in automated medical image analysis for the diagnosis of cardiac disorders and their management. Automated accurate delineation of cardiac images is the first necessary initial step for the quantification and automated diagnosis of cardiac disorders. In this paper, we propose a deep learning based enhanced UNe… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 15 pages, 3 figures

    ACM Class: I.4.6; I.2; I.5.2; I.5.1

  2. arXiv:2506.20399  [pdf, ps, other

    cs.RO

    Multimodal Behaviour Trees for Robotic Laboratory Task Automation

    Authors: Hatem Fakhruldeen, Arvind Raveendran Nambiar, Satheeshkumar Veeramani, Bonilkumar Vijaykumar Tailor, Hadi Beyzaee Juneghani, Gabriella Pizzuto, Andrew Ian Cooper

    Abstract: Laboratory robotics offer the capability to conduct experiments with a high degree of precision and reproducibility, with the potential to transform scientific research. Trivial and repeatable tasks; e.g., sample transportation for analysis and vial capping are well-suited for robots; if done successfully and reliably, chemists could contribute their efforts towards more critical research activiti… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 7 pages, 5 figures, accepted and presented in ICRA 2025

  3. arXiv:2506.18405  [pdf, ps, other

    cs.IT

    $(\ell,δ)$-Diversity: Linkage-Robustness via a Composition Theorem

    Authors: V. Arvind Rameshwar, Anshoo Tandon

    Abstract: In this paper, we consider the problem of degradation of anonymity upon linkages of anonymized datasets. We work in the setting where an adversary links together $t\geq 2$ anonymized datasets in which a user of interest participates, based on the user's known quasi-identifiers, which motivates the use of $\ell$-diversity as the notion of dataset anonymity. We first argue that in the worst case, su… ▽ More

    Submitted 24 June, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

    Comments: 10 pages, 2 tables, 2 figures

  4. arXiv:2506.17633  [pdf, ps, other

    cs.CV cs.AI

    Adaptive Multi-prompt Contrastive Network for Few-shot Out-of-distribution Detection

    Authors: Xiang Fang, Arvind Easwaran, Blaise Genest

    Abstract: Out-of-distribution (OOD) detection attempts to distinguish outlier samples to prevent models trained on the in-distribution (ID) dataset from producing unavailable outputs. Most OOD detection methods require many IID samples for training, which seriously limits their real-world applications. To this end, we target a challenging setting: few-shot OOD detection, where {Only a few {\em labeled ID} s… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: ICML 2025

  5. arXiv:2506.16649  [pdf

    cs.CR

    Automated Energy Billing with Blockchain and the Prophet Forecasting Model: A Holistic Approach

    Authors: Ajesh Thangaraj Nadar, Soham Chandane, Gabriel Nixon Raj, Nihar Mahesh Pasi, Yash Arvind Patil

    Abstract: This paper presents a comprehensive approach to automated energy billing that leverages IoT-based smart meters, blockchain technology, and the Prophet time series forecasting model. The proposed system facilitates real-time power consumption monitoring via Wi-Fi-enabled ESP32 modules and a mobile application interface. It integrates Firebase and blockchain for secure, transparent billing processes… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: 10 pages, 5 figures. Presented at IEEE International Conference on Multidisciplinary Research in Technology and Management MRTM 2023 held on 22 to 23 September 2023 at New Horizon College of Engineering India

    ACM Class: C.2.1; C.3; D.2.11

  6. arXiv:2506.15883  [pdf, ps, other

    cs.HC

    Semantic Scaffolding: Augmenting Textual Structures with Domain-Specific Groupings for Accessible Data Exploration

    Authors: Jonathan Zong, Isabella Pedraza Pineros, Mengzhu Katie Chen, Daniel Hajas, Arvind Satyanarayan

    Abstract: Drawing connections between interesting groupings of data and their real-world meaning is an important, yet difficult, part of encountering a new dataset. A lay reader might see an interesting visual pattern in a chart but lack the domain expertise to explain its meaning. Or, a reader might be familiar with a real-world concept but struggle to express it in terms of a dataset's fields. In response… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  7. Energy-Efficient Real-Time Job Mapping and Resource Management in Mobile-Edge Computing

    Authors: Chuanchao Gao, Niraj Kumar, Arvind Easwaran

    Abstract: Mobile-edge computing (MEC) has emerged as a promising paradigm for enabling Internet of Things (IoT) devices to handle computation-intensive jobs. Due to the imperfect parallelization of algorithms for job processing on servers and the impact of IoT device mobility on data communication quality in wireless networks, it is crucial to jointly consider server resource allocation and IoT device mobil… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    Journal ref: 2024 IEEE Real-Time Systems Symposium (RTSS)

  8. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  9. arXiv:2506.09661  [pdf, ps, other

    eess.IV cs.CV q-bio.TO

    A Cytology Dataset for Early Detection of Oral Squamous Cell Carcinoma

    Authors: Garima Jain, Sanghamitra Pati, Mona Duggal, Amit Sethi, Abhijeet Patil, Gururaj Malekar, Nilesh Kowe, Jitender Kumar, Jatin Kashyap, Divyajeet Rout, Deepali, Hitesh, Nishi Halduniya, Sharat Kumar, Heena Tabassum, Rupinder Singh Dhaliwal, Sucheta Devi Khuraijam, Sushma Khuraijam, Sharmila Laishram, Simmi Kharb, Sunita Singh, K. Swaminadtan, Ranjana Solanki, Deepika Hemranjani, Shashank Nath Singh , et al. (12 additional authors not shown)

    Abstract: Oral squamous cell carcinoma OSCC is a major global health burden, particularly in several regions across Asia, Africa, and South America, where it accounts for a significant proportion of cancer cases. Early detection dramatically improves outcomes, with stage I cancers achieving up to 90 percent survival. However, traditional diagnosis based on histopathology has limited accessibility in low-res… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 7 pages, 2 figurs

  10. arXiv:2506.00135  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Tradeoffs between Mistakes and ERM Oracle Calls in Online and Transductive Online Learning

    Authors: Idan Attias, Steve Hanneke, Arvind Ramaswami

    Abstract: We study online and transductive online learning when the learner interacts with the concept class only via Empirical Risk Minimization (ERM) or weak consistency oracles on arbitrary instance subsets. This contrasts with standard online models, where the learner knows the entire class. The ERM oracle returns a hypothesis minimizing loss on a given subset, while the weak consistency oracle returns… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

  11. arXiv:2505.21767  [pdf, ps, other

    eess.IV cs.LG eess.SP

    Beyond 1D: Vision Transformers and Multichannel Signal Images for PPG-to-ECG Reconstruction

    Authors: Xiaoyan Li, Shixin Xu, Faisal Habib, Arvind Gupta, Huaxiong Huang

    Abstract: Reconstructing ECG from PPG is a promising yet challenging task. While recent advancements in generative models have significantly improved ECG reconstruction, accurately capturing fine-grained waveform features remains a key challenge. To address this, we propose a novel PPG-to-ECG reconstruction method that leverages a Vision Transformer (ViT) as the core network. Unlike conventional approaches… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  12. arXiv:2505.11733  [pdf, ps, other

    cs.CL

    MedCaseReasoning: Evaluating and learning diagnostic reasoning from clinical case reports

    Authors: Kevin Wu, Eric Wu, Rahul Thapa, Kevin Wei, Angela Zhang, Arvind Suresh, Jacqueline J. Tao, Min Woo Sun, Alejandro Lozano, James Zou

    Abstract: Doctors and patients alike increasingly use Large Language Models (LLMs) to diagnose clinical cases. However, unlike domains such as math or coding, where correctness can be objectively defined by the final answer, medical diagnosis requires both the outcome and the reasoning process to be accurate. Currently, widely used medical benchmarks like MedQA and MMLU assess only accuracy in the final ans… ▽ More

    Submitted 20 May, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

  13. arXiv:2505.09764  [pdf, ps, other

    cs.DC cs.NI

    FLASH: Fast All-to-All Communication in GPU Clusters

    Authors: Yiran Lei, Dongjoo Lee, Liangyu Zhao, Daniar Kurniawan, Chanmyeong Kim, Heetaek Jeong, Changsu Kim, Hyeonseong Choi, Liangcheng Yu, Arvind Krishnamurthy, Justine Sherry, Eriko Nurvitadhi

    Abstract: Scheduling All-to-All communications efficiently is fundamental to minimizing job completion times in distributed systems. Incast and straggler flows can slow down All-to-All transfers; and GPU clusters bring additional straggler challenges due to highly heterogeneous link capacities between technologies like NVLink and Ethernet. Existing schedulers all suffer high overheads relative to theoretica… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  14. arXiv:2505.09254  [pdf, other

    cs.SI nlin.AO

    Moving towards informative and actionable social media research

    Authors: Joseph B. Bak-Coleman, Stephan Lewandowsky, Philipp Lorenz-Spreen, Arvind Narayanan, Amy Orben, Lisa Oswald

    Abstract: Social media is nearly ubiquitous in modern life, and concerns have been raised about its putative societal impacts, ranging from undermining mental health and exacerbating polarization to fomenting violence and disrupting democracy. Despite extensive research, consensus on these effects remains elusive, with observational studies often highlighting concerns while randomized controlled trials (RCT… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  15. arXiv:2505.07634  [pdf, ps, other

    cs.RO cs.AI cs.CV

    Neural Brain: A Neuroscience-inspired Framework for Embodied Agents

    Authors: Jian Liu, Xiongtao Shi, Thai Duy Nguyen, Haitian Zhang, Tianxiang Zhang, Wei Sun, Yanjie Li, Athanasios V. Vasilakos, Giovanni Iacca, Arshad Ali Khan, Arvind Kumar, Jae Won Cho, Ajmal Mian, Lihua Xie, Erik Cambria, Lin Wang

    Abstract: The rapid evolution of artificial intelligence (AI) has shifted from static, data-driven models to dynamic systems capable of perceiving and interacting with real-world environments. Despite advancements in pattern recognition and symbolic reasoning, current AI systems, such as large language models, remain disembodied, unable to physically engage with the world. This limitation has driven the ris… ▽ More

    Submitted 14 May, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

    Comments: 51 pages, 17 figures, 9 tables

  16. arXiv:2505.07069  [pdf, ps, other

    cs.HC

    HeedVision: Attention Awareness in Collaborative Immersive Analytics Environments

    Authors: Arvind Srinivasan, Niklas Elmqvist

    Abstract: Group awareness--the ability to perceive the activities of collaborators in a shared space--is a vital mechanism to support effective coordination and joint data analysis in collaborative visualization. We introduce collaborative attention-aware visualizations (CAAVs) that track, record, and revisualize the collective attention of multiple users over time. We implement this concept in HeedVision,… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  17. arXiv:2505.05515  [pdf, other

    q-bio.NC cs.LG

    Nature's Insight: A Novel Framework and Comprehensive Analysis of Agentic Reasoning Through the Lens of Neuroscience

    Authors: Zinan Liu, Haoran Li, Jingyi Lu, Gaoyuan Ma, Xu Hong, Giovanni Iacca, Arvind Kumar, Shaojun Tang, Lin Wang

    Abstract: Autonomous AI is no longer a hard-to-reach concept, it enables the agents to move beyond executing tasks to independently addressing complex problems, adapting to change while handling the uncertainty of the environment. However, what makes the agents truly autonomous? It is agentic reasoning, that is crucial for foundation models to develop symbolic logic, statistical correlations, or large-scale… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 39 pages, 17 figures

  18. arXiv:2505.04846  [pdf, ps, other

    cs.IR cs.CE cs.CL cs.DC cs.LG

    HiPerRAG: High-Performance Retrieval Augmented Generation for Scientific Insights

    Authors: Ozan Gokdemir, Carlo Siebenschuh, Alexander Brace, Azton Wells, Brian Hsu, Kyle Hippe, Priyanka V. Setty, Aswathy Ajith, J. Gregory Pauloski, Varuni Sastry, Sam Foreman, Huihuo Zheng, Heng Ma, Bharat Kale, Nicholas Chia, Thomas Gibbs, Michael E. Papka, Thomas Brettin, Francis J. Alexander, Anima Anandkumar, Ian Foster, Rick Stevens, Venkatram Vishwanath, Arvind Ramanathan

    Abstract: The volume of scientific literature is growing exponentially, leading to underutilized discoveries, duplicated efforts, and limited cross-disciplinary collaboration. Retrieval Augmented Generation (RAG) offers a way to assist scientists by improving the factuality of Large Language Models (LLMs) in processing this influx of information. However, scaling RAG to handle millions of articles introduce… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: This paper has been accepted at the Platform for Advanced Scientific Computing Conference (PASC 25), June 16-18, 2025, Brugg-Windisch, Switzerland

    ACM Class: H.3.3; I.2.7

  19. arXiv:2505.03132  [pdf, other

    cs.CV cs.AI cs.HC

    VISLIX: An XAI Framework for Validating Vision Models with Slice Discovery and Analysis

    Authors: Xinyuan Yan, Xiwei Xuan, Jorge Piazentin Ono, Jiajing Guo, Vikram Mohanty, Shekar Arvind Kumar, Liang Gou, Bei Wang, Liu Ren

    Abstract: Real-world machine learning models require rigorous evaluation before deployment, especially in safety-critical domains like autonomous driving and surveillance. The evaluation of machine learning models often focuses on data slices, which are subsets of the data that share a set of characteristics. Data slice finding automatically identifies conditions or data subgroups where models underperform,… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  20. arXiv:2505.01435  [pdf, other

    cs.IR cs.CL cs.DC cs.LG

    AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling Engine

    Authors: Carlo Siebenschuh, Kyle Hippe, Ozan Gokdemir, Alexander Brace, Arham Khan, Khalid Hossain, Yadu Babuji, Nicholas Chia, Venkatram Vishwanath, Rick Stevens, Arvind Ramanathan, Ian Foster, Robert Underwood

    Abstract: Language models for scientific tasks are trained on text from scientific publications, most distributed as PDFs that require parsing. PDF parsing approaches range from inexpensive heuristics (for simple documents) to computationally intensive ML-driven systems (for complex or degraded ones). The choice of the "best" parser for a particular document depends on its computational cost and the accurac… ▽ More

    Submitted 23 April, 2025; originally announced May 2025.

    Comments: This paper has been accepted at the The Eighth Annual Conference on Machine Learning and Systems (MLSys 2025)

  21. arXiv:2504.17080  [pdf, other

    cs.RO eess.SY

    Geometric Formulation of Unified Force-Impedance Control on SE(3) for Robotic Manipulators

    Authors: Joohwan Seo, Nikhil Potu Surya Prakash, Soomi Lee, Arvind Kruthiventy, Megan Teng, Jongeun Choi, Roberto Horowitz

    Abstract: In this paper, we present an impedance control framework on the SE(3) manifold, which enables force tracking while guaranteeing passivity. Building upon the unified force-impedance control (UFIC) and our previous work on geometric impedance control (GIC), we develop the geometric unified force impedance control (GUFIC) to account for the SE(3) manifold structure in the controller formulation using… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: Submitted to Control Decision Conference (CDC) 2025

  22. arXiv:2504.13863  [pdf

    cs.HC

    Utsarjan: A smartphone App for providing kidney care and real-time assistance to children with nephrotic syndrome

    Authors: Snigdha Tiwari, Sahil Sharma, Arvind Bagga, Aditi Sinha, Deepak Sharma

    Abstract: Background Telemedicine has the potential to provide secure and cost-effective healthcare at the touch of a button. Nephrotic syndrome is a chronic childhood illness involving frequent relapses and demands long/complex treatment. Hence, developing a remote means of doctor-patient interface will ensure the provision of quality healthcare to patients. Methods The Utsarjan mobile App framework was bu… ▽ More

    Submitted 26 March, 2025; originally announced April 2025.

    Comments: 16 pages, 3 figures

  23. arXiv:2504.13415  [pdf

    eess.IV cs.AI cs.CV cs.LG

    DADU: Dual Attention-based Deep Supervised UNet for Automated Semantic Segmentation of Cardiac Images

    Authors: Racheal Mukisa, Arvind K. Bansal

    Abstract: We propose an enhanced deep learning-based model for image segmentation of the left and right ventricles and myocardium scar tissue from cardiac magnetic resonance (CMR) images. The proposed technique integrates UNet, channel and spatial attention, edge-detection based skip-connection and deep supervised learning to improve the accuracy of the CMR image-segmentation. Images are processed using mul… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: 20 pages, 8 figures

    ACM Class: I.4.6; I.2; I.5.2; I.5.1

  24. arXiv:2504.13391  [pdf

    eess.IV cs.AI cs.CV cs.LG

    Cardiac MRI Semantic Segmentation for Ventricles and Myocardium using Deep Learning

    Authors: Racheal Mukisa, Arvind K. Bansal

    Abstract: Automated noninvasive cardiac diagnosis plays a critical role in the early detection of cardiac disorders and cost-effective clinical management. Automated diagnosis involves the automated segmentation and analysis of cardiac images. Precise delineation of cardiac substructures and extraction of their morphological attributes are essential for evaluating the cardiac function, and diagnosing cardio… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: 20 pages, 8 figures

    ACM Class: I.4.6; I.2; I.5.2; I.5.1

  25. arXiv:2504.11952  [pdf, other

    cs.CL cs.AI cs.LG

    Robust and Fine-Grained Detection of AI Generated Texts

    Authors: Ram Mohan Rao Kadiyala, Siddartha Pullakhandam, Kanwal Mehreen, Drishti Sharma, Siddhant Gupta, Jebish Purbey, Ashay Srivastava, Subhasya TippaReddy, Arvind Reddy Bobbili, Suraj Telugara Chandrashekhar, Modabbir Adeeb, Srinadh Vura, Hamza Farooq

    Abstract: An ideal detection system for machine generated content is supposed to work well on any generator as many more advanced LLMs come into existence day by day. Existing systems often struggle with accurately identifying AI-generated content over shorter texts. Further, not all texts might be entirely authored by a human or LLM, hence we focused more over partial cases i.e human-LLM co-authored texts.… ▽ More

    Submitted 22 May, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: 18 pages, 6 figures

  26. arXiv:2504.10191  [pdf, other

    cs.CL cs.AI

    Localized Cultural Knowledge is Conserved and Controllable in Large Language Models

    Authors: Veniamin Veselovsky, Berke Argin, Benedikt Stroebl, Chris Wendler, Robert West, James Evans, Thomas L. Griffiths, Arvind Narayanan

    Abstract: Just as humans display language patterns influenced by their native tongue when speaking new languages, LLMs often default to English-centric responses even when generating in other languages. Nevertheless, we observe that local cultural information persists within the models and can be readily activated for cultural customization. We first demonstrate that explicitly providing cultural context in… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  27. arXiv:2504.08784  [pdf, other

    cs.DC cs.LG

    SLOs-Serve: Optimized Serving of Multi-SLO LLMs

    Authors: Siyuan Chen, Zhipeng Jia, Samira Khan, Arvind Krishnamurthy, Phillip B. Gibbons

    Abstract: This paper introduces SLOs-Serve, a system designed for serving multi-stage large language model (LLM) requests with application- and stage-specific service level objectives (SLOs). The key idea behind SLOs-Serve is to customize the allocation of tokens to meet these SLO requirements. SLOs-Serve uses a multi-SLO dynamic programming-based algorithm to continuously optimize token allocations under S… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

  28. arXiv:2504.04657  [pdf

    cs.LG

    ACE-RLHF: Automated Code Evaluation and Socratic Feedback Generation Tool using Large Language Models and Reinforcement Learning with Human Feedback

    Authors: Tasnia Rahman, Sathish A. P. Kumar, Sumit Jha, Arvind Ramanathan

    Abstract: Automated Program Repair tools are developed for generating feedback and suggesting a repair method for erroneous code. State of the art (SOTA) code repair methods rely on data-driven approaches and often fail to deliver solution for complicated programming questions. To interpret the natural language of unprecedented programming problems, using Large Language Models (LLMs) for code-feedback gener… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

    Comments: 9 pages, 3 figures

  29. arXiv:2504.03423  [pdf

    cs.LG cs.RO

    DML-RAM: Deep Multimodal Learning Framework for Robotic Arm Manipulation using Pre-trained Models

    Authors: Sathish Kumar, Swaroop Damodaran, Naveen Kumar Kuruba, Sumit Jha, Arvind Ramanathan

    Abstract: This paper presents a novel deep learning framework for robotic arm manipulation that integrates multimodal inputs using a late-fusion strategy. Unlike traditional end-to-end or reinforcement learning approaches, our method processes image sequences with pre-trained models and robot state data with machine learning algorithms, fusing their outputs to predict continuous action values for control. E… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: 7 pages , 4 figures

  30. arXiv:2504.03163  [pdf

    cs.LG

    Enhanced Penalty-based Bidirectional Reinforcement Learning Algorithms

    Authors: Sai Gana Sandeep Pula, Sathish A. P. Kumar, Sumit Jha, Arvind Ramanathan

    Abstract: This research focuses on enhancing reinforcement learning (RL) algorithms by integrating penalty functions to guide agents in avoiding unwanted actions while optimizing rewards. The goal is to improve the learning process by ensuring that agents learn not only suitable actions but also which actions to avoid. Additionally, we reintroduce a bidirectional learning approach that enables agents to lea… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: 16 pages, 13 Figures

  31. arXiv:2504.03153  [pdf

    cs.LG

    MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories

    Authors: Natalie Tirabassi, Sathish A. P. Kumar, Sumit Jha, Arvind Ramanathan

    Abstract: We propose MORAL (a multimodal reinforcement learning framework for decision making in autonomous laboratories) that enhances sequential decision-making in autonomous robotic laboratories through the integration of visual and textual inputs. Using the BridgeData V2 dataset, we generate fine-tuned image captions with a pretrained BLIP-2 vision-language model and combine them with visual features th… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: 9 pages, 14 figures and 3 tables

  32. arXiv:2503.22248  [pdf, other

    cs.LG cs.RO

    CRLLK: Constrained Reinforcement Learning for Lane Keeping in Autonomous Driving

    Authors: Xinwei Gao, Arambam James Singh, Gangadhar Royyuru, Michael Yuhas, Arvind Easwaran

    Abstract: Lane keeping in autonomous driving systems requires scenario-specific weight tuning for different objectives. We formulate lane-keeping as a constrained reinforcement learning problem, where weight coefficients are automatically learned along with the policy, eliminating the need for scenario-specific tuning. Empirically, our approach outperforms traditional RL in efficiency and reliability. Addit… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

    Comments: Accepted at AAMAS 2025 (Demonstration Track), 3 pages, 2 figures, 1 table

    ACM Class: I.2.6; I.2.9; I.5.1; C.3; I.2.11

  33. arXiv:2503.16861  [pdf, other

    cs.AI

    In-House Evaluation Is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI

    Authors: Shayne Longpre, Kevin Klyman, Ruth E. Appel, Sayash Kapoor, Rishi Bommasani, Michelle Sahar, Sean McGregor, Avijit Ghosh, Borhane Blili-Hamelin, Nathan Butters, Alondra Nelson, Amit Elazari, Andrew Sellars, Casey John Ellis, Dane Sherrets, Dawn Song, Harley Geiger, Ilona Cohen, Lauren McIlvenny, Madhulika Srikumar, Mark M. Jaycox, Markus Anderljung, Nadine Farid Johnson, Nicholas Carlini, Nicolas Miailhe , et al. (9 additional authors not shown)

    Abstract: The widespread deployment of general-purpose AI (GPAI) systems introduces significant new risks. Yet the infrastructure, practices, and norms for reporting flaws in GPAI systems remain seriously underdeveloped, lagging far behind more established fields like software security. Based on a collaboration between experts from the fields of software security, machine learning, law, social science, and… ▽ More

    Submitted 25 March, 2025; v1 submitted 21 March, 2025; originally announced March 2025.

  34. arXiv:2503.16794  [pdf, other

    cs.DC cs.DM cs.DS

    Local Ratio based Real-time Job Offloading and Resource Allocation in Mobile Edge Computing

    Authors: Chuanchao Gao, Arvind Easwaran

    Abstract: Mobile Edge Computing (MEC) has emerged as a promising paradigm enabling vehicles to handle computation-intensive and time-sensitive applications for intelligent transportation. Due to the limited resources in MEC, effective resource management is crucial for improving system performance. While existing studies mostly focus on the job offloading problem and assume that job resource demands are fix… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: accepted by The 4th Real-time And intelliGent Edge computing workshop, hold on May 6th, 2025 in Irvine, CA, USA

  35. arXiv:2503.12400  [pdf, ps, other

    cs.IT

    Secrecy Analysis of Energy-Harvesting Backscatter Communications with Tag Selection in Nakagami-m Fading

    Authors: Mohammad Nafees, Dharmendra Dixit, Arvind Kumar

    Abstract: Backscatter communication is an energy-efficient technique that enables sustainable wireless connectivity with a minimal environmental impact. In this paper, the secrecy performance of practical non-linear energy-harvesting backscatter communications with various tag selection schemes is analyzed in Nakagami-m fading channels. We consider four tag selection schemes: sub-optimal, minimal eaves-drop… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  36. arXiv:2503.11870  [pdf, other

    cs.AI cs.LG

    Counterfactual Realizability

    Authors: Arvind Raghavan, Elias Bareinboim

    Abstract: It is commonly believed that, in a real-world environment, samples can only be drawn from observational and interventional distributions, corresponding to Layers 1 and 2 of the Pearl Causal Hierarchy. Layer 3, representing counterfactual distributions, is believed to be inaccessible by definition. However, Bareinboim, Forney, and Pearl (2015) introduced a procedure that allows an agent to sample d… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: published at ICLR'25 (spotlight)

    Report number: Causal AI lab TR-113 ACM Class: F.4.1; G.3

  37. arXiv:2503.09829  [pdf, other

    cs.RO cs.LG eess.SY

    SE(3)-Equivariant Robot Learning and Control: A Tutorial Survey

    Authors: Joohwan Seo, Soochul Yoo, Junwoo Chang, Hyunseok An, Hyunwoo Ryu, Soomi Lee, Arvind Kruthiventy, Jongeun Choi, Roberto Horowitz

    Abstract: Recent advances in deep learning and Transformers have driven major breakthroughs in robotics by employing techniques such as imitation learning, reinforcement learning, and LLM-based multimodal perception and decision-making. However, conventional deep learning and Transformer models often struggle to process data with inherent symmetries and invariances, typically relying on large datasets or ex… ▽ More

    Submitted 23 April, 2025; v1 submitted 12 March, 2025; originally announced March 2025.

    Comments: Accepted to International Journcal of Control, Automation and Systems (IJCAS)

  38. arXiv:2503.07963  [pdf, other

    cs.RO cs.AI eess.SY

    Hierarchical Contact-Rich Trajectory Optimization for Multi-Modal Manipulation using Tight Convex Relaxations

    Authors: Yuki Shirai, Arvind Raghunathan, Devesh K. Jha

    Abstract: Designing trajectories for manipulation through contact is challenging as it requires reasoning of object \& robot trajectories as well as complex contact sequences simultaneously. In this paper, we present a novel framework for simultaneously designing trajectories of robots, objects, and contacts efficiently for contact-rich manipulation. We propose a hierarchical optimization framework where Mi… ▽ More

    Submitted 11 March, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

    Comments: 2025 IEEE International Conference on Robotics and Automation (2025 ICRA)

  39. arXiv:2503.05238  [pdf, other

    cs.LG

    Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation

    Authors: Mohit Prashant, Arvind Easwaran, Suman Das, Michael Yuhas

    Abstract: An issue concerning the use of deep reinforcement learning (RL) agents is whether they can be trusted to perform reliably when deployed, as training environments may not reflect real-life environments. Anticipating instances outside their training scope, learning-enabled systems are often equipped with out-of-distribution (OOD) detectors that alert when a trained system encounters a state it does… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

  40. Tactile Vega-Lite: Rapidly Prototyping Tactile Charts with Smart Defaults

    Authors: Mengzhu Katie Chen, Isabella Pedraza Pineros, Arvind Satyanarayan, Jonathan Zong

    Abstract: Tactile charts are essential for conveying data to blind and low vision (BLV) readers but are difficult for designers to construct. Non-expert designers face barriers to entry due to complex guidelines, while experts struggle with fragmented and time-consuming workflows that involve extensive customization. Inspired by formative interviews with expert tactile graphics designers, we created Tactile… ▽ More

    Submitted 3 March, 2025; v1 submitted 28 February, 2025; originally announced March 2025.

    Comments: ACM CHI 2025

  41. arXiv:2502.20969  [pdf, other

    cs.DC cs.LG

    TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval

    Authors: Chien-Yu Lin, Keisuke Kamahori, Yiyu Liu, Xiaoxiang Shi, Madhav Kashyap, Yile Gu, Rulin Shao, Zihao Ye, Kan Zhu, Stephanie Wang, Arvind Krishnamurthy, Rohan Kadekodi, Luis Ceze, Baris Kasikci

    Abstract: Retrieval-augmented generation (RAG) extends large language models (LLMs) with external data sources to enhance factual correctness and domain coverage. Modern RAG pipelines rely on large datastores, leading to system challenges in latency-sensitive deployments, especially when limited GPU memory is available. To address these challenges, we propose TeleRAG, an efficient inference system that redu… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

  42. arXiv:2502.19625  [pdf, other

    cs.LG

    Revealing Treatment Non-Adherence Bias in Clinical Machine Learning Using Large Language Models

    Authors: Zhongyuan Liang, Arvind Suresh, Irene Y. Chen

    Abstract: Machine learning systems trained on electronic health records (EHRs) increasingly guide treatment decisions, but their reliability depends on the critical assumption that patients follow the prescribed treatments recorded in EHRs. Using EHR data from 3,623 hypertension patients, we investigate how treatment non-adherence introduces implicit bias that can fundamentally distort both causal inference… ▽ More

    Submitted 20 April, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

  43. arXiv:2502.19320  [pdf, other

    cs.CL cs.AI cs.CR cs.LG stat.ML

    Shh, don't say that! Domain Certification in LLMs

    Authors: Cornelius Emde, Alasdair Paren, Preetham Arvind, Maxime Kayser, Tom Rainforth, Thomas Lukasiewicz, Bernard Ghanem, Philip H. S. Torr, Adel Bibi

    Abstract: Large language models (LLMs) are often deployed to perform constrained tasks, with narrow domains. For example, customer support bots can be built on top of LLMs, relying on their broad language understanding and capabilities to enhance performance. However, these LLMs are adversarially susceptible, potentially generating outputs outside the intended domain. To formalize, assess, and mitigate this… ▽ More

    Submitted 6 March, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: 10 pages, includes appendix Published in International Conference on Learning Representations (ICLR) 2025

    Journal ref: International Conference on Learning Representations (ICLR) 2025

  44. arXiv:2502.17536  [pdf, other

    eess.SP cs.LG

    CLEP-GAN: An Innovative Approach to Subject-Independent ECG Reconstruction from PPG Signals

    Authors: Xiaoyan Li, Shixin Xu, Faisal Habib, Neda Aminnejad, Arvind Gupta, Huaxiong Huang

    Abstract: This study addresses the challenge of reconstructing unseen ECG signals from PPG signals, a critical task for non-invasive cardiac monitoring. While numerous public ECG-PPG datasets are available, they lack the diversity seen in image datasets, and data collection processes often introduce noise, complicating ECG reconstruction from PPG even with advanced machine learning models. To tackle these c… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  45. arXiv:2502.16154  [pdf, other

    cs.SE quant-ph

    Bridging Quantum Mechanics and Computing: A Primer for Software Engineers

    Authors: Arvind W Kiwelekar

    Abstract: Quantum mechanics, the fundamental theory that governs the behaviour of matter and energy at microscopic scales, forms the foundation of quantum computing and quantum information science. As quantum technologies progress, software engineers must develop a conceptual understanding of quantum mechanics to grasp its implications for computing. This article focuses on fundamental quantum mechanics pri… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

  46. arXiv:2502.12280  [pdf, other

    cs.DC cs.AI

    Connecting Large Language Model Agent to High Performance Computing Resource

    Authors: Heng Ma, Alexander Brace, Carlo Siebenschuh, Greg Pauloski, Ian Foster, Arvind Ramanathan

    Abstract: The Large Language Model agent workflow enables the LLM to invoke tool functions to increase the performance on specific scientific domain questions. To tackle large scale of scientific research, it requires access to computing resource and parallel computing setup. In this work, we implemented Parsl to the LangChain/LangGraph tool call setup, to bridge the gap between the LLM agent to the computi… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: 7 pages, 4 figures

    ACM Class: I.2.11

  47. arXiv:2502.12216  [pdf, other

    cs.LG cs.AI cs.CL

    Tactic: Adaptive Sparse Attention with Clustering and Distribution Fitting for Long-Context LLMs

    Authors: Kan Zhu, Tian Tang, Qinyu Xu, Yile Gu, Zhichen Zeng, Rohan Kadekodi, Liangyu Zhao, Ang Li, Arvind Krishnamurthy, Baris Kasikci

    Abstract: Long-context models are essential for many applications but face inefficiencies in loading large KV caches during decoding. Prior methods enforce fixed token budgets for sparse attention, assuming a set number of tokens can approximate full attention. However, these methods overlook variations in the importance of attention across heads, layers, and contexts. To address these limitations, we propo… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  48. arXiv:2502.07725  [pdf, other

    cs.HC

    Pluto: Authoring Semantically Aligned Text and Charts for Data-Driven Communication

    Authors: Arjun Srinivasan, Vidya Setlur, Arvind Satyanarayan

    Abstract: Textual content (including titles, annotations, and captions) plays a central role in helping readers understand a visualization by emphasizing, contextualizing, or summarizing the depicted data. Yet, existing visualization tools provide limited support for jointly authoring the two modalities of text and visuals such that both convey semantically-rich information and are cohesively integrated. In… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: 18 pages, 11 figures, accepted to 2025 ACM Conference on Intelligent User Interfaces (ACM IUI)

    ACM Class: H.5

  49. arXiv:2502.07608  [pdf, other

    cs.LG cs.HC

    Time2Lang: Bridging Time-Series Foundation Models and Large Language Models for Health Sensing Beyond Prompting

    Authors: Arvind Pillai, Dimitris Spathis, Subigya Nepal, Amanda C Collins, Daniel M Mackin, Michael V Heinz, Tess Z Griffin, Nicholas C Jacobson, Andrew Campbell

    Abstract: Large language models (LLMs) show promise for health applications when combined with behavioral sensing data. Traditional approaches convert sensor data into text prompts, but this process is prone to errors, computationally expensive, and requires domain expertise. These challenges are particularly acute when processing extended time series data. While time series foundation models (TFMs) have re… ▽ More

    Submitted 28 April, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

    Comments: Accepted to CHIL 2025. Code and models: https://github.com/arvind1609/time2lang

  50. arXiv:2502.04749  [pdf, ps, other

    cs.IT

    Bounding User Contributions for User-Level Differentially Private Mean Estimation

    Authors: V. Arvind Rameshwar, Anshoo Tandon

    Abstract: We revisit the problem of releasing the sample mean of bounded samples in a dataset, privately, under user-level $\varepsilon$-differential privacy (DP). We aim to derive the optimal method of preprocessing data samples, within a canonical class of processing strategies, in terms of the error in estimation. Typical error analyses of such \emph{bounding} (or \emph{clipping}) strategies in the liter… ▽ More

    Submitted 27 June, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: 7 pages, 3 figures, short technical note. Typos corrected