Skip to main content

Showing 1–50 of 731 results for author: kumar, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.13915  [pdf, ps, other

    cs.RO

    Sequence Modeling for Time-Optimal Quadrotor Trajectory Optimization with Sampling-based Robustness Analysis

    Authors: Katherine Mao, Hongzhan Yu, Ruipeng Zhang, Igor Spasojevic, M Ani Hsieh, Sicun Gao, Vijay Kumar

    Abstract: Time-optimal trajectories drive quadrotors to their dynamic limits, but computing such trajectories involves solving non-convex problems via iterative nonlinear optimization, making them prohibitively costly for real-time applications. In this work, we investigate learning-based models that imitate a model-based time-optimal trajectory planner to accelerate trajectory generation. Given a dataset o… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  2. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, AdriĆ  de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  3. arXiv:2506.08423  [pdf

    cond-mat.mtrl-sci cs.LG physics.ins-det

    Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy

    Authors: Utkarsh Pratiush, Austin Houston, Kamyar Barakati, Aditya Raghavan, Dasol Yoon, Harikrishnan KP, Zhaslan Baraissov, Desheng Ma, Samuel S. Welborn, Mikolaj Jakowski, Shawn-Patrick Barhorst, Alexander J. Pattison, Panayotis Manganaris, Sita Sirisha Madugula, Sai Venkata Gayathri Ayyagari, Vishal Kennedy, Ralph Bulanadi, Michelle Wang, Kieran J. Pang, Ian Addison-Smith, Willy Menacho, Horacio V. Guzman, Alexander Kiefer, Nicholas Furth, Nikola L. Kolev , et al. (48 additional authors not shown)

    Abstract: Microscopy is a primary source of information on materials structure and functionality at nanometer and atomic scales. The data generated is often well-structured, enriched with metadata and sample histories, though not always consistent in detail or format. The adoption of Data Management Plans (DMPs) by major funding agencies promotes preservation and access. However, deriving insights remains d… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  4. arXiv:2506.01365  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion

    Authors: Kumud Tripathi, Chowdam Venkata Kumar, Pankaj Wasnik

    Abstract: Voice Activity Detection (VAD) plays a key role in speech processing, often utilizing hand-crafted or neural features. This study examines the effectiveness of Mel-Frequency Cepstral Coefficients (MFCCs) and pre-trained model (PTM) features, including wav2vec 2.0, HuBERT, WavLM, UniSpeech, MMS, and Whisper. We propose FusionVAD, a unified framework that combines both feature types using three fusi… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: Accepted at INTERSPEECH 2025, 5 pages, 4 figures, 2 tables

  5. arXiv:2505.23523  [pdf, ps, other

    cs.LG cs.DC

    Accelerating AllReduce with a Persistent Straggler

    Authors: Arjun Devraj, Eric Ding, Abhishek Vijaya Kumar, Robert Kleinberg, Rachee Singh

    Abstract: Distributed machine learning workloads use data and tensor parallelism for training and inference, both of which rely on the AllReduce collective to synchronize gradients or activations. However, bulk-synchronous AllReduce algorithms can be delayed by a persistent straggler that is slower to reach the synchronization barrier required to begin the collective. To address this challenge, we propose S… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 23 pages, 11 figures

  6. arXiv:2505.23105  [pdf, ps, other

    cs.LG cs.NI

    LUMION: Fast Fault Recovery for ML Jobs Using Programmable Optical Fabrics

    Authors: Abhishek Vijaya Kumar, Eric Ding, Arjun Devraj, Darius Bunandar, Rachee Singh

    Abstract: When accelerators fail in modern ML datacenters, operators migrate the affected ML training or inference jobs to entirely new racks. This approach, while preserving network performance, is highly inefficient, requiring datacenters to reserve full racks of idle accelerators for fault tolerance. In this paper, we address this resource inefficiency by introducing LUMION, a novel reconfigurable optica… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  7. arXiv:2505.15157  [pdf, other

    cs.RO cs.LG

    Cascaded Diffusion Models for Neural Motion Planning

    Authors: Mohit Sharma, Adam Fishman, Vikash Kumar, Chris Paxton, Oliver Kroemer

    Abstract: Robots in the real world need to perceive and move to goals in complex environments without collisions. Avoiding collisions is especially difficult when relying on sensor perception and when goals are among clutter. Diffusion policies and other generative models have shown strong performance in solving local planning problems, but often struggle at avoiding all of the subtle constraint violations… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: ICRA'25

  8. arXiv:2505.14061  [pdf, ps, other

    cs.DS cs.CC

    Linear Hashing Is Optimal

    Authors: Michael Jaber, Vinayak M. Kumar, David Zuckerman

    Abstract: We prove that hashing $n$ balls into $n$ bins via a random matrix over $\mathbf{F}_2$ yields expected maximum load $O(\log n / \log \log n)$. This matches the expected maximum load of a fully random function and resolves an open question posed by Alon, Dietzfelbinger, Miltersen, Petrank, and Tardos (STOC '97, JACM '99). More generally, we show that the maximum load exceeds… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 20 pages, 1 figure; to appear in STOC 2025

  9. arXiv:2505.13916  [pdf, other

    cs.RO

    Robotic Monitoring of Colorimetric Leaf Sensors for Precision Agriculture

    Authors: Malakhi Hopkins, Alice Kate Li, Shobhita Kramadhati, Jackson Arnold, Akhila Mallavarapu, Chavez Lawrence, Varun Murali, Sanjeev J. Koppal, Cherie Kagan, Vijay Kumar

    Abstract: Current remote sensing technologies that measure crop health e.g. RGB, multispectral, hyperspectral, and LiDAR, are indirect, and cannot capture plant stress indicators directly. Instead, low-cost leaf sensors that directly interface with the crop surface present an opportunity to advance real-time direct monitoring. To this end, we co-design a sensor-detector system, where the sensor is a novel c… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Accepted to the Novel Approaches for Precision Agriculture and Forestry with Autonomous Robots IEEE ICRA Workshop - 2025

  10. arXiv:2505.11794  [pdf, other

    cs.RO

    Gaussian Splatting as a Unified Representation for Autonomy in Unstructured Environments

    Authors: Dexter Ong, Yuezhan Tao, Varun Murali, Igor Spasojevic, Vijay Kumar, Pratik Chaudhari

    Abstract: In this work, we argue that Gaussian splatting is a suitable unified representation for autonomous robot navigation in large-scale unstructured outdoor environments. Such environments require representations that can capture complex structures while remaining computationally tractable for real-time navigation. We demonstrate that the dense geometric and photometric information provided by a Gaussi… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  11. arXiv:2505.10228  [pdf, ps, other

    cs.RO eess.SY

    Quad-LCD: Layered Control Decomposition Enables Actuator-Feasible Quadrotor Trajectory Planning

    Authors: Anusha Srikanthan, Hanli Zhang, Spencer Folk, Vijay Kumar, Nikolai Matni

    Abstract: In this work, we specialize contributions from prior work on data-driven trajectory generation for a quadrotor system with motor saturation constraints. When motors saturate in quadrotor systems, there is an ``uncontrolled drift" of the vehicle that results in a crash. To tackle saturation, we apply a control decomposition and learn a tracking penalty from simulation data consisting of low, medium… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 4 pages, 4 figures

    Journal ref: ICRA 2025 Workshop on 25 Years of Aerial Robotics: Challenges and Opportunities

  12. arXiv:2505.09477  [pdf, ps, other

    cs.RO cs.AI

    Deploying Foundation Model-Enabled Air and Ground Robots in the Field: Challenges and Opportunities

    Authors: Zachary Ravichandran, Fernando Cladera, Jason Hughes, Varun Murali, M. Ani Hsieh, George J. Pappas, Camillo J. Taylor, Vijay Kumar

    Abstract: The integration of foundation models (FMs) into robotics has enabled robots to understand natural language and reason about the semantics in their environments. However, existing FM-enabled robots primary operate in closed-world settings, where the robot is given a full prior map or has a full view of its workspace. This paper addresses the deployment of FM-enabled robots in the field, where missi… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: Accepted to the IEEE ICRA Workshop on Field Robotics 2025

  13. arXiv:2505.09108  [pdf, ps, other

    cs.RO cs.AI

    Air-Ground Collaboration for Language-Specified Missions in Unknown Environments

    Authors: Fernando Cladera, Zachary Ravichandran, Jason Hughes, Varun Murali, Carlos Nieto-Granda, M. Ani Hsieh, George J. Pappas, Camillo J. Taylor, Vijay Kumar

    Abstract: As autonomous robotic systems become increasingly mature, users will want to specify missions at the level of intent rather than in low-level detail. Language is an expressive and intuitive medium for such mission specification. However, realizing language-guided robotic teams requires overcoming significant technical hurdles. Interpreting and realizing language-specified missions requires advance… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 19 pages, 24 figures, 7 tables. Submitted to T-FR

  14. arXiv:2505.03093  [pdf, other

    cs.CV

    Estimating the Diameter at Breast Height of Trees in a Forest With a Single 360 Camera

    Authors: Siming He, Zachary Osman, Fernando Cladera, Dexter Ong, Nitant Rai, Patrick Corey Green, Vijay Kumar, Pratik Chaudhari

    Abstract: Forest inventories rely on accurate measurements of the diameter at breast height (DBH) for ecological monitoring, resource management, and carbon accounting. While LiDAR-based techniques can achieve centimeter-level precision, they are cost-prohibitive and operationally complex. We present a low-cost alternative that only needs a consumer-grade 360 video camera. Our semi-automated pipeline compri… ▽ More

    Submitted 15 May, 2025; v1 submitted 5 May, 2025; originally announced May 2025.

  15. arXiv:2504.21845  [pdf, ps, other

    quant-ph cs.IT

    On the Efficacy of the Peeling Decoder for the Quantum Expander Code

    Authors: Jefrin Sharmitha Prabhu, Abhinav Vaishya, Shobhit Bhatnagar, Aryaman Manish Kolhe, V. Lalitha, P. Vijay Kumar

    Abstract: The problem of recovering from qubit erasures has recently gained attention as erasures occur in many physical systems such as photonic systems, trapped ions, superconducting qubits and circuit quantum electrodynamics. While several linear-time decoders for error correction are known, their error-correcting capability is limited to half the minimum distance of the code, whereas erasure correction… ▽ More

    Submitted 29 May, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

  16. Probing the Unknown: Exploring Student Interactions with Probeable Problems at Scale in Introductory Programming

    Authors: Paul Denny, Viraj Kumar, Stephen MacNeil, James Prather, Juho Leinonen

    Abstract: Introductory programming courses often rely on small code-writing exercises that have clearly specified problem statements. This limits opportunities for students to practice how to clarify ambiguous requirements -- a critical skill in real-world programming. In addition, the emerging capabilities of large language models (LLMs) to produce code from well-defined specifications may harm student eng… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: Accepted at ITiCSE 2025

  17. arXiv:2503.18668  [pdf, other

    cs.LG

    Geometric Preference Elicitation for Minimax Regret Optimization in Uncertainty Matroids

    Authors: Aditya Sai Ellendula, Arun K Pujari, Vikas Kumar, Venkateswara Rao Kagita

    Abstract: This paper presents an efficient preference elicitation framework for uncertain matroid optimization, where precise weight information is unavailable, but insights into possible weight values are accessible. The core innovation of our approach lies in its ability to systematically elicit user preferences, aligning the optimization process more closely with decision-makers' objectives. By increment… ▽ More

    Submitted 25 March, 2025; v1 submitted 24 March, 2025; originally announced March 2025.

  18. Genicious: Contextual Few-shot Prompting for Insights Discovery

    Authors: Vineet Kumar, Ronald Tony, Darshita Rathore, Vipasha Rana, Bhuvanesh Mandora, Kanishka, Chetna Bansal, Anindya Moitra

    Abstract: Data and insights discovery is critical for decision-making in modern organizations. We present Genicious, an LLM-aided interface that enables users to interact with tabular datasets and ask complex queries in natural language. By benchmarking various prompting strategies and language models, we have developed an end-to-end tool that leverages contextual few-shot prompting, achieving superior perf… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

    Comments: 5 pages, 3 figures, CODS-COMAD Dec 24, Jodhpur, India

  19. arXiv:2503.11985  [pdf, other

    cs.CL cs.AI

    No LLM is Free From Bias: A Comprehensive Study of Bias Evaluation in Large Language Models

    Authors: Charaka Vinayak Kumar, Ashok Urlana, Gopichand Kanumolu, Bala Mallikarjunarao Garlapati, Pruthwik Mishra

    Abstract: Advancements in Large Language Models (LLMs) have increased the performance of different natural language understanding as well as generation tasks. Although LLMs have breached the state-of-the-art performance in various tasks, they often reflect different forms of bias present in the training data. In the light of this perceived limitation, we provide a unified evaluation of benchmarks using a se… ▽ More

    Submitted 27 May, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: 12 pages, 1 figure

  20. arXiv:2503.11665  [pdf, other

    cs.AR cs.DB cs.ET

    Towards Efficient Flash Caches with Emerging NVMe Flexible Data Placement SSDs

    Authors: Michael Allison, Arun George, Javier Gonzalez, Dan Helmick, Vikash Kumar, Roshan Nair, Vivek Shah

    Abstract: NVMe Flash-based SSDs are widely deployed in data centers to cache working sets of large-scale web services. As data centers face increasing sustainability demands, such as reduced carbon emissions, efficient management of Flash overprovisioning and endurance has become crucial. Our analysis demonstrates that mixing data with different lifetimes on Flash blocks results in high device garbage colle… ▽ More

    Submitted 21 February, 2025; originally announced March 2025.

    Comments: This is a pre-print version of the paper that will appear at Eurosys 2025

  21. arXiv:2503.07885  [pdf, other

    cs.RO cs.AI

    Safety Guardrails for LLM-Enabled Robots

    Authors: Zachary Ravichandran, Alexander Robey, Vijay Kumar, George J. Pappas, Hamed Hassani

    Abstract: Although the integration of large language models (LLMs) into robotics has unlocked transformative capabilities, it has also introduced significant safety concerns, ranging from average-case LLM errors (e.g., hallucinations) to adversarial jailbreaking attacks, which can produce harmful robot behavior in real-world settings. Traditional robot safety approaches do not address the novel vulnerabilit… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  22. arXiv:2503.04615  [pdf, other

    cs.CL

    HalluCounter: Reference-free LLM Hallucination Detection in the Wild!

    Authors: Ashok Urlana, Gopichand Kanumolu, Charaka Vinayak Kumar, Bala Mallikarjunarao Garlapati, Rahul Mishra

    Abstract: Response consistency-based, reference-free hallucination detection (RFHD) methods do not depend on internal model states, such as generation probabilities or gradients, which Grey-box models typically rely on but are inaccessible in closed-source LLMs. However, their inability to capture query-response alignment patterns often results in lower detection accuracy. Additionally, the lack of large-sc… ▽ More

    Submitted 27 May, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

    Comments: 30 pages, 3 figures

  23. arXiv:2503.02208  [pdf, other

    cs.RO

    ADMM-MCBF-LCA: A Layered Control Architecture for Safe Real-Time Navigation

    Authors: Anusha Srikanthan, Yifan Xue, Vijay Kumar, Nikolai Matni, Nadia Figueroa

    Abstract: We consider the problem of safe real-time navigation of a robot in a dynamic environment with moving obstacles of arbitrary smooth geometries and input saturation constraints. We assume that the robot detects and models nearby obstacle boundaries with a short-range sensor and that this detection is error-free. This problem presents three main challenges: i) input constraints, ii) safety, and iii)… ▽ More

    Submitted 7 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

  24. arXiv:2503.01872  [pdf, other

    cs.LG cs.AI cs.CV

    FairGen: Controlling Sensitive Attributes for Fair Generations in Diffusion Models via Adaptive Latent Guidance

    Authors: Mintong Kang, Vinayshekhar Bannihatti Kumar, Shamik Roy, Abhishek Kumar, Sopan Khosla, Balakrishnan Murali Narayanaswamy, Rashmi Gangadharaiah

    Abstract: Text-to-image diffusion models often exhibit biases toward specific demographic groups, such as generating more males than females when prompted to generate images of engineers, raising ethical concerns and limiting their adoption. In this paper, we tackle the challenge of mitigating generation bias towards any target attribute value (e.g., "male" for "gender") in diffusion models while preserving… ▽ More

    Submitted 25 February, 2025; originally announced March 2025.

    Comments: Under submission

  25. arXiv:2502.20386  [pdf, other

    cs.RO

    ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting

    Authors: Dexter Ong, Yuezhan Tao, Varun Murali, Igor Spasojevic, Vijay Kumar, Pratik Chaudhari

    Abstract: We address the challenge of task-oriented navigation in unstructured and unknown environments, where robots must incrementally build and reason on rich, metric-semantic maps in real time. Since tasks may require clarification or re-specification, it is necessary for the information in the map to be rich enough to enable generalization across a wide range of tasks. To effectively execute tasks spec… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  26. arXiv:2502.20204  [pdf, other

    cs.IR cs.CL

    Granite Embedding Models

    Authors: Parul Awasthy, Aashka Trivedi, Yulong Li, Mihaela Bornea, David Cox, Abraham Daniels, Martin Franz, Gabe Goodhart, Bhavani Iyer, Vishwajeet Kumar, Luis Lastras, Scott McCarley, Rudra Murthy, Vignesh P, Sara Rosenthal, Salim Roukos, Jaydeep Sen, Sukriti Sharma, Avirup Sil, Kate Soule, Arafat Sultan, Radu Florian

    Abstract: We introduce the Granite Embedding models, a family of encoder-based embedding models designed for retrieval tasks, spanning dense-retrieval and sparse retrieval architectures, with both English and Multilingual capabilities. This report provides the technical details of training these highly effective 12 layer embedding models, along with their efficient 6 layer distilled counterparts. Extensive… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  27. arXiv:2502.15297  [pdf, other

    astro-ph.IM cs.LG quant-ph

    Comparative Analysis of Black Hole Mass Estimation in Type-2 AGNs: Classical vs. Quantum Machine Learning and Deep Learning Approaches

    Authors: Sathwik Narkedimilli, Venkata Sriram Amballa, N V Saran Kumar, R Arun Kumar, R Praneeth Reddy, Satvik Raghav, Manish M, Aswath Babu H

    Abstract: In the case of Type-2 AGNs, estimating the mass of the black hole is challenging. Understanding how galaxies form and evolve requires considerable insight into the mass of black holes. This work compared different classical and quantum machine learning (QML) algorithms for black hole mass estimation, wherein the classical algorithms are Linear Regression, XGBoost Regression, Random Forest Regresso… ▽ More

    Submitted 24 February, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

    Comments: 29 pages, 12 Figures, 6 Tables

  28. arXiv:2502.13135  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions

    Authors: Taedong Yun, Eric Yang, Mustafa Safdari, Jong Ha Lee, Vaishnavi Vinod Kumar, S. Sara Mahdavi, Jonathan Amar, Derek Peyton, Reut Aharony, Andreas Michaelides, Logan Schneider, Isaac Galatzer-Levy, Yugang Jia, John Canny, Arthur Gretton, Maja Matarić

    Abstract: We present an end-to-end framework for generating synthetic users for evaluating interactive agents designed to encourage positive behavior changes, such as in health and lifestyle coaching. The synthetic users are grounded in health and lifestyle conditions, specifically sleep and diabetes management in this study, to ensure realistic interactions with the health coaching agent. Synthetic users a… ▽ More

    Submitted 4 June, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: Accepted to the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

    ACM Class: I.2.7

  29. arXiv:2502.12723  [pdf, other

    cs.CV

    myEye2Wheeler: A Two-Wheeler Indian Driver Real-World Eye-Tracking Dataset

    Authors: Bhaiya Vaibhaw Kumar, Deepti Rawat, Tanvi Kandalla, Aarnav Nagariya, Kavita Vemuri

    Abstract: This paper presents the myEye2Wheeler dataset, a unique resource of real-world gaze behaviour of two-wheeler drivers navigating complex Indian traffic. Most datasets are from four-wheeler drivers on well-planned roads and homogeneous traffic. Our dataset offers a critical lens into the unique visual attention patterns and insights into the decision-making of Indian two-wheeler drivers. The analysi… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  30. arXiv:2502.10481  [pdf, other

    cs.LG

    Chronic Diseases Prediction Using ML

    Authors: Sri Varsha Mulakala, G. Neeharika, P. Vinay Kumar, A. Bhargava Kiran

    Abstract: The recent increase in morbidity is primarily due to chronic diseases including Diabetes, Heart disease, Lung cancer, and brain tumours. The results for patients can be improved, and the financial burden on the healthcare system can be lessened, through the early detection and prevention of certain disorders. In this study, we built a machine-learning model for predicting the existence of numerous… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  31. arXiv:2502.03127  [pdf, other

    cs.SE

    A Framework for Measuring the Quality of Infrastructure-as-Code Scripts

    Authors: Pandu Ranga Reddy Konala, Vimal Kumar, David Bainbridge, Junaid Haseeb

    Abstract: Infrastructure as Code (IaC) has become integral to modern software development, enabling automated and consistent configuration of computing environments. The rapid proliferation of IaC scripts has highlighted the need for better code quality assessment methods. This paper proposes a new IaC code quality framework specifically showcased for Ansible repositories as a foundation. By analyzing a com… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  32. arXiv:2502.01653  [pdf, other

    quant-ph cs.NI

    Quantum Internet: Technologies, Protocols, and Research Challenges

    Authors: Vinay Kumar, Claudio Cicconetti, Marco Conti, Andrea Passarella

    Abstract: As the field of the quantum internet advances, a comprehensive guide to navigate its complexities has become increasingly crucial. While quantum computing shares foundational principles with the quantum internet, distinguishing between the two is essential for further development and deeper understanding. This work systematically introduces the quantum internet by discussing its importance, core c… ▽ More

    Submitted 18 March, 2025; v1 submitted 30 January, 2025; originally announced February 2025.

    Comments: 50 pages, 11 figures

  33. arXiv:2501.18169  [pdf, other

    cs.NI

    Chip-to-chip photonic connectivity in multi-accelerator servers for ML

    Authors: Abhishek Vijaya Kumar, Arjun Devraj, Darius Bunandar, Rachee Singh

    Abstract: We present a rack-scale compute architecture for ML using multi-accelerator servers connected via chip-to-chip silicon photonic components. Our architecture achieves (1) multi-tenanted resource slicing without fragmentation, (2) 74% faster rack-scale collective communication, and (3) 1.7X speedup in end-to-end ML training throughput.

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: Accepted at OFC 2025, https://www.ofcconference.org/en-us/home/program-speakers/symposia/advanced-packaging-and-integrated-optics/

  34. arXiv:2501.02653  [pdf, ps, other

    cs.CC

    New Pseudorandom Generators and Correlation Bounds Using Extractors

    Authors: Vinayak M. Kumar

    Abstract: We establish new correlation bounds and pseudorandom generators for a collection of computation models. These models are all natural generalizations of structured low-degree $F_2$-polynomials that we did not have correlation bounds for before. In particular: 1. We construct a PRG for width-2 $poly(n)$-length branching programs which read $d$ bits at a time with seed length… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: 34 Pages, ITCS 2025

  35. arXiv:2501.01934  [pdf, other

    cs.LG physics.flu-dyn

    Fusion-DeepONet: A Data-Efficient Neural Operator for Geometry-Dependent Hypersonic and Supersonic Flows

    Authors: Ahmad Peyvan, Varun Kumar, George Em Karniadakis

    Abstract: Shape optimization is essential in aerospace vehicle design, including reentry systems, and propulsion system components, as it directly influences aerodynamic efficiency, structural integrity, and overall mission success. Rapid and accurate prediction of external and internal flows accelerates design iterations. To this end, we develop a new variant of DeepONet, called Fusion-DeepONet as a fast s… ▽ More

    Submitted 23 May, 2025; v1 submitted 3 January, 2025; originally announced January 2025.

  36. arXiv:2412.18344  [pdf, ps, other

    math.DS cs.LG

    Predator Prey Scavenger Model using Holling's Functional Response of Type III and Physics-Informed Deep Neural Networks

    Authors: Aneesh Panchal, Kirti Beniwal, Vivek Kumar

    Abstract: Nonlinear mathematical models introduce the relation between various physical and biological interactions present in nature. One of the most famous models is the Lotka-Volterra model which defined the interaction between predator and prey species present in nature. However, predators, scavengers, and prey populations coexist in a natural system where scavengers can additionally rely on the dead bo… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

  37. arXiv:2412.12981  [pdf, other

    cs.CL

    Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health

    Authors: Vivek Kumar, Eirini Ntoutsi, Pushpraj Singh Rajawat, Giacomo Medda, Diego Reforgiato Recupero

    Abstract: Large language models (LLMs) have shown promising capabilities in healthcare analysis but face several challenges like hallucinations, parroting, and bias manifestation. These challenges are exacerbated in complex, sensitive, and low-resource domains. Therefore, in this work we introduce IC-AnnoMI, an expert-annotated motivational interviewing (MI) dataset built upon AnnoMI by generating in-contex… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS) 2024

  38. arXiv:2412.12602  [pdf, other

    cs.RO cs.HC

    Don't Yell at Your Robot: Physical Correction as the Collaborative Interface for Language Model Powered Robots

    Authors: Chuye Zhang, Yifei Simon Shao, Harshil Parekh, Junyao Shi, Pratik Chaudhari, Vijay Kumar, Nadia Figueroa

    Abstract: We present a novel approach for enhancing human-robot collaboration using physical interactions for real-time error correction of large language model (LLM) powered robots. Unlike other methods that rely on verbal or text commands, the robot leverages an LLM to proactively executes 6 DoF linear Dynamical System (DS) commands using a description of the scene in natural language. During motion, a hu… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: 7 pages, 3 figures; Generative Modeling meets HRI - RSS'24 Workshop

  39. arXiv:2412.11967  [pdf, other

    cs.LG eess.SY

    A Digital twin for Diesel Engines: Operator-infused PINNs with Transfer Learning for Engine Health Monitoring

    Authors: Kamaljyoti Nath, Varun Kumar, Daniel J. Smith, George Em Karniadakis

    Abstract: Improving diesel engine efficiency and emission reduction have been critical research topics. Recent government regulations have shifted this focus to another important area related to engine health and performance monitoring. Although the advancements in the use of deep learning methods for system monitoring have shown promising results in this direction, designing efficient methods suitable for… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  40. arXiv:2412.11474  [pdf, other

    cs.CR

    Noise-Resilient Homomorphic Encryption: A Framework for Secure Data Processing in Health care Domain

    Authors: B. Shuriya, S. Vimal Kumar, K. Bagyalakshmi

    Abstract: In this paper, we introduce the Fully Homomorphic Integrity Model (HIM), a novel approach designed to enhance security, efficiency, and reliability in encrypted data processing, primarily within the health care industry. HIM addresses the key challenges that noise accumulation, computational overheads, and data integrity pose during homomorphic operations. Our contribution of HIM: advances in nois… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: Confirmed

  41. arXiv:2412.06842  [pdf, other

    cs.LG

    Partition of Unity Physics-Informed Neural Networks (POU-PINNs): An Unsupervised Framework for Physics-Informed Domain Decomposition and Mixtures of Experts

    Authors: Arturo Rodriguez, Ashesh Chattopadhyay, Piyush Kumar, Luis F. Rodriguez, Vinod Kumar

    Abstract: Physics-informed neural networks (PINNs) commonly address ill-posed inverse problems by uncovering unknown physics. This study presents a novel unsupervised learning framework that identifies spatial subdomains with specific governing physics. It uses the partition of unity networks (POUs) to divide the space into subdomains, assigning unique nonlinear model parameters to each, which are integrate… ▽ More

    Submitted 7 December, 2024; originally announced December 2024.

    Comments: 26 pages

  42. arXiv:2412.03573  [pdf, other

    cs.IR cs.AI cs.CL

    Improving Tool Retrieval by Leveraging Large Language Models for Query Generation

    Authors: Mohammad Kachuee, Sarthak Ahuja, Vaibhav Kumar, Puyang Xu, Xiaohu Liu

    Abstract: Using tools by Large Language Models (LLMs) is a promising avenue to extend their reach beyond language or conversational settings. The number of tools can scale to thousands as they enable accessing sensory information, fetching updated factual knowledge, or taking actions in the real world. In such settings, in-context learning by providing a short list of relevant tools in the prompt is a viabl… ▽ More

    Submitted 16 November, 2024; originally announced December 2024.

    Journal ref: COLING 2025

  43. arXiv:2412.02647  [pdf, other

    cs.IT

    Quaternary and Component-Binary Spreading Codes with Low Correlation for Navigation Systems

    Authors: P. Vijay Kumar, Sugandh Mishra, Dileep Dharmappa

    Abstract: In the first part of this two-part paper, we construct a family MFD$_2$ of low-correlation quaternary spreading codes having period $2046$. By quaternary, we mean that the spreading code symbols are drawn from $Z_4$ and are designed to be used in conjunction with QPSK modulation. Apart from low auto and crosscorrelation properties, we also require in addition, to our knowledge for the first time,… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  44. arXiv:2412.01248  [pdf, other

    cs.CV

    Multimodal Fusion Learning with Dual Attention for Medical Imaging

    Authors: Joy Dhar, Nayyar Zaidi, Maryam Haghighat, Puneet Goyal, Sudipta Roy, Azadeh Alavi, Vikas Kumar

    Abstract: Multimodal fusion learning has shown significant promise in classifying various diseases such as skin cancer and brain tumors. However, existing methods face three key limitations. First, they often lack generalizability to other diagnosis tasks due to their focus on a particular disease. Second, they do not fully leverage multiple health records from diverse modalities to learn robust complementa… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 10 pages

    Journal ref: IEEE/CVF Winter Conference on Applications of Computer Vision WACV 2025

  45. arXiv:2411.03303  [pdf, other

    cs.RO

    Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor

    Authors: Anish Bhattacharya, Marco Cannici, Nishanth Rao, Yuezhan Tao, Vijay Kumar, Nikolai Matni, Davide Scaramuzza

    Abstract: We present the first static-obstacle avoidance method for quadrotors using just an onboard, monocular event camera. Quadrotors are capable of fast and agile flight in cluttered environments when piloted manually, but vision-based autonomous flight in unknown environments is difficult in part due to the sensor limitations of traditional onboard cameras. Event cameras, however, promise nearly zero m… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: 18 pages with supplementary

    Journal ref: Conference on Robot Learning (CoRL), Munich, Germany, 2024

  46. arXiv:2411.02538  [pdf, other

    cs.CL

    MILU: A Multi-task Indic Language Understanding Benchmark

    Authors: Sshubam Verma, Mohammed Safi Ur Rahman Khan, Vishwajeet Kumar, Rudra Murthy, Jaydeep Sen

    Abstract: Evaluating Large Language Models (LLMs) in low-resource and linguistically diverse languages remains a significant challenge in NLP, particularly for languages using non-Latin scripts like those spoken in India. Existing benchmarks predominantly focus on English, leaving substantial gaps in assessing LLM capabilities in these languages. We introduce MILU, a Multi task Indic Language Understanding… ▽ More

    Submitted 4 February, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

  47. arXiv:2410.22143  [pdf, other

    cs.CL

    AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts

    Authors: Vishal Kumar, Zeyi Liao, Jaylen Jones, Huan Sun

    Abstract: Although large language models (LLMs) are typically aligned, they remain vulnerable to jailbreaking through either carefully crafted prompts in natural language or, interestingly, gibberish adversarial suffixes. However, gibberish tokens have received relatively less attention despite their success in attacking aligned LLMs. Recent work, AmpleGCG~\citep{liao2024amplegcg}, demonstrates that a gener… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

  48. arXiv:2410.19865  [pdf, other

    cs.LG cs.AI physics.ao-ph

    Evaluating Deep Learning Approaches for Predictions in Unmonitored Basins with Continental-scale Stream Temperature Models

    Authors: Jared D. Willard, Fabio Ciulla, Helen Weierbach, Vipin Kumar, Charuleka Varadharajan

    Abstract: The prediction of streamflows and other environmental variables in unmonitored basins is a grand challenge in hydrology. Recent machine learning (ML) models can harness vast datasets for accurate predictions at large spatial scales. However, there are open questions regarding model design and data needed for inputs and training to improve performance. This study explores these questions while demo… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Comments: 47 pages, 12 figures, 7 tables, submitted to Water Resources Research

  49. arXiv:2410.14137  [pdf, other

    cs.LG

    Hierarchical Conditional Multi-Task Learning for Streamflow Modeling

    Authors: Shaoming Xu, Arvind Renganathan, Ankush Khandelwal, Rahul Ghosh, Xiang Li, Licheng Liu, Kshitij Tayal, Peter Harrington, Xiaowei Jia, Zhenong Jin, Jonh Nieber, Vipin Kumar

    Abstract: Streamflow, vital for water resource management, is governed by complex hydrological systems involving intermediate processes driven by meteorological forces. While deep learning models have achieved state-of-the-art results of streamflow prediction, their end-to-end single-task learning approach often fails to capture the causal relationships within these systems. To address this, we propose Hier… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  50. arXiv:2410.13691  [pdf, other

    cs.RO cs.AI

    Jailbreaking LLM-Controlled Robots

    Authors: Alexander Robey, Zachary Ravichandran, Vijay Kumar, Hamed Hassani, George J. Pappas

    Abstract: The recent introduction of large language models (LLMs) has revolutionized the field of robotics by enabling contextual reasoning and intuitive human-robot interaction in domains as varied as manipulation, locomotion, and self-driving vehicles. When viewed as a stand-alone technology, LLMs are known to be vulnerable to jailbreaking attacks, wherein malicious prompters elicit harmful text by bypass… ▽ More

    Submitted 9 November, 2024; v1 submitted 17 October, 2024; originally announced October 2024.