Skip to main content

Showing 1–50 of 253 results for author: Agarwal, R

.
  1. arXiv:2510.23554  [pdf, ps, other

    cs.LG cs.CL cs.CV

    A U-Net and Transformer Pipeline for Multilingual Image Translation

    Authors: Siddharth Sahay, Radhika Agarwal

    Abstract: This paper presents an end-to-end multilingual translation pipeline that integrates a custom U-Net for text detection, the Tesseract engine for text recognition, and a from-scratch sequence-to-sequence (Seq2Seq) Transformer for Neural Machine Translation (NMT). Our approach first utilizes a U-Net model, trained on a synthetic dataset , to accurately segment and detect text regions from an image. T… ▽ More

    Submitted 27 October, 2025; originally announced October 2025.

    Comments: 6 pages, 3 figures, 5 tables, and 2 algorithms. Prepared in IEEE double-column format

  2. arXiv:2510.23510  [pdf, ps, other

    cs.NI

    How to build a sovereign network? -- A proposal to measure network sovereignty

    Authors: Shakthivelu Janardhanan, Ritanshi Agarwal, Wolfgang Kellerer, Carmen Mas-Machuca

    Abstract: Network sovereignty is a network operator's ability to reduce the dependency on component manufacturers to minimize the impact of manufacturer failures. Network operators now face new design challenges to increase network sovereignty and avoid vendor lock-in problems because a high dependency on a manufacturer corresponds to low survivability if that manufacturer is unavailable. The main contribut… ▽ More

    Submitted 27 October, 2025; originally announced October 2025.

  3. arXiv:2510.22395  [pdf, ps, other

    cs.CL

    Confabulations from ACL Publications (CAP): A Dataset for Scientific Hallucination Detection

    Authors: Federica Gamba, Aman Sinha, Timothee Mickus, Raul Vazquez, Patanjali Bhamidipati, Claudio Savelli, Ahana Chattopadhyay, Laura A. Zanella, Yash Kankanampati, Binesh Arakkal Remesh, Aryan Ashok Chandramania, Rohit Agarwal, Chuyuan Li, Ioana Buhnila, Radhika Mamidi

    Abstract: We introduce the CAP (Confabulations from ACL Publications) dataset, a multilingual resource for studying hallucinations in large language models (LLMs) within scientific text generation. CAP focuses on the scientific domain, where hallucinations can distort factual knowledge, as they frequently do. In this domain, however, the presence of specialized terminology, statistical reasoning, and contex… ▽ More

    Submitted 25 October, 2025; originally announced October 2025.

  4. arXiv:2510.13786  [pdf, ps, other

    cs.LG cs.AI

    The Art of Scaling Reinforcement Learning Compute for LLMs

    Authors: Devvrit Khatri, Lovish Madaan, Rishabh Tiwari, Rachit Bansal, Sai Surya Duvvuri, Manzil Zaheer, Inderjit S. Dhillon, David Brandfonbrener, Rishabh Agarwal

    Abstract: Reinforcement learning (RL) has become central to training large language models (LLMs), yet the field lacks predictive scaling methodologies comparable to those established for pre-training. Despite rapidly rising compute budgets, there is no principled understanding of how to evaluate algorithmic improvements for scaling RL compute. We present the first large-scale systematic study, amounting to… ▽ More

    Submitted 15 October, 2025; originally announced October 2025.

    Comments: 28 pages, 20 figures

  5. arXiv:2510.03997  [pdf, ps, other

    cs.CL

    Mapping Patient-Perceived Physician Traits from Nationwide Online Reviews with LLMs

    Authors: Junjie Luo, Rui Han, Arshana Welivita, Zeleikun Di, Jingfu Wu, Xuzhe Zhi, Ritu Agarwal, Gordon Gao

    Abstract: Understanding how patients perceive their physicians is essential to improving trust, communication, and satisfaction. We present a large language model (LLM)-based pipeline that infers Big Five personality traits and five patient-oriented subjective judgments. The analysis encompasses 4.1 million patient reviews of 226,999 U.S. physicians from an initial pool of one million. We validate the metho… ▽ More

    Submitted 4 October, 2025; originally announced October 2025.

  6. arXiv:2509.24263  [pdf, ps, other

    cs.AI cs.CL

    PAME-AI: Patient Messaging Creation and Optimization using Agentic AI

    Authors: Junjie Luo, Yihong Guo, Anqi Liu, Ritu Agarwal, Gordon Gao

    Abstract: Messaging patients is a critical part of healthcare communication, helping to improve things like medication adherence and healthy behaviors. However, traditional mobile message design has significant limitations due to its inability to explore the high-dimensional design space. We develop PAME-AI, a novel approach for Patient Messaging Creation and Optimization using Agentic AI. Built on the Data… ▽ More

    Submitted 30 September, 2025; v1 submitted 29 September, 2025; originally announced September 2025.

  7. arXiv:2509.23952  [pdf, ps, other

    physics.optics cond-mat.mes-hall

    General Framework for Twisted Bilayer Photonic Crystal with Interlayer Coupling and Far-Field Response

    Authors: Shupeng Xu, Dun Wang, Ritesh Agarwal

    Abstract: We develop a general theory for twisted bilayer photonic crystals that takes into account both far-field response and near-field coupling. The theory is based on the framework of a generalized Rayleigh-Schrödinger perturbation theory for non-Hermitian Hamiltonians. A universal form for interlayer coupling is derived, which relates the hopping strength to the Fourier transforms of the Wannier funct… ▽ More

    Submitted 28 September, 2025; originally announced September 2025.

  8. arXiv:2509.12592  [pdf

    cs.AI cs.CL

    Match Chat: Real Time Generative AI and Generative Computing for Tennis

    Authors: Aaron Baughman, Gozde Akay, Eduardo Morales, Rahul Agarwal, Preetika Srivastava

    Abstract: We present Match Chat, a real-time, agent-driven assistant designed to enhance the tennis fan experience by delivering instant, accurate responses to match-related queries. Match Chat integrates Generative Artificial Intelligence (GenAI) with Generative Computing (GenComp) techniques to synthesize key insights during live tennis singles matches. The system debuted at the 2025 Wimbledon Championshi… ▽ More

    Submitted 15 September, 2025; originally announced September 2025.

    Comments: 12 pages, 5 Figures, 4 Tables

  9. arXiv:2509.00106  [pdf, ps, other

    eess.AS cs.SD

    Quantum-Enhanced Analysis and Grading of Vocal Performance

    Authors: Rohan Agarwal

    Abstract: We present QuantumMelody, a hybrid quantum-classical method for objective singing assessment. Grouped vocal features (pitch stability, dynamics, timbre) are encoded into a small simulated quantum circuit; all nine qubits are initialized with a Hadamard on each qubit and then receive Rx, Ry, and Rz rotations, with intra- and cross-group entanglement. The circuit measurement probabilities are fused… ▽ More

    Submitted 27 August, 2025; originally announced September 2025.

    Comments: 4 pages, 5 figures. Hybrid quantum - classical feasibility study; simulator - only results

    ACM Class: H.5.5; I.2.6; I.5.4

  10. arXiv:2508.09484  [pdf, ps, other

    physics.plasm-ph hep-ph

    Exact expressions for nonperturbative guiding center theory in symmetric fields

    Authors: I. Hollas, R. Agarwal, J. W. Burby, A. J. Brizard

    Abstract: We apply a recently-developed nonperturbative guiding center formalism to charged particle dynamics in fields with two-parameter continuous symmetry groups. This entails finding exact constants of motion, valid in the nonperturbative regime, that agree with Kruskal's adiabatic invariant series to all orders in the perturbative regime, when the field scale length is large compared with a typical gy… ▽ More

    Submitted 13 August, 2025; originally announced August 2025.

    Comments: 30 pages, 3 figures

  11. arXiv:2508.04301  [pdf, ps, other

    cs.CE math.DS nlin.CD

    Extreme Event Precursor Prediction in Turbulent Dynamical Systems via CNN-Augmented Recurrence Analysis

    Authors: Rahul Agarwal, Mustafa A. Mohamad

    Abstract: We present a general framework to predict precursors to extreme events in turbulent dynamical systems. The approach combines phase-space reconstruction techniques with recurrence matrices and convolutional neural networks to identify precursors to extreme events. We evaluate the framework across three distinct testbed systems: a triad turbulent interaction model, a prototype stochastic anisotropic… ▽ More

    Submitted 6 August, 2025; originally announced August 2025.

  12. arXiv:2507.16217  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Towards Compute-Optimal Many-Shot In-Context Learning

    Authors: Shahriar Golchin, Yanfei Chen, Rujun Han, Manan Gandhi, Tianli Yu, Swaroop Mishra, Mihai Surdeanu, Rishabh Agarwal, Chen-Yu Lee, Tomas Pfister

    Abstract: Long-context large language models (LLMs) are able to process inputs containing up to several million tokens. In the scope of in-context learning (ICL), this translates into using hundreds/thousands of demonstrations in the input prompt, enabling many-shot ICL. In practice, a fixed set of demonstrations is often selected at random in many-shot settings due to (1) high inference costs, (2) the bene… ▽ More

    Submitted 29 August, 2025; v1 submitted 22 July, 2025; originally announced July 2025.

    Comments: Final version; accepted at COLM 2025

  13. arXiv:2507.07229  [pdf, ps, other

    cs.CL

    SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains

    Authors: Krithika Ramesh, Daniel Smolyak, Zihao Zhao, Nupoor Gandhi, Ritu Agarwal, Margrét Bjarnadóttir, Anjalie Field

    Abstract: We present SynthTextEval, a toolkit for conducting comprehensive evaluations of synthetic text. The fluency of large language model (LLM) outputs has made synthetic text potentially viable for numerous applications, such as reducing the risks of privacy violations in the development and deployment of AI systems in high-stakes domains. Realizing this potential, however, requires principled consiste… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

  14. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3410 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 16 October, 2025; v1 submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  15. arXiv:2506.20804  [pdf, ps, other

    cs.RO

    Online Planning for Cooperative Air-Ground Robot Systems with Unknown Fuel Requirements

    Authors: Ritvik Agarwal, Behnoushsadat Hatami, Alvika Gautam, Parikshit Maini

    Abstract: We consider an online variant of the fuel-constrained UAV routing problem with a ground-based mobile refueling station (FCURP-MRS), where targets incur unknown fuel costs. We develop a two-phase solution: an offline heuristic-based planner computes initial UAV and UGV paths, and a novel online planning algorithm that dynamically adjusts rendezvous points based on real-time fuel consumption during… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: Submitted to RSS (MRS Workshop)

  16. FEWSim: A Visual Analytic Framework for Exploring the Nexus of Food-Energy-Water Simulations

    Authors: Fan Lei, David A. Sampson, Jiayi Hong, Yuxin Ma, Giuseppe Mascaro, Dave White, Rimjhim Agarwal, Ross Maciejewski

    Abstract: The interdependencies of food, energy, and water (FEW) systems create a nexus opportunity to explore the strengths and vulnerabilities of individual and cross-sector interactions within FEW systems. However, the variables quantifying nexus interactions are hard to observe, which hinders the cross-sector analysis. To overcome such challenges, we present FEWSim, a visual analytics framework designed… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: Accepted by IEEE Computer Graphics and Applications (CG&A)

  17. arXiv:2506.06798  [pdf, ps, other

    cs.RO

    SARAL-Bot: Autonomous Robot for Strawberry Plant Care

    Authors: Arif Ahmed, Ritvik Agarwal, Gaurav Srikar, Nathaniel Rose, Parikshit Maini

    Abstract: Strawberry farming demands intensive labor for monitoring and maintaining plant health. To address this, Team SARAL develops an autonomous robot for the 2024 ASABE Student Robotics Challenge, capable of navigation, unhealthy leaf detection, and removal. The system addresses labor shortages, reduces costs, and supports sustainable farming through vision-based plant assessment. This work demonstrate… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

    Comments: Awarded Best Written Report @ Robotics Design Challenge (Advanced), ASABE 2024

  18. arXiv:2506.02887  [pdf, ps, other

    cs.LG cs.DC

    Overcoming Challenges of Partial Client Participation in Federated Learning : A Comprehensive Review

    Authors: Mrinmay Sen, Shruti Aparna, Rohit Agarwal, Chalavadi Krishna Mohan

    Abstract: Federated Learning (FL) is a learning mechanism that falls under the distributed training umbrella, which collaboratively trains a shared global model without disclosing the raw data from different clients. This paper presents an extensive survey on the impact of partial client participation in federated learning. While much of the existing research focuses on addressing issues such as generalizat… ▽ More

    Submitted 6 June, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

    Comments: 15 pages, 6 tables, comprehensive survey of federated learning with partial client participation

  19. arXiv:2505.23231  [pdf, ps, other

    cs.CY

    REDDIX-NET: A Novel Dataset and Benchmark for Moderating Online Explicit Services

    Authors: MSVPJ Sathvik, Manan Roy Choudhury, Rishita Agarwal, Sathwik Narkedimilli, Vivek Gupta

    Abstract: The rise of online platforms has enabled covert illicit activities, including online prostitution, to pose challenges for detection and regulation. In this study, we introduce REDDIX-NET, a novel benchmark dataset specifically designed for moderating online sexual services and going beyond traditional NSFW filters. The dataset is derived from thousands of web-scraped NSFW posts on Reddit and categ… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 29 pages, 15 figures

  20. arXiv:2505.09024  [pdf

    cs.AI cs.CL cs.LG

    Automated Meta Prompt Engineering for Alignment with the Theory of Mind

    Authors: Aaron Baughman, Rahul Agarwal, Eduardo Morales, Gozde Akay

    Abstract: We introduce a method of meta-prompting that jointly produces fluent text for complex tasks while optimizing the similarity of neural states between a human's mental expectation and a Large Language Model's (LLM) neural processing. A technique of agentic reinforcement learning is applied, in which an LLM as a Judge (LLMaaJ) teaches another LLM, through in-context learning, how to produce content b… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 9 pages, 6 figures, 3 tables

  21. arXiv:2505.04842  [pdf, other

    cs.LG cs.AI

    Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers

    Authors: Kusha Sareen, Morgane M Moss, Alessandro Sordoni, Rishabh Agarwal, Arian Hosseini

    Abstract: Prevalent reinforcement learning~(RL) methods for fine-tuning LLM reasoners, such as GRPO or Leave-one-out PPO, abandon the learned value function in favor of empirically estimated returns. This hinders test-time compute scaling that relies on using the value-function for verification. In this work, we propose RL$^V$ that augments any ``value-free'' RL method by jointly training the LLM as both a… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  22. arXiv:2505.00035  [pdf, other

    cs.CL cs.AI

    Linguistic Complexity and Socio-cultural Patterns in Hip-Hop Lyrics

    Authors: Aayam Bansal, Raghav Agarwal, Kaashvi Jain

    Abstract: This paper presents a comprehensive computational framework for analyzing linguistic complexity and socio-cultural trends in hip-hop lyrics. Using a dataset of 3,814 songs from 146 influential artists spanning four decades (1980-2020), we employ natural language processing techniques to quantify multiple dimensions of lyrical complexity. Our analysis reveals a 23.7% increase in vocabulary diversit… ▽ More

    Submitted 29 April, 2025; originally announced May 2025.

    Comments: 12 pages

  23. arXiv:2504.21417  [pdf, other

    physics.acc-ph hep-ex hep-ph physics.ins-det

    The Muon Collider

    Authors: Carlotta Accettura, Simon Adrian, Rohit Agarwal, Claudia Ahdida, Chiara Aime', Avni Aksoy, Gian Luigi Alberghi, Siobhan Alden, Luca Alfonso, Muhammad Ali, Anna Rita Altamura, Nicola Amapane, Kathleen Amm, David Amorim, Paolo Andreetto, Fabio Anulli, Ludovica Aperio Bella, Rob Appleby, Artur Apresyan, Pouya Asadi, Mohammed Attia Mahmoud, Bernhard Auchmann, John Back, Anthony Badea, Kyu Jung Bae , et al. (433 additional authors not shown)

    Abstract: Muons offer a unique opportunity to build a compact high-energy electroweak collider at the 10 TeV scale. A Muon Collider enables direct access to the underlying simplicity of the Standard Model and unparalleled reach beyond it. It will be a paradigm-shifting tool for particle physics representing the first collider to combine the high-energy reach of a proton collider and the high precision of an… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

    Comments: 406 pages, supplementary report to the European Strategy for Particle Physics - 2026 update

  24. arXiv:2504.16828  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Process Reward Models That Think

    Authors: Muhammad Khalifa, Rishabh Agarwal, Lajanugen Logeswaran, Jaekyeom Kim, Hao Peng, Moontae Lee, Honglak Lee, Lu Wang

    Abstract: Step-by-step verifiers -- also known as process reward models (PRMs) -- are a key ingredient for test-time scaling. PRMs require step-level supervision, making them expensive to train. This work aims to build data-efficient PRMs as verbalized step-wise reward models that verify every step in the solution by generating a verification chain-of-thought (CoT). We propose ThinkPRM, a long CoT verifier… ▽ More

    Submitted 25 September, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

    Comments: New results on Qwen3, compute-matched analysis and more

  25. arXiv:2504.02912  [pdf, other

    cs.CV cs.AI cs.ET cs.LG

    Haphazard Inputs as Images in Online Learning

    Authors: Rohit Agarwal, Aryan Dessai, Arif Ahmed Sekh, Krishna Agarwal, Alexander Horsch, Dilip K. Prasad

    Abstract: The field of varying feature space in online learning settings, also known as haphazard inputs, is very prominent nowadays due to its applicability in various fields. However, the current solutions to haphazard inputs are model-dependent and cannot benefit from the existing advanced deep-learning methods, which necessitate inputs of fixed dimensions. Therefore, we propose to transform the varying… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: Accepted at IJCNN 2025

  26. Finding Interest Needle in Popularity Haystack: Improving Retrieval by Modeling Item Exposure

    Authors: Rahul Agarwal, Amit Jaspal, Saurabh Gupta, Omkar Vichare

    Abstract: Recommender systems operate in closed feedback loops, where user interactions reinforce popularity bias, leading to over-recommendation of already popular items while under-exposing niche or novel content. Existing bias mitigation methods, such as Inverse Propensity Scoring (IPS) and Off-Policy Correction (OPC), primarily operate at the ranking stage or during training, lacking explicit real-time… ▽ More

    Submitted 8 June, 2025; v1 submitted 30 March, 2025; originally announced March 2025.

    Comments: 2 pages. UMAP '25: 33rd ACM Conference on User Modeling, Adaptation and Personalization, New York City, USA, June 2025

  27. arXiv:2503.20545  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci physics.app-ph

    Symmetry Enhanced Unconventional Spin Current Anisotropy in a Collinear Antiferromagnet

    Authors: Pankhuri Gupta, Kacho Imtiyaz Ali Khan, Akash Kumar, Rekha Agarwal, Nidhi Kandwal, Ram Singh Yadav, Johan Åkerman, Pranaba Kishor Muduli

    Abstract: Spin-orbit torque (SOT) presents a promising avenue for energy-efficient spintronics devices, surpassing the limitations of spin transfer torque. While extensively studied in heavy metals, SOT in antiferromagnetic quantum materials remains largely unexplored. Here, we investigate SOT in epitaxial FeSn, a collinear antiferromagnet with a kagome lattice. FeSn exhibits intriguing topological quantum… ▽ More

    Submitted 31 March, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

  28. arXiv:2503.19786  [pdf, other

    cs.CL cs.AI

    Gemma 3 Technical Report

    Authors: Gemma Team, Aishwarya Kamath, Johan Ferret, Shreya Pathak, Nino Vieillard, Ramona Merhej, Sarah Perrin, Tatiana Matejovicova, Alexandre Ramé, Morgane Rivière, Louis Rouillard, Thomas Mesnard, Geoffrey Cideron, Jean-bastien Grill, Sabela Ramos, Edouard Yvinec, Michelle Casbon, Etienne Pot, Ivo Penchev, Gaël Liu, Francesco Visin, Kathleen Kenealy, Lucas Beyer, Xiaohai Zhai, Anton Tsitsulin , et al. (191 additional authors not shown)

    Abstract: We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer context - at least 128K tokens. We also change the architecture of the model to reduce the KV-cache memory that tends to explode with long context. This is achie… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  29. arXiv:2502.05740  [pdf, other

    cs.HC cs.AI

    RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care

    Authors: Ziqi Yang, Yuxuan Lu, Jennifer Bagdasarian, Vedant Das Swain, Ritu Agarwal, Collin Campbell, Waddah Al-Refaire, Jehan El-Bayoumi, Guodong Gao, Dakuo Wang, Bingsheng Yao, Nawar Shara

    Abstract: Cancer surgery is a key treatment for gastrointestinal (GI) cancers, a group of cancers that account for more than 35% of cancer-related deaths worldwide, but postoperative complications are unpredictable and can be life-threatening. In this paper, we investigate how recent advancements in large language models (LLMs) can benefit remote patient monitoring (RPM) systems through clinical integration… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

  30. arXiv:2501.18837  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

    Authors: Mrinank Sharma, Meg Tong, Jesse Mu, Jerry Wei, Jorrit Kruthoff, Scott Goodfriend, Euan Ong, Alwin Peng, Raj Agarwal, Cem Anil, Amanda Askell, Nathan Bailey, Joe Benton, Emma Bluemke, Samuel R. Bowman, Eric Christiansen, Hoagy Cunningham, Andy Dau, Anjali Gopal, Rob Gilson, Logan Graham, Logan Howard, Nimit Kalra, Taesung Lee, Kevin Lin , et al. (18 additional authors not shown)

    Abstract: Large language models (LLMs) are vulnerable to universal jailbreaks-prompting strategies that systematically bypass model safeguards and enable users to carry out harmful processes that require many model interactions, like manufacturing illegal substances at scale. To defend against these attacks, we introduce Constitutional Classifiers: safeguards trained on synthetic data, generated by promptin… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  31. arXiv:2412.19274  [pdf

    physics.optics physics.app-ph

    Flat panel laser displays enabled by large-scale visible photonic integrated circuits

    Authors: Zhujun Shi, Risheng Cheng, Guohua Wei, Steven A. Hickman, Min Chul Shin, Peter Topalian, Lei Wang, Dusan Coso, Brian Le, Lizzy Lee, Sean Braxton, Alexander Koshelev, Maxwell F. Parsons, Rahul Agarwal, Barry Silverstein, Yun Wang, Giuseppe Calafiore

    Abstract: Laser-based displays are highly sought after for their superior brightness and color performance, especially in advanced applications like augmented reality (AR). However, their broader adoption has been hindered by bulky projector designs and complex optical module assemblies. Here, we introduce a new laser display architecture enabled by large-scale visible photonic integrated circuits (PICs) to… ▽ More

    Submitted 26 December, 2024; originally announced December 2024.

  32. arXiv:2412.16656  [pdf, other

    cs.CV cs.AI

    Generalizable Articulated Object Perception with Superpoints

    Authors: Qiaojun Yu, Ce Hao, Xibin Yuan, Li Zhang, Liu Liu, Yukang Huo, Rohit Agarwal, Cewu Lu

    Abstract: Manipulating articulated objects with robotic arms is challenging due to the complex kinematic structure, which requires precise part segmentation for efficient manipulation. In this work, we introduce a novel superpoint-based perception method designed to improve part segmentation in 3D point clouds of articulated objects. We propose a learnable, part-aware superpoint generation technique that ef… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  33. arXiv:2412.16335  [pdf, other

    cs.LG cs.CY

    Improving Equity in Health Modeling with GPT4-Turbo Generated Synthetic Data: A Comparative Study

    Authors: Daniel Smolyak, Arshana Welivita, Margrét V. Bjarnadóttir, Ritu Agarwal

    Abstract: Objective. Demographic groups are often represented at different rates in medical datasets. These differences can create bias in machine learning algorithms, with higher levels of performance for better-represented groups. One promising solution to this problem is to generate synthetic data to mitigate potential adverse effects of non-representative data sets. Methods. We build on recent advance… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

    Comments: 26 pages, 4 figures

  34. arXiv:2412.15287  [pdf, other

    cs.CL cs.AI cs.LG

    Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models

    Authors: Yinlam Chow, Guy Tennenholtz, Izzeddin Gur, Vincent Zhuang, Bo Dai, Sridhar Thiagarajan, Craig Boutilier, Rishabh Agarwal, Aviral Kumar, Aleksandra Faust

    Abstract: Recent studies have indicated that effectively utilizing inference-time compute is crucial for attaining better performance from large language models (LLMs). In this work, we propose a novel inference-aware fine-tuning paradigm, in which the model is fine-tuned in a manner that directly optimizes the performance of the inference-time strategy. We study this paradigm using the simple yet effective… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

  35. arXiv:2412.09727  [pdf

    q-bio.QM cs.AI cs.LG

    A Large Sensor Foundation Model Pretrained on Continuous Glucose Monitor Data for Diabetes Management

    Authors: Junjie Luo, Abhimanyu Kumbara, Mansur Shomali, Rui Han, Anand Iyer, Ritu Agarwal, Gordon Gao

    Abstract: Continuous glucose monitoring (CGM) combined with AI offers new opportunities for proactive diabetes management through real-time glucose forecasting. However, most existing models are task-specific and lack generalization across patient populations. Inspired by the autoregressive paradigm of large language models, we introduce CGM-LSM, a Transformer decoder-based Large Sensor Model (LSM) pretrain… ▽ More

    Submitted 1 August, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

  36. arXiv:2412.04705  [pdf, ps, other

    quant-ph

    QuTiP 5: The Quantum Toolbox in Python

    Authors: Neill Lambert, Eric Giguère, Paul Menczel, Boxi Li, Patrick Hopf, Gerardo Suárez, Marc Gali, Jake Lishman, Rushiraj Gadhvi, Rochisha Agarwal, Asier Galicia, Nathan Shammah, Paul Nation, J. R. Johansson, Shahnawaz Ahmed, Simon Cross, Alexander Pitchford, Franco Nori

    Abstract: QuTiP, the Quantum Toolbox in Python, has been at the forefront of open-source quantum software for the past 13 years. It is used as a research, teaching, and industrial tool, and has been downloaded millions of times by users around the world. Here we introduce the latest developments in QuTiP v5, which are set to have a large impact on the future of QuTiP and enable it to be a modern, continuous… ▽ More

    Submitted 1 October, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: 82 pages, 25 figures. Updated with additional tables, information on environment class, and formatting fixes

  37. arXiv:2411.16096  [pdf, other

    cs.CV cs.AI cs.MM

    ENCLIP: Ensembling and Clustering-Based Contrastive Language-Image Pretraining for Fashion Multimodal Search with Limited Data and Low-Quality Images

    Authors: Prithviraj Purushottam Naik, Rohit Agarwal

    Abstract: Multimodal search has revolutionized the fashion industry, providing a seamless and intuitive way for users to discover and explore fashion items. Based on their preferences, style, or specific attributes, users can search for products by combining text and image information. Text-to-image searches enable users to find visually similar items or describe products using natural language. This paper… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  38. arXiv:2411.09228  [pdf, ps, other

    cs.CR

    Injection Attacks Against End-to-End Encrypted Applications

    Authors: Andrés Fábrega, Carolina Ortega Pérez, Armin Namavari, Ben Nassi, Rachit Agarwal, Thomas Ristenpart

    Abstract: We explore an emerging threat model for end-to-end (E2E) encrypted applications: an adversary sends chosen messages to a target client, thereby "injecting" adversarial content into the application state. Such state is subsequently encrypted and synchronized to an adversarially-visible storage. By observing the lengths of the resulting cloud-stored ciphertexts, the attacker backs out confidential i… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

    Comments: Published in IEEE Security and Privacy 2024

  39. MuCol Milestone Report No. 5: Preliminary Parameters

    Authors: Carlotta Accettura, Simon Adrian, Rohit Agarwal, Claudia Ahdida, Chiara Aimé, Avni Aksoy, Gian Luigi Alberghi, Siobhan Alden, Luca Alfonso, Nicola Amapane, David Amorim, Paolo Andreetto, Fabio Anulli, Rob Appleby, Artur Apresyan, Pouya Asadi, Mohammed Attia Mahmoud, Bernhard Auchmann, John Back, Anthony Badea, Kyu Jung Bae, E. J. Bahng, Lorenzo Balconi, Fabrice Balli, Laura Bandiera , et al. (369 additional authors not shown)

    Abstract: This document is comprised of a collection of updated preliminary parameters for the key parts of the muon collider. The updated preliminary parameters follow on from the October 2023 Tentative Parameters Report. Particular attention has been given to regions of the facility that are believed to hold greater technical uncertainty in their design and that have a strong impact on the cost and power… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  40. arXiv:2411.00062  [pdf, other

    cs.CL cs.AI physics.data-an stat.ML

    Scalable Reinforcement Post-Training Beyond Static Human Prompts: Evolving Alignment via Asymmetric Self-Play

    Authors: Ziyu Ye, Rishabh Agarwal, Tianqi Liu, Rishabh Joshi, Sarmishta Velury, Quoc V. Le, Qijun Tan, Yuan Liu

    Abstract: Current reinforcement learning (RL) frameworks for large language models (LLM) post-training typically assume a fixed prompt distribution, which is sub-optimal and bottlenecks scalability. Prior works have explored prompt evolving, but are often limited to the supervised fine-tuning stage, and prompts are sampled and evolved uniformly without signals. This empirical work presents a paradigm shift:… ▽ More

    Submitted 9 April, 2025; v1 submitted 31 October, 2024; originally announced November 2024.

    Comments: spotlight @ neurips language gamification workshop. updated the problem description and added new online RL experiments in this version

  41. arXiv:2410.18252  [pdf, other

    cs.LG cs.AI cs.CL

    Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models

    Authors: Michael Noukhovitch, Shengyi Huang, Sophie Xhonneux, Arian Hosseini, Rishabh Agarwal, Aaron Courville

    Abstract: The dominant paradigm for RLHF is online and on-policy RL: synchronously generating from the large language model (LLM) policy, labelling with a reward model, and learning using feedback on the LLM's own outputs. While performant, this paradigm is computationally inefficient. Inspired by classical deep RL literature, we propose separating generation and learning in RLHF. This enables asynchronous… ▽ More

    Submitted 26 April, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: accepted at ICLR 2025, code at https://github.com/mnoukhov/async_rlhf, integrated into the open-instruct library https://github.com/allenai/open-instruct

  42. arXiv:2410.17394  [pdf, other

    cs.LG cs.AI

    packetLSTM: Dynamic LSTM Framework for Streaming Data with Varying Feature Space

    Authors: Rohit Agarwal, Karaka Prasanth Naidu, Alexander Horsch, Krishna Agarwal, Dilip K. Prasad

    Abstract: We study the online learning problem characterized by the varying input feature space of streaming data. Although LSTMs have been employed to effectively capture the temporal nature of streaming data, they cannot handle the dimension-varying streams in an online learning setting. Therefore, we propose a dynamic LSTM-based novel method, called packetLSTM, to model the dimension-varying streams. The… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

  43. arXiv:2410.11325  [pdf, other

    cs.CL cs.AI

    Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling

    Authors: Wenda Xu, Rujun Han, Zifeng Wang, Long T. Le, Dhruv Madeka, Lei Li, William Yang Wang, Rishabh Agarwal, Chen-Yu Lee, Tomas Pfister

    Abstract: Recent advances in knowledge distillation (KD) have enabled smaller student models to approach the performance of larger teacher models. However, popular methods such as supervised KD and on-policy KD, are adversely impacted by the knowledge gaps between teacher-student in practical scenarios. Supervised KD suffers from a distribution mismatch between training with a static dataset and inference o… ▽ More

    Submitted 27 April, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

    Comments: ICLR2025

  44. arXiv:2410.08146  [pdf, other

    cs.LG cs.CL

    Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

    Authors: Amrith Setlur, Chirag Nagpal, Adam Fisch, Xinyang Geng, Jacob Eisenstein, Rishabh Agarwal, Alekh Agarwal, Jonathan Berant, Aviral Kumar

    Abstract: A promising approach for improving reasoning in large language models is to use process reward models (PRMs). PRMs provide feedback at each step of a multi-step reasoning trace, potentially improving credit assignment over outcome reward models (ORMs) that only provide feedback at the final step. However, collecting dense, per-step human labels is not scalable, and training PRMs from automatically… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  45. arXiv:2410.01748  [pdf, other

    cs.LG

    Not All LLM Reasoners Are Created Equal

    Authors: Arian Hosseini, Alessandro Sordoni, Daniel Toyama, Aaron Courville, Rishabh Agarwal

    Abstract: We study the depth of grade-school math (GSM) problem-solving capabilities of LLMs. To this end, we evaluate their performance on pairs of existing math word problems together so that the answer to the second problem depends on correctly answering the first problem. Our findings reveal a significant reasoning gap in most LLMs, that is performance difference between solving the compositional pairs… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  46. arXiv:2409.16291  [pdf, other

    cs.HC cs.AI

    Beyond Following: Mixing Active Initiative into Computational Creativity

    Authors: Zhiyu Lin, Upol Ehsan, Rohan Agarwal, Samihan Dani, Vidushi Vashishth, Mark Riedl

    Abstract: Generative Artificial Intelligence (AI) encounters limitations in efficiency and fairness within the realm of Procedural Content Generation (PCG) when human creators solely drive and bear responsibility for the generative process. Alternative setups, such as Mixed-Initiative Co-Creative (MI-CC) systems, exhibited their promise. Still, the potential of an active mixed initiative, where AI takes a r… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: 11 pages, 4 figures

  47. arXiv:2409.12917  [pdf, other

    cs.LG

    Training Language Models to Self-Correct via Reinforcement Learning

    Authors: Aviral Kumar, Vincent Zhuang, Rishabh Agarwal, Yi Su, John D Co-Reyes, Avi Singh, Kate Baumli, Shariq Iqbal, Colton Bishop, Rebecca Roelofs, Lei M Zhang, Kay McKinney, Disha Shrivastava, Cosmin Paduraru, George Tucker, Doina Precup, Feryal Behbahani, Aleksandra Faust

    Abstract: Self-correction is a highly desirable capability of large language models (LLMs), yet it has consistently been found to be largely ineffective in modern LLMs. Current methods for training self-correction typically depend on either multiple models, a more advanced model, or additional forms of supervision. To address these shortcomings, we develop a multi-turn online reinforcement learning (RL) app… ▽ More

    Submitted 4 October, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

  48. arXiv:2409.10242  [pdf, other

    cs.LG cs.AI

    Hedging Is Not All You Need: A Simple Baseline for Online Learning Under Haphazard Inputs

    Authors: Himanshu Buckchash, Momojit Biswas, Rohit Agarwal, Dilip K. Prasad

    Abstract: Handling haphazard streaming data, such as data from edge devices, presents a challenging problem. Over time, the incoming data becomes inconsistent, with missing, faulty, or new inputs reappearing. Therefore, it requires models that are reliable. Recent methods to solve this problem depend on a hedging-based solution and require specialized elements like auxiliary dropouts, forked architectures,… ▽ More

    Submitted 30 December, 2024; v1 submitted 16 September, 2024; originally announced September 2024.

  49. arXiv:2408.16737  [pdf, other

    cs.CL cs.AI

    Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

    Authors: Hritik Bansal, Arian Hosseini, Rishabh Agarwal, Vinh Q. Tran, Mehran Kazemi

    Abstract: Training on high-quality synthetic data from strong language models (LMs) is a common strategy to improve the reasoning performance of LMs. In this work, we revisit whether this strategy is compute-optimal under a fixed inference budget (e.g., FLOPs). To do so, we investigate the trade-offs between generating synthetic data using a stronger but more expensive (SE) model versus a weaker but cheaper… ▽ More

    Submitted 7 October, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

  50. arXiv:2408.15575  [pdf, other

    cs.IR

    Lyrically Speaking: Exploring the Link Between Lyrical Emotions, Themes and Depression Risk

    Authors: Pavani Chowdary, Bhavyajeet Singh, Rajat Agarwal, Vinoo Alluri

    Abstract: Lyrics play a crucial role in affecting and reinforcing emotional states by providing meaning and emotional connotations that interact with the acoustic properties of the music. Specific lyrical themes and emotions may intensify existing negative states in listeners and may lead to undesirable outcomes, especially in listeners with mood disorders such as depression. Hence, it is important for such… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: Accepted at the 25th International Society for Music Information Retrieval Conference (ISMIR) 2024, San Francisco, United States