Skip to main content

Showing 1–50 of 477 results for author: Shruti

.
  1. arXiv:2506.13724  [pdf, ps, other

    quant-ph physics.atom-ph

    Leveraging erasure errors in logical qubits with metastable $^{171}$Yb atoms

    Authors: Bichen Zhang, Genyue Liu, Guillaume Bornet, Sebastian P. Horvath, Pai Peng, Shuo Ma, Shilin Huang, Shruti Puri, Jeff D. Thompson

    Abstract: Implementing large-scale quantum algorithms with practical advantage will require fault-tolerance achieved through quantum error correction, but the associated overhead is a significant cost. The overhead can be reduced by engineering physical qubits with fewer errors, and by shaping the residual errors to be more easily correctable. In this work, we demonstrate quantum error correcting codes and… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  2. arXiv:2506.06324  [pdf

    cs.AI

    Mapping Human-Agent Co-Learning and Co-Adaptation: A Scoping Review

    Authors: Shruti Kumar, Xiaoyu Chen, Xiaomei Wang

    Abstract: Several papers have delved into the challenges of human-AI-robot co-learning and co-adaptation. It has been noted that the terminology used to describe this collaborative relationship in existing studies needs to be more consistent. For example, the prefix "co" is used interchangeably to represent both "collaborative" and "mutual," and the terms "co-learning" and "co-adaptation" are sometimes used… ▽ More

    Submitted 29 May, 2025; originally announced June 2025.

    Comments: Abstract accepted to HFES 2024 Annual Meeting

  3. arXiv:2506.04084  [pdf, ps, other

    quant-ph

    A Unitary Encoder for Surface Codes

    Authors: Pei-Kai Tsai, Shruti Puri

    Abstract: The surface code is a promising candidate for fault-tolerant quantum computation and has been implemented in many quantum hardware platforms. In this work, we propose a new non-local unitary circuit to encode a surface code state based on a code conversion between rotated and regular surface codes, which halves the gate count of the fastest encoder known previously. While the unitary encoders can… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 10 pages, 11 figures

  4. arXiv:2506.02887  [pdf, ps, other

    cs.LG cs.DC

    Overcoming Challenges of Partial Client Participation in Federated Learning : A Comprehensive Review

    Authors: Mrinmay Sen, Shruti Aparna, Rohit Agarwal, Chalavadi Krishna Mohan

    Abstract: Federated Learning (FL) is a learning mechanism that falls under the distributed training umbrella, which collaboratively trains a shared global model without disclosing the raw data from different clients. This paper presents an extensive survey on the impact of partial client participation in federated learning. While much of the existing research focuses on addressing issues such as generalizat… ▽ More

    Submitted 6 June, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

    Comments: 15 pages, 6 tables, comprehensive survey of federated learning with partial client participation

  5. arXiv:2505.23643  [pdf, ps, other

    cs.CR cs.AI

    Securing AI Agents with Information-Flow Control

    Authors: Manuel Costa, Boris Köpf, Aashish Kolluri, Andrew Paverd, Mark Russinovich, Ahmed Salem, Shruti Tople, Lukas Wutschitz, Santiago Zanella-Béguelin

    Abstract: As AI agents become increasingly autonomous and capable, ensuring their security against vulnerabilities such as prompt injection becomes critical. This paper explores the use of information-flow control (IFC) to provide security guarantees for AI agents. We present a formal model to reason about the security and expressiveness of agent planners. Using this model, we characterize the class of prop… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  6. arXiv:2505.23030  [pdf

    cs.CL

    Can Modern NLP Systems Reliably Annotate Chest Radiography Exams? A Pre-Purchase Evaluation and Comparative Study of Solutions from AWS, Google, Azure, John Snow Labs, and Open-Source Models on an Independent Pediatric Dataset

    Authors: Shruti Hegde, Mabon Manoj Ninan, Jonathan R. Dillman, Shireen Hayatghaibi, Lynn Babcock, Elanchezhian Somasundaram

    Abstract: General-purpose clinical natural language processing (NLP) tools are increasingly used for the automatic labeling of clinical reports. However, independent evaluations for specific tasks, such as pediatric chest radiograph (CXR) report labeling, are limited. This study compares four commercial clinical NLP systems - Amazon Comprehend Medical (AWS), Google Healthcare NLP (GC), Azure Clinical NLP (A… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  7. arXiv:2505.18058  [pdf

    eess.IV cs.CV

    A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer

    Authors: Yumeng Zhang, Zohaib Salahuddin, Danial Khan, Shruti Atul Mali, Henry C. Woodruff, Sina Amirrajab, Eduardo Ibor-Crespo, Ana Jimenez-Pastor, Luis Marti-Bonmati, Philippe Lambin

    Abstract: Background: Accurate MRI-based identification of extramural vascular invasion (EVI) and mesorectal fascia invasion (MFI) is pivotal for risk-stratified management of rectal cancer, yet visual assessment is subjective and vulnerable to inter-institutional variability. Purpose: To develop and externally evaluate a multicenter, foundation-model-driven framework that automatically classifies EVI and M… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 22 pages, 8 figures

  8. arXiv:2505.17971  [pdf

    eess.IV cs.CV

    Explainable Anatomy-Guided AI for Prostate MRI: Foundation Models and In Silico Clinical Trials for Virtual Biopsy-based Risk Assessment

    Authors: Danial Khan, Zohaib Salahuddin, Yumeng Zhang, Sheng Kuang, Shruti Atul Mali, Henry C. Woodruff, Sina Amirrajab, Rachel Cavill, Eduardo Ibor-Crespo, Ana Jimenez-Pastor, Adrian Galiana-Bordera, Paula Jimenez Gomez, Luis Marti-Bonmati, Philippe Lambin

    Abstract: We present a fully automated, anatomically guided deep learning pipeline for prostate cancer (PCa) risk stratification using routine MRI. The pipeline integrates three key components: an nnU-Net module for segmenting the prostate gland and its zones on axial T2-weighted MRI; a classification module based on the UMedPT Swin Transformer foundation model, fine-tuned on 3D patches with optional anatom… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  9. arXiv:2505.17893  [pdf

    cs.CV

    Pixels to Prognosis: Harmonized Multi-Region CT-Radiomics and Foundation-Model Signatures Across Multicentre NSCLC Data

    Authors: Shruti Atul Mali, Zohaib Salahuddin, Danial Khan, Yumeng Zhang, Henry C. Woodruff, Eduardo Ibor-Crespo, Ana Jimenez-Pastor, Luis Marti-Bonmati, Philippe Lambin

    Abstract: Purpose: To evaluate the impact of harmonization and multi-region CT image feature integration on survival prediction in non-small cell lung cancer (NSCLC) patients, using handcrafted radiomics, pretrained foundation model (FM) features, and clinical data from a multicenter dataset. Methods: We analyzed CT scans and clinical data from 876 NSCLC patients (604 training, 272 test) across five cente… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  10. arXiv:2505.17238  [pdf, other

    cs.CL

    Personalizing Student-Agent Interactions Using Log-Contextualized Retrieval Augmented Generation (RAG)

    Authors: Clayton Cohn, Surya Rayala, Caitlin Snyder, Joyce Fonteles, Shruti Jain, Naveeduddin Mohammed, Umesh Timalsina, Sarah K. Burriss, Ashwin T S, Namrata Srivastava, Menton Deweese, Angela Eeds, Gautam Biswas

    Abstract: Collaborative dialogue offers rich insights into students' learning and critical thinking. This is essential for adapting pedagogical agents to students' learning and problem-solving skills in STEM+C settings. While large language models (LLMs) facilitate dynamic pedagogical interactions, potential hallucinations can undermine confidence, trust, and instructional value. Retrieval-augmented generat… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: Submitted to the International Conference on Artificial Intelligence in Education (AIED) Workshop on Epistemics and Decision-Making in AI-Supported Education

  11. arXiv:2505.09094  [pdf, ps, other

    cs.HC

    PLanet: Formalizing Experimental Design

    Authors: London Bielicke, Anna Zhang, Shruti Tyagi, Emery Berger, Adam Chlipala, Eunice Jun

    Abstract: Carefully constructed experimental designs are essential for drawing valid, generalizable conclusions from scientific studies. Unfortunately, experimental design plans can be difficult to specify, communicate clearly, and relate to alternatives. In response, we introduce a grammar of experimental design that provides composable operators for constructing assignment procedures (e.g., Latin square).… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 14 pages, 4 tables, 6 figures, human-computer interaction, domain specific language, experimental design

  12. arXiv:2505.02520  [pdf, other

    hep-th

    Smooth Splitting and Zeros from On-Shell Recursion

    Authors: Callum R. T. Jones, Shruti Paranjape

    Abstract: We describe a new approach to understanding the origins of recently discovered "hidden zeros" and "smooth splitting" of tree-level amplitudes in $\text{Tr}φ^3$, Non-Linear Sigma Model (NLSM), Yang-Mill-Scalar (YMS) and the special Galileon. Introducing a new type of linear shift in kinematic space we demonstrate that the mysterious splitting formulae follow from a simple contour integration argume… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 32 pages, 9 figures

  13. arXiv:2505.00768  [pdf, other

    quant-ph

    Optomechanical resource for fault-tolerant quantum computing

    Authors: Margaret Pavlovich, Peter Rakich, Shruti Puri

    Abstract: Fusion-based quantum computing with dual-rail qubits is a leading candidate for scalable quantum computing using linear optics. This paradigm requires single photons which are entangled into small resource states before being fed into a fusion network. The most common sources for single optical photons and for small entangled states are probabilistic and heralded. The realization of a single relia… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: 19 pages, 9 figures. Supplement 29 pages, 7 figures

  14. arXiv:2504.11863  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Understanding the evolution of the magnetic ground state in Ba$_4$NaRu$_3$O$_{12}$

    Authors: Shruti Chakravarty, Pascal Manuel, Antonio Cervellino, Sunil Nair

    Abstract: We report a comprehensive investigation of the quadruple perovskite Ba$_4$NaRu$_3$O$_{12}$, in which we discover a robust spin-lattice coupled ground state characterized by a long-range antiferromagnetic ordering at $T_N \sim$ 257 K. The system's unique structural motif of three symmetrically distinct magnetic ions, including Ru dimers separated by non-magnetic layers, is intimately correlated wit… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  15. arXiv:2504.11253  [pdf, other

    hep-th

    Large deformations of Tr($Φ^3$) and the world at infinity

    Authors: Shruti Paranjape, Marcos Skowronek, Marcus Spradlin, Anastasia Volovich

    Abstract: The amplitudes of the non-linear sigma model can be obtained from those of Tr($Φ^3$) theory by sending the kinematic (Mandelstam) variables to infinity in a certain direction. In this paper we characterize the behavior of Tr($Φ^3$) amplitudes under a general class of large kinematic shifts called $g$-vector shifts. The objects that live in this world at infinity retain certain key amplitude-like p… ▽ More

    Submitted 29 April, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

    Comments: 25 pages, 14 figures, v2 minor corrections

  16. arXiv:2504.02781  [pdf, other

    cs.LG cs.AI cs.NE eess.SP

    Towards Green AI-Native Networks: Evaluation of Neural Circuit Policy for Estimating Energy Consumption of Base Stations

    Authors: Selim Ickin, Shruti Bothe, Aman Raparia, Nitin Khanna, Erik Sanders

    Abstract: Optimization of radio hardware and AI-based network management software yield significant energy savings in radio access networks. The execution of underlying Machine Learning (ML) models, which enable energy savings through recommended actions, may require additional compute and energy, highlighting the opportunity to explore and adopt accurate and energy-efficient ML technologies. This work eval… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: 15 pages, 9 figures

  17. arXiv:2504.01669  [pdf, other

    astro-ph.CO gr-qc hep-ph

    The CosmoVerse White Paper: Addressing observational tensions in cosmology with systematics and fundamental physics

    Authors: Eleonora Di Valentino, Jackson Levi Said, Adam Riess, Agnieszka Pollo, Vivian Poulin, Adrià Gómez-Valent, Amanda Weltman, Antonella Palmese, Caroline D. Huang, Carsten van de Bruck, Chandra Shekhar Saraf, Cheng-Yu Kuo, Cora Uhlemann, Daniela Grandón, Dante Paz, Dominique Eckert, Elsa M. Teixeira, Emmanuel N. Saridakis, Eoin Ó Colgáin, Florian Beutler, Florian Niedermann, Francesco Bajardi, Gabriela Barenboim, Giulia Gubitosi, Ilaria Musella , et al. (513 additional authors not shown)

    Abstract: The standard model of cosmology has provided a good phenomenological description of a wide range of observations both at astrophysical and cosmological scales for several decades. This concordance model is constructed by a universal cosmological constant and supported by a matter sector described by the standard model of particle physics and a cold dark matter contribution, as well as very early-t… ▽ More

    Submitted 15 May, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

    Comments: 416 pages, 81 figures, accepted in PotDU

  18. arXiv:2504.01162  [pdf, ps, other

    cs.IR

    Information Retrieval for Climate Impact

    Authors: Maarten de Rijke, Bart van den Hurk, Flora Salim, Alaa Al Khourdajie, Nan Bai, Renato Calzone, Declan Curran, Getnet Demil, Lesley Frew, Noah Gießing, Mukesh Kumar Gupta, Maria Heuss, Sanaa Hobeichi, David Huard, Jingwei Kang, Ana Lucic, Tanwi Mallick, Shruti Nath, Andrew Okem, Barbara Pernici, Thilina Rajapakse, Hira Saleem, Harry Scells, Nicole Schneider, Damiano Spina , et al. (6 additional authors not shown)

    Abstract: The purpose of the MANILA24 Workshop on information retrieval for climate impact was to bring together researchers from academia, industry, governments, and NGOs to identify and discuss core research problems in information retrieval to assess climate change impacts. The workshop aimed to foster collaboration by bringing communities together that have so far not been very well connected -- informa… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: Report on the MANILA24 Workshop

    ACM Class: H.3.3

  19. arXiv:2503.23579  [pdf, other

    hep-th

    Hidden Zeros of the Cosmological Wavefunction

    Authors: Shounak De, Shruti Paranjape, Andrzej Pokraka, Marcus Spradlin, Anastasia Volovich

    Abstract: Motivated by the recent discovery of hidden zeros in particle and string amplitudes, we characterize zeros of individual graph contributions to the cosmological wavefunction of a scalar field theory. We demonstrate that these contributions split near these zeros for all tree graphs and provide evidence that this extends to loop graphs as well. We explicitly construct polytopal realizations of the… ▽ More

    Submitted 30 March, 2025; originally announced March 2025.

    Comments: 26 pages, 14 figures

  20. arXiv:2503.19786  [pdf, other

    cs.CL cs.AI

    Gemma 3 Technical Report

    Authors: Gemma Team, Aishwarya Kamath, Johan Ferret, Shreya Pathak, Nino Vieillard, Ramona Merhej, Sarah Perrin, Tatiana Matejovicova, Alexandre Ramé, Morgane Rivière, Louis Rouillard, Thomas Mesnard, Geoffrey Cideron, Jean-bastien Grill, Sabela Ramos, Edouard Yvinec, Michelle Casbon, Etienne Pot, Ivo Penchev, Gaël Liu, Francesco Visin, Kathleen Kenealy, Lucas Beyer, Xiaohai Zhai, Anton Tsitsulin , et al. (191 additional authors not shown)

    Abstract: We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer context - at least 128K tokens. We also change the architecture of the model to reduce the KV-cache memory that tends to explode with long context. This is achie… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  21. arXiv:2503.15148  [pdf, other

    physics.ao-ph

    Climatic Phase Transitions Unravel the Onset and Withdrawal of Indian Monsoon

    Authors: Yogenraj Patil, Gaurav Chopra, Shruti Tandon, B. N. Goswami, R. I. Sujith

    Abstract: The livelihood and food security of more than a billion people depend on the Indian monsoon (IM). Yet, a universal definition of the large-scale season and progress of IM is missing. Even though IM is a planetary-scale convectively coupled system arising largely from seasonal migration of the Intertropical Convergence Zone (ITCZ), the definitions of its onset and progression are based on local wea… ▽ More

    Submitted 20 March, 2025; v1 submitted 19 March, 2025; originally announced March 2025.

    Comments: 32, 14 Figures

  22. arXiv:2503.11945  [pdf, other

    cs.CV cs.CR cs.LG

    Your Text Encoder Can Be An Object-Level Watermarking Controller

    Authors: Naresh Kumar Devulapally, Mingzhen Huang, Vishal Asnani, Shruti Agarwal, Siwei Lyu, Vishnu Suresh Lokhande

    Abstract: Invisible watermarking of AI-generated images can help with copyright protection, enabling detection and identification of AI-generated media. In this work, we present a novel approach to watermark images of T2I Latent Diffusion Models (LDMs). By only fine-tuning text token embeddings $W_*$, we enable watermarking in selected objects or parts of the image, offering greater flexibility compared to… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  23. arXiv:2502.21228  [pdf, other

    cs.CL cs.AI

    ECLeKTic: a Novel Challenge Set for Evaluation of Cross-Lingual Knowledge Transfer

    Authors: Omer Goldman, Uri Shaham, Dan Malkin, Sivan Eiger, Avinatan Hassidim, Yossi Matias, Joshua Maynez, Adi Mayrav Gilady, Jason Riesa, Shruti Rijhwani, Laura Rimell, Idan Szpektor, Reut Tsarfaty, Matan Eyal

    Abstract: To achieve equitable performance across languages, multilingual large language models (LLMs) must be able to abstract knowledge beyond the language in which it was acquired. However, the current literature lacks reliable ways to measure LLMs' capability of cross-lingual knowledge transfer. To that end, we present ECLeKTic, a multilingual closed-book QA (CBQA) dataset that Evaluates Cross-Lingual K… ▽ More

    Submitted 3 March, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

  24. arXiv:2502.19322  [pdf, other

    hep-th

    Bosonisation and BTZ Black Hole Microstates

    Authors: Suvankar Dutta, Shruti Menon, Aayush Srivastav

    Abstract: When the boundary dynamics of \(AdS_3\) gravity is governed by the collective field theory Hamiltonian proposed by Jevicki and Sakita, its asymptotic symmetry algebra becomes the centerless \(U(1)\) Kac-Moody algebra. We quantize this system using the quantum bosonization of relativistic free fermions and relate these to the dynamical fields of \(AdS_3\) gravity. This leads to a correspondence whe… ▽ More

    Submitted 20 March, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: 22 pages, 1 figure

  25. arXiv:2502.14921  [pdf, ps, other

    cs.CL cs.CR cs.LG

    The Canary's Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text

    Authors: Matthieu Meeus, Lukas Wutschitz, Santiago Zanella-Béguelin, Shruti Tople, Reza Shokri

    Abstract: How much information about training samples can be leaked through synthetic data generated by Large Language Models (LLMs)? Overlooking the subtleties of information flow in synthetic data generation pipelines can lead to a false sense of privacy. In this paper, we assume an adversary has access to some synthetic data generated by a LLM. We design membership inference attacks (MIAs) that target th… ▽ More

    Submitted 6 June, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: 42nd International Conference on Machine Learning (ICML 2025)

  26. arXiv:2502.12404  [pdf, other

    cs.CL

    WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects

    Authors: Daniel Deutsch, Eleftheria Briakou, Isaac Caswell, Mara Finkelstein, Rebecca Galor, Juraj Juraska, Geza Kovacs, Alison Lui, Ricardo Rei, Jason Riesa, Shruti Rijhwani, Parker Riley, Elizabeth Salesky, Firas Trabelsi, Stephanie Winkler, Biao Zhang, Markus Freitag

    Abstract: As large language models (LLM) become more and more capable in languages other than English, it is important to collect benchmark datasets in order to evaluate their multilingual performance, including on tasks like machine translation (MT). In this work, we extend the WMT24 dataset to cover 55 languages by collecting new human-written references and post-edits for 46 new languages and dialects in… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  27. arXiv:2502.12179  [pdf, other

    cs.LG cs.AI cs.CL

    Identifiable Steering via Sparse Autoencoding of Multi-Concept Shifts

    Authors: Shruti Joshi, Andrea Dittadi, Sébastien Lachapelle, Dhanya Sridhar

    Abstract: Steering methods manipulate the representations of large language models (LLMs) to induce responses that have desired properties, e.g., truthfulness, offering a promising approach for LLM alignment without the need for fine-tuning. Traditionally, steering has relied on supervision, such as from contrastive pairs of prompts that vary in a single target concept, which is costly to obtain and limits… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

    Comments: 27 pages, 9 figures

  28. arXiv:2502.08107  [pdf

    cs.GR

    Machine Learning-Driven Volumetric Cloud Rendering: Procedural Shader Optimization and Dynamic Lighting in Unreal Engine for Realistic Atmospheric Simulation

    Authors: Shruti Singh, Shantanu Kumar

    Abstract: This study advances real-time volumetric cloud rendering in Computer Graphics (CG) by developing a specialized shader in Unreal Engine (UE), focusing on realistic cloud modeling and lighting. By leveraging ray-casting-based lighting algorithms, this work demonstrates the practical application of a dual-layered procedural noise model, eliminating the need for conventional two-dimensional (2D) weath… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  29. arXiv:2502.07634  [pdf

    cs.LG cs.MM

    Efficient Distributed Training through Gradient Compression with Sparsification and Quantization Techniques

    Authors: Shruti Singh, Shantanu Kumar

    Abstract: This study investigates the impact of gradient compression on distributed training performance, focusing on sparsification and quantization techniques, including top-k, DGC, and QSGD. In baseline experiments, random-k compression results in severe performance degradation, highlighting its inefficacy. In contrast, using top-k and DGC at 50 times compression yields performance improvements, reducing… ▽ More

    Submitted 7 December, 2024; originally announced February 2025.

  30. arXiv:2502.06507  [pdf, other

    physics.atom-ph

    A Continuous Pump-Probe Experiment to Observe Rydberg Wave Packet Dynamics

    Authors: Kevin L. Romans, Kyle Foster, Shruti Majumdar, Bishnu P. Acharya, Onyx Russ, A. H. N. C. De Silva, Daniel Fischer

    Abstract: Rydberg atoms remain in the limelight due to their applications in quantum optics and information technologies. In this work, the dynamics of Rydberg atoms stored in a momentum spectrometer by an all-optical trap is studied by ionizing them in the field of a continuous wave optical dipole trap. While the addition of the optical dipole trap allows to further cool the atoms, it comes at the expense… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  31. arXiv:2502.03950  [pdf, other

    cs.CV

    LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation Models

    Authors: Priyank Pathak, Shyam Marjit, Shruti Vyas, Yogesh S Rawat

    Abstract: Visual-language foundation Models (FMs) exhibit remarkable zero-shot generalization across diverse tasks, largely attributed to extensive pre-training on largescale datasets. However, their robustness on low-resolution/pixelated (LR) images, a common challenge in real-world scenarios, remains underexplored. We introduce LR0.FM, a comprehensive benchmark evaluating the impact of low resolution on t… ▽ More

    Submitted 18 May, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

    Comments: Accepted to ICLR 2025

  32. arXiv:2501.18490  [pdf, ps, other

    cs.RO cs.AI

    Curriculum-based Sample Efficient Reinforcement Learning for Robust Stabilization of a Quadrotor

    Authors: Fausto Mauricio Lagos Suarez, Akshit Saradagi, Vidya Sumathy, Shruti Kotpaliwar, George Nikolakopoulos

    Abstract: This article introduces a curriculum learning approach to develop a reinforcement learning-based robust stabilizing controller for a Quadrotor that meets predefined performance criteria. The learning objective is to achieve desired positions from random initial conditions while adhering to both transient and steady-state performance specifications. This objective is challenging for conventional on… ▽ More

    Submitted 17 April, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

    Comments: 8 pages, 7 figures

  33. arXiv:2501.17356  [pdf, other

    cs.CV cs.AI cs.CY

    On the Coexistence and Ensembling of Watermarks

    Authors: Aleksandar Petrov, Shruti Agarwal, Philip H. S. Torr, Adel Bibi, John Collomosse

    Abstract: Watermarking, the practice of embedding imperceptible information into media such as images, videos, audio, and text, is essential for intellectual property protection, content provenance and attribution. The growing complexity of digital ecosystems necessitates watermarks for different uses to be embedded in the same media. However, to detect and decode all watermarks, they need to coexist well w… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  34. arXiv:2501.07242  [pdf, other

    quant-ph

    Characterization of Entanglement in Higher Dimensional Bipartite as well as Multipartite Quantum System and its Application

    Authors: Shruti Aggarwal

    Abstract: In recent years considerable progress has been made towards developing a general theory of quantum entanglement. In particular, criteria to decide whether a given quantum state is entangled are of high theoretical and practical interest. This problem is additionally complicated by the existence of bound entanglement, which are weak entangled states and hard to detect. In this thesis, we have worke… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

    Comments: 192 pages

    Report number: PhD Thesis, 2024

  35. arXiv:2501.04018  [pdf, other

    physics.ao-ph cs.LG stat.AP

    MERCURY: A fast and versatile multi-resolution based global emulator of compound climate hazards

    Authors: Shruti Nath, Julie Carreau, Kai Kornhuber, Peter Pfleiderer, Carl-Friedrich Schleussner, Philippe Naveau

    Abstract: High-impact climate damages are often driven by compounding climate conditions. For example, elevated heat stress conditions can arise from a combination of high humidity and temperature. To explore future changes in compounding hazards under a range of climate scenarios and with large ensembles, climate emulators can provide light-weight, data-driven complements to Earth System Models. Yet, only… ▽ More

    Submitted 23 December, 2024; originally announced January 2025.

  36. TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions

    Authors: Vriksha Srihari, R. Bhavya, Shruti Jayaraman, V. Mary Anita Rajam

    Abstract: While generative models such as text-to-image, large language models and text-to-video have seen significant progress, the extension to text-to-virtual-reality remains largely unexplored, due to a deficit in training data and the complexity of achieving realistic depth and motion in virtual environments. This paper proposes an approach to coalesce existing generative systems to form a stereoscopic… ▽ More

    Submitted 10 March, 2025; v1 submitted 2 January, 2025; originally announced January 2025.

    Comments: Co-authors do not consent to publishing on Arxiv

    ACM Class: I.2

    Journal ref: TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions, 2024 IEEE International Conference on Computer Vision and Machine Intelligence (CVMI), Prayagraj, India, 2024, pp. 1-6

  37. arXiv:2501.00209  [pdf, other

    quant-ph

    Unraveling the switching dynamics in a quantum double-well potential

    Authors: Qile Su, Rodrigo G. Cortiñas, Jayameenakshi Venkatraman, Shruti Puri

    Abstract: The spontaneous switching of a quantum particle between the wells of a double-well potential is a phenomenon of general interest to physics and chemistry. It was broadly believed that the switching rate decreases steadily with the size of the energy barrier. This view was challenged by a recent experiment on a driven superconducting Kerr nonlinear oscillator (often called the Kerr-cat qubit or the… ▽ More

    Submitted 30 December, 2024; originally announced January 2025.

  38. WavePulse: Real-time Content Analytics of Radio Livestreams

    Authors: Govind Mittal, Sarthak Gupta, Shruti Wagle, Chirag Chopra, Anthony J DeMattee, Nasir Memon, Mustaque Ahamad, Chinmay Hegde

    Abstract: Radio remains a pervasive medium for mass information dissemination, with AM/FM stations reaching more Americans than either smartphone-based social networking or live television. Increasingly, radio broadcasts are also streamed online and accessed over the Internet. We present WavePulse, a framework that records, documents, and analyzes radio content in real-time. While our framework is generally… ▽ More

    Submitted 29 January, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

    Comments: To appear at The Web Conference (WWW) 2025. 20 Pages, 24 figures. Access code and dataset at https://wave-pulse.io

  39. arXiv:2412.08713  [pdf, ps, other

    hep-th

    Uniqueness of MHV Gravity Amplitudes

    Authors: Joris Koefler, Umut Oktem, Shruti Paranjape, Jaroslav Trnka, Bailee Zacovic

    Abstract: We investigate MHV tree-level gravity amplitudes as defined on the spinor-helicity variety. Unlike their gluon counterparts, the gravity amplitudes do not have logarithmic singularities and do not admit Amplituhedron-like construction. Importantly, they are not determined just by their singularities, but rather their numerators have interesting zeroes. We make a conjecture about the uniqueness of… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: 18 pages

    Journal ref: Special volume on Positive Geometry, Le Matematiche 80 (1) (2025), 347 - 364

  40. arXiv:2412.01456  [pdf, other

    cs.CV eess.IV

    Phaseformer: Phase-based Attention Mechanism for Underwater Image Restoration and Beyond

    Authors: MD Raqib Khan, Anshul Negi, Ashutosh Kulkarni, Shruti S. Phutke, Santosh Kumar Vipparthi, Subrahmanyam Murala

    Abstract: Quality degradation is observed in underwater images due to the effects of light refraction and absorption by water, leading to issues like color cast, haziness, and limited visibility. This degradation negatively affects the performance of autonomous underwater vehicles used in marine applications. To address these challenges, we propose a lightweight phase-based transformer network with 1.77M pa… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 8 pages, 8 figures, conference

  41. arXiv:2411.18913  [pdf, other

    cs.RO

    Planning Shorter Paths in Graphs of Convex Sets by Undistorting Parametrized Configuration Spaces

    Authors: Shruti Garg, Thomas Cohn, Russ Tedrake

    Abstract: Optimization based motion planning provides a useful modeling framework through various costs and constraints. Using Graph of Convex Sets (GCS) for trajectory optimization gives guarantees of feasibility and optimality by representing configuration space as the finite union of convex sets. Nonlinear parametrizations can be used to extend this technique to handle cases such as kinematic loops, but… ▽ More

    Submitted 13 April, 2025; v1 submitted 28 November, 2024; originally announced November 2024.

    Comments: 8 pages, 6 figures, accepted to Robotics and Automation Letters in April 2025

  42. arXiv:2411.18577  [pdf, other

    cs.CL cs.LG

    On Importance of Code-Mixed Embeddings for Hate Speech Identification

    Authors: Shruti Jagdale, Omkar Khade, Gauri Takalikar, Mihir Inamdar, Raviraj Joshi

    Abstract: Code-mixing is the practice of using two or more languages in a single sentence, which often occurs in multilingual communities such as India where people commonly speak multiple languages. Classic NLP tools, trained on monolingual data, face challenges when dealing with code-mixed data. Extracting meaningful information from sentences containing multiple languages becomes difficult, particularly… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  43. arXiv:2411.18571  [pdf, other

    cs.CL cs.LG

    Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning

    Authors: Omkar Khade, Shruti Jagdale, Abhishek Phaltankar, Gauri Takalikar, Raviraj Joshi

    Abstract: Large Language Models (LLMs) have demonstrated remarkable multilingual capabilities, yet challenges persist in adapting these models for low-resource languages. In this study, we investigate the effects of Low-Rank Adaptation (LoRA) Parameter-Efficient Fine-Tuning (PEFT) on multilingual Gemma models for Marathi, a language with limited resources. Using a translated Alpaca dataset with 52,000 instr… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  44. arXiv:2411.15612  [pdf, other

    quant-ph

    Faulty towers: recovering a functioning quantum random access memory in the presence of defective routers

    Authors: D. K. Weiss, Shifan Xu, Shruti Puri, Yongshan Ding, S. M. Girvin

    Abstract: Proposals for quantum random access memory (QRAM) generally have a binary-tree structure, and thus require hardware that is exponential in the depth of the QRAM. For solid-state based devices, a fabrication yield that is less than $100\%$ implies that certain addresses at the bottom of the tree become inaccessible if a router in the unique path to that address is faulty. We discuss how to recover… ▽ More

    Submitted 23 November, 2024; originally announced November 2024.

    Comments: 13 pages, 11 figures, associated code available https://github.com/dkweiss31/QRAMfaultyrouters

  45. arXiv:2411.13802  [pdf, other

    cs.CL

    SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model

    Authors: Christopher Nguyen, William Nguyen, Atsushi Suzuki, Daisuke Oku, Hong An Phan, Sang Dinh, Zooey Nguyen, Anh Ha, Shruti Raghavan, Huy Vo, Thang Nguyen, Lan Nguyen, Yoshikuni Hirayama

    Abstract: Large Language Models (LLMs) have demonstrated the potential to address some issues within the semiconductor industry. However, they are often general-purpose models that lack the specialized knowledge needed to tackle the unique challenges of this sector, such as the intricate physics and chemistry of semiconductor devices and processes. SemiKong, the first industry-specific LLM for the semicondu… ▽ More

    Submitted 21 November, 2024; v1 submitted 20 November, 2024; originally announced November 2024.

    Comments: On-going work

  46. arXiv:2411.13006  [pdf

    eess.IV cs.AI cs.CV

    Automating Sonologists USG Commands with AI and Voice Interface

    Authors: Emad Mohamed, Shruti Tiwari, Sheena Christabel Pravin

    Abstract: This research presents an advanced AI-powered ultrasound imaging system that incorporates real-time image processing, organ tracking, and voice commands to enhance the efficiency and accuracy of diagnoses in clinical practice. Traditional ultrasound diagnostics often require significant time and introduce a degree of subjectivity due to user interaction. The goal of this innovative solution is to… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

  47. arXiv:2411.05605  [pdf, ps, other

    hep-th

    Logarithmic corrections to entropy of 3D cosmological solutions from celestial dual

    Authors: Arindam Bhattacharjee, Shruti Menon, Muktajyoti Saha

    Abstract: Recently a one-dimensional Schwarzian type theory was proposed as an effective dual theory of pure gravity in (2+1) dimensional asymptotically flat spacetimes \cite{Bhattacharjee:2023sfd}. This codimension-two `celestial' dual captures the Bekenstein-Hawking entropy of bulk flat cosmologies in semiclassical limit. In this paper, we extend this analysis beyond semiclassical approximation and evalua… ▽ More

    Submitted 27 November, 2024; v1 submitted 8 November, 2024; originally announced November 2024.

    Comments: 19 pages, minor changes and references added

  48. arXiv:2411.05338  [pdf, other

    cs.CL

    SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers

    Authors: Shruti Singh, Nandan Sarkar, Arman Cohan

    Abstract: Scientific literature is typically dense, requiring significant background knowledge and deep comprehension for effective engagement. We introduce SciDQA, a new dataset for reading comprehension that challenges LLMs for a deep understanding of scientific articles, consisting of 2,937 QA pairs. Unlike other scientific QA datasets, SciDQA sources questions from peer reviews by domain experts and ans… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    Comments: 18 pages, Accepted to EMNLP 2024

  49. arXiv:2410.18037  [pdf, other

    quant-ph physics.optics

    Quantum optomechanical control of long-lived bulk acoustic phonons

    Authors: Hilel Hagai Diamandi, Yizhi Luo, David Mason, Tevfik Bulent Kanmaz, Sayan Ghosh, Margaret Pavlovich, Taekwan Yoon, Ryan Behunin, Shruti Puri, Jack G. E. Harris, Peter T. Rakich

    Abstract: High-fidelity quantum optomechanical control of a mechanical oscillator requires the ability to perform efficient, low-noise operations on long-lived phononic excitations. Microfabricated high-overtone bulk acoustic wave resonators ($\mathrmμ$HBARs) have been shown to support high-frequency (> 10 GHz) mechanical modes with exceptionally long coherence times (> 1.5 ms), making them a compelling res… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

  50. arXiv:2410.15553  [pdf, other

    cs.CL

    Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

    Authors: Yun He, Di Jin, Chaoqi Wang, Chloe Bi, Karishma Mandyam, Hejia Zhang, Chen Zhu, Ning Li, Tengyu Xu, Hongjiang Lv, Shruti Bhosale, Chenguang Zhu, Karthik Abinav Sankararaman, Eryk Helenowski, Melanie Kambadur, Aditya Tayade, Hao Ma, Han Fang, Sinong Wang

    Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities in various tasks, including instruction following, which is crucial for aligning model outputs with user expectations. However, evaluating LLMs' ability to follow instructions remains challenging due to the complexity and subjectivity of human language. Current benchmarks primarily focus on single-turn, monolingual instructions… ▽ More

    Submitted 12 November, 2024; v1 submitted 20 October, 2024; originally announced October 2024.