Skip to main content

Showing 51–100 of 1,098 results for author: Aayush

.
  1. arXiv:2505.05885  [pdf, ps, other

    cs.DB cs.IR

    Cost-Effective, Low Latency Vector Search with Azure Cosmos DB

    Authors: Nitish Upreti, Krishnan Sundaram, Hari Sudan Sundar, Samer Boshra, Balachandar Perumalswamy, Shivam Atri, Martin Chisholm, Revti Raman Singh, Greg Yang, Subramanyam Pattipaka, Tamara Hass, Nitesh Dudhey, James Codella, Mark Hildebrand, Magdalen Manohar, Jack Moffitt, Haiyang Xu, Naren Datha, Suryansh Gupta, Ravishankar Krishnaswamy, Prashant Gupta, Abhishek Sahu, Ritika Mor, Santosh Kulkarni, Hemeswari Varada , et al. (11 additional authors not shown)

    Abstract: Vector indexing enables semantic search over diverse corpora and has become an important interface to databases for both users and AI agents. Efficient vector search requires deep optimizations in database systems. This has motivated a new class of specialized vector databases that optimize for vector search quality and cost. Instead, we argue that a scalable, high-performance, and cost-efficient… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    ACM Class: H.3.3

  2. arXiv:2505.03839  [pdf, other

    cs.IR cs.CL

    An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification

    Authors: Utsav Kumar Nareti, Soumi Chattopadhyay, Prolay Mallick, Suraj Kumar, Ayush Vikas Daga, Chandranath Adak, Adarsh Wase, Arjab Roy

    Abstract: Identifying the finer details of a book's genres enhances user experience by enabling efficient book discovery and personalized recommendations, ultimately improving reader engagement and satisfaction. It also provides valuable insights into market trends and consumer preferences, allowing publishers and marketers to make data-driven decisions regarding book production and marketing strategies. Wh… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  3. arXiv:2505.03189  [pdf, other

    cs.AI cs.HC

    Patterns and Mechanisms of Contrastive Activation Engineering

    Authors: Yixiong Hao, Ayush Panda, Stepan Shabalin, Sheikh Abdur Raheem Ali

    Abstract: Controlling the behavior of Large Language Models (LLMs) remains a significant challenge due to their inherent complexity and opacity. While techniques like fine-tuning can modify model behavior, they typically require extensive computational resources. Recent work has introduced a class of contrastive activation engineering (CAE) techniques as promising approaches for steering LLM outputs through… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Published at the ICLR 2025 Bi-Align, HAIC, and Building Trust workshops

  4. arXiv:2505.03173  [pdf, other

    cs.CV cs.AI

    RAVU: Retrieval Augmented Video Understanding with Compositional Reasoning over Graph

    Authors: Sameer Malik, Moyuru Yamada, Ayush Singh, Dishank Aggarwal

    Abstract: Comprehending long videos remains a significant challenge for Large Multi-modal Models (LMMs). Current LMMs struggle to process even minutes to hours videos due to their lack of explicit memory and retrieval mechanisms. To address this limitation, we propose RAVU (Retrieval Augmented Video Understanding), a novel framework for video understanding enhanced by retrieval with compositional reasoning… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  5. arXiv:2505.03098  [pdf, other

    eess.SP cs.IT

    USF Spectral Estimation: Prevalence of Gaussian Cramér-Rao Bounds Despite Modulo Folding

    Authors: Ruiming Guo, Ayush Bhandari

    Abstract: Spectral Estimation (SpecEst) is a core area of signal processing with a history spanning two centuries and applications across various fields. With the advent of digital acquisition, SpecEst algorithms have been widely applied to tasks like frequency super-resolution. However, conventional digital acquisition imposes a trade-off: for a fixed bit budget, one can optimize either signal dynamic rang… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 2 Figs, to appear in Proc. of 2025 IEEE Statistical Signal Processing (SSP) Workshop

  6. arXiv:2505.01700  [pdf, other

    cs.LG q-bio.QM

    PoseX: AI Defeats Physics Approaches on Protein-Ligand Cross Docking

    Authors: Yize Jiang, Xinze Li, Yuanyuan Zhang, Jin Han, Youjun Xu, Ayush Pandit, Zaixi Zhang, Mengdi Wang, Mengyang Wang, Chong Liu, Guang Yang, Yejin Choi, Wu-Jun Li, Tianfan Fu, Fang Wu, Junhong Liu

    Abstract: Existing protein-ligand docking studies typically focus on the self-docking scenario, which is less practical in real applications. Moreover, some studies involve heavy frameworks requiring extensive training, posing challenges for convenient and efficient assessment of docking methods. To fill these gaps, we design PoseX, an open-source benchmark to evaluate both self-docking and cross-docking, e… ▽ More

    Submitted 21 May, 2025; v1 submitted 3 May, 2025; originally announced May 2025.

  7. arXiv:2504.19395  [pdf, other

    cs.CL

    ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers

    Authors: Zhouxiang Fang, Aayush Mishra, Muhan Gao, Anqi Liu, Daniel Khashabi

    Abstract: Recent works have suggested that In-Context Learning (ICL) operates in dual modes, i.e. task retrieval (remember learned patterns from pre-training) and task learning (inference-time ``learning'' from demonstrations). However, disentangling these the two modes remains a challenging goal. We introduce ICL CIPHERS, a class of task reformulations based on substitution ciphers borrowed from classic cr… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

  8. arXiv:2504.17950  [pdf, other

    cs.MA cs.CL

    Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning

    Authors: Isadora White, Kolby Nottingham, Ayush Maniar, Max Robinson, Hansen Lillemark, Mehul Maheshwari, Lianhui Qin, Prithviraj Ammanabrolu

    Abstract: Collaboration is ubiquitous and essential in day-to-day life -- from exchanging ideas, to delegating tasks, to generating plans together. This work studies how LLMs can adaptively collaborate to perform complex embodied reasoning tasks. To this end we introduce MINDcraft, an easily extensible platform built to enable LLM agents to control characters in the open-world game of Minecraft; and MineCol… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 9 pages of main paper with 6 main figures, overall 28 pages

  9. arXiv:2504.17656  [pdf, ps, other

    cs.CE cond-mat.mtrl-sci cs.LG

    polyGen: A Learning Framework for Atomic-level Polymer Structure Generation

    Authors: Ayush Jain, Rampi Ramprasad

    Abstract: Synthetic polymeric materials underpin fundamental technologies in the energy, electronics, consumer goods, and medical sectors, yet their development still suffers from prolonged design timelines. Although polymer informatics tools have supported speedup, polymer simulation protocols continue to face significant challenges in the on-demand generation of realistic 3D atomic structures that respect… ▽ More

    Submitted 10 June, 2025; v1 submitted 24 April, 2025; originally announced April 2025.

  10. arXiv:2504.17140  [pdf, other

    cs.LG cs.AI

    Scalable Permutation-Aware Modeling for Temporal Set Prediction

    Authors: Ashish Ranjan, Ayush Agarwal, Shalin Barot, Sushant Kumar

    Abstract: Temporal set prediction involves forecasting the elements that will appear in the next set, given a sequence of prior sets, each containing a variable number of elements. Existing methods often rely on intricate architectures with substantial computational overhead, which hampers their scalability. In this work, we introduce a novel and scalable framework that leverages permutation-equivariant and… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  11. arXiv:2504.14151  [pdf, other

    cs.CV cs.AI cs.RO

    Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D

    Authors: Sergio Arnaud, Paul McVay, Ada Martin, Arjun Majumdar, Krishna Murthy Jatavallabhula, Phillip Thomas, Ruslan Partsey, Daniel Dugas, Abha Gejji, Alexander Sax, Vincent-Pierre Berges, Mikael Henaff, Ayush Jain, Ang Cao, Ishita Prasad, Mrinal Kalakrishnan, Michael Rabbat, Nicolas Ballas, Mido Assran, Oleksandr Maksymets, Aravind Rajeswaran, Franziska Meier

    Abstract: We present LOCATE 3D, a model for localizing objects in 3D scenes from referring expressions like "the small coffee table between the sofa and the lamp." LOCATE 3D sets a new state-of-the-art on standard referential grounding benchmarks and showcases robust generalization capabilities. Notably, LOCATE 3D operates directly on sensor observation streams (posed RGB-D frames), enabling real-world depl… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    ACM Class: I.2.10; I.2.6; I.2.9; I.3.7; I.4.6; I.4.8

  12. arXiv:2504.12515  [pdf, other

    cs.CV

    Event Quality Score (EQS): Assessing the Realism of Simulated Event Camera Streams via Distances in Latent Space

    Authors: Kaustav Chanda, Aayush Atul Verma, Arpitsinh Vaghela, Yezhou Yang, Bharatesh Chakravarthi

    Abstract: Event cameras promise a paradigm shift in vision sensing with their low latency, high dynamic range, and asynchronous nature of events. Unfortunately, the scarcity of high-quality labeled datasets hinders their widespread adoption in deep learning-driven computer vision. To mitigate this, several simulators have been proposed to generate synthetic event data for training models for detection and e… ▽ More

    Submitted 20 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: Accepted at 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW); Fifth International Workshop on Event-Based Vision

  13. arXiv:2504.12389  [pdf, other

    quant-ph cs.LG

    Predictive control of blast furnace temperature in steelmaking with hybrid depth-infused quantum neural networks

    Authors: Nayoung Lee, Minsoo Shin, Asel Sagingalieva, Ayush Joshi Tripathi, Karan Pinto, Alexey Melnikov

    Abstract: Accurate prediction and stabilization of blast furnace temperatures are crucial for optimizing the efficiency and productivity of steel production. Traditional methods often struggle with the complex and non-linear nature of the temperature fluctuations within blast furnaces. This paper proposes a novel approach that combines hybrid quantum machine learning with pulverized coal injection control t… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  14. arXiv:2504.11673  [pdf, ps, other

    cs.CL

    Deep Binding of Language Model Virtual Personas: a Study on Approximating Political Partisan Misperceptions

    Authors: Minwoo Kang, Suhong Moon, Seung Hyeong Lee, Ayush Raj, Joseph Suh, David M. Chan

    Abstract: Large language models (LLMs) are increasingly capable of simulating human behavior, offering cost-effective ways to estimate user responses to various surveys and polls. However, the questions in these surveys usually reflect socially understood attitudes: the patterns of attitudes of old/young, liberal/conservative, as understood by both members and non-members of those groups. It is not clear wh… ▽ More

    Submitted 12 June, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

  15. arXiv:2504.09249  [pdf, other

    cs.CV cs.IR cs.LG cs.MM

    NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding

    Authors: Aniket Pal, Sanket Biswas, Alloy Das, Ayush Lodh, Priyanka Banerjee, Soumitri Chattopadhyay, Dimosthenis Karatzas, Josep Llados, C. V. Jawahar

    Abstract: Understanding and reasoning over academic handwritten notes remains a challenge in document AI, particularly for mathematical equations, diagrams, and scientific notations. Existing visual question answering (VQA) benchmarks focus on printed or structured handwritten text, limiting generalization to real-world note-taking. To address this, we introduce NoTeS-Bank, an evaluation benchmark for Neura… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  16. arXiv:2504.08169  [pdf, other

    cs.LG cs.AI stat.AP stat.ML

    On the Practice of Deep Hierarchical Ensemble Network for Ad Conversion Rate Prediction

    Authors: Jinfeng Zhuang, Yinrui Li, Runze Su, Ke Xu, Zhixuan Shao, Kungang Li, Ling Leng, Han Sun, Meng Qi, Yixiong Meng, Yang Tang, Zhifang Liu, Qifei Shen, Aayush Mudgal, Caleb Lu, Jie Liu, Hongda Shen

    Abstract: The predictions of click through rate (CTR) and conversion rate (CVR) play a crucial role in the success of ad-recommendation systems. A Deep Hierarchical Ensemble Network (DHEN) has been proposed to integrate multiple feature crossing modules and has achieved great success in CTR prediction. However, its performance for CVR prediction is unclear in the conversion ads setting, where an ad bids for… ▽ More

    Submitted 23 April, 2025; v1 submitted 10 April, 2025; originally announced April 2025.

    Comments: Accepted by WWW 2025

  17. arXiv:2504.07949  [pdf, other

    cs.CV

    InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians

    Authors: Kefan Chen, Sergiu Oprea, Justin Theiss, Sreyas Mohan, Srinath Sridhar, Aayush Prakash

    Abstract: With the rising interest from the community in digital avatars coupled with the importance of expressions and gestures in communication, modeling natural avatar behavior remains an important challenge across many industries such as teleconferencing, gaming, and AR/VR. Human hands are the primary tool for interacting with the environment and essential for realistic human behavior modeling, yet exis… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  18. arXiv:2504.06219  [pdf, other

    cs.CL cs.LG

    Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs

    Authors: Dongyang Fan, Vinko Sabolčec, Matin Ansaripour, Ayush Kumar Tarun, Martin Jaggi, Antoine Bosselut, Imanol Schlag

    Abstract: The increasing adoption of web crawling opt-outs by copyright holders of online content raises critical questions about the impact of data compliance on large language model (LLM) performance. However, little is known about how these restrictions (and the resultant filtering of pretraining datasets) affect the capabilities of models trained using these corpora. In this work, we conceptualize this… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  19. arXiv:2504.05781  [pdf, other

    cs.HC cs.CY

    Building Proactive and Instant-Reactive Safety Designs to Address Harassment in Social Virtual Reality

    Authors: Zhehui Liao, Hanwen Zhao, Ayush Kulkarni, Shaan Singh Chattrath, Amy X. Zhang

    Abstract: Social Virtual Reality (VR) games offer immersive socialization experiences but pose significant challenges of harassment. Common solutions, such as reporting and moderation, address harassment after it happens but fail to prevent or stop harassment in the moment. In this study, we explore and design proactive and instant-reactive safety designs to mitigate harassment in social VR. Proactive desig… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 37 pages, 11 figures

  20. arXiv:2504.05506  [pdf, other

    cs.CL

    ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering

    Authors: Ahmed Masry, Mohammed Saidul Islam, Mahir Ahmed, Aayush Bajaj, Firoz Kabir, Aaryaman Kartha, Md Tahmid Rahman Laskar, Mizanur Rahman, Shadikur Rahman, Mehrad Shahmohammadi, Megh Thakkar, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty

    Abstract: Charts are ubiquitous, as people often use them to analyze data, answer questions, and discover critical insights. However, performing complex analytical tasks with charts requires significant perceptual and cognitive effort. Chart Question Answering (CQA) systems automate this process by enabling models to interpret and reason with visual representations of data. However, existing benchmarks like… ▽ More

    Submitted 10 April, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  21. arXiv:2504.05405  [pdf, other

    cs.LG cs.AI stat.ML

    The Role of Environment Access in Agnostic Reinforcement Learning

    Authors: Akshay Krishnamurthy, Gene Li, Ayush Sekhari

    Abstract: We study Reinforcement Learning (RL) in environments with large state spaces, where function approximation is required for sample-efficient learning. Departing from a long history of prior work, we consider the weakest possible form of function approximation, called agnostic policy learning, where the learner seeks to find the best policy in a given class $Π$, with no guarantee that $Π$ contains a… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: comments welcome

  22. arXiv:2504.05192  [pdf, other

    astro-ph.HE astro-ph.IM

    CHIME/FRB Outriggers: Design Overview

    Authors: The CHIME/FRB Collaboration, Mandana Amiri, Bridget C. Andersen, Shion Andrew, Kevin Bandura, Mohit Bhardwaj, Kalyani Bhopi, Vadym Bidula, P. J. Boyle, Charanjot Brar, Mark Carlson, Tomas Cassanelli, Alyssa Cassity, Shami Chatterjee, Jean-François Cliche, Alice P. Curtin, Rachel Darlinger, David R. DeBoer, Matt Dobbs, Fengqiu Adam Dong, Gwendolyn Eadie, Emmanuel Fonseca, B. M. Gaensler, Nina Gusinskaia, Mark Halpern , et al. (44 additional authors not shown)

    Abstract: The Canadian Hydrogen Intensity Mapping Experiment (CHIME) has emerged as the world's premier facility for studying fast radio bursts (FRBs) through its fast transient search backend CHIME/FRB\@. The CHIME/FRB Outriggers project will augment this high detection rate of 2--3 FRBs per day with the ability to precisely localize them using very long baseline interferometry (VLBI). Using three strategi… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: 32 pages, 7 figures, submitted to ApJ

  23. arXiv:2504.03624  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

    Authors: NVIDIA, :, Aaron Blakeman, Aarti Basant, Abhinav Khattar, Adithya Renduchintala, Akhiad Bercovich, Aleksander Ficek, Alexis Bjorlin, Ali Taghibakhshi, Amala Sanjay Deshmukh, Ameya Sunil Mahabaleshwarkar, Andrew Tao, Anna Shors, Ashwath Aithal, Ashwin Poojary, Ayush Dattagupta, Balaram Buddharaju, Bobby Chen, Boris Ginsburg, Boxin Wang, Brandon Norick, Brian Butterfield, Bryan Catanzaro, Carlo del Mundo , et al. (176 additional authors not shown)

    Abstract: As inference-time scaling becomes critical for enhanced reasoning capabilities, it is increasingly becoming important to build models that are efficient to infer. We introduce Nemotron-H, a family of 8B and 56B/47B hybrid Mamba-Transformer models designed to reduce inference cost for a given accuracy level. To achieve this goal, we replace the majority of self-attention layers in the common Transf… ▽ More

    Submitted 15 April, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

  24. arXiv:2504.01669  [pdf, other

    astro-ph.CO gr-qc hep-ph

    The CosmoVerse White Paper: Addressing observational tensions in cosmology with systematics and fundamental physics

    Authors: Eleonora Di Valentino, Jackson Levi Said, Adam Riess, Agnieszka Pollo, Vivian Poulin, Adrià Gómez-Valent, Amanda Weltman, Antonella Palmese, Caroline D. Huang, Carsten van de Bruck, Chandra Shekhar Saraf, Cheng-Yu Kuo, Cora Uhlemann, Daniela Grandón, Dante Paz, Dominique Eckert, Elsa M. Teixeira, Emmanuel N. Saridakis, Eoin Ó Colgáin, Florian Beutler, Florian Niedermann, Francesco Bajardi, Gabriela Barenboim, Giulia Gubitosi, Ilaria Musella , et al. (513 additional authors not shown)

    Abstract: The standard model of cosmology has provided a good phenomenological description of a wide range of observations both at astrophysical and cosmological scales for several decades. This concordance model is constructed by a universal cosmological constant and supported by a matter sector described by the standard model of particle physics and a cold dark matter contribution, as well as very early-t… ▽ More

    Submitted 15 May, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

    Comments: 416 pages, 81 figures, accepted in PotDU

  25. arXiv:2504.00030  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Token-Driven GammaTune: Adaptive Calibration for Enhanced Speculative Decoding

    Authors: Aayush Gautam, Susav Shrestha, Narasimha Reddy

    Abstract: Speculative decoding accelerates large language model (LLM) inference by using a smaller draft model to propose tokens, which are then verified by a larger target model. However, selecting an optimal speculation length is critical for maximizing speedup while minimizing wasted computation. We introduce \textit{GammaTune} and \textit{GammaTune+}, training-free adaptive algorithms that dynamically a… ▽ More

    Submitted 4 June, 2025; v1 submitted 28 March, 2025; originally announced April 2025.

    Comments: 6 pages, 2 figures, 1 table

  26. arXiv:2503.21772  [pdf, other

    cs.CV

    LOCORE: Image Re-ranking with Long-Context Sequence Modeling

    Authors: Zilin Xiao, Pavel Suma, Ayush Sachdeva, Hao-Jen Wang, Giorgos Kordopatis-Zilos, Giorgos Tolias, Vicente Ordonez

    Abstract: We introduce LOCORE, Long-Context Re-ranker, a model that takes as input local descriptors corresponding to an image query and a list of gallery images and outputs similarity scores between the query and each gallery image. This model is used for image retrieval, where typically a first ranking is performed with an efficient similarity measure, and then a shortlist of top-ranked images is re-ranke… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

    Comments: CVPR 2025

  27. arXiv:2503.21232  [pdf, other

    cs.AI

    Knowledge Graphs as World Models for Semantic Material-Aware Obstacle Handling in Autonomous Vehicles

    Authors: Ayush Bheemaiah, Seungyong Yang

    Abstract: The inability of autonomous vehicles (AVs) to infer the material properties of obstacles limits their decision-making capacity. While AVs rely on sensor systems such as cameras, LiDAR, and radar to detect obstacles, this study suggests combining sensors with a knowledge graph (KG)-based world model to improve AVs' comprehension of physical material qualities. Beyond sensor data, AVs can infer qual… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  28. arXiv:2503.19328  [pdf, other

    cs.CL cs.AI

    Substance over Style: Evaluating Proactive Conversational Coaching Agents

    Authors: Vidya Srinivas, Xuhai Xu, Xin Liu, Kumar Ayush, Isaac Galatzer-Levy, Shwetak Patel, Daniel McDuff, Tim Althoff

    Abstract: While NLP research has made strides in conversational tasks, many approaches focus on single-turn responses with well-defined objectives or evaluation criteria. In contrast, coaching presents unique challenges with initially undefined goals that evolve through multi-turn interactions, subjective evaluation criteria, mixed-initiative dialogue. In this work, we describe and implement five multi-turn… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  29. arXiv:2503.19072  [pdf, other

    quant-ph hep-ph

    Entanglement Witnesses Mediated Via Axion-Like Particles

    Authors: Pablo Guillermo Carmona Rufo, Ayush Kumar, Carlos Sabín, Anupam Mazumdar

    Abstract: Entanglement is solely a quantum property and it can be extremely helpful to test the physics beyond the Standard Model in tabletop experiments with the advent of future quantum technologies. In this work, we provide an entanglement-based partial positive transpose (PPT) witness for Yukawa-type potentials in the infrared regime between pairs of neutral/charged particles in a spatial quantum superp… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: 9 pages, 5 figures

  30. arXiv:2503.13512  [pdf, other

    stat.ML cs.DM cs.LG cs.SC math.CO math.FA

    Positivity sets of hinge functions

    Authors: Josef Schicho, Ayush Kumar Tewari, Audie Warren

    Abstract: In this paper we investigate which subsets of the real plane are realisable as the set of points on which a one-layer ReLU neural network takes a positive value. In the case of cones we give a full characterisation of such sets. Furthermore, we give a necessary condition for any subset of $\mathbb R^d$. We give various examples of such one-layer neural networks.

    Submitted 14 March, 2025; originally announced March 2025.

  31. arXiv:2503.11538  [pdf, other

    cs.CV cs.AI cs.LG physics.ao-ph physics.optics

    FLASHμ: Fast Localizing And Sizing of Holographic Microparticles

    Authors: Ayush Paliwal, Oliver Schlenczek, Birte Thiede, Manuel Santos Pereira, Katja Stieger, Eberhard Bodenschatz, Gholamhossein Bagheri, Alexander Ecker

    Abstract: Reconstructing the 3D location and size of microparticles from diffraction images - holograms - is a computationally expensive inverse problem that has traditionally been solved using physics-based reconstruction methods. More recently, researchers have used machine learning methods to speed up the process. However, for small particles in large sample volumes the performance of these methods falls… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  32. arXiv:2503.11195  [pdf, other

    cs.CV

    Provenance Detection for AI-Generated Images: Combining Perceptual Hashing, Homomorphic Encryption, and AI Detection Models

    Authors: Shree Singhi, Aayan Yadav, Aayush Gupta, Shariar Ebrahimi, Parisa Hassanizadeh

    Abstract: As AI-generated sensitive images become more prevalent, identifying their source is crucial for distinguishing them from real images. Conventional image watermarking methods are vulnerable to common transformations like filters, lossy compression, and screenshots, often applied during social media sharing. Watermarks can also be faked or removed if models are open-sourced or leaked since images ca… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  33. arXiv:2503.10970  [pdf, other

    cs.AI cs.LG

    TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools

    Authors: Shanghua Gao, Richard Zhu, Zhenglun Kong, Ayush Noori, Xiaorui Su, Curtis Ginder, Theodoros Tsiligkaridis, Marinka Zitnik

    Abstract: Precision therapeutics require multimodal adaptive models that generate personalized treatment recommendations. We introduce TxAgent, an AI agent that leverages multi-step reasoning and real-time biomedical knowledge retrieval across a toolbox of 211 tools to analyze drug interactions, contraindications, and patient-specific treatment strategies. TxAgent evaluates how drugs interact at molecular,… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: Project page: https://zitniklab.hms.harvard.edu/TxAgent TxAgent code: https://github.com/mims-harvard/TxAgent ToolUniverse code: https://github.com/mims-harvard/ToolUniverse

  34. arXiv:2503.10745  [pdf, ps, other

    cs.CV cs.AI cs.RO

    Unifying 2D and 3D Vision-Language Understanding

    Authors: Ayush Jain, Alexander Swerdlow, Yuzhou Wang, Sergio Arnaud, Ada Martin, Alexander Sax, Franziska Meier, Katerina Fragkiadaki

    Abstract: Progress in 3D vision-language learning has been hindered by the scarcity of large-scale 3D datasets. We introduce UniVLG, a unified architecture for 2D and 3D vision-language understanding that bridges the gap between existing 2D-centric models and the rich 3D sensory data available in embodied systems. Our approach initializes most model weights from pre-trained 2D models and trains on both 2D a… ▽ More

    Submitted 8 June, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    Comments: The first two authors contributed equally

  35. arXiv:2503.09705  [pdf, other

    cond-mat.str-el

    Altermagnets with topological order in Kitaev bilayers

    Authors: Aayush Vijayvargia, Ezra Day-Roberts, Antia S. Botana, Onur Erten

    Abstract: Building on recent advancements in altermagnetism, we develop a highly-frustrated magnetic model with Kitaev-like interactions that integrates key aspects of both quantum spin liquids and altermagnets. While the ground state is a gapless quantum spin liquid, our analysis indicates that an altermagnetic local order emerges upon the introduction of additional interactions that gap the excitation spe… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: 7 pages, 5 figures

  36. arXiv:2503.09670  [pdf, other

    quant-ph physics.chem-ph

    On the generalized eigenvalue problem in subspace-based excited state methods for quantum computers

    Authors: Prince Frederick Kwao, Srivathsan Poyyapakkam Sundar, Brajesh Gupt, Ayush Asthana

    Abstract: Solving challenging problems in quantum chemistry is one of the leading promised applications of quantum computers. Within the quantum algorithms proposed for problems in excited state quantum chemistry, subspace-based quantum algorithms, including quantum subspace expansion (QSE), quantum equation of motion (qEOM) and quantum self-consistent equation-of-motion (q-sc-EOM), are promising for pre-fa… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  37. arXiv:2503.08814  [pdf

    cs.HC

    An Iterative, User-Centered Design of a Clinical Decision Support System for Critical Care Assessments: Co-Design Sessions with ICU Clinical Providers

    Authors: Andrea E. Davidson, Jessica M. Ray, Ayush K. Patel, Yulia Strekalova Levites, Parisa Rashidi, Azra Bihorac

    Abstract: This study reports the findings of qualitative interview sessions conducted with ICU clinicians for the co-design of a system user interface of an artificial intelligence (AI)-driven clinical decision support (CDS) system. This system integrates medical record data with wearable sensor, video, and environmental data into a real-time dynamic model that quantifies patients' risk of clinical decompen… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  38. arXiv:2503.08732  [pdf

    q-bio.QM cs.AI

    Quantifying Circadian Desynchrony in ICU Patients and Its Association with Delirium

    Authors: Yuanfang Ren, Andrea E. Davidson, Jiaqing Zhang, Miguel Contreras, Ayush K. Patel, Michelle Gumz, Tezcan Ozrazgat-Baslanti, Parisa Rashidi, Azra Bihorac

    Abstract: Background: Circadian desynchrony characterized by the misalignment between an individual's internal biological rhythms and external environmental cues, significantly affects various physiological processes and health outcomes. Quantifying circadian desynchrony often requires prolonged and frequent monitoring, and currently, an easy tool for this purpose is missing. Additionally, its association w… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  39. arXiv:2503.04401  [pdf, ps, other

    math.GR math.RT

    Representations of Skew Left Braces of order $pq$

    Authors: Nishant Rathee, Ayush Udeep

    Abstract: In this paper, we study the irreducible representations of skew braces of order \( pq \), which is equivalent to studying the representation theory of groups of order \( p^2q^2 \) arising from skew left braces, where \( p > q \) are primes. To achieve this, we classify all semidirect product groups \( Λ_A \) associated with skew left braces $A$ of order \( pq \), up to isomorphism.

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: 14 pages

    MSC Class: 16T25; 20C15; 20D10

  40. Atomistically informed phase field study of austenite grain growth

    Authors: Ayush Suhane, Daniel Scheiber, Vsevolod I. Razumovskiy, Matthias Militzer

    Abstract: Atomistically-informed phase field simulations have been performed to investigate the effect of five common alloying elements (Nb, Ti, Mo, V, Mn) on austenite grain growth. The anisotropic simulations based on the segregation energy profiles of the solutes to four different grain boundary (GB) types from density functional theory calculations suggest a secondary role of solute drag anisotropy on g… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Journal ref: Computational Materials Science, Volume 228, September 2023, 112300

  41. arXiv:2503.02092  [pdf, other

    cs.CV cs.RO

    Data Augmentation for NeRFs in the Low Data Limit

    Authors: Ayush Gaggar, Todd D. Murphey

    Abstract: Current methods based on Neural Radiance Fields fail in the low data limit, particularly when training on incomplete scene data. Prior works augment training data only in next-best-view applications, which lead to hallucinations and model collapse with sparse data. In contrast, we propose adding a set of views during training by rejection sampling from a posterior uncertainty distribution, generat… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: To be published in 2025 IEEE International Conference on Robotics and Automation (ICRA 2025)

  42. arXiv:2503.01307  [pdf, other

    cs.CL cs.LG

    Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

    Authors: Kanishk Gandhi, Ayush Chakravarthy, Anikait Singh, Nathan Lile, Noah D. Goodman

    Abstract: Test-time inference has emerged as a powerful paradigm for enabling language models to ``think'' longer and more carefully about complex challenges, much like skilled human experts. While reinforcement learning (RL) can drive self-improvement in language models on verifiable tasks, some models exhibit substantial gains while others quickly plateau. For instance, we find that Qwen-2.5-3B far exceed… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  43. arXiv:2503.01287  [pdf, other

    cs.LG cs.AI stat.ML

    Robust Simulation-Based Inference under Missing Data via Neural Processes

    Authors: Yogesh Verma, Ayush Bharti, Vikas Garg

    Abstract: Simulation-based inference (SBI) methods typically require fully observed data to infer parameters of models with intractable likelihood functions. However, datasets often contain missing values due to incomplete observations, data corruptions (common in astrophysics), or instrument limitations (e.g., in high-energy physics applications). In such scenarios, missing data must be imputed before appl… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: Accepted at ICLR 2025

  44. arXiv:2502.20420  [pdf, other

    cs.CL cs.CV

    Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation

    Authors: Shaharukh Khan, Ayush Tarun, Ali Faraz, Palash Kamble, Vivek Dahiya, Praveen Pokala, Ashish Kulkarni, Chandra Khatri, Abhinav Ravi, Shubham Agarwal

    Abstract: In this work, we provide the system description of our submission as part of the English to Lowres Multimodal Translation Task at the Workshop on Asian Translation (WAT2024). We introduce Chitranuvad, a multimodal model that effectively integrates Multilingual LLM and a vision module for Multimodal Translation. Our method uses a ViT image encoder to extract visual representations as visual token e… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Journal ref: https://aclanthology.org/2024.wmt-1.80/

  45. arXiv:2502.20389  [pdf, ps, other

    cs.CV

    From Thousands to Billions: 3D Visual Language Grounding via Render-Supervised Distillation from 2D VLMs

    Authors: Ang Cao, Sergio Arnaud, Oleksandr Maksymets, Jianing Yang, Ayush Jain, Sriram Yenamandra, Ada Martin, Vincent-Pierre Berges, Paul McVay, Ruslan Partsey, Aravind Rajeswaran, Franziska Meier, Justin Johnson, Jeong Joon Park, Alexander Sax

    Abstract: 3D vision-language grounding faces a fundamental data bottleneck: while 2D models train on billions of images, 3D models have access to only thousands of labeled scenes--a six-order-of-magnitude gap that severely limits performance. We introduce $\textbf{LIFT-GS}$, a practical distillation technique that overcomes this limitation by using differentiable rendering to bridge 3D and 2D supervision. L… ▽ More

    Submitted 9 June, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

    Comments: Project page: https://liftgs.github.io

  46. arXiv:2502.20375  [pdf, other

    cs.LG

    When does a predictor know its own loss?

    Authors: Aravind Gollakota, Parikshit Gopalan, Aayush Karan, Charlotte Peale, Udi Wieder

    Abstract: Given a predictor and a loss function, how well can we predict the loss that the predictor will incur on an input? This is the problem of loss prediction, a key computational task associated with uncertainty estimation for a predictor. In a classification setting, a predictor will typically predict a distribution over labels and hence have its own estimate of the loss that it will incur, given by… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  47. arXiv:2502.19781  [pdf, other

    cs.CV

    RANGE: Retrieval Augmented Neural Fields for Multi-Resolution Geo-Embeddings

    Authors: Aayush Dhakal, Srikumar Sastry, Subash Khanal, Adeel Ahmad, Eric Xing, Nathan Jacobs

    Abstract: The choice of representation for geographic location significantly impacts the accuracy of models for a broad range of geospatial tasks, including fine-grained species classification, population density estimation, and biome classification. Recent works like SatCLIP and GeoCLIP learn such representations by contrastively aligning geolocation with co-located images. While these methods work excepti… ▽ More

    Submitted 3 April, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

    Comments: Accepted to CVPR 2025

  48. arXiv:2502.19322  [pdf, other

    hep-th

    Bosonisation and BTZ Black Hole Microstates

    Authors: Suvankar Dutta, Shruti Menon, Aayush Srivastav

    Abstract: When the boundary dynamics of \(AdS_3\) gravity is governed by the collective field theory Hamiltonian proposed by Jevicki and Sakita, its asymptotic symmetry algebra becomes the centerless \(U(1)\) Kac-Moody algebra. We quantize this system using the quantum bosonization of relativistic free fermions and relate these to the dynamical fields of \(AdS_3\) gravity. This leads to a correspondence whe… ▽ More

    Submitted 20 March, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: 22 pages, 1 figure

  49. arXiv:2502.18832  [pdf, other

    cs.OS cs.PL

    Safe and usable kernel extensions with Rex

    Authors: Jinghao Jia, Ruowen Qin, Milo Craun, Egor Lukiyanov, Ayush Bansal, Michael V. Le, Hubertus Franke, Hani Jamjoom, Tianyin Xu, Dan Williams

    Abstract: Safe kernel extensions have gained significant traction, evolving from simple packet filters to large, complex programs that customize storage, networking, and scheduling. Existing kernel extension mechanisms like eBPF rely on in-kernel verifiers to ensure safety of kernel extensions by static verification using symbolic execution. We identify significant usability issues -- safe extensions being… ▽ More

    Submitted 28 April, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    ACM Class: D.4.5

  50. arXiv:2502.17955  [pdf, other

    cs.CL cs.AI

    Language Models' Factuality Depends on the Language of Inquiry

    Authors: Tushar Aggarwal, Kumar Tanmay, Ayush Agrawal, Kumar Ayush, Hamid Palangi, Paul Pu Liang

    Abstract: Multilingual language models (LMs) are expected to recall factual knowledge consistently across languages, yet they often fail to transfer knowledge between languages even when they possess the correct information in one of the languages. For example, we find that an LM may correctly identify Rashed Al Shashai as being from Saudi Arabia when asked in Arabic, but consistently fails to do so when as… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.