Skip to main content

Showing 1–50 of 217 results for author: Aadithya

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17121  [pdf, ps, other

    cs.CL

    Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?

    Authors: Adithya Bhaskar, Alexander Wettig, Tianyu Gao, Yihe Dong, Danqi Chen

    Abstract: Language models handle increasingly long contexts for tasks such as book summarization, but this leads to growing memory costs for the key-value (KV) cache. Many prior works have proposed ways of discarding KVs from memory, but their approaches are tailored to favorable settings, obscuring caveats like high peak memory and performance degradation, and a fair comparison between methods is difficult… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: We release our code publicly at https://github.com/princeton-pli/PruLong

  2. arXiv:2506.06211  [pdf, other

    cs.CL cs.AI cs.CV

    PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts

    Authors: Hengzhi Li, Brendon Jiang, Alexander Naehu, Regan Song, Justin Zhang, Megan Tjandrasuwita, Chanakya Ekbote, Steven-Shine Chen, Adithya Balachandran, Wei Dai, Rebecca Chang, Paul Pu Liang

    Abstract: Puzzlehunts are a genre of complex, multi-step puzzles lacking well-defined problem definitions. In contrast to conventional reasoning benchmarks consisting of tasks with clear instructions, puzzlehunts require models to discover the underlying problem structure from multimodal evidence and iterative reasoning, mirroring real-world domains such as scientific discovery, exploratory data analysis, o… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  3. arXiv:2506.04531  [pdf, ps, other

    cs.LG

    HALoS: Hierarchical Asynchronous Local SGD over Slow Networks for Geo-Distributed Large Language Model Training

    Authors: Geon-Woo Kim, Junbo Li, Shashidhar Gandham, Omar Baldonado, Adithya Gangidi, Pavan Balaji, Zhangyang Wang, Aditya Akella

    Abstract: Training large language models (LLMs) increasingly relies on geographically distributed accelerators, causing prohibitive communication costs across regions and uneven utilization of heterogeneous hardware. We propose HALoS, a hierarchical asynchronous optimization framework that tackles these issues by introducing local parameter servers (LPSs) within each region and a global parameter server (GP… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  4. arXiv:2506.00569  [pdf, ps, other

    cs.LG

    AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs

    Authors: Nicholas E. Corrado, Julian Katz-Samuels, Adithya Devraj, Hyokun Yun, Chao Zhang, Yi Xu, Yi Pan, Bing Yin, Trishul Chilimbi

    Abstract: When aligning large language models (LLMs), their performance on various tasks (such as being helpful, harmless, and honest) depends heavily on the composition of their training data. However, selecting a data mixture that achieves strong performance across all tasks is challenging. Existing approaches rely on large ablation studies, heuristics, or human intuition, but these can be prohibitively e… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: ACL 2025, Main Conference

  5. arXiv:2505.24705  [pdf, ps, other

    cs.CV

    RT-X Net: RGB-Thermal cross attention network for Low-Light Image Enhancement

    Authors: Raman Jha, Adithya Lenka, Mani Ramanagopal, Aswin Sankaranarayanan, Kaushik Mitra

    Abstract: In nighttime conditions, high noise levels and bright illumination sources degrade image quality, making low-light image enhancement challenging. Thermal images provide complementary information, offering richer textures and structural details. We propose RT-X Net, a cross-attention network that fuses RGB and thermal images for nighttime image enhancement. We leverage self-attention networks for f… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    Comments: Accepted at ICIP 2025

  6. arXiv:2505.17931  [pdf, ps, other

    cs.CV cs.AI cs.LG

    AutoMiSeg: Automatic Medical Image Segmentation via Test-Time Adaptation of Foundation Models

    Authors: Xingjian Li, Qifeng Wu, Colleen Que, Yiran Ding, Adithya S. Ubaradka, Jianhua Xing, Tianyang Wang, Min Xu

    Abstract: Medical image segmentation is vital for clinical diagnosis, yet current deep learning methods often demand extensive expert effort, i.e., either through annotating large training datasets or providing prompts at inference time for each new case. This paper introduces a zero-shot and automatic segmentation pipeline that combines off-the-shelf vision-language and segmentation foundation models. Give… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  7. arXiv:2505.09738  [pdf, other

    cs.CL cs.AI

    Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning

    Authors: Shaurya Sharthak, Vinayak Pahalwan, Adithya Kamath, Adarsh Shirawalmath

    Abstract: Pretrained language models (LLMs) are often constrained by their fixed tokenization schemes, leading to inefficiencies and performance limitations, particularly for multilingual or specialized applications. This tokenizer lock-in presents significant challenges. standard methods to overcome this often require prohibitive computational resources. Although tokenizer replacement with heuristic initia… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  8. arXiv:2505.08182  [pdf

    cs.IT

    Semantic De-boosting in e-commerce Query Autocomplete

    Authors: Adithya Rajan, Weiqi Tong, Greg Sharp, Prateek Verma, Kevin Li

    Abstract: In ecommerce search, query autocomplete plays a critical role to help users in their shopping journey. Often times, query autocomplete presents users with semantically similar queries, which can impede the user's ability to find diverse and relevant results. This paper proposes a novel strategy to enhance this service by refining the presentation of typeahead suggestions based on their semantic si… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  9. arXiv:2505.04651  [pdf, other

    cs.CL cs.LG

    Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions

    Authors: Adithya Kulkarni, Fatimah Alotaibi, Xinyue Zeng, Longfeng Wu, Tong Zeng, Barry Menglong Yao, Minqian Liu, Shuaicheng Zhang, Lifu Huang, Dawei Zhou

    Abstract: Large Language Models (LLMs) are transforming scientific hypothesis generation and validation by enabling information synthesis, latent relationship discovery, and reasoning augmentation. This survey provides a structured overview of LLM-driven approaches, including symbolic frameworks, generative models, hybrid systems, and multi-agent architectures. We examine techniques such as retrieval-augmen… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  10. arXiv:2504.12972  [pdf, other

    cs.CL

    Estimating Optimal Context Length for Hybrid Retrieval-augmented Multi-document Summarization

    Authors: Adithya Pratapa, Teruko Mitamura

    Abstract: Recent advances in long-context reasoning abilities of language models led to interesting applications in large-scale multi-document summarization. However, prior work has shown that these long-context models are not effective at their claimed context windows. To this end, retrieval-augmented systems provide an efficient and effective alternative. However, their performance can be highly sensitive… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  11. arXiv:2504.06996  [pdf, other

    cs.AR cs.HC cs.LG

    Neural Signal Compression using RAMAN tinyML Accelerator for BCI Applications

    Authors: Adithya Krishna, Sohan Debnath, André van Schaik, Mahesh Mehendale, Chetan Singh Thakur

    Abstract: High-quality, multi-channel neural recording is indispensable for neuroscience research and clinical applications. Large-scale brain recordings often produce vast amounts of data that must be wirelessly transmitted for subsequent offline analysis and decoding, especially in brain-computer interfaces (BCIs) utilizing high-density intracortical recordings with hundreds or thousands of electrodes. Ho… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  12. arXiv:2504.06141  [pdf, other

    cs.LG

    Adversarial Training of Reward Models

    Authors: Alexander Bukharin, Haifeng Qian, Shengyang Sun, Adithya Renduchintala, Soumye Singhal, Zhilin Wang, Oleksii Kuchaiev, Olivier Delalleau, Tuo Zhao

    Abstract: Reward modeling has emerged as a promising approach for the scalable alignment of language models. However, contemporary reward models (RMs) often lack robustness, awarding high rewards to low-quality, out-of-distribution (OOD) samples. This can lead to reward hacking, where policies exploit unintended shortcuts to maximize rewards, undermining alignment. To address this challenge, we introduce Ad… ▽ More

    Submitted 11 April, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

    Comments: 16 pages, 7 figures

  13. arXiv:2504.05747  [pdf, other

    cs.CL

    SEA-LION: Southeast Asian Languages in One Network

    Authors: Raymond Ng, Thanh Ngan Nguyen, Yuli Huang, Ngee Chia Tai, Wai Yi Leong, Wei Qi Leong, Xianbin Yong, Jian Gang Ngui, Yosephine Susanto, Nicholas Cheng, Hamsawardhini Rengarajan, Peerat Limkonchotiwat, Adithya Venkatadri Hulagadri, Kok Wai Teng, Yeo Yeow Tong, Bryan Siow, Wei Yi Teo, Wayne Lau, Choon Meng Tan, Brandon Ong, Zhi Hao Ong, Jann Railey Montalan, Adwin Chan, Sajeban Antonyrex, Ren Lee , et al. (6 additional authors not shown)

    Abstract: Recently, Large Language Models (LLMs) have dominated much of the artificial intelligence scene with their ability to process and generate natural languages. However, the majority of LLM research and development remains English-centric, leaving low-resource languages such as those in the Southeast Asian (SEA) region under-represented. To address this representation gap, we introduce Llama-SEA-LION… ▽ More

    Submitted 15 April, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

    Comments: We released our model at https://huggingface.co/collections/aisingapore/sea-lionv3-672589a39cdadd6a5b199581

  14. arXiv:2504.03624  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

    Authors: NVIDIA, :, Aaron Blakeman, Aarti Basant, Abhinav Khattar, Adithya Renduchintala, Akhiad Bercovich, Aleksander Ficek, Alexis Bjorlin, Ali Taghibakhshi, Amala Sanjay Deshmukh, Ameya Sunil Mahabaleshwarkar, Andrew Tao, Anna Shors, Ashwath Aithal, Ashwin Poojary, Ayush Dattagupta, Balaram Buddharaju, Bobby Chen, Boris Ginsburg, Boxin Wang, Brandon Norick, Brian Butterfield, Bryan Catanzaro, Carlo del Mundo , et al. (176 additional authors not shown)

    Abstract: As inference-time scaling becomes critical for enhanced reasoning capabilities, it is increasingly becoming important to build models that are efficient to infer. We introduce Nemotron-H, a family of 8B and 56B/47B hybrid Mamba-Transformer models designed to reduce inference cost for a given accuracy level. To achieve this goal, we replace the majority of self-attention layers in the common Transf… ▽ More

    Submitted 15 April, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

  15. An Iterative Algorithm to Symbolically Derive Generalized n-Trailer Vehicle Kinematics

    Authors: Yuvraj Singh, Adithya Jayakumar, Giorgio Rizzoni

    Abstract: Articulated multi-axle vehicles are interesting from a control-theoretic perspective due to their peculiar kinematic offtracking characteristics, instability modes, and singularities. Holonomic and nonholonomic constraints affecting the kinematic behavior is investigated in order to develop control-oriented kinematic models representative of these peculiarities. Then, the structure of these constr… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

  16. arXiv:2503.23088  [pdf, other

    cs.CL cs.AI

    UNITYAI-GUARD: Pioneering Toxicity Detection Across Low-Resource Indian Languages

    Authors: Himanshu Beniwal, Reddybathuni Venkat, Rohit Kumar, Birudugadda Srivibhav, Daksh Jain, Pavan Doddi, Eshwar Dhande, Adithya Ananth, Kuldeep, Heer Kubadia, Pratham Sharda, Mayank Singh

    Abstract: This work introduces UnityAI-Guard, a framework for binary toxicity classification targeting low-resource Indian languages. While existing systems predominantly cater to high-resource languages, UnityAI-Guard addresses this critical gap by developing state-of-the-art models for identifying toxic content across diverse Brahmic/Indic scripts. Our approach achieves an impressive average F1-score of 8… ▽ More

    Submitted 29 March, 2025; originally announced March 2025.

  17. arXiv:2503.20925  [pdf, other

    cs.CV cs.AI cs.LG

    Prototype Guided Backdoor Defense

    Authors: Venkat Adithya Amula, Sunayana Samavedam, Saurabh Saini, Avani Gupta, Narayanan P J

    Abstract: Deep learning models are susceptible to {\em backdoor attacks} involving malicious attackers perturbing a small subset of training data with a {\em trigger} to causes misclassifications. Various triggers have been used, including semantic triggers that are easily realizable without requiring the attacker to manipulate the image. The emergence of generative AI has eased the generation of varied poi… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  18. Attacking and Improving the Tor Directory Protocol

    Authors: Zhongtang Luo, Adithya Bhat, Kartik Nayak, Aniket Kate

    Abstract: The Tor network enhances clients' privacy by routing traffic through an overlay network of volunteered intermediate relays. Tor employs a distributed protocol among nine hard-coded Directory Authority (DA) servers to securely disseminate information about these relays to produce a new consensus document every hour. With a straightforward voting mechanism to ensure consistency, the protocol is expe… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Journal ref: 2024 IEEE Symposium on Security and Privacy (SP), San Francisco, CA, USA, 2024, pp. 3221-3237

  19. AI Rivalry as a Craft: How Resisting and Embracing Generative AI Reshape Writing Professions

    Authors: Rama Adithya Varanasi, Batia Mishan Wiesenfeld, Oded Nov

    Abstract: Generative AI (GAI) technologies are disrupting professional writing, challenging traditional practices. Recent studies explore GAI adoption experiences of creative practitioners, but we know little about how these experiences evolve into established practices and how GAI resistance alters these practices. To address this gap, we conducted 25 semi-structured interviews with writing professionals w… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  20. arXiv:2503.08930  [pdf, other

    eess.SP cs.CV cs.RO

    Acoustic Neural 3D Reconstruction Under Pose Drift

    Authors: Tianxiang Lin, Mohamad Qadri, Kevin Zhang, Adithya Pediredla, Christopher A. Metzler, Michael Kaess

    Abstract: We consider the problem of optimizing neural implicit surfaces for 3D reconstruction using acoustic images collected with drifting sensor poses. The accuracy of current state-of-the-art 3D acoustic modeling algorithms is highly dependent on accurate pose estimation; small errors in sensor pose can lead to severe reconstruction artifacts. In this paper, we propose an algorithm that jointly optimize… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 8 pages, 8 figures. This paper is under review

  21. arXiv:2503.04549  [pdf, other

    cs.CR

    Lite-PoT: Practical Powers-of-Tau Setup Ceremony

    Authors: Lucien K. L. Ng, Pedro Moreno-Sanchez, Mohsen Minaei, Panagiotis Chatzigiannis, Adithya Bhat, Duc V. Le

    Abstract: Zero-Knowledge Succinct Non-Interactive Argument of Knowledge (zk-SNARK) schemes have gained significant adoption in privacy-preserving applications, decentralized systems (e.g., blockchain), and verifiable computation due to their efficiency. However, the most efficient zk-SNARKs often rely on a one-time trusted setup to generate a public parameter, often known as the ``Powers of Tau" (PoT) strin… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  22. arXiv:2503.00245  [pdf, other

    cs.LG cs.CL

    CoSMoEs: Compact Sparse Mixture of Experts

    Authors: Patrick Huber, Akshat Shrivastava, Ernie Chang, Chinnadhurai Sankar, Ahmed Aly, Adithya Sagar

    Abstract: Sparse Mixture of Expert (MoE) models are popular foundational architectures at large scale, however, under-explored at smaller sizes. Here, we show how to enable Compact Sparse Mixture of Experts (CoSMoEs) for on-device inference. Specifically, we tackle the three main on-device dimensions: Quality, Memory and Latency. Along the quality axis, we show that in a fair evaluation (removing confoundin… ▽ More

    Submitted 28 February, 2025; originally announced March 2025.

    Comments: 11 pages, 8 figures

  23. arXiv:2502.14301  [pdf, ps, other

    cs.CL cs.AI

    SEA-HELM: Southeast Asian Holistic Evaluation of Language Models

    Authors: Yosephine Susanto, Adithya Venkatadri Hulagadri, Jann Railey Montalan, Jian Gang Ngui, Xian Bin Yong, Weiqi Leong, Hamsawardhini Rengarajan, Peerat Limkonchotiwat, Yifan Mai, William Chandra Tjhi

    Abstract: With the rapid emergence of novel capabilities in Large Language Models (LLMs), the need for rigorous multilingual and multicultural benchmarks that are integrated has become more pronounced. Though existing LLM benchmarks are capable of evaluating specific capabilities of LLMs in English as well as in various mid- to low-resource languages, including those in the Southeast Asian (SEA) region, a c… ▽ More

    Submitted 2 June, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

  24. arXiv:2502.08854  [pdf, ps, other

    cs.HC

    Generative AI & Changing Work: Systematic Review of Practitioner-led Work Transformations through the Lens of Job Crafting

    Authors: Matthew Law, Rama Adithya Varanasi

    Abstract: Widespread integration of Generative AI tools is transforming white-collar work, reshaping how workers define their roles, manage their tasks, and collaborate with peers. This has created a need to develop an overarching understanding of common worker-driven patterns around these transformations. To fill this gap, we conducted a systematic literature review of 23 studies from the ACM Digital Libra… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

  25. arXiv:2502.07854  [pdf, other

    cs.LG cs.CV

    Advancing Heat Demand Forecasting with Attention Mechanisms: Opportunities and Challenges

    Authors: Adithya Ramachandran, Thorkil Flensmark B. Neergaard, Andreas Maier, Siming Bayer

    Abstract: Global leaders and policymakers are unified in their unequivocal commitment to decarbonization efforts in support of Net-Zero agreements. District Heating Systems (DHS), while contributing to carbon emissions due to the continued reliance on fossil fuels for heat production, are embracing more sustainable practices albeit with some sense of vulnerability as it could constrain their ability to adap… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  26. arXiv:2502.06617  [pdf, other

    cs.CL

    Scaling Multi-Document Event Summarization: Evaluating Compression vs. Full-Text Approaches

    Authors: Adithya Pratapa, Teruko Mitamura

    Abstract: Automatically summarizing large text collections is a valuable tool for document research, with applications in journalism, academic research, legal work, and many other fields. In this work, we contrast two classes of systems for large-scale multi-document summarization (MDS): compression and full-text. Compression-based methods use a multi-stage pipeline and often lead to lossy summaries. Full-t… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: NAACL 2025 camera-ready version

  27. arXiv:2502.04630  [pdf, other

    cs.CV cs.GR

    High-Speed Dynamic 3D Imaging with Sensor Fusion Splatting

    Authors: Zihao Zou, Ziyuan Qu, Xi Peng, Vivek Boominathan, Adithya Pediredla, Praneeth Chakravarthula

    Abstract: Capturing and reconstructing high-speed dynamic 3D scenes has numerous applications in computer graphics, vision, and interdisciplinary fields such as robotics, aerodynamics, and evolutionary biology. However, achieving this using a single imaging modality remains challenging. For instance, traditional RGB cameras suffer from low frame rates, limited exposure times, and narrow baselines. To addres… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  28. arXiv:2502.00203  [pdf, other

    cs.LG cs.CL

    Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment

    Authors: Shengyang Sun, Yian Zhang, Alexander Bukharin, David Mosallanezhad, Jiaqi Zeng, Soumye Singhal, Gerald Shen, Adithya Renduchintala, Tugrul Konuk, Yi Dong, Zhilin Wang, Dmitry Chichkov, Olivier Delalleau, Oleksii Kuchaiev

    Abstract: The rapid development of large language model (LLM) alignment algorithms has resulted in a complex and fragmented landscape, with limited clarity on the effectiveness of different methods and their inter-connections. This paper introduces Reward-Aware Preference Optimization (RPO), a mathematical framework that unifies popular preference optimization techniques in LLM alignment, including DPO, IPO… ▽ More

    Submitted 7 February, 2025; v1 submitted 31 January, 2025; originally announced February 2025.

    Comments: 8 pages, 4 figures; update author names

  29. arXiv:2501.16344  [pdf, ps, other

    eess.AS cs.AI cs.CL cs.SD

    WhiSPA: Semantically and Psychologically Aligned Whisper with Self-Supervised Contrastive and Student-Teacher Learning

    Authors: Rajath Rao, Adithya Ganesan, Oscar Kjell, Jonah Luby, Akshay Raghavan, Scott Feltman, Whitney Ringwald, Ryan L. Boyd, Benjamin Luft, Camilo Ruggero, Neville Ryant, Roman Kotov, H. Andrew Schwartz

    Abstract: Current speech encoding pipelines often rely on an additional text-based LM to get robust representations of human communication, even though SotA speech-to-text models often have a LM within. This work proposes an approach to improve the LM within an audio model such that the subsequent text-LM is unnecessary. We introduce WhiSPA (Whisper with Semantic and Psychological Alignment), which leverage… ▽ More

    Submitted 31 May, 2025; v1 submitted 15 January, 2025; originally announced January 2025.

    Comments: 16 pages, 8 figures, ACL 2025

  30. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  31. arXiv:2412.16882  [pdf, other

    cs.AI cs.CL

    PsychAdapter: Adapting LLM Transformers to Reflect Traits, Personality and Mental Health

    Authors: Huy Vu, Huy Anh Nguyen, Adithya V Ganesan, Swanie Juhng, Oscar N. E. Kjell, Joao Sedoc, Margaret L. Kern, Ryan L. Boyd, Lyle Ungar, H. Andrew Schwartz, Johannes C. Eichstaedt

    Abstract: Artificial intelligence-based language generators are now a part of most people's lives. However, by default, they tend to generate "average" language without reflecting the ways in which people differ. Here, we propose a lightweight modification to the standard language model transformer architecture - "PsychAdapter" - that uses empirically derived trait-language patterns to generate natural lang… ▽ More

    Submitted 31 December, 2024; v1 submitted 22 December, 2024; originally announced December 2024.

  32. arXiv:2412.16270  [pdf, other

    cs.AI cs.HC

    MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design

    Authors: Jingyuan Qi, Zian Jia, Minqian Liu, Wangzhi Zhan, Junkai Zhang, Xiaofei Wen, Jingru Gan, Jianpeng Chen, Qin Liu, Mingyu Derek Ma, Bangzheng Li, Haohui Wang, Adithya Kulkarni, Muhao Chen, Dawei Zhou, Ling Li, Wei Wang, Lifu Huang

    Abstract: The discovery of novel mechanical metamaterials, whose properties are dominated by their engineered structures rather than chemical composition, is a knowledge-intensive and resource-demanding process. To accelerate the design of novel metamaterials, we present MetaScientist, a human-in-the-loop system that integrates advanced AI capabilities with expert oversight with two primary phases: (1) hypo… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  33. arXiv:2412.06191  [pdf, other

    cs.CV

    Event fields: Capturing light fields at high speed, resolution, and dynamic range

    Authors: Ziyuan Qu, Zihao Zou, Vivek Boominathan, Praneeth Chakravarthula, Adithya Pediredla

    Abstract: Event cameras, which feature pixels that independently respond to changes in brightness, are becoming increasingly popular in high-speed applications due to their lower latency, reduced bandwidth requirements, and enhanced dynamic range compared to traditional frame-based cameras. Numerous imaging and vision techniques have leveraged event cameras for high-speed scene understanding by capturing hi… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

  34. arXiv:2411.18597  [pdf, other

    cs.CV

    Structured light with a million light planes per second

    Authors: Dhawal Sirikonda, Praneeth Chakravarthula, Ioannis Gkioulekas, Adithya Pediredla

    Abstract: We introduce a structured light system that captures full-frame depth at rates of a thousand frames per second, four times faster than the previous state of the art. Our key innovation to this end is the design of an acousto-optic light scanning device that can scan light planes at rates up to two million planes per second. We combine this device with an event camera for structured light, using th… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  35. arXiv:2411.15592  [pdf, other

    eess.IV cs.CV

    Classifier Enhanced Deep Learning Model for Erythroblast Differentiation with Limited Data

    Authors: Buddhadev Goswami, Adithya B. Somaraj, Prantar Chakrabarti, Ravindra Gudi, Nirmal Punjabi

    Abstract: Hematological disorders, which involve a variety of malignant conditions and genetic diseases affecting blood formation, present significant diagnostic challenges. One such major challenge in clinical settings is differentiating Erythroblast from WBCs. Our approach evaluates the efficacy of various machine learning (ML) classifiers$\unicode{x2014}$SVM, XG-Boost, KNN, and Random Forest… ▽ More

    Submitted 25 November, 2024; v1 submitted 23 November, 2024; originally announced November 2024.

    Comments: 14 pages, Accepted for the 27th International Conference on Pattern Recognition (ICPR 2024)

  36. arXiv:2411.13800  [pdf, other

    cs.CL

    Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis

    Authors: Adithya V Ganesan, Vasudha Varadarajan, Yash Kumar Lal, Veerle C. Eijsbroek, Katarina Kjell, Oscar N. E. Kjell, Tanuja Dhanasekaran, Elizabeth C. Stade, Johannes C. Eichstaedt, Ryan L. Boyd, H. Andrew Schwartz, Lucie Flek

    Abstract: Use of large language models such as ChatGPT (GPT-4) for mental health support has grown rapidly, emerging as a promising route to assess and help people with mood disorders, like depression. However, we have a limited understanding of GPT-4's schema of mental disorders, that is, how it internally associates and interprets symptoms. In this work, we leveraged contemporary measurement theory to dec… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

    Comments: 21 pages, 3 tables, 6 figures, 1 supplementary table, 83 references

  37. arXiv:2411.12619  [pdf

    cs.HC cs.CL

    Leveraging Virtual Reality and AI Tutoring for Language Learning: A Case Study of a Virtual Campus Environment with OpenAI GPT Integration with Unity 3D

    Authors: Adithya TG, Abhinavaram N, Gowri Srinivasa

    Abstract: This paper presents a new approach to multiple language learning, with Hindi the language to be learnt in our case, by using the integration of virtual reality environments and AI enabled tutoring systems using OpenAIs GPT api calls. We have developed a scenario which has a virtual campus environment using Unity which focuses on a detailed representation of our universitys buildings 11th floor, wh… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: 5 pages, 2 tables, 8 figures

  38. arXiv:2411.07175  [pdf, other

    cs.CL

    Continual Memorization of Factoids in Language Models

    Authors: Howard Chen, Jiayi Geng, Adithya Bhaskar, Dan Friedman, Danqi Chen

    Abstract: As new knowledge rapidly accumulates, language models (LMs) with pretrained knowledge quickly become obsolete. A common approach to updating LMs is fine-tuning them directly on new knowledge. However, recent studies have shown that fine-tuning for memorization may be ineffective in storing knowledge or may exacerbate hallucinations. In this work, we introduce a setting we call continual memorizati… ▽ More

    Submitted 27 February, 2025; v1 submitted 11 November, 2024; originally announced November 2024.

  39. arXiv:2411.04299  [pdf, other

    cs.SE

    An Empirical Study on Automatically Detecting AI-Generated Source Code: How Far Are We?

    Authors: Hyunjae Suh, Mahan Tafreshipour, Jiawei Li, Adithya Bhattiprolu, Iftekhar Ahmed

    Abstract: Artificial Intelligence (AI) techniques, especially Large Language Models (LLMs), have started gaining popularity among researchers and software developers for generating source code. However, LLMs have been shown to generate code with quality issues and also incurred copyright/licensing infringements. Therefore, detecting whether a piece of source code is written by humans or AI has become necess… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

    Comments: Accepted at The 47th IEEE/ACM International Conference on Software Engineering (ICSE 2025)

  40. arXiv:2410.21680  [pdf, other

    cs.DC cs.LG

    Revisiting Reliability in Large-Scale Machine Learning Research Clusters

    Authors: Apostolos Kokolis, Michael Kuchnik, John Hoffman, Adithya Kumar, Parth Malani, Faye Ma, Zachary DeVito, Shubho Sengupta, Kalyan Saladi, Carole-Jean Wu

    Abstract: Reliability is a fundamental challenge in operating large-scale machine learning (ML) infrastructures, particularly as the scale of ML models and training clusters continues to grow. Despite decades of research on infrastructure failures, the impact of job failures across different scales remains unclear. This paper presents a view of managing two large, multi-tenant ML clusters, providing quantit… ▽ More

    Submitted 6 February, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

  41. Enhancing CNN Classification with Lamarckian Memetic Algorithms and Local Search

    Authors: Akhilbaran Ghosh, Rama Sai Adithya Kalidindi

    Abstract: Optimization is critical for optimal performance in deep neural networks (DNNs). Traditional gradient-based methods often face challenges like local minima entrapment. This paper explores population-based metaheuristic optimization algorithms for image classification networks. We propose a novel approach integrating a two-stage training technique with population-based optimization algorithms incor… ▽ More

    Submitted 30 October, 2024; v1 submitted 26 October, 2024; originally announced October 2024.

    Comments: Accepted in IEEE SPARC 2024

    Journal ref: 2024 IEEE International Conference on Signal Processing and Advance Research in Computing (SPARC)

  42. arXiv:2410.20199  [pdf, other

    cs.AI

    Rethinking the Uncertainty: A Critical Review and Analysis in the Era of Large Language Models

    Authors: Mohammad Beigi, Sijia Wang, Ying Shen, Zihao Lin, Adithya Kulkarni, Jianfeng He, Feng Chen, Ming Jin, Jin-Hee Cho, Dawei Zhou, Chang-Tien Lu, Lifu Huang

    Abstract: In recent years, Large Language Models (LLMs) have become fundamental to a broad spectrum of artificial intelligence applications. As the use of LLMs expands, precisely estimating the uncertainty in their predictions has become crucial. Current methods often struggle to accurately identify, measure, and address the true uncertainty, with many focusing primarily on estimating model confidence. This… ▽ More

    Submitted 26 October, 2024; originally announced October 2024.

  43. arXiv:2410.08847  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

    Authors: Noam Razin, Sadhika Malladi, Adithya Bhaskar, Danqi Chen, Sanjeev Arora, Boris Hanin

    Abstract: Direct Preference Optimization (DPO) and its variants are increasingly used for aligning language models with human preferences. Although these methods are designed to teach a model to generate preferred responses more frequently relative to dispreferred responses, prior work has observed that the likelihood of preferred responses often decreases during training. The current work sheds light on th… ▽ More

    Submitted 27 April, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: Accepted to ICLR 2025; Code available at https://github.com/princeton-nlp/unintentional-unalignment

  44. arXiv:2410.03555  [pdf, other

    cs.RO cs.CV

    Enhancing Autonomous Navigation by Imaging Hidden Objects using Single-Photon LiDAR

    Authors: Aaron Young, Nevindu M. Batagoda, Harry Zhang, Akshat Dave, Adithya Pediredla, Dan Negrut, Ramesh Raskar

    Abstract: Robust autonomous navigation in environments with limited visibility remains a critical challenge in robotics. We present a novel approach that leverages Non-Line-of-Sight (NLOS) sensing using single-photon LiDAR to improve visibility and enhance autonomous navigation. Our method enables mobile robots to "see around corners" by utilizing multi-bounce light information, effectively expanding their… ▽ More

    Submitted 11 March, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: Project webpage: https://camera-culture.github.io/nlos-aided-autonomous-navigation

  45. arXiv:2409.19771  [pdf, other

    cs.RO

    Learning Wheelchair Tennis Navigation from Broadcast Videos with Domain Knowledge Transfer and Diffusion Motion Planning

    Authors: Zixuan Wu, Zulfiqar Zaidi, Adithya Patil, Qingyu Xiao, Matthew Gombolay

    Abstract: In this paper, we propose a novel and generalizable zero-shot knowledge transfer framework that distills expert sports navigation strategies from web videos into robotic systems with adversarial constraints and out-of-distribution image trajectories. Our pipeline enables diffusion-based imitation learning by reconstructing the full 3D task space from multiple partial views, warping it into 2D imag… ▽ More

    Submitted 8 May, 2025; v1 submitted 29 September, 2024; originally announced September 2024.

    Comments: This manuscript has been accepted by 2025 IEEE International Conference on Robotics & Automation (ICRA)

  46. arXiv:2409.09550  [pdf, other

    cs.MA cs.RO

    Swarm Algorithms for Dynamic Task Allocation in Unknown Environments

    Authors: Adithya Balachandran, Noble Harasha, Nancy Lynch

    Abstract: Robot swarms, systems of many robots that operate in a distributed fashion, have many applications in areas such as search-and-rescue, natural disaster response, and self-assembly. Several of these applications can be abstracted to the general problem of task allocation in an environment, in which robots must assign themselves to and complete tasks. While several algorithms for task allocation hav… ▽ More

    Submitted 14 September, 2024; originally announced September 2024.

    Comments: 14 pages, 10 figures

  47. arXiv:2409.00795  [pdf, other

    q-bio.PE cs.LG math.DS

    Analysis of a mathematical model for malaria using data-driven approach

    Authors: Adithya Rajnarayanan, Manoj Kumar, Abdessamad Tridane

    Abstract: Malaria is one of the deadliest diseases in the world, every year millions of people become victims of this disease and many even lose their lives. Medical professionals and the government could take accurate measures to protect the people only when the disease dynamics are understood clearly. In this work, we propose a compartmental model to study the dynamics of malaria. We consider the transmis… ▽ More

    Submitted 31 December, 2024; v1 submitted 1 September, 2024; originally announced September 2024.

  48. arXiv:2408.11219  [pdf, other

    cs.CL cs.AI

    CoDi: Conversational Distillation for Grounded Question Answering

    Authors: Patrick Huber, Arash Einolghozati, Rylan Conway, Kanika Narang, Matt Smith, Waqar Nayyar, Adithya Sagar, Ahmed Aly, Akshat Shrivastava

    Abstract: Distilling conversational skills into Small Language Models (SLMs) with approximately 1 billion parameters presents significant challenges. Firstly, SLMs have limited capacity in their model parameters to learn extensive knowledge compared to larger models. Secondly, high-quality conversational datasets are often scarce, small, and domain-specific. Addressing these challenges, we introduce a novel… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 13 pages

  49. arXiv:2408.07759  [pdf, other

    cs.IR

    SWaT: Statistical Modeling of Video Watch Time through User Behavior Analysis

    Authors: Shentao Yang, Haichuan Yang, Linna Du, Adithya Ganesh, Bo Peng, Boying Liu, Serena Li, Ji Liu

    Abstract: The significance of estimating video watch time has been highlighted by the rising importance of (short) video recommendation, which has become a core product of mainstream social media platforms. Modeling video watch time, however, has been challenged by the complexity of user-video interaction, such as different user behavior modes in watching the recommended videos and varying watching probabil… ▽ More

    Submitted 23 February, 2025; v1 submitted 14 August, 2024; originally announced August 2024.

    Comments: Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1 (KDD '25),

  50. arXiv:2408.02802  [pdf, other

    cs.LG

    Deciphering Air Travel Disruptions: A Machine Learning Approach

    Authors: Aravinda Jatavallabha, Jacob Gerlach, Aadithya Naresh

    Abstract: This research investigates flight delay trends by examining factors such as departure time, airline, and airport. It employs regression machine learning methods to predict the contributions of various sources to delays. Time-series models, including LSTM, Hybrid LSTM, and Bi-LSTM, are compared with baseline regression models such as Multiple Regression, Decision Tree Regression, Random Forest Regr… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: 10 pages, 11 figures, 6 tables