Skip to main content

Showing 1–50 of 486 results for author: Hari

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.16412  [pdf, ps, other

    cs.SI cs.CL cs.CY

    Unpacking Generative AI in Education: Computational Modeling of Teacher and Student Perspectives in Social Media Discourse

    Authors: Paulina DeVito, Akhil Vallala, Sean Mcmahon, Yaroslav Hinda, Benjamin Thaw, Hanqi Zhuang, Hari Kalva

    Abstract: Generative AI (GAI) technologies are quickly reshaping the educational landscape. As adoption accelerates, understanding how students and educators perceive these tools is essential. This study presents one of the most comprehensive analyses to date of stakeholder discourse dynamics on GAI in education using social media data. Our dataset includes 1,199 Reddit posts and 13,959 corresponding top-le… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: This work has been submitted to IEEE Transactions on Computational Social Systems for possible publication

  2. arXiv:2506.12332  [pdf, ps, other

    cs.HC

    TermSight: Making Service Contracts Approachable

    Authors: Ziheng Huang, Tal August, Hari Sundaram

    Abstract: Terms of Service (ToS) are ubiquitous, legally binding contracts that govern consumers' digital interactions. However, ToS are not designed to be read: they are filled with pages of ambiguous and complex legal terminology that burden potential users. We introduce TermSight, an intelligent reading interface designed to make ToS more approachable. TermSight offers visual summaries that highlight the… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  3. arXiv:2506.10968  [pdf, ps, other

    cs.RO cs.CV

    Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop

    Authors: Justin Kerr, Kush Hari, Ethan Weber, Chung Min Kim, Brent Yi, Tyler Bonnen, Ken Goldberg, Angjoo Kanazawa

    Abstract: Humans do not passively observe the visual world -- we actively look in order to act. Motivated by this principle, we introduce EyeRobot, a robotic system with gaze behavior that emerges from the need to complete real-world tasks. We develop a mechanical eyeball that can freely rotate to observe its surroundings and train a gaze policy to control it using reinforcement learning. We accomplish this… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: Project page: https://www.eyerobot.net/

  4. arXiv:2506.04434  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Grokking and Generalization Collapse: Insights from \texttt{HTSR} theory

    Authors: Hari K. Prakash, Charles H. Martin

    Abstract: We study the well-known grokking phenomena in neural networks (NNs) using a 3-layer MLP trained on 1 k-sample subset of MNIST, with and without weight decay, and discover a novel third phase -- \emph{anti-grokking} -- that occurs very late in training and resembles but is distinct from the familiar \emph{pre-grokking} phases: test accuracy collapses while training accuracy stays perfect. This late… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 15 pages,7 figs

  5. arXiv:2505.20630  [pdf, ps, other

    cs.SE cs.CL

    SV-TrustEval-C: Evaluating Structure and Semantic Reasoning in Large Language Models for Source Code Vulnerability Analysis

    Authors: Yansong Li, Paula Branco, Alexander M. Hoole, Manish Marwah, Hari Manassery Koduvely, Guy-Vincent Jourdan, Stephan Jou

    Abstract: As Large Language Models (LLMs) evolve in understanding and generating code, accurately evaluating their reliability in analyzing source code vulnerabilities becomes increasingly vital. While studies have examined LLM capabilities in tasks like vulnerability detection and repair, they often overlook the importance of both structure and semantic reasoning crucial for trustworthy vulnerability analy… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Journal ref: 2025 IEEE Symposium on Security and Privacy (SP), 2025, pp. 2791-2809

  6. arXiv:2505.19445  [pdf, other

    cs.LG

    MetaGMT: Improving Actionable Interpretability of Graph Multilinear Networks via Meta-Learning Filtration

    Authors: Rishabh Bhattacharya, Hari Shankar, Vaishnavi Shivkumar, Ponnurangam Kumaraguru

    Abstract: The growing adoption of Graph Neural Networks (GNNs) in high-stakes domains like healthcare and finance demands reliable explanations of their decision-making processes. While inherently interpretable GNN architectures like Graph Multi-linear Networks (GMT) have emerged, they remain vulnerable to generating explanations based on spurious correlations, potentially undermining trust in critical appl… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 8 Pages Main Content, 10 Pages including Appendix. 1 Figure, 7 Tables

  7. arXiv:2505.15558  [pdf, ps, other

    cs.RO cs.AI cs.DB cs.LG

    Robo-DM: Data Management For Large Robot Datasets

    Authors: Kaiyuan Chen, Letian Fu, David Huang, Yanxiang Zhang, Lawrence Yunliang Chen, Huang Huang, Kush Hari, Ashwin Balakrishna, Ted Xiao, Pannag R Sanketi, John Kubiatowicz, Ken Goldberg

    Abstract: Recent results suggest that very large datasets of teleoperated robot demonstrations can be used to train transformer-based models that have the potential to generalize to new scenes, robots, and tasks. However, curating, distributing, and loading large datasets of robot trajectories, which typically consist of video, textual, and numerical modalities - including streams from multiple cameras - re… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: Best paper finalist of IEEE ICRA 2025

  8. arXiv:2505.14900  [pdf

    cs.DB

    Implementing Decentralized Per-Partition Automatic Failover in Azure Cosmos DB

    Authors: Josh Rowe, Mikael Horal, Hari Sudan Sundar, Muthukumaran Arumugam, Burak Kose, Sravani Mitra Palivela, Geni Marsh, Varun Jain, Abhishek Kumar, Dhaval Patel

    Abstract: Azure Cosmos DB is a cloud-native distributed database, operating at a massive scale, powering Microsoft Cloud. Think 10s of millions of database partitions (replica-sets), 100+ PBs of data under management, 20M+ vCores. Failovers are an integral part of distributed databases to provide data availability during outages (partial or full regional outages). While failovers within a replica-set within… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    ACM Class: H.2.4; H.2.7

  9. arXiv:2505.14536  [pdf, ps, other

    cs.CL

    Breaking Bad Tokens: Detoxification of LLMs Using Sparse Autoencoders

    Authors: Agam Goyal, Vedant Rathi, William Yeh, Yian Wang, Yuen Chen, Hari Sundaram

    Abstract: Large language models (LLMs) are now ubiquitous in user-facing applications, yet they still generate undesirable toxic outputs, including profanity, vulgarity, and derogatory remarks. Although numerous detoxification methods exist, most apply broad, surface-level fixes and can therefore easily be circumvented by jailbreak attacks. In this paper we leverage sparse autoencoders (SAEs) to identify to… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Preprint: 19 pages, 7 figures, 1 table

  10. arXiv:2505.13287  [pdf, ps, other

    cs.AI quant-ph

    Level Generation with Quantum Reservoir Computing

    Authors: João S. Ferreira, Pierre Fromholz, Hari Shaji, James R. Wootton

    Abstract: Reservoir computing is a form of machine learning particularly suited for time series analysis, including forecasting predictions. We take an implementation of \emph{quantum} reservoir computing that was initially designed to generate variants of musical scores and adapt it to create levels of Super Mario Bros. Motivated by our analysis of these levels, we develop a new Roblox \textit{obby} where… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  11. arXiv:2505.12072  [pdf, ps, other

    cs.RO

    L2D2: Robot Learning from 2D Drawings

    Authors: Shaunak A. Mehta, Heramb Nemlekar, Hari Sumant, Dylan P. Losey

    Abstract: Robots should learn new tasks from humans. But how do humans convey what they want the robot to do? Existing methods largely rely on humans physically guiding the robot arm throughout their intended task. Unfortunately -- as we scale up the amount of data -- physical guidance becomes prohibitively burdensome. Not only do humans need to operate robot hardware but also modify the environment (e.g.,… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  12. arXiv:2505.05885  [pdf, ps, other

    cs.DB cs.IR

    Cost-Effective, Low Latency Vector Search with Azure Cosmos DB

    Authors: Nitish Upreti, Krishnan Sundaram, Hari Sudan Sundar, Samer Boshra, Balachandar Perumalswamy, Shivam Atri, Martin Chisholm, Revti Raman Singh, Greg Yang, Subramanyam Pattipaka, Tamara Hass, Nitesh Dudhey, James Codella, Mark Hildebrand, Magdalen Manohar, Jack Moffitt, Haiyang Xu, Naren Datha, Suryansh Gupta, Ravishankar Krishnaswamy, Prashant Gupta, Abhishek Sahu, Ritika Mor, Santosh Kulkarni, Hemeswari Varada , et al. (11 additional authors not shown)

    Abstract: Vector indexing enables semantic search over diverse corpora and has become an important interface to databases for both users and AI agents. Efficient vector search requires deep optimizations in database systems. This has motivated a new class of specialized vector databases that optimize for vector search quality and cost. Instead, we argue that a scalable, high-performance, and cost-efficient… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    ACM Class: H.3.3

  13. arXiv:2505.01064  [pdf, ps, other

    cs.CV cs.LG

    Efficient Vocabulary-Free Fine-Grained Visual Recognition in the Age of Multimodal LLMs

    Authors: Hari Chandana Kuchibhotla, Sai Srinivas Kancheti, Abbavaram Gowtham Reddy, Vineeth N Balasubramanian

    Abstract: Fine-grained Visual Recognition (FGVR) involves distinguishing between visually similar categories, which is inherently challenging due to subtle inter-class differences and the need for large, expert-annotated datasets. In domains like medical imaging, such curated datasets are unavailable due to issues like privacy concerns and high annotation costs. In such scenarios lacking labeled data, an FG… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: preprint; earlier version accepted at NeurIPS 2024 Workshop on Adaptive Foundation Models

  14. arXiv:2505.00195  [pdf, other

    cs.CY cs.GT cs.LG

    Algorithmic Collective Action with Two Collectives

    Authors: Aditya Karan, Nicholas Vincent, Karrie Karahalios, Hari Sundaram

    Abstract: Given that data-dependent algorithmic systems have become impactful in more domains of life, the need for individuals to promote their own interests and hold algorithms accountable has grown. To have meaningful influence, individuals must band together to engage in collective action. Groups that engage in such algorithmic collective action are likely to vary in size, membership characteristics, an… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

  15. arXiv:2504.19674  [pdf, other

    cs.CR cs.AI

    $\texttt{SAGE}$: A Generic Framework for LLM Safety Evaluation

    Authors: Madhur Jindal, Hari Shrawgi, Parag Agrawal, Sandipan Dandapat

    Abstract: Safety evaluation of Large Language Models (LLMs) has made progress and attracted academic interest, but it remains challenging to keep pace with the rapid integration of LLMs across diverse applications. Different applications expose users to various harms, necessitating application-specific safety evaluations with tailored harms and policies. Another major gap is the lack of focus on the dynamic… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 24 pages, 9 main pages excluding references and appendix

  16. arXiv:2504.16047  [pdf

    cs.CV cs.AI

    Evaluating Vision Language Models (VLMs) for Radiology: A Comprehensive Analysis

    Authors: Frank Li, Hari Trivedi, Bardia Khosravi, Theo Dapamede, Mohammadreza Chavoshi, Abdulhameed Dere, Rohan Satya Isaac, Aawez Mansuri, Janice Newsome, Saptarshi Purkayastha, Judy Gichoya

    Abstract: Foundation models, trained on vast amounts of data using self-supervised techniques, have emerged as a promising frontier for advancing artificial intelligence (AI) applications in medicine. This study evaluates three different vision-language foundation models (RAD-DINO, CheXagent, and BiomedCLIP) on their ability to capture fine-grained imaging features for radiology tasks. The models were asses… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  17. arXiv:2504.14857  [pdf, other

    cs.RO

    SuFIA-BC: Generating High Quality Demonstration Data for Visuomotor Policy Learning in Surgical Subtasks

    Authors: Masoud Moghani, Nigel Nelson, Mohamed Ghanem, Andres Diaz-Pinto, Kush Hari, Mahdi Azizian, Ken Goldberg, Sean Huver, Animesh Garg

    Abstract: Behavior cloning facilitates the learning of dexterous manipulation skills, yet the complexity of surgical environments, the difficulty and expense of obtaining patient data, and robot calibration errors present unique challenges for surgical robot learning. We provide an enhanced surgical digital twin with photorealistic human anatomical organs, integrated into a comprehensive simulator designed… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  18. arXiv:2504.14157  [pdf, other

    physics.optics cs.LG

    DeepPD: Joint Phase and Object Estimation from Phase Diversity with Neural Calibration of a Deformable Mirror

    Authors: Magdalena C. Schneider, Courtney Johnson, Cedric Allier, Larissa Heinrich, Diane Adjavon, Joren Husic, Patrick La Rivière, Stephan Saalfeld, Hari Shroff

    Abstract: Sample-induced aberrations and optical imperfections limit the resolution of fluorescence microscopy. Phase diversity is a powerful technique that leverages complementary phase information in sequentially acquired images with deliberately introduced aberrations--the phase diversities--to enable phase and object reconstruction and restore diffraction-limited resolution. These phase diversities are… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  19. arXiv:2504.11302  [pdf, ps, other

    math.CA cs.LG math.MG

    Limits of Discrete Energy of Families of Increasing Sets

    Authors: Hari Nathan

    Abstract: The Hausdorff dimension of a set can be detected using the Riesz energy. Here, we consider situations where a sequence of points, $\{x_n\}$, ``fills in'' a set $E \subset \mathbb{R}^d$ in an appropriate sense and investigate the degree to which the discrete analog to the Riesz energy of these sets can be used to bound the Hausdorff dimension of $E$. We also discuss applications to data science and… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  20. arXiv:2504.06406  [pdf, other

    cs.NI

    Scalable Routing in a City-Scale Wi-Fi Network for Disaster Recovery

    Authors: Ziqian Liu, Om Chabra, James Lynch, Chenning Li, Manya Ghobadi, Hari Balakrishnan

    Abstract: In this paper, we present a new city-scale decentralized mesh network system suited for disaster recovery and emergencies. When wide-area connectivity is unavailable or significantly degraded, our system, MapMesh, enables static access points and mobile devices equipped with Wi-Fi in a city to route packets via each other for intra-city connectivity and to/from any nodes that might have Internet a… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  21. arXiv:2504.06293  [pdf, other

    q-fin.RM cs.LG

    Generative AI Enhanced Financial Risk Management Information Retrieval

    Authors: Amin Haeri, Jonathan Vitrano, Mahdi Ghelichi

    Abstract: Risk management in finance involves recognizing, evaluating, and addressing financial risks to maintain stability and ensure regulatory compliance. Extracting relevant insights from extensive regulatory documents is a complex challenge requiring advanced retrieval and language models. This paper introduces RiskData, a dataset specifically curated for finetuning embedding models in risk management,… ▽ More

    Submitted 9 April, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

    Comments: 10 pages, 3 figures, 2 tables, 1 equation

  22. arXiv:2504.05636  [pdf, other

    eess.IV cs.CV cs.LG

    A Multi-Modal AI System for Screening Mammography: Integrating 2D and 3D Imaging to Improve Breast Cancer Detection in a Prospective Clinical Study

    Authors: Jungkyu Park, Jan Witowski, Yanqi Xu, Hari Trivedi, Judy Gichoya, Beatrice Brown-Mulry, Malte Westerhoff, Linda Moy, Laura Heacock, Alana Lewin, Krzysztof J. Geras

    Abstract: Although digital breast tomosynthesis (DBT) improves diagnostic performance over full-field digital mammography (FFDM), false-positive recalls remain a concern in breast cancer screening. We developed a multi-modal artificial intelligence system integrating FFDM, synthetic mammography, and DBT to provide breast-level predictions and bounding-box localizations of suspicious findings. Our AI system,… ▽ More

    Submitted 11 April, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  23. arXiv:2504.04635  [pdf, other

    cs.CL

    Steering off Course: Reliability Challenges in Steering Language Models

    Authors: Patrick Queiroz Da Silva, Hari Sethuraman, Dheeraj Rajagopal, Hannaneh Hajishirzi, Sachin Kumar

    Abstract: Steering methods for language models (LMs) have gained traction as lightweight alternatives to fine-tuning, enabling targeted modifications to model activations. However, prior studies primarily report results on a few models, leaving critical gaps in understanding the robustness of these methods. In this work, we systematically examine three prominent steering methods -- DoLa, function vectors, a… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

  24. arXiv:2504.01694  [pdf, other

    quant-ph cs.ET

    Iterative Interpolation Schedules for Quantum Approximate Optimization Algorithm

    Authors: Anuj Apte, Shree Hari Sureshbabu, Ruslan Shaydulin, Sami Boulebnane, Zichang He, Dylan Herman, James Sud, Marco Pistoia

    Abstract: Quantum Approximate Optimization Algorithm (QAOA) is a promising quantum optimization heuristic with empirical evidence of speedup over classical state-of-the-art for some problems. QAOA solves optimization problems using a parameterized circuit with $p$ layers, with higher $p$ leading to better solutions. Existing methods require optimizing $2p$ independent parameters which is challenging for lar… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 11 pages, 7 figures

  25. Large-scale Evaluation of Notebook Checkpointing with AI Agents

    Authors: Hanxi Fang, Supawit Chockchowwat, Hari Sundaram, Yongjoo Park

    Abstract: Saving, or checkpointing, intermediate results during interactive data exploration can potentially boost user productivity. However, existing studies on this topic are limited, as they primarily rely on small-scale experiments with human participants - a fundamental constraint of human subject studies. To address this limitation, we employ AI agents to simulate a large number of complex data explo… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2025, Yokohama, Japan

  26. Enhancing Computational Notebooks with Code+Data Space Versioning

    Authors: Hanxi Fang, Supawit Chockchowwat, Hari Sundaram, Yongjoo Park

    Abstract: There is a gap between how people explore data and how Jupyter-like computational notebooks are designed. People explore data nonlinearly, using execution undos, branching, and/or complete reverts, whereas notebooks are designed for sequential exploration. Recent works like ForkIt are still insufficient to support these multiple modes of nonlinear exploration in a unified way. In this work, we add… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 17 pages, CHI 2025, Yokohama, Japan

  27. arXiv:2503.23258  [pdf, other

    cs.SD cs.LG eess.AS eess.SP

    Joint Source-Environment Adaptation of Data-Driven Underwater Acoustic Source Ranging Based on Model Uncertainty

    Authors: Dariush Kari, Hari Vishnu, Andrew C. Singer

    Abstract: Adapting pre-trained deep learning models to new and unknown environments is a difficult challenge in underwater acoustic localization. We show that although pre-trained models have performance that suffers from mismatch between the training and test data, they generally exhibit a higher ``implied uncertainty'' in environments where there is more mismatch. Leveraging this notion of implied uncerta… ▽ More

    Submitted 29 March, 2025; originally announced March 2025.

  28. arXiv:2503.21588  [pdf, other

    cs.LG physics.ao-ph

    Generalizable Implicit Neural Representations via Parameterized Latent Dynamics for Baroclinic Ocean Forecasting

    Authors: Guang Zhao, Xihaier Luo, Seungjun Lee, Yihui Ren, Shinjae Yoo, Luke Van Roekel, Balu Nadiga, Sri Hari Krishna Narayanan, Yixuan Sun, Wei Xu

    Abstract: Mesoscale ocean dynamics play a critical role in climate systems, governing heat transport, hurricane genesis, and drought patterns. However, simulating these processes at high resolution remains computationally prohibitive due to their nonlinear, multiscale nature and vast spatiotemporal domains. Implicit neural representations (INRs) reduce the computational costs as resolution-independent surro… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  29. arXiv:2503.18403  [pdf, other

    cs.CV cs.AI cs.LG

    Knowledge Graph Enhanced Generative Multi-modal Models for Class-Incremental Learning

    Authors: Xusheng Cao, Haori Lu, Linlan Huang, Fei Yang, Xialei Liu, Ming-Ming Cheng

    Abstract: Continual learning in computer vision faces the critical challenge of catastrophic forgetting, where models struggle to retain prior knowledge while adapting to new tasks. Although recent studies have attempted to leverage the generalization capabilities of pre-trained models to mitigate overfitting on current tasks, models still tend to forget details of previously learned categories as tasks pro… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  30. arXiv:2503.16793  [pdf, other

    cs.CV

    Restoring Forgotten Knowledge in Non-Exemplar Class Incremental Learning through Test-Time Semantic Evolution

    Authors: Haori Lu, Xusheng Cao, Linlan Huang, Enguang Wang, Fei Yang, Xialei Liu

    Abstract: Continual learning aims to accumulate knowledge over a data stream while mitigating catastrophic forgetting. In Non-exemplar Class Incremental Learning (NECIL), forgetting arises during incremental optimization because old classes are inaccessible, hindering the retention of prior knowledge. To solve this, previous methods struggle in achieving the stability-plasticity balance in the training stag… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  31. arXiv:2503.16782  [pdf, other

    cs.CV cs.AI cs.LG

    Learning Part Knowledge to Facilitate Category Understanding for Fine-Grained Generalized Category Discovery

    Authors: Enguang Wang, Zhimao Peng, Zhengyuan Xie, Haori Lu, Fei Yang, Xialei Liu

    Abstract: Generalized Category Discovery (GCD) aims to classify unlabeled data containing both seen and novel categories. Although existing methods perform well on generic datasets, they struggle in fine-grained scenarios. We attribute this difficulty to their reliance on contrastive learning over global image features to automatically capture discriminative cues, which fails to capture the subtle local dif… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  32. arXiv:2503.15636  [pdf, ps, other

    cs.SC math.CO

    A computational approach to rational summability and its applications via discrete residues

    Authors: Carlos E. Arreche, Hari P. Sitaula

    Abstract: A rational function $f(x)$ is rationally summable if there exists a rational function $g(x)$ such that $f(x)=g(x+1)-g(x)$. Detecting whether a given rational function is summable is an important and basic computational subproblem that arises in algorithms to study diverse aspects of shift difference equations. The discrete residues introduced by Chen and Singer in 2012 enjoy the obstruction-theore… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: Submitted. arXiv admin note: substantial text overlap with arXiv:2402.07328

    MSC Class: 39A06; 33F10; 68W30; 40C15; 11Y50 ACM Class: I.1.2; F.2.1

  33. arXiv:2503.14552  [pdf, other

    cs.CV cs.AI

    Fire and Smoke Datasets in 20 Years: An In-depth Review

    Authors: Sayed Pedram Haeri Boroujeni, Niloufar Mehrabi, Fatemeh Afghah, Connor Peter McGrath, Danish Bhatkar, Mithilesh Anil Biradar, Abolfazl Razi

    Abstract: Fire and smoke phenomena pose a significant threat to the natural environment, ecosystems, and global economy, as well as human lives and wildlife. In this particular circumstance, there is a demand for more sophisticated and advanced technologies to implement an effective strategy for early detection, real-time monitoring, and minimizing the overall impacts of fires on ecological balance and publ… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  34. arXiv:2503.14550  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Novel AI-Based Quantification of Breast Arterial Calcification to Predict Cardiovascular Risk

    Authors: Theodorus Dapamede, Aisha Urooj, Vedant Joshi, Gabrielle Gershon, Frank Li, Mohammadreza Chavoshi, Beatrice Brown-Mulry, Rohan Satya Isaac, Aawez Mansuri, Chad Robichaux, Chadi Ayoub, Reza Arsanjani, Laurence Sperling, Judy Gichoya, Marly van Assen, Charles W. ONeill, Imon Banerjee, Hari Trivedi

    Abstract: Women are underdiagnosed and undertreated for cardiovascular disease. Automatic quantification of breast arterial calcification on screening mammography can identify women at risk for cardiovascular disease and enable earlier treatment and management of disease. In this retrospective study of 116,135 women from two healthcare systems, a transformer-based neural network quantified BAC severity (no… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  35. arXiv:2503.13581  [pdf, other

    eess.IV cs.CV

    Subgroup Performance of a Commercial Digital Breast Tomosynthesis Model for Breast Cancer Detection

    Authors: Beatrice Brown-Mulry, Rohan Satya Isaac, Sang Hyup Lee, Ambika Seth, KyungJee Min, Theo Dapamede, Frank Li, Aawez Mansuri, MinJae Woo, Christian Allison Fauria-Robinson, Bhavna Paryani, Judy Wawira Gichoya, Hari Trivedi

    Abstract: While research has established the potential of AI models for mammography to improve breast cancer screening outcomes, there have not been any detailed subgroup evaluations performed to assess the strengths and weaknesses of commercial models for digital breast tomosynthesis (DBT) imaging. This study presents a granular evaluation of the Lunit INSIGHT DBT model on a large retrospective cohort of 1… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Comments: 14 pages, 7 figures (plus 7 figures in supplement), 3 tables (plus 1 table in supplement)

  36. arXiv:2503.07510  [pdf, other

    cs.CY cs.CL

    Sometimes the Model doth Preach: Quantifying Religious Bias in Open LLMs through Demographic Analysis in Asian Nations

    Authors: Hari Shankar, Vedanta S P, Tejas Cavale, Ponnurangam Kumaraguru, Abhijnan Chakraborty

    Abstract: Large Language Models (LLMs) are capable of generating opinions and propagating bias unknowingly, originating from unrepresentative and non-diverse data collection. Prior research has analysed these opinions with respect to the West, particularly the United States. However, insights thus produced may not be generalized in non-Western populations. With the widespread usage of LLM systems by users a… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  37. arXiv:2503.05189  [pdf, other

    cs.RO

    Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects

    Authors: Justin Yu, Kush Hari, Karim El-Refai, Arnav Dalal, Justin Kerr, Chung Min Kim, Richard Cheng, Muhammad Zubair Irshad, Ken Goldberg

    Abstract: Tracking and manipulating irregularly-shaped, previously unseen objects in dynamic environments is important for robotic applications in manufacturing, assembly, and logistics. Recently introduced Gaussian Splats efficiently model object geometry, but lack persistent state estimation for task-oriented manipulation. We present Persistent Object Gaussian Splat (POGS), a system that embeds semantics,… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: Accepted to ICRA 2025

  38. Organize, Then Vote: Exploring Cognitive Load in Quadratic Survey Interfaces

    Authors: Ti-Chung Cheng, Yutong Zhang, Yi-Hung Chou, Vinay Koshy, Tiffany Wenting Li, Karrie Karahalios, Hari Sundaram

    Abstract: Quadratic Surveys (QSs) elicit more accurate preferences than traditional methods like Likert-scale surveys. However, the cognitive load associated with QSs has hindered their adoption in digital surveys for collective decision-making. We introduce a two-phase "organize-then-vote" QS to reduce cognitive load. As interface design significantly impacts survey results and accuracy, our design scaffol… ▽ More

    Submitted 16 May, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

    ACM Class: H.5.2

    Journal ref: Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI '25), Article 475, 35 pages, ACM, New York, NY, USA

  39. arXiv:2503.01522  [pdf, ps, other

    cs.IT cs.CR

    Byzantine Distributed Function Computation

    Authors: Hari Krishnan P. Anilkumar, Neha Sangwan, Varun Narayanan, Vinod M. Prabhakaran

    Abstract: We study the distributed function computation problem with $k$ users of which at most $s$ may be controlled by an adversary and characterize the set of functions of the sources the decoder can reconstruct robustly in the following sense -- if the users behave honestly, the function is recovered with high probability (w.h.p.); if they behave adversarially, w.h.p, either one of the adversarial users… ▽ More

    Submitted 10 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

  40. arXiv:2503.00917  [pdf, other

    cs.LG cs.CR

    AMUN: Adversarial Machine UNlearning

    Authors: Ali Ebrahimpour-Boroojeny, Hari Sundaram, Varun Chandrasekaran

    Abstract: Machine unlearning, where users can request the deletion of a forget dataset, is becoming increasingly important because of numerous privacy regulations. Initial works on ``exact'' unlearning (e.g., retraining) incur large computational overheads. However, while computationally inexpensive, ``approximate'' methods have fallen short of reaching the effectiveness of exact unlearning: models produced… ▽ More

    Submitted 1 May, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

  41. arXiv:2502.12552  [pdf, other

    cs.CY cs.AI

    LLM Safety for Children

    Authors: Prasanjit Rath, Hari Shrawgi, Parag Agrawal, Sandipan Dandapat

    Abstract: This paper analyzes the safety of Large Language Models (LLMs) in interactions with children below age of 18 years. Despite the transformative applications of LLMs in various aspects of children's lives such as education and therapy, there remains a significant gap in understanding and mitigating potential content harms specific to this demographic. The study acknowledges the diverse nature of chi… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  42. arXiv:2502.12477  [pdf, other

    cs.CL

    Savaal: Scalable Concept-Driven Question Generation to Enhance Human Learning

    Authors: Kimia Noorbakhsh, Joseph Chandler, Pantea Karimi, Mohammad Alizadeh, Hari Balakrishnan

    Abstract: Assessing and enhancing human learning through question-answering is vital, yet automating this process remains challenging. While large language models (LLMs) excel at summarization and query responses, their ability to generate meaningful questions for learners is underexplored. We propose Savaal, a scalable question-generation system with three objectives: (i) scalability, enabling question g… ▽ More

    Submitted 21 February, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: Kimia Noorbakhsh, Joseph Chandler, and Pantea Karimi contributed equally to the work

  43. arXiv:2502.10638  [pdf, other

    cs.HC

    Script&Shift: A Layered Interface Paradigm for Integrating Content Development and Rhetorical Strategy with LLM Writing Assistants

    Authors: Momin Siddiqui, Roy Pea, Hari Subramonyam

    Abstract: Good writing is a dynamic process of knowledge transformation, where writers refine and evolve ideas through planning, translating, and reviewing. Generative AI-powered writing tools can enhance this process but may also disrupt the natural flow of writing, such as when using LLMs for complex tasks like restructuring content across different sections or creating smooth transitions. We introduce Sc… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  44. arXiv:2502.09854  [pdf, other

    cs.CL cs.AI cs.LG

    Efficient Multitask Learning in Small Language Models Through Upside-Down Reinforcement Learning

    Authors: Yu-Chen Lin, Sanat Sharma, Hari Manikandan, Jayant Kumar, Tracy Holloway King, Jing Zheng

    Abstract: In this work, we demonstrate that small language models (SLMs), specifically a 100M parameter GPT-2 model, can achieve competitive performance in multitask prompt generation tasks while requiring only a fraction of the computational resources needed by large language models (LLMs). Through a novel combination of upside-down reinforcement learning and synthetic data distillation from a powerful LLM… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  45. arXiv:2502.01676  [pdf, other

    cs.CL cs.CY

    Benchmark on Peer Review Toxic Detection: A Challenging Task with a New Dataset

    Authors: Man Luo, Bradley Peterson, Rafael Gan, Hari Ramalingame, Navya Gangrade, Ariadne Dimarogona, Imon Banerjee, Phillip Howard

    Abstract: Peer review is crucial for advancing and improving science through constructive criticism. However, toxic feedback can discourage authors and hinder scientific progress. This work explores an important but underexplored area: detecting toxicity in peer reviews. We first define toxicity in peer reviews across four distinct categories and curate a dataset of peer reviews from the OpenReview platform… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

    Comments: Accepted to WiML workshop @Neurips 2024

  46. arXiv:2502.00416  [pdf, other

    cs.CE

    GO-GAN: Geometry Optimization Generative Adversarial Network for Achieving Optimized Structures with Targeted Physical Properties

    Authors: A. Padmaprabhan, Shriram Hari, Nived Philip Thomas, Khaish Singh Chadha, Sai Sidhardh, Viswanath Chinthapenta, Prabhat Kumar

    Abstract: This paper presents GO-GAN, a novel Generative Adversarial Network (GAN) architecture for geometry optimization (GO), specifically to generate structures based on user-specified input parameters. The architecture for GO-GAN proposed here combines a \texttt{Pix2Pix} GAN with a new input mechanism, involving a dynamic batch gradient descent-based training loop that leverages dataset symmetries. The… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

    Comments: iNCMDAO 2024

  47. arXiv:2501.19232  [pdf, other

    cs.IR cs.AI

    A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation

    Authors: Yunzhe Li, Junting Wang, Hari Sundaram, Zhining Liu

    Abstract: Zero-shot cross-domain sequential recommendation (ZCDSR) enables predictions in unseen domains without the need for additional training or fine-tuning, making it particularly valuable in data-sparse environments where traditional models struggle. Recent advancements in large language models (LLMs) have greatly improved ZCDSR by leveraging rich pretrained representations to facilitate cross-domain… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

    Comments: 11 pages

  48. arXiv:2501.15743  [pdf, other

    eess.IV cs.CV

    Z-Stack Scanning can Improve AI Detection of Mitosis: A Case Study of Meningiomas

    Authors: Hongyan Gu, Ellie Onstott, Wenzhong Yan, Tengyou Xu, Ruolin Wang, Zida Wu, Xiang 'Anthony' Chen, Mohammad Haeri

    Abstract: Z-stack scanning is an emerging whole slide imaging technology that captures multiple focal planes alongside the z-axis of a glass slide. Because z-stacking can offer enhanced depth information compared to the single-layer whole slide imaging, this technology can be particularly useful in analyzing small-scaled histopathological patterns. However, its actual clinical impact remains debated with mi… ▽ More

    Submitted 26 January, 2025; originally announced January 2025.

    Comments: To appear 2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI)

  49. arXiv:2501.10733  [pdf, other

    cs.CV

    A CNN-Transformer for Classification of Longitudinal 3D MRI Images -- A Case Study on Hepatocellular Carcinoma Prediction

    Authors: Jakob Nolte, Maureen M. J. Guichelaar, Donald E. Bouman, Stephanie M. van den Berg, Maryam Amir Haeri

    Abstract: Longitudinal MRI analysis is crucial for predicting disease outcomes, particularly in chronic conditions like hepatocellular carcinoma (HCC), where early detection can significantly influence treatment strategies and patient prognosis. Yet, due to challenges like limited data availability, subtle parenchymal changes, and the irregular timing of medical screenings, current approaches have so far fo… ▽ More

    Submitted 22 January, 2025; v1 submitted 18 January, 2025; originally announced January 2025.

    Comments: Submitted for publication to Biomedical Signal Processing and Control; Incorrect notation corrected

    ACM Class: I.4.9; I.2.1

  50. arXiv:2501.10374  [pdf

    cs.CY

    Artificial Intelligence in Mental Health and Well-Being: Evolution, Current Applications, Future Challenges, and Emerging Evidence

    Authors: Hari Mohan Pandey

    Abstract: Artificial Intelligence (AI) is a broad field that is upturning mental health care in many ways, from addressing anxiety, depression, and stress to increasing access, personalization of treatment, and real-time monitoring that enhances patient outcomes. The current paper discusses the evolution, present application, and future challenges in the field of AI for mental health and well-being. From th… ▽ More

    Submitted 13 December, 2024; originally announced January 2025.