Skip to main content

Showing 1–50 of 320 results for author: Ra, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.01329  [pdf

    cs.CL cs.AI

    Evaluating Large Language Models in Crisis Detection: A Real-World Benchmark from Psychological Support Hotlines

    Authors: Guifeng Deng, Shuyin Rao, Tianyu Lin, Anlu Dai, Pan Wang, Junyi Xie, Haidong Song, Ke Zhao, Dongwu Xu, Zhengdong Cheng, Tao Li, Haiteng Jiang

    Abstract: Psychological support hotlines are critical for crisis intervention but face significant challenges due to rising demand. Large language models (LLMs) could support crisis assessments, yet their capabilities in emotionally sensitive contexts remain unclear. We introduce PsyCrisisBench, a benchmark of 540 annotated transcripts from the Hangzhou Psychological Assistance Hotline, assessing four tasks… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: 30 pages, 8 figures

  2. arXiv:2504.20676  [pdf, ps, other

    cs.AI cs.CY cs.IT

    The Limits of AI Explainability: An Algorithmic Information Theory Approach

    Authors: Shrisha Rao

    Abstract: This paper establishes a theoretical foundation for understanding the fundamental limits of AI explainability through algorithmic information theory. We formalize explainability as the approximation of complex models by simpler ones, quantifying both approximation error and explanation complexity using Kolmogorov complexity. Our key theoretical contributions include: (1) a complexity gap theorem p… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    MSC Class: 68Q30; 68T01 ACM Class: I.2.0; H.1.1; K.4.1

  3. Analyzing Value Functions of States in Parametric Markov Chains

    Authors: Kasper Engelen, Guillermo A. Pérez, Shrisha Rao

    Abstract: Parametric Markov chains (pMC) are used to model probabilistic systems with unknown or partially known probabilities. Although (universal) pMC verification for reachability properties is known to be coETR-complete, there have been efforts to approach it using potentially easier-to-check properties such as asking whether the pMC is monotonic in certain parameters. In this paper, we first reduce mon… ▽ More

    Submitted 26 April, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

    Comments: Published as part of the book "Principles of Verification: Cycling the Probabilistic Landscape: Essays Dedicated to Joost-Pieter Katoen on the Occasion of His 60th Birthday, Part II"

  4. arXiv:2504.11056  [pdf, other

    math.NA cs.CE

    A study of troubled-cell indicators applied to finite volume methods using a novel monotonicity parameter

    Authors: R. Shivananda Rao, M. Ramakrishna

    Abstract: We adapt a troubled-cell indicator developed for discontinuous Galerkin (DG) methods to the finite volume method (FVM) framework for solving hyperbolic conservation laws. This indicator depends solely on the cell-average data of the target cell and its immediate neighbours. Once the troubled-cells are identified, we apply the limiter only in these cells instead of applying in all computational cel… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  5. arXiv:2504.10861  [pdf, other

    cs.CL

    Ai2 Scholar QA: Organized Literature Synthesis with Attribution

    Authors: Amanpreet Singh, Joseph Chee Chang, Chloe Anastasiades, Dany Haddad, Aakanksha Naik, Amber Tanaka, Angele Zamarron, Cecile Nguyen, Jena D. Hwang, Jason Dunkleberger, Matt Latzke, Smita Rao, Jaron Lochner, Rob Evans, Rodney Kinney, Daniel S. Weld, Doug Downey, Sergey Feldman

    Abstract: Retrieval-augmented generation is increasingly effective in answering scientific questions from literature, but many state-of-the-art systems are expensive and closed-source. We introduce Ai2 Scholar QA, a free online scientific question answering application. To facilitate research, we make our entire pipeline public: as a customizable open-source Python package and interactive web app, along wit… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 7 pages

  6. arXiv:2504.08952  [pdf, other

    cs.SE cs.HC

    RiskRAG: A Data-Driven Solution for Improved AI Model Risk Reporting

    Authors: Pooja S. B. Rao, Sanja Šćepanović, Ke Zhou, Edyta Paulina Bogucka, Daniele Quercia

    Abstract: Risk reporting is essential for documenting AI models, yet only 14% of model cards mention risks, out of which 96% copying content from a small set of cards, leading to a lack of actionable insights. Existing proposals for improving model cards do not resolve these issues. To address this, we introduce RiskRAG, a Retrieval Augmented Generation based risk reporting solution guided by five design re… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  7. arXiv:2503.22613   

    cs.DS cs.CC

    Randomized $\tilde{O}(m\sqrt{n})$ Bellman-Ford from Fineman and the Boilermakers

    Authors: Satish Rao

    Abstract: A classical algorithm by Bellman and Ford from the 1950's computes shortest paths in weighted graphs on $n$ vertices and $m$ edges with possibly negative weights in $O(mn)$ time. Indeed, this algorithm is taught regularly in undergraduate Algorithms courses. In 2023, after nearly 70 years, Fineman \cite{fineman2024single} developed an $\tilde{O}(m n^{8/9})$ expected time algorithm for this probl… ▽ More

    Submitted 9 April, 2025; v1 submitted 28 March, 2025; originally announced March 2025.

    Comments: This paper is incorrect. The negative sandwich needs to be done after the betweenness reduction in the the recursive version. That is, the negative sandwich and the full betweenness reduction needs to be done every step which makes the runtime be what was in the work of Huang, Jin, and Quanrud

  8. arXiv:2503.17332  [pdf, other

    cs.CR cs.AI

    CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities

    Authors: Yuxuan Zhu, Antony Kellermann, Dylan Bowman, Philip Li, Akul Gupta, Adarsh Danda, Richard Fang, Conner Jensen, Eric Ihli, Jason Benn, Jet Geronimo, Avi Dhir, Sudhit Rao, Kaicheng Yu, Twm Stone, Daniel Kang

    Abstract: Large language model (LLM) agents are increasingly capable of autonomously conducting cyberattacks, posing significant threats to existing applications. This growing risk highlights the urgent need for a real-world benchmark to evaluate the ability of LLM agents to exploit web application vulnerabilities. However, existing benchmarks fall short as they are limited to abstracted Capture the Flag co… ▽ More

    Submitted 10 April, 2025; v1 submitted 21 March, 2025; originally announced March 2025.

    Comments: 15 pages, 4 figures, 5 tables

    ACM Class: I.2.1; I.2.7

  9. arXiv:2503.16672  [pdf, other

    cs.LG cs.AI

    Accelerating Transformer Inference and Training with 2:4 Activation Sparsity

    Authors: Daniel Haziza, Timothy Chou, Dhruv Choudhary, Luca Wehrstedt, Francisco Massa, Jiecao Yu, Geonhwa Jeong, Supriya Rao, Patrick Labatut, Jesse Cai

    Abstract: In this paper, we demonstrate how to leverage 2:4 sparsity, a popular hardware-accelerated GPU sparsity pattern, to activations to accelerate large language model training and inference. Crucially we exploit the intrinsic sparsity found in Squared-ReLU activations to provide this acceleration with no accuracy loss. Our approach achieves up to 1.3x faster Feed Forward Network (FFNs) in both the for… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    MSC Class: I.2

  10. arXiv:2503.13507  [pdf, other

    cs.CL cs.AI

    NeurIPS 2023 LLM Efficiency Fine-tuning Competition

    Authors: Mark Saroufim, Yotam Perlitz, Leshem Choshen, Luca Antiga, Greg Bowyer, Christian Puhrsch, Driss Guessous, Supriya Rao, Geeta Chauhan, Ashvini Kumar, Jindal Pawan Kumar, Rajpoot Ankur Parikh, Joe Isaacson, Weiwei Yang

    Abstract: Our analysis of the NeurIPS 2023 large language model (LLM) fine-tuning competition revealed the following trend: top-performing models exhibit significant overfitting on benchmark datasets, mirroring the broader issue of benchmark overfitting on popular leaderboards and that data curation is essential in order to get a high performing LLM. The competition, which consisted of two stages - an open… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 11 pages, 10 figures

  11. arXiv:2503.12317  [pdf

    cs.AI

    A Transformer-based survival model for prediction of all-cause mortality in heart failure patients: a multi-cohort study

    Authors: Shishir Rao, Nouman Ahmed, Gholamreza Salimi-Khorshidi, Christopher Yau, Huimin Su, Nathalie Conrad, Folkert W Asselbergs, Mark Woodward, Rod Jackson, John GF Cleland, Kazem Rahimi

    Abstract: We developed and validated TRisk, a Transformer-based AI model predicting 36-month mortality in heart failure patients by analysing temporal patient journeys from UK electronic health records (EHR). Our study included 403,534 heart failure patients (ages 40-90) from 1,418 English general practices, with 1,063 practices for model derivation and 355 for external validation. TRisk was compared agains… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

  12. arXiv:2503.05473  [pdf, other

    cs.NE cs.AI

    The Society of HiveMind: Multi-Agent Optimization of Foundation Model Swarms to Unlock the Potential of Collective Intelligence

    Authors: Noah Mamie, Susie Xi Rao

    Abstract: Multi-agent systems address issues of accessibility and scalability of artificial intelligence (AI) foundation models, which are often represented by large language models. We develop a framework - the "Society of HiveMind" (SOHM) - that orchestrates the interaction between multiple AI foundation models, imitating the observed behavior of animal swarms in nature by following modern evolutionary th… ▽ More

    Submitted 13 March, 2025; v1 submitted 7 March, 2025; originally announced March 2025.

    Comments: 11 pages (excl. appendix)

  13. arXiv:2502.12992  [pdf, other

    cs.CL cs.AI

    B-cos LM: Efficiently Transforming Pre-trained Language Models for Improved Explainability

    Authors: Yifan Wang, Sukrut Rao, Ji-Ung Lee, Mayank Jobanputra, Vera Demberg

    Abstract: Post-hoc explanation methods for black-box models often struggle with faithfulness and human interpretability due to the lack of explainability in current neural models. Meanwhile, B-cos networks have been introduced to improve model explainability through architectural and computational adaptations, but their application has so far been limited to computer vision models and their associated train… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 20 pages, 15 figures

  14. arXiv:2502.09189  [pdf, other

    cs.LO cs.DS cs.FL

    Data Structures for Finite Downsets of Natural Vectors: Theory and Practice

    Authors: Michaël Cadilhac, Vanessa Flügel, Guillermo A. Pérez, Shrisha Rao

    Abstract: Manipulating downward-closed sets of vectors forms the basis of so-called antichain-based algorithms in verification. In that context, the dimension of the vectors is intimately tied to the size of the input structure to be verified. In this work, we formally analyze the complexity of classical list-based algorithms to manipulate antichains as well as that of Zampuniéris's sharing trees and tradit… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  15. arXiv:2502.07281  [pdf, other

    cs.LG

    Supervised Contrastive Block Disentanglement

    Authors: Taro Makino, Ji Won Park, Natasa Tagasovska, Takamasa Kudo, Paula Coelho, Jan-Christian Huetter, Heming Yao, Burkhard Hoeckendorf, Ana Carolina Leote, Stephen Ra, David Richmond, Kyunghyun Cho, Aviv Regev, Romain Lopez

    Abstract: Real-world datasets often combine data collected under different experimental conditions. This yields larger datasets, but also introduces spurious correlations that make it difficult to model the phenomena of interest. We address this by learning two embeddings to independently represent the phenomena of interest and the spurious correlations. The embedding representing the phenomena of interest… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  16. arXiv:2501.09201  [pdf, other

    cs.PL cs.SC

    Towards Semantics Lifting for Scientific Computing: A Case Study on FFT

    Authors: Naifeng Zhang, Sanil Rao, Mike Franusich, Franz Franchetti

    Abstract: The rise of automated code generation tools, such as large language models (LLMs), has introduced new challenges in ensuring the correctness and efficiency of scientific software, particularly in complex kernels, where numerical stability, domain-specific optimizations, and precise floating-point arithmetic are critical. We propose a stepwise semantics lifting approach using an extended SPIRAL fra… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.

    Comments: Accepted at the Theory and Practice of Static Analysis Workshop (TPSA), in conjunction with the ACM SIGPLAN Symposium on Principles of Programming Languages (POPL), 2025

  17. Autonomous Electrochemistry Platform with Real-Time Normality Testing of Voltammetry Measurements Using ML

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Craig A. Bridges, Sheng Dai, Alex Walters

    Abstract: Electrochemistry workflows utilize various instruments and computing systems to execute workflows consisting of electrocatalyst synthesis, testing and evaluation tasks. The heterogeneity of the software and hardware of these ecosystems makes it challenging to orchestrate a complete workflow from production to characterization by automating its tasks. We propose an autonomous electrochemistry compu… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

    Comments: 10 pages, 14 figures, accepted in the IEEE 20th International Conference on e-Science (e-Science), 2024

  18. arXiv:2501.05765  [pdf, ps, other

    cs.AI cs.LO

    Deontic Temporal Logic for Formal Verification of AI Ethics

    Authors: Priya T. V., Shrisha Rao

    Abstract: Ensuring ethical behavior in Artificial Intelligence (AI) systems amidst their increasing ubiquity and influence is a major concern the world over. The use of formal methods in AI ethics is a possible crucial approach for specifying and verifying the ethical behavior of AI systems. This paper proposes a formalization based on deontic logic to define and evaluate the ethical behavior of AI systems,… ▽ More

    Submitted 14 May, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

    ACM Class: I.2.m; F.4.1

  19. arXiv:2501.01963  [pdf, other

    cs.LG cs.AI cs.IT math.PR math.ST stat.ML

    Statistical learning does not always entail knowledge

    Authors: Daniel Andrés Díaz-Pachón, H. Renata Gallegos, Ola Hössjer, J. Sunil Rao

    Abstract: In this paper, we study learning and knowledge acquisition (LKA) of an agent about a proposition that is either true or false. We use a Bayesian approach, where the agent receives data to update his beliefs about the proposition according to a posterior distribution. The LKA is formulated in terms of active information, with data representing external or exogenous information that modifies the age… ▽ More

    Submitted 17 December, 2024; originally announced January 2025.

    Comments: 30 pages, 1 figure

    MSC Class: 60A99 62A01 68T01 62B10

  20. arXiv:2501.00273  [pdf, other

    cs.CL

    Echoes in AI: Quantifying Lack of Plot Diversity in LLM Outputs

    Authors: Weijia Xu, Nebojsa Jojic, Sudha Rao, Chris Brockett, Bill Dolan

    Abstract: With rapid advances in large language models (LLMs), there has been an increasing application of LLMs in creative content ideation and generation. A critical question emerges: can current LLMs provide ideas that are diverse enough to truly bolster the collective creativity? We examine two state-of-the-art LLMs, GPT-4 and LLaMA-3, on story generation and discover that LLM-generated stories often co… ▽ More

    Submitted 30 December, 2024; originally announced January 2025.

  21. arXiv:2412.01273  [pdf, other

    cs.HC cs.CV

    AR-Facilitated Safety Inspection and Fall Hazard Detection on Construction Sites

    Authors: Jiazhou Liu, Aravinda S. Rao, Fucai Ke, Tim Dwyer, Benjamin Tag, Pari Delir Haghighi

    Abstract: Together with industry experts, we are exploring the potential of head-mounted augmented reality to facilitate safety inspections on high-rise construction sites. A particular concern in the industry is inspecting perimeter safety screens on higher levels of construction sites, intended to prevent falls of people and objects. We aim to support workers performing this inspection task by tracking wh… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 2 pages, 1 figure, ISMAR24 Workshop Paper

  22. arXiv:2411.12516  [pdf, other

    cond-mat.mes-hall cs.CV cs.ET cs.LG quant-ph

    Modular Autonomous Virtualization System for Two-Dimensional Semiconductor Quantum Dot Arrays

    Authors: Anantha S. Rao, Donovan Buterakos, Barnaby van Straaten, Valentin John, Cécile X. Yu, Stefan D. Oosterhout, Lucas Stehouwer, Giordano Scappucci, Menno Veldhorst, Francesco Borsoi, Justyna P. Zwolak

    Abstract: Arrays of gate-defined semiconductor quantum dots are among the leading candidates for building scalable quantum processors. High-fidelity initialization, control, and readout of spin qubit registers require exquisite and targeted control over key Hamiltonian parameters that define the electrostatic environment. However, due to the tight gate pitch, capacitive crosstalk between gates hinders indep… ▽ More

    Submitted 6 May, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

    Comments: 14 pages, 5 figures, 9 pages of supplemental material

    Journal ref: Phys. Rev. X 15, 021034 (2025)

  23. arXiv:2411.04057  [pdf, ps, other

    quant-ph cs.DS

    A unified approach to quantum de Finetti theorems and SoS rounding via geometric quantization

    Authors: Sujit Rao

    Abstract: The sum-of-squares hierarchy of semidefinite programs has become a common tool for algorithm design in theoretical computer science, including problems in quantum information. In this work we study a connection between a Hermitian version of the SoS hierarchy, related to the quantum de Finetti theorem, and geometric quantization of compact Kähler manifolds (such as complex projective space… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

    Comments: 46 pages

  24. arXiv:2411.02714  [pdf, other

    cs.CL cs.AI cs.HC

    Game Plot Design with an LLM-powered Assistant: An Empirical Study with Game Designers

    Authors: Seyed Hossein Alavi, Weijia Xu, Nebojsa Jojic, Daniel Kennett, Raymond T. Ng, Sudha Rao, Haiyan Zhang, Bill Dolan, Vered Shwartz

    Abstract: We introduce GamePlot, an LLM-powered assistant that supports game designers in crafting immersive narratives for turn-based games, and allows them to test these games through a collaborative game play and refine the plot throughout the process. Our user study with 14 game designers shows high levels of both satisfaction with the generated game plots and sense of ownership over the narratives, but… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  25. arXiv:2411.00715  [pdf, other

    cs.CV cs.AI cs.LG

    B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable

    Authors: Shreyash Arya, Sukrut Rao, Moritz Böhle, Bernt Schiele

    Abstract: B-cos Networks have been shown to be effective for obtaining highly human interpretable explanations of model decisions by architecturally enforcing stronger alignment between inputs and weight. B-cos variants of convolutional networks (CNNs) and vision transformers (ViTs), which primarily replace linear layers with B-cos transformations, perform competitively to their respective standard variants… ▽ More

    Submitted 24 January, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: 31 pages, 9 figures, 12 tables, Neural Information Processing Systems (NeurIPS) 2024; added references, corrected typos

  26. arXiv:2410.21627  [pdf, other

    cs.CL cs.AI

    MCPDial: A Minecraft Persona-driven Dialogue Dataset

    Authors: Seyed Hossein Alavi, Sudha Rao, Ashutosh Adhikari, Gabriel A DesGarennes, Akanksha Malhotra, Chris Brockett, Mahmoud Adada, Raymond T. Ng, Vered Shwartz, Bill Dolan

    Abstract: We propose a novel approach that uses large language models (LLMs) to generate persona-driven conversations between Players and Non-Player Characters (NPC) in games. Showcasing the application of our methodology, we introduce the Minecraft Persona-driven Dialogue dataset (MCPDial). Starting with a small seed of expert-written conversations, we employ our method to generate hundreds of additional c… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

  27. arXiv:2410.20773  [pdf, other

    cs.SD cs.LG eess.AS

    An Ensemble Approach to Music Source Separation: A Comparative Analysis of Conventional and Hierarchical Stem Separation

    Authors: Saarth Vardhan, Pavani R Acharya, Samarth S Rao, Oorjitha Ratna Jasthi, S Natarajan

    Abstract: Music source separation (MSS) is a task that involves isolating individual sound sources, or stems, from mixed audio signals. This paper presents an ensemble approach to MSS, combining several state-of-the-art architectures to achieve superior separation performance across traditional Vocal, Drum, and Bass (VDB) stems, as well as expanding into second-level hierarchical separation for sub-stems li… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

  28. arXiv:2410.16842  [pdf, other

    cs.CL cs.AI

    Assessment of Transformer-Based Encoder-Decoder Model for Human-Like Summarization

    Authors: Sindhu Nair, Y. S. Rao, Radha Shankarmani

    Abstract: In recent times, extracting valuable information from large text is making significant progress. Especially in the current era of social media, people expect quick bites of information. Automatic text summarization seeks to tackle this by slimming large texts down into more manageable summaries. This important research area can aid in decision-making by digging out salient content from large text.… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Comments: Pre-print

    ACM Class: I.2.7

  29. arXiv:2410.15998  [pdf, other

    cs.CL cs.AI cs.LG

    1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification

    Authors: Ram Mohan Rao Kadiyala, M. V. P. Chandra Sekhara Rao

    Abstract: Social media is a great source of data for users reporting information and regarding their health and how various things have had an effect on them. This paper presents various approaches using Transformers and Large Language Models and their ensembles, their performance along with advantages and drawbacks for various tasks of SMM4H'24 - Classifying texts on impact of nature and outdoor spaces on… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: short paper , acl 2024

  30. Leveraging Internet Principles to Build a Quantum Network

    Authors: Leonardo Bacciottini, Matheus Guedes De Andrade, Shahrooz Pouryousef, Emily A. Van Milligen, Aparimit Chandra, Nitish K. Panigrahy, Nageswara S. V. Rao, Gayane Vardoyan, Don Towsley

    Abstract: Designing an operational architecture for the Quantum Internet is challenging in light of both fundamental limits imposed by physics laws and technological constraints. Here, we propose a method to abstract away most of the quantum-specific elements and formulate a best-effort quantum network architecture based on packet switching, akin to that of the classical Internet. This reframing provides an… ▽ More

    Submitted 29 April, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: 9 pages, 5 figures

  31. arXiv:2410.07625  [pdf, other

    cs.CV

    MorCode: Face Morphing Attack Generation using Generative Codebooks

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, Sushma Venkatesh, Krothapalli Sreenivasa Rao, Pabitra Mitra, Rakesh Krishna

    Abstract: Face recognition systems (FRS) can be compromised by face morphing attacks, which blend textural and geometric information from multiple facial images. The rapid evolution of generative AI, especially Generative Adversarial Networks (GAN) or Diffusion models, where encoded images are interpolated to generate high-quality face morphing images. In this work, we present a novel method for the automat… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  32. arXiv:2410.06543  [pdf, other

    cs.CR cs.SD eess.AS

    Gumbel Rao Monte Carlo based Bi-Modal Neural Architecture Search for Audio-Visual Deepfake Detection

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra Vinod Rathod

    Abstract: Deepfakes pose a critical threat to biometric authentication systems by generating highly realistic synthetic media. Existing multimodal deepfake detectors often struggle to adapt to diverse data and rely on simple fusion methods. To address these challenges, we propose Gumbel-Rao Monte Carlo Bi-modal Neural Architecture Search (GRMC-BMNAS), a novel architecture search framework that employs Gumbe… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  33. arXiv:2409.19109  [pdf, other

    cs.NI

    Trust, But Verify, Operator-Reported Geolocation

    Authors: Katherine Izhikevich, Ben Du, Sumanth Rao, Alisha Ukani, Liz Izhikevich

    Abstract: Geolocation plays a critical role in understanding the Internet. In this work, we provide an in-depth analysis of operator-misreported geolocation. Using a bandwidth-efficient methodology, we find in May 2024 that only a small percentage (1.5%) of vantage points in the largest community-vantage point collection, RIPE Atlas, do not respond from their operator-reported geolocation. However, misrepor… ▽ More

    Submitted 9 October, 2024; v1 submitted 27 September, 2024; originally announced September 2024.

  34. arXiv:2409.05477  [pdf, other

    cs.LG

    Retrofitting Temporal Graph Neural Networks with Transformer

    Authors: Qiang Huang, Xiao Yan, Xin Wang, Susie Xi Rao, Zhichao Han, Fangcheng Fu, Wentao Zhang, Jiawei Jiang

    Abstract: Temporal graph neural networks (TGNNs) outperform regular GNNs by incorporating time information into graph-based operations. However, TGNNs adopt specialized models (e.g., TGN, TGAT, and APAN ) and require tailored training frameworks (e.g., TGL and ETC). In this paper, we propose TF-TGN, which uses Transformer decoder as the backbone model for TGNN to enjoy Transformer's codebase for efficient t… ▽ More

    Submitted 18 September, 2024; v1 submitted 9 September, 2024; originally announced September 2024.

    Comments: conference Under review

  35. arXiv:2408.14169  [pdf, other

    cs.DC cs.AI

    Dynamic Pricing for Electric Vehicle Charging

    Authors: Arun Kumar Kalakanti, Shrisha Rao

    Abstract: Dynamic pricing is a promising strategy to address the challenges of smart charging, as traditional time-of-use (ToU) rates and stationary pricing (SP) do not dynamically react to changes in operating conditions, reducing revenue for charging station (CS) vendors and affecting grid stability. Previous studies evaluated single objectives or linear combinations of objectives for EV CS pricing soluti… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 12 pages

  36. arXiv:2408.10148  [pdf, other

    cs.GT cs.MA

    Auctioning Escape Permits for Multiple Correlated Pollutants Using CMRA

    Authors: Keshav Goyal, Sooraj Sathish, Shrisha Rao

    Abstract: In the context of increasingly complex environmental challenges, effective pollution control mechanisms are crucial. By extending the state of the art auction mechanisms, we aim to develop an efficient approach for allocating pollution abatement resources in a multi-pollutant setting with pollutants affecting each other's reduction costs. We modify the Combinatorial Multi-Round Ascending Auction f… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  37. arXiv:2408.04362  [pdf, other

    cs.SD eess.AS

    NeuralMultiling: A Novel Neural Architecture Search for Smartphone based Multilingual Speaker Verification

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, K. Sreenivasa Rao, Pabitra Mitra

    Abstract: Multilingual speaker verification introduces the challenge of verifying a speaker in multiple languages. Existing systems were built using i-vector/x-vector approaches along with Bi-LSTMs, which were trained to discriminate speakers, irrespective of the language. Instead of exploring the design space manually, we propose a neural architecture search for multilingual speaker verification suitable f… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  38. arXiv:2407.21075  [pdf, other

    cs.AI cs.CL cs.LG

    Apple Intelligence Foundation Language Models

    Authors: Tom Gunter, Zirui Wang, Chong Wang, Ruoming Pang, Andy Narayanan, Aonan Zhang, Bowen Zhang, Chen Chen, Chung-Cheng Chiu, David Qiu, Deepak Gopinath, Dian Ang Yap, Dong Yin, Feng Nan, Floris Weers, Guoli Yin, Haoshuo Huang, Jianyu Wang, Jiarui Lu, John Peebles, Ke Ye, Mark Lee, Nan Du, Qibin Chen, Quentin Keunebroek , et al. (130 additional authors not shown)

    Abstract: We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  39. arXiv:2407.21028  [pdf, other

    q-bio.BM cs.LG

    Antibody DomainBed: Out-of-Distribution Generalization in Therapeutic Protein Design

    Authors: Nataša Tagasovska, Ji Won Park, Matthieu Kirchmeyer, Nathan C. Frey, Andrew Martin Watkins, Aya Abdelsalam Ismail, Arian Rokkum Jamasb, Edith Lee, Tyler Bryson, Stephen Ra, Kyunghyun Cho

    Abstract: Machine learning (ML) has demonstrated significant promise in accelerating drug design. Active ML-guided optimization of therapeutic molecules typically relies on a surrogate model predicting the target property of interest. The model predictions are used to determine which designs to evaluate in the lab, and the model is updated on the new measurements to inform the next cycle of decisions. A key… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  40. arXiv:2407.14499  [pdf, other

    cs.CV cs.AI cs.LG

    Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery

    Authors: Sukrut Rao, Sweta Mahajan, Moritz Böhle, Bernt Schiele

    Abstract: Concept Bottleneck Models (CBMs) have recently been proposed to address the 'black-box' problem of deep neural networks, by first mapping images to a human-understandable concept space and then linearly combining concepts for classification. Such models typically require first coming up with a set of concepts relevant to the task and then aligning the representations of a feature extractor to map… ▽ More

    Submitted 12 August, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: 40 pages, 21 figures, 6 tables, European Conference on Computer Vision (ECCV) 2024

  41. arXiv:2407.04976  [pdf, other

    cs.DS

    Congestion-Approximators from the Bottom Up

    Authors: Jason Li, Satish Rao, Di Wang

    Abstract: We develop a novel algorithm to construct a congestion-approximator with polylogarithmic quality on a capacitated, undirected graph in nearly-linear time. Our approach is the first *bottom-up* hierarchical construction, in contrast to previous *top-down* approaches including that of Racke, Shah, and Taubig (SODA 2014), the only other construction achieving polylogarithmic quality that is implement… ▽ More

    Submitted 13 January, 2025; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: SODA 2025, 46 pages. Fix error in Lemma 4.7

  42. arXiv:2407.03460  [pdf, other

    cs.CL cs.AI

    Collaborative Quest Completion with LLM-driven Non-Player Characters in Minecraft

    Authors: Sudha Rao, Weijia Xu, Michael Xu, Jorge Leandro, Ken Lobb, Gabriel DesGarennes, Chris Brockett, Bill Dolan

    Abstract: The use of generative AI in video game development is on the rise, and as the conversational and other capabilities of large language models continue to improve, we expect LLM-driven non-player characters (NPCs) to become widely deployed. In this paper, we seek to understand how human players collaborate with LLM-driven NPCs to accomplish in-game goals. We design a minigame within Minecraft where… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted at Wordplay workshop at ACL 2024

    Journal ref: ACL 2024

  43. arXiv:2406.14373  [pdf, other

    cs.AI cs.CL cs.CY cs.HC cs.MA

    Artificial Leviathan: Exploring Social Evolution of LLM Agents Through the Lens of Hobbesian Social Contract Theory

    Authors: Gordon Dai, Weijia Zhang, Jinhan Li, Siqi Yang, Chidera Onochie lbe, Srihas Rao, Arthur Caetano, Misha Sra

    Abstract: The emergence of Large Language Models (LLMs) and advancements in Artificial Intelligence (AI) offer an opportunity for computational social science research at scale. Building upon prior explorations of LLM agent design, our work introduces a simulated agent society where complex social relationships dynamically form and evolve over time. Agents are imbued with psychological drives and placed in… ▽ More

    Submitted 1 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  44. arXiv:2406.13384  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Straight Through Gumbel Softmax Estimator based Bimodal Neural Architecture Search for Audio-Visual Deepfake Detection

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra, Vinod Rathod

    Abstract: Deepfakes are a major security risk for biometric authentication. This technology creates realistic fake videos that can impersonate real people, fooling systems that rely on facial features and voice patterns for identification. Existing multimodal deepfake detectors rely on conventional fusion methods, such as majority rule and ensemble voting, which often struggle to adapt to changing data char… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  45. arXiv:2406.07559  [pdf, other

    cs.CR eess.SY

    After the Breach: Incident Response within Enterprises

    Authors: Sumanth Rao

    Abstract: Enterprises are constantly under attack from sophisticated adversaries. These adversaries use a variety of techniques to first gain access to the enterprise, then spread laterally inside its networks, establish persistence, and finally exfiltrate sensitive data, or hold it for ransom. While historically, enterprises have used different Incident Response systems that monitor hosts, servers, or netw… ▽ More

    Submitted 13 June, 2024; v1 submitted 30 April, 2024; originally announced June 2024.

  46. arXiv:2406.04482  [pdf, other

    cs.CL cs.AI cs.HC cs.SE

    Automatic Bug Detection in LLM-Powered Text-Based Games Using LLMs

    Authors: Claire Jin, Sudha Rao, Xiangyu Peng, Portia Botchway, Jessica Quaye, Chris Brockett, Bill Dolan

    Abstract: Advancements in large language models (LLMs) are revolutionizing interactive game design, enabling dynamic plotlines and interactions between players and non-player characters (NPCs). However, LLMs may exhibit flaws such as hallucinations, forgetfulness, or misinterpretations of prompts, causing logical inconsistencies and unexpected deviations from intended designs. Automated techniques for detec… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in Findings of the Association for Computational Linguistics: ACL 2024

  47. arXiv:2405.11070  [pdf, other

    cs.AI cs.CL cs.LG

    Jill Watson: A Virtual Teaching Assistant powered by ChatGPT

    Authors: Karan Taneja, Pratyusha Maiti, Sandeep Kakar, Pranav Guruprasad, Sanjeev Rao, Ashok K. Goel

    Abstract: Conversational AI agents often require extensive datasets for training that are not publicly released, are limited to social chit-chat or handling a specific domain, and may not be easily extended to accommodate the latest advances in AI technologies. This paper introduces Jill Watson, a conversational Virtual Teaching Assistant (VTA) leveraging the capabilities of ChatGPT. Jill Watson based on Ch… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  48. arXiv:2405.08927  [pdf, ps, other

    cs.DS

    Expanderizing Higher Order Random Walks

    Authors: Vedat Levi Alev, Shravas Rao

    Abstract: We study a variant of the down-up and up-down walks over an $n$-partite simplicial complex, which we call expanderized higher order random walks -- where the sequence of updated coordinates correspond to the sequence of vertices visited by a random walk over an auxiliary expander graph $H$. When $H$ is the clique, this random walk reduces to the usual down-up walk and when $H$ is the directed cycl… ▽ More

    Submitted 3 June, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  49. arXiv:2404.17027  [pdf, other

    cs.CL cs.AI

    Player-Driven Emergence in LLM-Driven Game Narrative

    Authors: Xiangyu Peng, Jessica Quaye, Sudha Rao, Weijia Xu, Portia Botchway, Chris Brockett, Nebojsa Jojic, Gabriel DesGarennes, Ken Lobb, Michael Xu, Jorge Leandro, Claire Jin, Bill Dolan

    Abstract: We explore how interaction with large language models (LLMs) can give rise to emergent behaviors, empowering players to participate in the evolution of game narratives. Our testbed is a text-adventure game in which players attempt to solve a mystery under a fixed narrative premise, but can freely interact with non-player characters generated by GPT-4, a large language model. We recruit 28 gamers t… ▽ More

    Submitted 3 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted at IEEE Conference on Games 2024

    Journal ref: IEEE Conference on Games 2024

  50. arXiv:2404.12679  [pdf, other

    cs.CV cs.CR

    MLSD-GAN -- Generating Strong High Quality Face Morphing Attacks using Latent Semantic Disentanglement

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra

    Abstract: Face-morphing attacks are a growing concern for biometric researchers, as they can be used to fool face recognition systems (FRS). These attacks can be generated at the image level (supervised) or representation level (unsupervised). Previous unsupervised morphing attacks have relied on generative adversarial networks (GANs). More recently, researchers have used linear interpolation of StyleGAN-en… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.