Skip to main content

Showing 1–50 of 480 results for author: Varshney

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.18213  [pdf, ps, other

    math.OC cs.LG

    Joint Cooperative and Non-Cooperative Localization in WSNs with Distributed Scaled Proximal ADMM Algorithms

    Authors: Qiaojia Zhu, Xiaojing Shen, Haiqi Liu, Pramod K. Varshney

    Abstract: Cooperative and non-cooperative localization frequently arise together in wireless sensor networks, particularly when sensor positions are uncertain and targets are unable to communicate with the network. While joint processing can eliminate the delay in target estimation found in sequential approaches, it introduces complex variable coupling, posing challenges in both modeling and optimization. T… ▽ More

    Submitted 21 September, 2025; originally announced September 2025.

  2. arXiv:2509.13332  [pdf, ps, other

    cs.AI cs.CL

    Explicit Reasoning Makes Better Judges: A Systematic Study on Accuracy, Efficiency, and Robustness

    Authors: Pratik Jayarao, Himanshu Gupta, Neeraj Varshney, Chaitanya Dwivedi

    Abstract: As Large Language Models (LLMs) are increasingly adopted as automated judges in benchmarking and reward modeling, ensuring their reliability, efficiency, and robustness has become critical. In this work, we present a systematic comparison of "thinking" and "non-thinking" LLMs in the LLM-as-a-judge paradigm using open-source Qwen 3 models of relatively small sizes (0.6B, 1.7B, and 4B parameters). W… ▽ More

    Submitted 9 September, 2025; originally announced September 2025.

  3. arXiv:2509.08422  [pdf, ps, other

    cs.CV cs.LG

    LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations

    Authors: Payal Varshney, Adriano Lucieri, Christoph Balada, Sheraz Ahmed, Andreas Dengel

    Abstract: Video-based AI systems are increasingly adopted in safety-critical domains such as autonomous driving and healthcare. However, interpreting their decisions remains challenging due to the inherent spatiotemporal complexity of video data and the opacity of deep learning models. Existing explanation techniques often suffer from limited temporal coherence, insufficient robustness, and a lack of action… ▽ More

    Submitted 23 September, 2025; v1 submitted 10 September, 2025; originally announced September 2025.

    Comments: 30 pages

  4. arXiv:2508.15025  [pdf, ps, other

    cs.LG eess.SY

    Federated Nonlinear System Identification

    Authors: Omkar Tupe, Max Hartman, Lav R. Varshney, Saurav Prakash

    Abstract: We consider federated learning of linearly-parameterized nonlinear systems. We establish theoretical guarantees on the effectiveness of federated nonlinear system identification compared to centralized approaches, demonstrating that the convergence rate improves as the number of clients increases. Although the convergence rates in the linear and nonlinear cases differ only by a constant, this cons… ▽ More

    Submitted 24 August, 2025; v1 submitted 20 August, 2025; originally announced August 2025.

  5. arXiv:2508.14444  [pdf, ps, other

    cs.CL cs.AI cs.LG

    NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

    Authors: NVIDIA, :, Aarti Basant, Abhijit Khairnar, Abhijit Paithankar, Abhinav Khattar, Adithya Renduchintala, Aditya Malte, Akhiad Bercovich, Akshay Hazare, Alejandra Rico, Aleksander Ficek, Alex Kondratenko, Alex Shaposhnikov, Alexander Bukharin, Ali Taghibakhshi, Amelia Barton, Ameya Sunil Mahabaleshwarkar, Amy Shen, Andrew Tao, Ann Guan, Anna Shors, Anubhav Mandarwal, Arham Mehta, Arun Venkatesan , et al. (192 additional authors not shown)

    Abstract: We introduce Nemotron-Nano-9B-v2, a hybrid Mamba-Transformer language model designed to increase throughput for reasoning workloads while achieving state-of-the-art accuracy compared to similarly-sized models. Nemotron-Nano-9B-v2 builds on the Nemotron-H architecture, in which the majority of the self-attention layers in the common Transformer architecture are replaced with Mamba-2 layers, to achi… ▽ More

    Submitted 2 September, 2025; v1 submitted 20 August, 2025; originally announced August 2025.

  6. arXiv:2508.10649  [pdf, ps, other

    cs.LG cs.CV

    Geospatial Diffusion for Land Cover Imperviousness Change Forecasting

    Authors: Debvrat Varshney, Vibhas Vats, Bhartendu Pandey, Christa Brelsford, Philipe Dias

    Abstract: Land cover, both present and future, has a significant effect on several important Earth system processes. For example, impervious surfaces heat up and speed up surface water runoff and reduce groundwater infiltration, with concomitant effects on regional hydrology and flood risk. While regional Earth System models have increasing skill at forecasting hydrologic and atmospheric processes at high r… ▽ More

    Submitted 14 August, 2025; originally announced August 2025.

  7. arXiv:2507.14492  [pdf, ps, other

    cs.LG stat.ML

    Glitches in Decision Tree Ensemble Models

    Authors: Satyankar Chandra, Ashutosh Gupta, Kaushik Mallik, Krishna Shankaranarayanan, Namrita Varshney

    Abstract: Many critical decision-making tasks are now delegated to machine-learned models, and it is imperative that their decisions are trustworthy and reliable, and their outputs are consistent across similar inputs. We identify a new source of unreliable behaviors-called glitches-which may significantly impair the reliability of AI models having steep decision boundaries. Roughly speaking, glitches are s… ▽ More

    Submitted 19 July, 2025; originally announced July 2025.

  8. arXiv:2507.12185  [pdf, ps, other

    cs.CR

    Exploiting Jailbreaking Vulnerabilities in Generative AI to Bypass Ethical Safeguards for Facilitating Phishing Attacks

    Authors: Rina Mishra, Gaurav Varshney

    Abstract: The advent of advanced Generative AI (GenAI) models such as DeepSeek and ChatGPT has significantly reshaped the cybersecurity landscape, introducing both promising opportunities and critical risks. This study investigates how GenAI powered chatbot services can be exploited via jailbreaking techniques to bypass ethical safeguards, enabling the generation of phishing content, recommendation of hacki… ▽ More

    Submitted 16 July, 2025; originally announced July 2025.

  9. arXiv:2507.10813  [pdf, ps, other

    cs.HC

    Static or Temporal? Semantic Scene Simplification to Aid Wayfinding in Immersive Simulations of Bionic Vision

    Authors: Justin M. Kasowski, Apurv Varshney, Michael Beyeler

    Abstract: Visual neuroprostheses (bionic eye) aim to restore a rudimentary form of vision by translating camera input into patterns of electrical stimulation. To improve scene understanding under extreme resolution and bandwidth constraints, prior work has explored computer vision techniques such as semantic segmentation and depth estimation. However, presenting all task-relevant information simultaneously… ▽ More

    Submitted 14 July, 2025; originally announced July 2025.

  10. arXiv:2507.09564  [pdf, ps, other

    cs.CR

    A Login Page Transparency and Visual Similarity Based Zero Day Phishing Defense Protocol

    Authors: Gaurav Varshney, Akanksha Raj, Divya Sangwan, Sharif Abuadbba, Rina Mishra, Yansong Gao

    Abstract: Phishing is a prevalent cyberattack that uses look-alike websites to deceive users into revealing sensitive information. Numerous efforts have been made by the Internet community and security organizations to detect, prevent, or train users to avoid falling victim to phishing attacks. Most of this research over the years has been highly diverse and application-oriented, often serving as standalone… ▽ More

    Submitted 13 July, 2025; originally announced July 2025.

  11. arXiv:2507.08019  [pdf

    cs.CL econ.GN

    Signal or Noise? Evaluating Large Language Models in Resume Screening Across Contextual Variations and Human Expert Benchmarks

    Authors: Aryan Varshney, Venkat Ram Reddy Ganuthula

    Abstract: This study investigates whether large language models (LLMs) exhibit consistent behavior (signal) or random variation (noise) when screening resumes against job descriptions, and how their performance compares to human experts. Using controlled datasets, we tested three LLMs (Claude, GPT, and Gemini) across contexts (No Company, Firm1 [MNC], Firm2 [Startup], Reduced Context) with identical and ran… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

  12. arXiv:2507.02745  [pdf, ps, other

    cs.HC

    Who's Sorry Now: User Preferences Among Rote, Empathic, and Explanatory Apologies from LLM Chatbots

    Authors: Zahra Ashktorab, Alessandra Buccella, Jason D'Cruz, Zoe Fowler, Andrew Gill, Kei Yan Leung, P. D. Magnus, John Richards, Kush R. Varshney

    Abstract: As chatbots driven by large language models (LLMs) are increasingly deployed in everyday contexts, their ability to recover from errors through effective apologies is critical to maintaining user trust and satisfaction. In a preregistered study with Prolific workers (N=162), we examine user preferences for three types of apologies (rote, explanatory, and empathic) issued in response to three categ… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

  13. arXiv:2507.00814  [pdf, ps, other

    cs.CL cs.AI cs.CY

    Many LLMs Are More Utilitarian Than One

    Authors: Anita Keshmirian, Razan Baltaji, Babak Hemmatian, Hadi Asghari, Lav R. Varshney

    Abstract: Moral judgment is integral to large language model (LLM) alignment and social reasoning. As multi-agent systems gain prominence, it becomes crucial to understand how LLMs function collectively during collaboration, compared to individual agents. In human moral judgment, group deliberation leads to a utilitarian boost: a tendency to endorse norm violations that maximize benefits for the greatest nu… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: 9 pages, 8 Figures, 7 tables

    ACM Class: I.2.7; I.2.11

  14. arXiv:2507.00004  [pdf, ps, other

    cs.LG cs.AI cs.CY cs.PF

    A Theory of Inference Compute Scaling: Reasoning through Directed Stochastic Skill Search

    Authors: Austin R. Ellis-Mohr, Anuj K. Nayak, Lav R. Varshney

    Abstract: Large language models (LLMs) demand considerable computational, energy, and financial resources during both training and deployment. While scaling laws for training have guided much of the field's recent progress, inference costs now represent a significant and growing component of the overall resource burden, particularly for reasoning-focused models. Existing characterizations of compute-optimal… ▽ More

    Submitted 10 July, 2025; v1 submitted 10 June, 2025; originally announced July 2025.

  15. arXiv:2506.20916  [pdf, ps, other

    cs.LG

    Explainable AI for Radar Resource Management: Modified LIME in Deep Reinforcement Learning

    Authors: Ziyang Lu, M. Cenk Gursoy, Chilukuri K. Mohan, Pramod K. Varshney

    Abstract: Deep reinforcement learning has been extensively studied in decision-making processes and has demonstrated superior performance over conventional approaches in various fields, including radar resource management (RRM). However, a notable limitation of neural networks is their ``black box" nature and recent research work has increasingly focused on explainable AI (XAI) techniques to describe the ra… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  16. arXiv:2506.20853  [pdf, ps, other

    cs.LG eess.SP

    Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management

    Authors: Ziyang Lu, Subodh Kalia, M. Cenk Gursoy, Chilukuri K. Mohan, Pramod K. Varshney

    Abstract: The time allocation problem in multi-function cognitive radar systems focuses on the trade-off between scanning for newly emerging targets and tracking the previously detected targets. We formulate this as a multi-objective optimization problem and employ deep reinforcement learning to find Pareto-optimal solutions and compare deep deterministic policy gradient (DDPG) and soft actor-critic (SAC) a… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  17. arXiv:2506.20849  [pdf, ps, other

    cs.LG

    Learning-Based Resource Management in Integrated Sensing and Communication Systems

    Authors: Ziyang Lu, M. Cenk Gursoy, Chilukuri K. Mohan, Pramod K. Varshney

    Abstract: In this paper, we tackle the task of adaptive time allocation in integrated sensing and communication systems equipped with radar and communication units. The dual-functional radar-communication system's task involves allocating dwell times for tracking multiple targets and utilizing the remaining time for data transmission towards estimated target locations. We introduce a novel constrained deep… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  18. arXiv:2506.05586  [pdf, ps, other

    cs.LG cs.AI

    CoFrNets: Interpretable Neural Architecture Inspired by Continued Fractions

    Authors: Isha Puri, Amit Dhurandhar, Tejaswini Pedapati, Kartikeyan Shanmugam, Dennis Wei, Kush R. Varshney

    Abstract: In recent years there has been a considerable amount of research on local post hoc explanations for neural networks. However, work on building interpretable neural architectures has been relatively sparse. In this paper, we present a novel neural architecture, CoFrNet, inspired by the form of continued fractions which are known to have many attractive properties in number theory, such as fast conv… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS) 2021, vol 34, pp 21668-21690

  19. arXiv:2506.02546  [pdf, ps, other

    cs.CR

    Attention Knows Whom to Trust: Attention-based Trust Management for LLM Multi-Agent Systems

    Authors: Pengfei He, Zhenwei Dai, Xianfeng Tang, Yue Xing, Hui Liu, Jingying Zeng, Qiankun Peng, Shrivats Agrawal, Samarth Varshney, Suhang Wang, Jiliang Tang, Qi He

    Abstract: Large Language Model-based Multi-Agent Systems (LLM-MAS) have demonstrated strong capabilities in solving complex tasks but remain vulnerable when agents receive unreliable messages. This vulnerability stems from a fundamental gap: LLM agents treat all incoming messages equally without evaluating their trustworthiness. While some existing studies approach the trustworthiness, they focus on a singl… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  20. arXiv:2506.01813  [pdf, ps, other

    cs.AI

    The Ultimate Test of Superintelligent AI Agents: Can an AI Balance Care and Control in Asymmetric Relationships?

    Authors: Djallel Bouneffouf, Matthew Riemer, Kush Varshney

    Abstract: This paper introduces the Shepherd Test, a new conceptual test for assessing the moral and relational dimensions of superintelligent artificial agents. The test is inspired by human interactions with animals, where ethical considerations about care, manipulation, and consumption arise in contexts of asymmetric power and self-preservation. We argue that AI crosses an important, and potentially dang… ▽ More

    Submitted 27 July, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

  21. arXiv:2506.00359  [pdf, ps, other

    cs.CR

    Keeping an Eye on LLM Unlearning: The Hidden Risk and Remedy

    Authors: Jie Ren, Zhenwei Dai, Xianfeng Tang, Yue Xing, Shenglai Zeng, Hui Liu, Jingying Zeng, Qiankun Peng, Samarth Varshney, Suhang Wang, Qi He, Charu C. Aggarwal, Hui Liu

    Abstract: Although Large Language Models (LLMs) have demonstrated impressive capabilities across a wide range of tasks, growing concerns have emerged over the misuse of sensitive, copyrighted, or harmful data during training. To address these concerns, unlearning techniques have been developed to remove the influence of specific data without retraining from scratch. However, this paper reveals a critical vu… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

  22. arXiv:2505.20841  [pdf, ps, other

    cs.CL

    Concealment of Intent: A Game-Theoretic Analysis

    Authors: Xinbo Wu, Abhishek Umrawal, Lav R. Varshney

    Abstract: As large language models (LLMs) grow more capable, concerns about their safe deployment have also grown. Although alignment mechanisms have been introduced to deter misuse, they remain vulnerable to carefully designed adversarial prompts. In this work, we present a scalable attack strategy: intent-hiding adversarial prompting, which conceals malicious intent through the composition of skills. We d… ▽ More

    Submitted 18 August, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

  23. arXiv:2505.20737  [pdf, ps, other

    cs.AI

    RRO: LLM Agent Optimization Through Rising Reward Trajectories

    Authors: Zilong Wang, Jingfeng Yang, Sreyashi Nag, Samarth Varshney, Xianfeng Tang, Haoming Jiang, Jingbo Shang, Sheikh Muhammad Sarwar

    Abstract: Large language models (LLMs) have exhibited extraordinary performance in a variety of tasks while it remains challenging for them to solve complex multi-step tasks as agents. In practice, agents sensitive to the outcome of certain key steps which makes them likely to fail the task because of a subtle mistake in the planning trajectory. Recent approaches resort to calibrating the reasoning process… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: preprint

  24. arXiv:2505.19430  [pdf, ps, other

    cs.CL cs.AI

    Deriving Strategic Market Insights with Large Language Models: A Benchmark for Forward Counterfactual Generation

    Authors: Keane Ong, Rui Mao, Deeksha Varshney, Paul Pu Liang, Erik Cambria, Gianmarco Mengaldo

    Abstract: Counterfactual reasoning typically involves considering alternatives to actual events. While often applied to understand past events, a distinct form-forward counterfactual reasoning-focuses on anticipating plausible future developments. This type of reasoning is invaluable in dynamic financial markets, where anticipating market developments can powerfully unveil potential risks and opportunities… ▽ More

    Submitted 5 June, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

  25. arXiv:2505.18422  [pdf

    cs.CY

    A Task-Driven Human-AI Collaboration: When to Automate, When to Collaborate, When to Challenge

    Authors: Saleh Afroogh, Kush R. Varshney, Jason D'Cruz

    Abstract: According to several empirical investigations, despite enhancing human capabilities, human-AI cooperation frequently falls short of expectations and fails to reach true synergy. We propose a task-driven framework that reverses prevalent approaches by assigning AI roles according to how the task's requirements align with the capabilities of AI technology. Three major AI roles are identified through… ▽ More

    Submitted 3 July, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

  26. arXiv:2505.07073  [pdf, ps, other

    cs.CV cs.LG

    Discovering Concept Directions from Diffusion-based Counterfactuals via Latent Clustering

    Authors: Payal Varshney, Adriano Lucieri, Christoph Balada, Andreas Dengel, Sheraz Ahmed

    Abstract: Concept-based explanations have emerged as an effective approach within Explainable Artificial Intelligence, enabling interpretable insights by aligning model decisions with human-understandable concepts. However, existing methods rely on computationally intensive procedures and struggle to efficiently capture complex, semantic concepts. Recently, the Concept Discovery through Latent Diffusion-bas… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  27. arXiv:2505.00949  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Llama-Nemotron: Efficient Reasoning Models

    Authors: Akhiad Bercovich, Itay Levy, Izik Golan, Mohammad Dabbah, Ran El-Yaniv, Omri Puny, Ido Galil, Zach Moshe, Tomer Ronen, Najeeb Nabwani, Ido Shahaf, Oren Tropp, Ehud Karpas, Ran Zilberstein, Jiaqi Zeng, Soumye Singhal, Alexander Bukharin, Yian Zhang, Tugrul Konuk, Gerald Shen, Ameya Sunil Mahabaleshwarkar, Bilal Kartal, Yoshi Suhara, Olivier Delalleau, Zijia Chen , et al. (111 additional authors not shown)

    Abstract: We introduce the Llama-Nemotron series of models, an open family of heterogeneous reasoning models that deliver exceptional reasoning capabilities, inference efficiency, and an open license for enterprise use. The family comes in three sizes -- Nano (8B), Super (49B), and Ultra (253B) -- and performs competitively with state-of-the-art reasoning models such as DeepSeek-R1 while offering superior i… ▽ More

    Submitted 9 September, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

  28. arXiv:2505.00786  [pdf, other

    cs.CV

    AI-ready Snow Radar Echogram Dataset (SRED) for climate change monitoring

    Authors: Oluwanisola Ibikunle, Hara Talasila, Debvrat Varshney, Jilu Li, John Paden, Maryam Rahnemoonfar

    Abstract: Tracking internal layers in radar echograms with high accuracy is essential for understanding ice sheet dynamics and quantifying the impact of accelerated ice discharge in Greenland and other polar regions due to contemporary global climate warming. Deep learning algorithms have become the leading approach for automating this task, but the absence of a standardized and well-annotated echogram data… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  29. arXiv:2504.20090  [pdf, other

    cs.AI cs.IR cs.LG

    Spark: A System for Scientifically Creative Idea Generation

    Authors: Aishik Sanyal, Samuel Schapiro, Sumuk Shashidhar, Royce Moon, Lav R. Varshney, Dilek Hakkani-Tur

    Abstract: Recently, large language models (LLMs) have shown promising abilities to generate novel research ideas in science, a direction which coincides with many foundational principles in computational creativity (CC). In light of these developments, we present an idea generation system named Spark that couples retrieval-augmented idea generation using LLMs with a reviewer model named Judge trained on 600… ▽ More

    Submitted 21 May, 2025; v1 submitted 25 April, 2025; originally announced April 2025.

    Comments: Accepted at ICCC 2025

  30. arXiv:2504.19345  [pdf, other

    cs.HC

    Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users

    Authors: Apurv Varshney, Lucas Nadolskis, Tobias Höllerer, Michael Beyeler

    Abstract: Blind individuals face persistent challenges in last-mile navigation, including locating entrances, identifying obstacles, and navigating complex or cluttered spaces. Although wearable cameras are increasingly used in assistive systems, there has been no systematic, vantage-focused comparison to guide their design. This paper addresses that gap through a two-part investigation. First, we surveyed… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

  31. arXiv:2504.19327  [pdf, ps, other

    cs.CV cs.AI

    DeepInsert: Early Layer Bypass for Efficient and Performant Multimodal Understanding

    Authors: Moulik Choraria, Xinbo Wu, Akhil Bhimaraju, Nitesh Sekhar, Yue Wu, Xu Zhang, Prateek Singhal, Lav R. Varshney

    Abstract: The hyperscaling of data and parameter count in transformer models is yielding diminishing performance improvement, especially when weighed against training costs. Such plateauing underlines a growing need for more efficient finetuning and inference, without sacrificing performance. This is particularly pressing for multimodal learning, where the overhead of processing multimodal tokens alongside… ▽ More

    Submitted 21 September, 2025; v1 submitted 27 April, 2025; originally announced April 2025.

  32. arXiv:2504.19066  [pdf, other

    cs.CL cs.AI cs.LG physics.ao-ph

    ClimaEmpact: Domain-Aligned Small Language Models and Datasets for Extreme Weather Analytics

    Authors: Deeksha Varshney, Keane Ong, Rui Mao, Erik Cambria, Gianmarco Mengaldo

    Abstract: Accurate assessments of extreme weather events are vital for research and policy, yet localized and granular data remain scarce in many parts of the world. This data gap limits our ability to analyze potential outcomes and implications of extreme weather events, hindering effective decision-making. Large Language Models (LLMs) can process vast amounts of unstructured text data, extract meaningful… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

  33. arXiv:2504.18687  [pdf, ps, other

    cs.AI

    Transformational Creativity in Science: A Graphical Theory

    Authors: Samuel Schapiro, Jonah Black, Lav R. Varshney

    Abstract: Creative processes are typically divided into three types: combinatorial, exploratory, and transformational. Here, we provide a graphical theory of transformational scientific creativity, synthesizing Boden's insight that transformational creativity arises from changes in the "enabling constraints" of a conceptual space and Kuhn's structure of scientific revolutions as resulting from paradigm shif… ▽ More

    Submitted 20 May, 2025; v1 submitted 25 April, 2025; originally announced April 2025.

    Comments: Accepted at ICCC 2025

  34. arXiv:2504.16140  [pdf, other

    cs.LG cs.AI

    SparseJEPA: Sparse Representation Learning of Joint Embedding Predictive Architectures

    Authors: Max Hartman, Lav Varshney

    Abstract: Joint Embedding Predictive Architectures (JEPA) have emerged as a powerful framework for learning general-purpose representations. However, these models often lack interpretability and suffer from inefficiencies due to dense embedding representations. We propose SparseJEPA, an extension that integrates sparse representation learning into the JEPA framework to enhance the quality of learned represe… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  35. arXiv:2504.14143  [pdf, other

    cs.LG

    Predicting Stress and Damage in Carbon Fiber-Reinforced Composites Deformation Process using Composite U-Net Surrogate Model

    Authors: Zeping Chen, Marwa Yacouti, Maryam Shakiba, Jian-Xun Wang, Tengfei Luo, Vikas Varshney

    Abstract: Carbon fiber-reinforced composites (CFRC) are pivotal in advanced engineering applications due to their exceptional mechanical properties. A deep understanding of CFRC behavior under mechanical loading is essential for optimizing performance in demanding applications such as aerospace structures. While traditional Finite Element Method (FEM) simulations, including advanced techniques like Interfac… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  36. arXiv:2504.03624  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

    Authors: NVIDIA, :, Aaron Blakeman, Aarti Basant, Abhinav Khattar, Adithya Renduchintala, Akhiad Bercovich, Aleksander Ficek, Alexis Bjorlin, Ali Taghibakhshi, Amala Sanjay Deshmukh, Ameya Sunil Mahabaleshwarkar, Andrew Tao, Anna Shors, Ashwath Aithal, Ashwin Poojary, Ayush Dattagupta, Balaram Buddharaju, Bobby Chen, Boris Ginsburg, Boxin Wang, Brandon Norick, Brian Butterfield, Bryan Catanzaro, Carlo del Mundo , et al. (176 additional authors not shown)

    Abstract: As inference-time scaling becomes critical for enhanced reasoning capabilities, it is increasingly becoming important to build models that are efficient to infer. We introduce Nemotron-H, a family of 8B and 56B/47B hybrid Mamba-Transformer models designed to reduce inference cost for a given accuracy level. To achieve this goal, we replace the majority of self-attention layers in the common Transf… ▽ More

    Submitted 5 September, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

  37. arXiv:2503.13518  [pdf, other

    cs.CL cs.AI

    Examples as the Prompt: A Scalable Approach for Efficient LLM Adaptation in E-Commerce

    Authors: Jingying Zeng, Zhenwei Dai, Hui Liu, Samarth Varshney, Zhiji Liu, Chen Luo, Zhen Li, Qi He, Xianfeng Tang

    Abstract: Prompting LLMs offers an efficient way to guide output generation without explicit model training. In the e-commerce domain, prompting-based applications are widely used for tasks such as query understanding, recommender systems, and customer support. However, adapting LLMs to different tasks often requires extensive prompt engineering by domain experts, along with frequent updates to align with e… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  38. arXiv:2503.06487  [pdf, other

    cs.CR

    A Study of Effectiveness of Brand Domain Identification Features for Phishing Detection in 2025

    Authors: Rina Mishra, Gaurav Varshney

    Abstract: Phishing websites continue to pose a significant security challenge, making the development of robust detection mechanisms essential. Brand Domain Identification (BDI) serves as a crucial step in many phishing detection approaches. This study systematically evaluates the effectiveness of features employed over the past decade for BDI, focusing on their weighted importance in phishing detection as… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  39. arXiv:2503.05780  [pdf, ps, other

    cs.CY cs.HC

    AI Risk Atlas: Taxonomy and Tooling for Navigating AI Risks and Resources

    Authors: Frank Bagehorn, Kristina Brimijoin, Elizabeth M. Daly, Jessica He, Michael Hind, Luis Garces-Erice, Christopher Giblin, Ioana Giurgiu, Jacquelyn Martino, Rahul Nair, David Piorkowski, Ambrish Rawat, John Richards, Sean Rooney, Dhaval Salwala, Seshu Tirupathi, Peter Urbanetz, Kush R. Varshney, Inge Vejsbjerg, Mira L. Wolf-Bauwens

    Abstract: The rapid evolution of generative AI has expanded the breadth of risks associated with AI systems. While various taxonomies and frameworks exist to classify these risks, the lack of interoperability between them creates challenges for researchers, practitioners, and policymakers seeking to operationalise AI governance. To address this gap, we introduce the AI Risk Atlas, a structured taxonomy that… ▽ More

    Submitted 9 July, 2025; v1 submitted 26 February, 2025; originally announced March 2025.

    Comments: 4.5 page main text, 22 page supporting material, 2 figures

  40. arXiv:2503.04830  [pdf, other

    cs.CL cs.AI

    Cite Before You Speak: Enhancing Context-Response Grounding in E-commerce Conversational LLM-Agents

    Authors: Jingying Zeng, Hui Liu, Zhenwei Dai, Xianfeng Tang, Chen Luo, Samarth Varshney, Zhen Li, Qi He

    Abstract: With the advancement of conversational large language models (LLMs), several LLM-based Conversational Shopping Agents (CSA) have been developed to help customers smooth their online shopping. The primary objective in building an engaging and trustworthy CSA is to ensure the agent's responses about product factoids are accurate and factually grounded. However, two challenges remain. First, LLMs pro… ▽ More

    Submitted 13 May, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

  41. arXiv:2503.04292  [pdf, other

    cs.CR

    A Study on Malicious Browser Extensions in 2025

    Authors: Shreya Singh, Gaurav Varshney, Tarun Kumar Singh, Vidhi Mishra

    Abstract: Browser extensions are additional tools developed by third parties that integrate with web browsers to extend their functionality beyond standard capabilities. However, the browser extension platform is increasingly being exploited by hackers to launch sophisticated cyber threats. These threats encompass a wide range of malicious activities, including but not limited to phishing, spying, Distribut… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  42. arXiv:2503.01395  [pdf, ps, other

    cs.CR

    Jailbreaking Generative AI: Empowering Novices to Conduct Phishing Attacks

    Authors: Rina Mishra, Gaurav Varshney, Shreya Singh

    Abstract: The rapid advancements in generative AI models, such as ChatGPT, have introduced both significant benefits and new risks within the cybersecurity landscape. This paper investigates the potential misuse of the latest AI model, ChatGPT-4o Mini, in facilitating social engineering attacks, with a particular focus on phishing, one of the most pressing cybersecurity threats today. While existing literat… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  43. arXiv:2503.00237  [pdf, other

    cs.AI

    Agentic AI Needs a Systems Theory

    Authors: Erik Miehling, Karthikeyan Natesan Ramamurthy, Kush R. Varshney, Matthew Riemer, Djallel Bouneffouf, John T. Richards, Amit Dhurandhar, Elizabeth M. Daly, Michael Hind, Prasanna Sattigeri, Dennis Wei, Ambrish Rawat, Jasmina Gajcin, Werner Geyer

    Abstract: The endowment of AI with reasoning capabilities and some degree of agency is widely viewed as a path toward more capable and generalizable systems. Our position is that the current development of agentic AI requires a more holistic, systems-theoretic perspective in order to fully understand their capabilities and mitigate any emergent risks. The primary motivation for our position is that AI devel… ▽ More

    Submitted 28 February, 2025; originally announced March 2025.

  44. arXiv:2502.15821  [pdf, ps, other

    cs.CL cs.AI

    Towards Robust ESG Analysis Against Greenwashing Risks: Aspect-Action Analysis with Cross-Category Generalization

    Authors: Keane Ong, Rui Mao, Deeksha Varshney, Erik Cambria, Gianmarco Mengaldo

    Abstract: Sustainability reports are key for evaluating companies' environmental, social and governance, ESG performance, but their content is increasingly obscured by greenwashing - sustainability claims that are misleading, exaggerated, and fabricated. Yet, existing NLP approaches for ESG analysis lack robustness against greenwashing risks, often extracting insights that reflect misleading or exaggerated… ▽ More

    Submitted 5 June, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: Proceedings of the Association for Computational Linguistics Main Conference (ACL 2025)

  45. arXiv:2502.15436  [pdf, other

    cs.LG cs.AI cs.CL cs.DC

    Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning

    Authors: Raghav Singhal, Kaustubh Ponkshe, Rohit Vartak, Lav R. Varshney, Praneeth Vepakomma

    Abstract: Low-Rank Adaptation (LoRA) has become ubiquitous for efficiently fine-tuning foundation models. However, federated fine-tuning using LoRA is challenging due to suboptimal updates arising from traditional federated averaging of individual adapters. Existing solutions either incur prohibitively high communication cost that scales linearly with the number of clients or suffer from performance degrada… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: Raghav Singhal and Kaustubh Ponkshe contributed equally to this work

  46. arXiv:2502.15427  [pdf, other

    cs.CR cs.LG

    Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs

    Authors: Giulio Zizzo, Giandomenico Cornacchia, Kieran Fraser, Muhammad Zaid Hameed, Ambrish Rawat, Beat Buesser, Mark Purcell, Pin-Yu Chen, Prasanna Sattigeri, Kush Varshney

    Abstract: As large language models (LLMs) become integrated into everyday applications, ensuring their robustness and security is increasingly critical. In particular, LLMs can be manipulated into unsafe behaviour by prompts known as jailbreaks. The variety of jailbreak styles is growing, necessitating the use of external defences known as guardrails. While many jailbreak defences have been proposed, not al… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: NeurIPS 2024, Safe Generative AI Workshop

  47. arXiv:2502.12762  [pdf, other

    cs.LG eess.SP

    One-bit Compressed Sensing using Generative Models

    Authors: Swatantra Kafle, Geethu Joseph, Pramod K. Varshney

    Abstract: This paper addresses the classical problem of one-bit compressed sensing using a deep learning-based reconstruction algorithm that leverages a trained generative model to enhance the signal reconstruction performance. The generator, a pre-trained neural network, learns to map from a low-dimensional latent space to a higher-dimensional set of sparse vectors. This generator is then used to reconstru… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  48. arXiv:2502.10562  [pdf, other

    cs.CV cs.LG

    Detecting and Monitoring Bias for Subgroups in Breast Cancer Detection AI

    Authors: Amit Kumar Kundu, Florence X. Doo, Vaishnavi Patil, Amitabh Varshney, Joseph Jaja

    Abstract: Automated mammography screening plays an important role in early breast cancer detection. However, current machine learning models, developed on some training datasets, may exhibit performance degradation and bias when deployed in real-world settings. In this paper, we analyze the performance of high-performing AI models on two mammography datasets-the Emory Breast Imaging Dataset (EMBED) and the… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  49. arXiv:2502.05352  [pdf, other

    cs.AI cs.DC cs.MA

    ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

    Authors: Saurabh Jha, Rohan Arora, Yuji Watanabe, Takumi Yanagawa, Yinfang Chen, Jackson Clark, Bhavya Bhavya, Mudit Verma, Harshit Kumar, Hirokuni Kitahara, Noah Zheutlin, Saki Takano, Divya Pathak, Felix George, Xinbo Wu, Bekir O. Turkkan, Gerard Vanloo, Michael Nidd, Ting Dai, Oishik Chatterjee, Pranjal Gupta, Suranjana Samanta, Pooja Aggarwal, Rong Lee, Pavankumar Murali , et al. (18 additional authors not shown)

    Abstract: Realizing the vision of using AI agents to automate critical IT tasks depends on the ability to measure and understand effectiveness of proposed solutions. We introduce ITBench, a framework that offers a systematic methodology for benchmarking AI agents to address real-world IT automation tasks. Our initial release targets three key areas: Site Reliability Engineering (SRE), Compliance and Securit… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  50. arXiv:2502.05148  [pdf, ps, other

    cs.CY cs.CL

    An Annotated Reading of 'The Singer of Tales' in the LLM Era

    Authors: Kush R. Varshney

    Abstract: The Parry-Lord oral-formulaic theory was a breakthrough in understanding how oral narrative poetry is learned, composed, and transmitted by illiterate bards. In this paper, we provide an annotated reading of the mechanism underlying this theory from the lens of large language models (LLMs) and generative artificial intelligence (AI). We point out the the similarities and differences between oral c… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.