Skip to main content

Showing 1–50 of 275 results for author: Neha

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.13845  [pdf, ps, other

    cs.CY cs.AI cs.HC

    Students' Reliance on AI in Higher Education: Identifying Contributing Factors

    Authors: Griffin Pitts, Neha Rani, Weedguet Mildort, Eva-Marie Cook

    Abstract: The increasing availability and use of artificial intelligence (AI) tools in educational settings has raised concerns about students' overreliance on these technologies. Overreliance occurs when individuals accept incorrect AI-generated recommendations, often without critical evaluation, leading to flawed problem solutions and undermining learning outcomes. This study investigates potential factor… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    ACM Class: K.3; K.4; I.2.6

  2. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  3. arXiv:2506.10910  [pdf, ps, other

    cs.CL

    Magistral

    Authors: Mistral-AI, :, Abhinav Rastogi, Albert Q. Jiang, Andy Lo, Gabrielle Berrada, Guillaume Lample, Jason Rute, Joep Barmentlo, Karmesh Yadav, Kartik Khandelwal, Khyathi Raghavi Chandu, Léonard Blier, Lucile Saulnier, Matthieu Dinot, Maxime Darrin, Neha Gupta, Roman Soletskyi, Sagar Vaze, Teven Le Scao, Yihan Wang, Adam Yang, Alexander H. Liu, Alexandre Sablayrolles, Amélie Héliou , et al. (76 additional authors not shown)

    Abstract: We introduce Magistral, Mistral's first reasoning model and our own scalable reinforcement learning (RL) pipeline. Instead of relying on existing implementations and RL traces distilled from prior models, we follow a ground up approach, relying solely on our own models and infrastructure. Notably, we demonstrate a stack that enabled us to explore the limits of pure RL training of LLMs, present a s… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  4. arXiv:2506.05739  [pdf, ps, other

    cs.CR cs.AI

    To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt

    Authors: Zhilong Wang, Neha Nagaraja, Lan Zhang, Hayretdin Bahsi, Pawan Patil, Peng Liu

    Abstract: LLM agents are widely used as agents for customer support, content generation, and code assistance. However, they are vulnerable to prompt injection attacks, where adversarial inputs manipulate the model's behavior. Traditional defenses like input sanitization, guard models, and guardrails are either cumbersome or ineffective. In this paper, we propose a novel, lightweight defense mechanism called… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: To appear in the Industry Track of the 55th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2025)

  5. arXiv:2505.09970  [pdf, ps, other

    cs.AI

    Pre-Act: Multi-Step Planning and Reasoning Improves Acting in LLM Agents

    Authors: Mrinal Rawat, Ambuje Gupta, Rushil Goomer, Alessandro Di Bari, Neha Gupta, Roberto Pieraccini

    Abstract: The ReAct (Reasoning + Action) capability in large language models (LLMs) has become the foundation of modern agentic systems. Recent LLMs, such as DeepSeek-R1 and OpenAI o1/o3, exemplify this by emphasizing reasoning through the generation of ample intermediate tokens, which help build a strong premise before producing the final output tokens. In this paper, we introduce Pre-Act, a novel approach… ▽ More

    Submitted 18 May, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

  6. arXiv:2504.21187  [pdf, other

    cs.LG

    LIFT: LLM-Based Pragma Insertion for HLS via GNN Supervised Fine-Tuning

    Authors: Neha Prakriya, Zijian Ding, Yizhou Sun, Jason Cong

    Abstract: FPGAs are increasingly adopted in datacenter environments for their reconfigurability and energy efficiency. High-Level Synthesis (HLS) tools have eased FPGA programming by raising the abstraction level from RTL to untimed C/C++, yet attaining high performance still demands expert knowledge and iterative manual insertion of optimization pragmas to modify the microarchitecture. To address this chal… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  7. arXiv:2504.19457  [pdf, other

    cs.CL cs.AI

    Towards Long Context Hallucination Detection

    Authors: Siyi Liu, Kishaloy Halder, Zheng Qi, Wei Xiao, Nikolaos Pappas, Phu Mon Htut, Neha Anna John, Yassine Benajiba, Dan Roth

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across various tasks. However, they are prone to contextual hallucination, generating information that is either unsubstantiated or contradictory to the given context. Although many studies have investigated contextual hallucinations in LLMs, addressing them in long-context inputs remains an open problem. In this work, we take a… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

  8. arXiv:2504.16942  [pdf, other

    cs.SI cs.AI cs.CV

    S2Vec: Self-Supervised Geospatial Embeddings

    Authors: Shushman Choudhury, Elad Aharoni, Chandrakumari Suvarna, Iveel Tsogsuren, Abdul Rahman Kreidieh, Chun-Ta Lu, Neha Arora

    Abstract: Scalable general-purpose representations of the built environment are crucial for geospatial artificial intelligence applications. This paper introduces S2Vec, a novel self-supervised framework for learning such geospatial embeddings. S2Vec uses the S2 Geometry library to partition large areas into discrete S2 cells, rasterizes built environment feature vectors within cells as images, and applies… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: To be submitted to ACM Transactions on Spatial Algorithms and Systems

  9. arXiv:2504.16277  [pdf, other

    cs.LG cs.AI

    DataS^3: Dataset Subset Selection for Specialization

    Authors: Neha Hulkund, Alaa Maalouf, Levi Cai, Daniel Yang, Tsun-Hsuan Wang, Abigail O'Neil, Timm Haucke, Sandeep Mukherjee, Vikram Ramaswamy, Judy Hansen Shen, Gabriel Tseng, Mike Walmsley, Daniela Rus, Ken Goldberg, Hannah Kerner, Irene Chen, Yogesh Girdhar, Sara Beery

    Abstract: In many real-world machine learning (ML) applications (e.g. detecting broken bones in x-ray images, detecting species in camera traps), in practice models need to perform well on specific deployments (e.g. a specific hospital, a specific national park) rather than the domain broadly. However, deployments often have imbalanced, unique data distributions. Discrepancy between the training distributio… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  10. arXiv:2504.06011  [pdf, other

    cs.CL

    Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi

    Authors: Monojit Choudhury, Shivam Chauhan, Rocktim Jyoti Das, Dhruv Sahnan, Xudong Han, Haonan Li, Aaryamonvikram Singh, Alok Anil Jadhav, Utkarsh Agarwal, Mukund Choudhary, Debopriyo Banerjee, Fajri Koto, Junaid Bhat, Awantika Shukla, Samujjwal Ghosh, Samta Kamboj, Onkar Pandit, Lalit Pradhan, Rahul Pal, Sunil Sahu, Soundar Doraiswamy, Parvez Mullah, Ali El Filali, Neha Sengupta, Gokul Ramakrishnan , et al. (5 additional authors not shown)

    Abstract: Developing high-quality large language models (LLMs) for moderately resourced languages presents unique challenges in data availability, model adaptation, and evaluation. We introduce Llama-3-Nanda-10B-Chat, or Nanda for short, a state-of-the-art Hindi-centric instruction-tuned generative LLM, designed to push the boundaries of open-source Hindi language models. Built upon Llama-3-8B, Nanda incorp… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  11. arXiv:2504.02823  [pdf, other

    cs.CV eess.IV

    STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection

    Authors: Divya Velayudhan, Abdelfatah Ahmed, Mohamad Alansari, Neha Gour, Abderaouf Behouch, Taimur Hassan, Syed Talal Wasim, Nabil Maalej, Muzammal Naseer, Juergen Gall, Mohammed Bennamoun, Ernesto Damiani, Naoufel Werghi

    Abstract: Advancements in Computer-Aided Screening (CAS) systems are essential for improving the detection of security threats in X-ray baggage scans. However, current datasets are limited in representing real-world, sophisticated threats and concealment tactics, and existing approaches are constrained by a closed-set paradigm with predefined labels. To address these challenges, we introduce STCray, the fir… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: Accepted at CVPR 2025

  12. arXiv:2503.19114  [pdf, other

    cs.CL cs.IR cs.LG

    Understanding and Improving Information Preservation in Prompt Compression for LLMs

    Authors: Weronika Łajewska, Momchil Hardalov, Laura Aina, Neha Anna John, Hang Su, Lluís Màrquez

    Abstract: Recent advancements in large language models (LLMs) have enabled their successful application to a broad range of tasks. However, in information-intensive tasks, the prompt length can grow fast, leading to increased computational requirements, performance degradation, and induced biases from irrelevant or redundant information. Recently, various prompt compression techniques have been introduced t… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: 21 pages, 6 figures, 23 tables

  13. arXiv:2503.17114  [pdf, ps, other

    cs.CC

    Range Avoidance in Boolean Circuits via Turan-type Bounds

    Authors: Neha Kuntewar, Jayalal Sarma

    Abstract: Given a circuit $C : \{0,1\}^n \to \{0,1\}^m$ from a circuit class $F$, with $m > n$, finding a $y \in \{0,1\}^m$ such that $\forall x \in \{0,1\}^n$, $C(x) \ne y$, is the range avoidance problem (denoted by $F$-$avoid$). Deterministic polynomial time algorithms (even with access to $NP$ oracles) solving this problem is known to imply explicit constructions of various pseudorandom objects like har… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: 31 pages, abstract shortened to fit in arxiv requirements

  14. arXiv:2503.12370  [pdf, other

    cs.CL

    Understanding Common Ground Misalignment in Goal-Oriented Dialog: A Case-Study with Ubuntu Chat Logs

    Authors: Rupak Sarkar, Neha Srikanth, Taylor Hudson, Rachel Rudinger, Claire Bonial, Philip Resnik

    Abstract: While it is commonly accepted that maintaining common ground plays a role in conversational success, little prior research exists connecting conversational grounding to success in task-oriented conversations. We study failures of grounding in the Ubuntu IRC dataset, where participants use text-only communication to resolve technical issues. We find that disruptions in conversational flow often ste… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

    Comments: 8 pages

  15. arXiv:2503.01522  [pdf, ps, other

    cs.IT cs.CR

    Byzantine Distributed Function Computation

    Authors: Hari Krishnan P. Anilkumar, Neha Sangwan, Varun Narayanan, Vinod M. Prabhakaran

    Abstract: We study the distributed function computation problem with $k$ users of which at most $s$ may be controlled by an adversary and characterize the set of functions of the sources the decoder can reconstruct robustly in the following sense -- if the users behave honestly, the function is recovered with high probability (w.h.p.); if they behave adversarially, w.h.p, either one of the adversarial users… ▽ More

    Submitted 10 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

  16. arXiv:2503.01493  [pdf, other

    cs.CL

    Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh

    Authors: Fajri Koto, Rituraj Joshi, Nurdaulet Mukhituly, Yuxia Wang, Zhuohan Xie, Rahul Pal, Daniil Orel, Parvez Mullah, Diana Turmakhan, Maiya Goloburda, Mohammed Kamran, Samujjwal Ghosh, Bokang Jia, Jonibek Mansurov, Mukhammed Togmanov, Debopriyo Banerjee, Nurkhan Laiyk, Akhmed Sakip, Xudong Han, Ekaterina Kochmar, Alham Fikri Aji, Aaryamonvikram Singh, Alok Anil Jadhav, Satheesh Katipomu, Samta Kamboj , et al. (10 additional authors not shown)

    Abstract: Llama-3.1-Sherkala-8B-Chat, or Sherkala-Chat (8B) for short, is a state-of-the-art instruction-tuned open generative large language model (LLM) designed for Kazakh. Sherkala-Chat (8B) aims to enhance the inclusivity of LLM advancements for Kazakh speakers. Adapted from the LLaMA-3.1-8B model, Sherkala-Chat (8B) is trained on 45.3B tokens across Kazakh, English, Russian, and Turkish. With 8 billion… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: Technical Report

  17. arXiv:2502.19528  [pdf, other

    cs.ET eess.SY

    Improving Simulation-Based Origin-Destination Demand Calibration Using Sample Segment Counts Data

    Authors: Arwa Alanqary, Chao Zhang, Yechen Li, Neha Arora, Carolina Osorio

    Abstract: This paper introduces a novel approach to demand estimation that utilizes partial observations of segment-level track counts. Building on established simulation-based demand estimation methods, we present a modified formulation that integrates sample track counts as a regularization term. This approach effectively addresses the underdetermination challenge in demand estimation, moving beyond the c… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: Published in the 12th Triennial Symposium on Transportation Analysis conference (TRISTAN XII)

  18. arXiv:2502.16008  [pdf, other

    stat.ML cs.IT cs.LG

    Exact Recovery of Sparse Binary Vectors from Generalized Linear Measurements

    Authors: Arya Mazumdar, Neha Sangwan

    Abstract: We consider the problem of exact recovery of a $k$-sparse binary vector from generalized linear measurements (such as logistic regression). We analyze the linear estimation algorithm (Plan, Vershynin, Yudovina, 2017), and also show information theoretic lower bounds on the number of required measurements. As a consequence of our results, for noisy one bit quantized linear measurements (… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

  19. arXiv:2502.15978  [pdf, other

    cs.CY cs.HC

    "Who Has the Time?": Understanding Receptivity to Health Chatbots among Underserved Women in India

    Authors: Manvi S, Roshini Deva, Neha Madhiwalla, Azra Ismail

    Abstract: Access to health information and services among women continues to be a major challenge in many communities globally. In recent years, there has been a growing interest in the potential of chatbots to address this information and access gap. We conducted interviews and focus group discussions with underserved women in urban India to understand their receptivity towards the use of chatbots for mate… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: This is a preprint of the paper accepted at the International Conference on Gender and Technology 2025. The final version will be available in the IEEE Xplore Digital Library

  20. arXiv:2502.14270  [pdf, other

    cs.LG

    Predicting Fetal Birthweight from High Dimensional Data using Advanced Machine Learning

    Authors: Nachiket Kapure, Harsh Joshi, Rajeshwari Mistri, Parul Kumari, Manasi Mali, Seema Purohit, Neha Sharma, Mrityunjoy Panday, Chittaranjan S. Yajnik

    Abstract: Birth weight serves as a fundamental indicator of neonatal health, closely linked to both early medical interventions and long-term developmental risks. Traditional predictive models, often constrained by limited feature selection and incomplete datasets, struggle to achieve overlooking complex maternal and fetal interactions in diverse clinical settings. This research explores machine learning to… ▽ More

    Submitted 8 April, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

  21. arXiv:2502.08080  [pdf, other

    cs.CL

    NLI under the Microscope: What Atomic Hypothesis Decomposition Reveals

    Authors: Neha Srikanth, Rachel Rudinger

    Abstract: Decomposition of text into atomic propositions is a flexible framework allowing for the closer inspection of input and output text. We use atomic decomposition of hypotheses in two natural language reasoning tasks, traditional NLI and defeasible NLI, to form atomic sub-problems, or granular inferences that models must weigh when solving the overall problem. These atomic sub-problems serve as a too… ▽ More

    Submitted 7 March, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

    Comments: Accepted to NAACL 2025

  22. arXiv:2502.05227  [pdf, other

    cs.RO cs.AI cs.CL

    Robotouille: An Asynchronous Planning Benchmark for LLM Agents

    Authors: Gonzalo Gonzalez-Pumariega, Leong Su Yean, Neha Sunkara, Sanjiban Choudhury

    Abstract: Effective asynchronous planning, or the ability to efficiently reason and plan over states and actions that must happen in parallel or sequentially, is essential for agents that must account for time delays, reason over diverse long-horizon tasks, and collaborate with other agents. While large language model (LLM) agents show promise in high-level task planning, current benchmarks focus primarily… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: 11 pages (not including references or appendix); 41 figures (7 main paper, 34 appendix); (v1) preprint

  23. arXiv:2501.06306  [pdf, other

    cs.ET eess.SP physics.soc-ph

    On How Traffic Signals Impact the Fundamental Diagrams of Urban Roads

    Authors: Chao Zhang, Yechen Li, Neha Arora, Carolina Osorio

    Abstract: Being widely adopted by the transportation and planning practitioners, the fundamental diagram (FD) is the primary tool used to relate the key macroscopic traffic variables of speed, flow, and density. We empirically analyze the relation between vehicular space-mean speeds and flows given different signal settings and postulate a parsimonious parametric function form of the traditional FD where it… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

    Comments: Published in the 4th Symposium on Management of Future Motorway and Urban Traffic Systems (MFTS)

  24. arXiv:2501.06126  [pdf, other

    cs.CL cs.LG

    Merging Feed-Forward Sublayers for Compressed Transformers

    Authors: Neha Verma, Kenton Murray, Kevin Duh

    Abstract: With the rise and ubiquity of larger deep learning models, the need for high-quality compression techniques is growing in order to deploy these models widely. The sheer parameter count of these models makes it difficult to fit them into the memory constraints of different hardware. In this work, we present a novel approach to model compression by merging similar parameter groups within a model, ra… ▽ More

    Submitted 28 March, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

  25. arXiv:2501.04783  [pdf, other

    cs.ET eess.SY

    Traffic Simulations: Multi-City Calibration of Metropolitan Highway Networks

    Authors: Chao Zhang, Yechen Li, Neha Arora, Damien Pierce, Carolina Osorio

    Abstract: This paper proposes an approach to perform travel demand calibration for high-resolution stochastic traffic simulators. It employs abundant travel times at the path-level, departing from the standard practice of resorting to scarce segment-level sensor counts. The proposed approach is shown to tackle high-dimensional instances in a sample-efficient way. For the first time, case studies on 6 metrop… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: Published on the 27th IEEE International Conference on Intelligent Transportation Systems (ITSC) (2024)

  26. arXiv:2412.17899  [pdf, other

    math.ST cs.DS stat.ML

    A mixing time bound for Gibbs sampling from log-smooth log-concave distributions

    Authors: Neha S. Wadia

    Abstract: The Gibbs sampler, also known as the coordinate hit-and-run algorithm, is a Markov chain that is widely used to draw samples from probability distributions in arbitrary dimensions. At each iteration of the algorithm, a randomly selected coordinate is resampled from the distribution that results from conditioning on all the other coordinates. We study the behavior of the Gibbs sampler on the class… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

    Comments: 22 pages, 4 figures

  27. On the Use of Abundant Road Speed Data for Travel Demand Calibration of Urban Traffic Simulators

    Authors: Suyash Vishnoi, Akhil Shetty, Iveel Tsogsuren, Neha Arora, Carolina Osorio

    Abstract: This work develops a compute-efficient algorithm to tackle a fundamental problem in transportation: that of urban travel demand estimation. It focuses on the calibration of origin-destination travel demand input parameters for high-resolution traffic simulation models. It considers the use of abundant traffic road speed data. The travel demand calibration problem is formulated as a continuous, hig… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: 4 pages

    Journal ref: Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems, pp. 1-4. 2023

  28. arXiv:2412.09411  [pdf, other

    cs.DB

    Resilience for Regular Path Queries: Towards a Complexity Classification

    Authors: Antoine Amarilli, Wolfgang Gatterbauer, Neha Makhija, Mikaël Monet

    Abstract: The resilience problem for a query and an input set or bag database is to compute the minimum number of facts to remove from the database to make the query false. In this paper, we study how to compute the resilience of Regular Path Queries (RPQs) over graph databases. Our goal is to characterize the regular languages $L$ for which it is tractable to compute the resilience of the existentially-qua… ▽ More

    Submitted 23 March, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

    Comments: 48 pages, 17 figures. Minor updates relative to version 1. This version includes the appendices with all proofs, and all reviewer feedback: it is identical to the PODS'25 publication up to minor formatting differences

  29. arXiv:2412.09230  [pdf, other

    cs.CV cs.AI

    Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering

    Authors: Sai Bhargav Rongali, Mohamad Hassan N C, Ankit Jha, Neha Bhargava, Saurabh Prasad, Biplab Banerjee

    Abstract: This paper tackles the intricate challenge of video question-answering (VideoQA). Despite notable progress, current methods fall short of effectively integrating questions with video frames and semantic object-level abstractions to create question-aware video representations. We introduce Local-Global Question Aware Video Embedding (LGQAVE), which incorporates three major innovations to integrate… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Journal ref: WACV2025

  30. arXiv:2412.05724  [pdf, other

    cs.CV cs.AI

    A Tiered GAN Approach for Monet-Style Image Generation

    Authors: FNU Neha, Deepshikha Bhati, Deepak Kumar Shukla, Md Amiruzzaman

    Abstract: Generative Adversarial Networks (GANs) have proven to be a powerful tool in generating artistic images, capable of mimicking the styles of renowned painters, such as Claude Monet. This paper introduces a tiered GAN model to progressively refine image quality through a multi-stage process, enhancing the generated images at each step. The model transforms random noise into detailed artistic represen… ▽ More

    Submitted 7 December, 2024; originally announced December 2024.

  31. arXiv:2412.05686  [pdf, other

    cs.NE cs.AI cs.CV cs.LG

    Neural network interpretability with layer-wise relevance propagation: novel techniques for neuron selection and visualization

    Authors: Deepshikha Bhati, Fnu Neha, Md Amiruzzaman, Angela Guercio, Deepak Kumar Shukla, Ben Ward

    Abstract: Interpreting complex neural networks is crucial for understanding their decision-making processes, particularly in applications where transparency and accountability are essential. This proposed method addresses this need by focusing on layer-wise Relevance Propagation (LRP), a technique used in explainable artificial intelligence (XAI) to attribute neural network outputs to input features through… ▽ More

    Submitted 7 December, 2024; originally announced December 2024.

  32. arXiv:2412.05252  [pdf, other

    cs.CV cs.AI cs.LG

    From classical techniques to convolution-based models: A review of object detection algorithms

    Authors: Fnu Neha, Deepshikha Bhati, Deepak Kumar Shukla, Md Amiruzzaman

    Abstract: Object detection is a fundamental task in computer vision and image understanding, with the goal of identifying and localizing objects of interest within an image while assigning them corresponding class labels. Traditional methods, which relied on handcrafted features and shallow models, struggled with complex visual data and showed limited performance. These methods combined low-level features w… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  33. arXiv:2412.03933  [pdf, other

    cs.AI cs.HC cs.LG

    Exploring AI Text Generation, Retrieval-Augmented Generation, and Detection Technologies: a Comprehensive Overview

    Authors: Fnu Neha, Deepshikha Bhati, Deepak Kumar Shukla, Angela Guercio, Ben Ward

    Abstract: The rapid development of Artificial Intelligence (AI) has led to the creation of powerful text generation models, such as large language models (LLMs), which are widely used for diverse applications. However, concerns surrounding AI-generated content, including issues of originality, bias, misinformation, and accountability, have become increasingly prominent. This paper offers a comprehensive ove… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

  34. arXiv:2412.02242  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    U-Net in Medical Image Segmentation: A Review of Its Applications Across Modalities

    Authors: Fnu Neha, Deepshikha Bhati, Deepak Kumar Shukla, Sonavi Makarand Dalvi, Nikolaos Mantzou, Safa Shubbar

    Abstract: Medical imaging is essential in healthcare to provide key insights into patient anatomy and pathology, aiding in diagnosis and treatment. Non-invasive techniques such as X-ray, Magnetic Resonance Imaging (MRI), Computed Tomography (CT), and Ultrasound (US), capture detailed images of organs, tissues, and abnormalities. Effective analysis of these images requires precise segmentation to delineate r… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  35. arXiv:2412.02166  [pdf, other

    cs.AI

    Analyzing the Impact of AI Tools on Student Study Habits and Academic Performance

    Authors: Ben Ward, Deepshikha Bhati, Fnu Neha, Angela Guercio

    Abstract: This study explores the effectiveness of AI tools in enhancing student learning, specifically in improving study habits, time management, and feedback mechanisms. The research focuses on how AI tools can support personalized learning, adaptive test adjustments, and provide real-time classroom analysis. Student feedback revealed strong support for these features, and the study found a significant r… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  36. arXiv:2411.17603  [pdf, ps, other

    cs.DB

    Is Integer Linear Programming All You Need for Deletion Propagation? A Unified and Practical Approach for Generalized Deletion Propagation

    Authors: Neha Makhija, Wolfgang Gatterbauer

    Abstract: Deletion Propagation (DP) refers to a family of database problems rooted in the classical view-update problem: how to propagate intended deletions in a view (query output) back to the source database while satisfying constraints and minimizing side effects. Although studied for over 40 years, DP variants, their complexities, and practical algorithms have been typically explored in isolation. Thi… ▽ More

    Submitted 16 June, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

    Comments: 19 pages, 12 figures

  37. arXiv:2411.14437  [pdf

    cs.CY

    Transforming Business with Generative AI: Research, Innovation, Market Deployment and Future Shifts in Business Models

    Authors: Narotam Singh, Vaibhav Chaudhary, Nimisha Singh, Neha Soni, Amita Kapoor

    Abstract: This paper explores the transformative impact of Generative AI (GenAI) on the business landscape, examining its role in reshaping traditional business models, intensifying market competition, and fostering innovation. By applying the principles of Neo-Schumpeterian economics, the research analyses how GenAI is driving a new wave of "creative destruction," leading to the emergence of novel business… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: 30 pages, 12 figures, original submission

    MSC Class: 68T05 (Primary); 68T50; 91B84; 91B69; 62H30 (Secondary) ACM Class: I.2.7; I.2.6; K.4.1; K.6.1; J.4; H.1.2

  38. arXiv:2411.10548  [pdf, ps, other

    cs.LG q-bio.BM

    BioNeMo Framework: a modular, high-performance library for AI model development in drug discovery

    Authors: Peter St. John, Dejun Lin, Polina Binder, Malcolm Greaves, Vega Shah, John St. John, Adrian Lange, Patrick Hsu, Rajesh Illango, Arvind Ramanathan, Anima Anandkumar, David H Brookes, Akosua Busia, Abhishaike Mahajan, Stephen Malina, Neha Prasad, Sam Sinai, Lindsay Edwards, Thomas Gaudelet, Cristian Regep, Martin Steinegger, Burkhard Rost, Alexander Brace, Kyle Hippe, Luca Naef , et al. (68 additional authors not shown)

    Abstract: Artificial Intelligence models encoding biology and chemistry are opening new routes to high-throughput and high-quality in-silico drug development. However, their training increasingly relies on computational scale, with recent protein language models (pLM) training on hundreds of graphical processing units (GPUs). We introduce the BioNeMo Framework to facilitate the training of computational bio… ▽ More

    Submitted 12 June, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

  39. arXiv:2410.20231  [pdf, other

    cs.CV

    CAVE-Net: Classifying Abnormalities in Video Capsule Endoscopy

    Authors: Ishita Harish, Saurav Mishra, Neha Bhadoria, Rithik Kumar, Madhav Arora, Syed Rameem Zahra, Ankur Gupta

    Abstract: Accurate classification of medical images is critical for detecting abnormalities in the gastrointestinal tract, a domain where misclassification can significantly impact patient outcomes. We propose an ensemble-based approach to improve diagnostic accuracy in analyzing complex image datasets. Using a Convolutional Block Attention Module along with a Deep Neural Network, we leverage the unique fea… ▽ More

    Submitted 30 December, 2024; v1 submitted 26 October, 2024; originally announced October 2024.

  40. arXiv:2410.19656  [pdf, other

    cs.RO

    APRICOT: Active Preference Learning and Constraint-Aware Task Planning with LLMs

    Authors: Huaxiaoyue Wang, Nathaniel Chin, Gonzalo Gonzalez-Pumariega, Xiangwan Sun, Neha Sunkara, Maximus Adrian Pace, Jeannette Bohg, Sanjiban Choudhury

    Abstract: Home robots performing personalized tasks must adeptly balance user preferences with environmental affordances. We focus on organization tasks within constrained spaces, such as arranging items into a refrigerator, where preferences for placement collide with physical limitations. The robot must infer user preferences based on a small set of demonstrations, which is easier for users to provide tha… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

    Comments: Conference on Robot Learning (CoRL) 2024

  41. arXiv:2410.15472  [pdf

    cs.AI cs.CV cs.LG

    Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation

    Authors: Fnu Neha, Arvind K. Bansal

    Abstract: Renal tumors, especially renal cell carcinoma (RCC), show significant heterogeneity, posing challenges for diagnosis using radiology images such as MRI, echocardiograms, and CT scans. U-Net based deep learning techniques are emerging as a promising approach for automated medical image segmentation for minimally invasive diagnosis of renal tumors. However, current techniques need further improvemen… ▽ More

    Submitted 21 October, 2024; v1 submitted 20 October, 2024; originally announced October 2024.

    Comments: 8 pages

    Journal ref: Proceedings of the 10th World Congress on Electrical Engineering and Computer Systems and Science, Avestia Publishing, ISSN = 2369-811X, 2024

  42. arXiv:2410.15229  [pdf

    cs.CV cs.LG physics.app-ph physics.med-ph

    Deep Learning-based Detection of Bacterial Swarm Motion Using a Single Image

    Authors: Yuzhu Li, Hao Li, Weijie Chen, Keelan O'Riordan, Neha Mani, Yuxuan Qi, Tairan Liu, Sridhar Mani, Aydogan Ozcan

    Abstract: Distinguishing between swarming and swimming, the two principal forms of bacterial movement, holds significant conceptual and clinical relevance. This is because bacteria that exhibit swarming capabilities often possess unique properties crucial to the pathogenesis of infectious diseases and may also have therapeutic potential. Here, we report a deep learning-based swarming classifier that rapidly… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 17 Pages, 4 Figures

    Journal ref: Gut Microbes (2025)

  43. arXiv:2410.12311  [pdf, other

    cs.CL cs.AI

    Open Domain Question Answering with Conflicting Contexts

    Authors: Siyi Liu, Qiang Ning, Kishaloy Halder, Wei Xiao, Zheng Qi, Phu Mon Htut, Yi Zhang, Neha Anna John, Bonan Min, Yassine Benajiba, Dan Roth

    Abstract: Open domain question answering systems frequently rely on information retrieved from large collections of text (such as the Web) to answer questions. However, such collections of text often contain conflicting information, and indiscriminately depending on this information may result in untruthful and inaccurate answers. To understand the gravity of this problem, we collect a human-annotated datas… ▽ More

    Submitted 27 April, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

  44. arXiv:2410.11967  [pdf

    cs.CV cs.LG

    Integrating Artificial Intelligence Models and Synthetic Image Data for Enhanced Asset Inspection and Defect Identification

    Authors: Reddy Mandati, Vladyslav Anderson, Po-chen Chen, Ankush Agarwal, Tatjana Dokic, David Barnard, Michael Finn, Jesse Cromer, Andrew Mccauley, Clay Tutaj, Neha Dave, Bobby Besharati, Jamie Barnett, Timothy Krall

    Abstract: In the past utilities relied on in-field inspections to identify asset defects. Recently, utilities have started using drone-based inspections to enhance the field-inspection process. We consider a vast repository of drone images, providing a wealth of information about asset health and potential issues. However, making the collected imagery data useful for automated defect detection requires sign… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  45. arXiv:2410.09047  [pdf, other

    cs.CL cs.AI cs.LG

    Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models

    Authors: Qin Liu, Chao Shang, Ling Liu, Nikolaos Pappas, Jie Ma, Neha Anna John, Srikanth Doss, Lluis Marquez, Miguel Ballesteros, Yassine Benajiba

    Abstract: The safety alignment ability of Vision-Language Models (VLMs) is prone to be degraded by the integration of the vision module compared to its LLM backbone. We investigate this phenomenon, dubbed as ''safety alignment degradation'' in this paper, and show that the challenge arises from the representation gap that emerges when introducing vision modality to VLMs. In particular, we show that the repr… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: Preprint

  46. arXiv:2410.00260  [pdf, other

    cs.CL cs.AI cs.LG

    DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining

    Authors: Vinayak Arannil, Neha Narwal, Sourav Sanjukta Bhabesh, Sai Nikhil Thirandas, Darren Yow-Bang Wang, Graham Horwood, Alex Anto Chirayath, Gouri Pandeshwar

    Abstract: Large Language Models (LLMs) have shown remarkable ability to generalize effectively across numerous industry domains while executing a range of tasks. Many of these competencies are obtained from the data utilized during the pre-training phase of the Language Models (LMs). However, these models exhibit limitations when tasked with performing in specialized or low-resource industry domains. More r… ▽ More

    Submitted 9 October, 2024; v1 submitted 30 September, 2024; originally announced October 2024.

  47. arXiv:2409.16560  [pdf, other

    cs.AI

    Dynamic-Width Speculative Beam Decoding for Efficient LLM Inference

    Authors: Zongyue Qin, Zifan He, Neha Prakriya, Jason Cong, Yizhou Sun

    Abstract: Large language models (LLMs) have shown outstanding performance across numerous real-world tasks. However, the autoregressive nature of these models makes the inference process slow and costly. Speculative decoding has emerged as a promising solution, leveraging a smaller auxiliary model to draft future tokens, which are then validated simultaneously by the larger model, achieving a speed-up of 1-… ▽ More

    Submitted 14 March, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

  48. arXiv:2409.06131  [pdf, other

    cs.CL cs.AI

    Accelerating Large Language Model Pretraining via LFR Pedagogy: Learn, Focus, and Review

    Authors: Neha Prakriya, Jui-Nan Yen, Cho-Jui Hsieh, Jason Cong

    Abstract: Traditional Large Language Model (LLM) pretraining relies on autoregressive language modeling with randomly sampled data from web-scale datasets. Inspired by human learning techniques like spaced repetition, we hypothesize that random sampling leads to high training costs, lower-quality models, and significant data forgetting. To address these inefficiencies, we propose the Learn-Focus-Review (LFR… ▽ More

    Submitted 28 January, 2025; v1 submitted 9 September, 2024; originally announced September 2024.

  49. arXiv:2409.04880  [pdf, other

    cs.CR cs.AI

    Towards identifying Source credibility on Information Leakage in Digital Gadget Market

    Authors: Neha Kumaru, Garvit Gupta, Shreyas Mongia, Shubham Singh, Ponnurangam Kumaraguru, Arun Balaji Buduru

    Abstract: The use of Social media to share content is on a constant rise. One of the capsize effect of information sharing on Social media includes the spread of sensitive information on the public domain. With the digital gadget market becoming highly competitive and ever-evolving, the trend of an increasing number of sensitive posts leaking information on devices in social media is observed. Many web-blog… ▽ More

    Submitted 7 September, 2024; originally announced September 2024.

  50. arXiv:2408.10239  [pdf, ps, other

    cs.CY cs.AI cs.LG cs.SE

    A Conceptual Framework for Ethical Evaluation of Machine Learning Systems

    Authors: Neha R. Gupta, Jessica Hullman, Hari Subramonyam

    Abstract: Research in Responsible AI has developed a range of principles and practices to ensure that machine learning systems are used in a manner that is ethical and aligned with human values. However, a critical yet often neglected aspect of ethical ML is the ethical implications that appear when designing evaluations of ML systems. For instance, teams may have to balance a trade-off between highly infor… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.