Skip to main content

Showing 1–50 of 268 results for author: Singh, A K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.19925  [pdf, ps, other

    cs.AI

    CON-QA: Privacy-Preserving QA using cloud LLMs in Contract Domain

    Authors: Ajeet Kumar Singh, Rajsabi Surya, Anurag Tripathi, Santanu Choudhury, Sudhir Bisane

    Abstract: As enterprises increasingly integrate cloud-based large language models (LLMs) such as ChatGPT and Gemini into their legal document workflows, protecting sensitive contractual information - including Personally Identifiable Information (PII) and commercially sensitive clauses - has emerged as a critical challenge. In this work, we propose CON-QA, a hybrid privacy-preserving framework designed spec… ▽ More

    Submitted 24 September, 2025; originally announced September 2025.

  2. arXiv:2508.14946  [pdf, ps, other

    cs.LG

    HHNAS-AM: Hierarchical Hybrid Neural Architecture Search using Adaptive Mutation Policies

    Authors: Anurag Tripathi, Ajeet Kumar Singh, Rajsabi Surya, Aum Gupta, Sahiinii Lemaina Veikho, Dorien Herremans, Sudhir Bisane

    Abstract: Neural Architecture Search (NAS) has garnered significant research interest due to its capability to discover architectures superior to manually designed ones. Learning text representation is crucial for text classification and other language-related tasks. The NAS model used in text classification does not have a Hybrid hierarchical structure, and there is no restriction on the architecture struc… ▽ More

    Submitted 20 August, 2025; originally announced August 2025.

  3. arXiv:2508.07387  [pdf, ps, other

    cs.RO

    MonoMPC: Monocular Vision Based Navigation with Learned Collision Model and Risk-Aware Model Predictive Control

    Authors: Basant Sharma, Prajyot Jadhav, Pranjal Paul, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Navigating unknown environments with a single RGB camera is challenging, as the lack of depth information prevents reliable collision-checking. While some methods use estimated depth to build collision maps, we found that depth estimates from vision foundation models are too noisy for zero-shot navigation in cluttered environments. We propose an alternative approach: instead of using noisy estim… ▽ More

    Submitted 10 August, 2025; originally announced August 2025.

  4. arXiv:2508.06387  [pdf, ps, other

    cs.LG cs.AI

    End-to-End Text-to-SQL with Dataset Selection: Leveraging LLMs for Adaptive Query Generation

    Authors: Anurag Tripathi, Vaibhav Patle, Abhinav Jain, Ayush Pundir, Sairam Menon, Ajeet Kumar Singh, Dorien Herremans

    Abstract: Text-to-SQL bridges the gap between natural language and structured database language, thus allowing non-technical users to easily query databases. Traditional approaches model text-to-SQL as a direct translation task, where a given Natural Language Query (NLQ) is mapped to an SQL command. Recent advances in large language models (LLMs) have significantly improved translation accuracy, however, th… ▽ More

    Submitted 11 August, 2025; v1 submitted 8 August, 2025; originally announced August 2025.

    Comments: Accepted in IJCNN25

  5. arXiv:2508.05661  [pdf, ps, other

    cs.IR cs.AI

    Zero-Shot Retrieval for Scalable Visual Search in a Two-Sided Marketplace

    Authors: Andre Rusli, Shoma Ishimoto, Sho Akiyama, Aman Kumar Singh

    Abstract: Visual search offers an intuitive way for customers to explore diverse product catalogs, particularly in consumer-to-consumer (C2C) marketplaces where listings are often unstructured and visually driven. This paper presents a scalable visual search system deployed in Mercari's C2C marketplace, where end-users act as buyers and sellers. We evaluate recent vision-language models for zero-shot image… ▽ More

    Submitted 31 July, 2025; originally announced August 2025.

    Comments: 6 pages, KDD 2025 Workshop on Two-sided Marketplace Optimization: Search, Pricing, Matching & Growth (TSMO)

  6. arXiv:2507.18763  [pdf, ps, other

    cs.CV cs.RO

    Diffusion-FS: Multimodal Free-Space Prediction via Diffusion for Autonomous Driving

    Authors: Keshav Gupta, Tejas S. Stanley, Pranjal Paul, Arun K. Singh, K. Madhava Krishna

    Abstract: Drivable Free-space prediction is a fundamental and crucial problem in autonomous driving. Recent works have addressed the problem by representing the entire non-obstacle road regions as the free-space. In contrast our aim is to estimate the driving corridors that are a navigable subset of the entire road region. Unfortunately, existing corridor estimation methods directly assume a BEV-centric rep… ▽ More

    Submitted 24 July, 2025; originally announced July 2025.

    Comments: 8 pages, 7 figures, IROS 2025

  7. arXiv:2507.17654  [pdf, ps, other

    cs.IT cs.DM cs.IR

    On Function-Correcting Codes in the Lee Metric

    Authors: Gyanendra K. Verma, Abhay Kumar Singh

    Abstract: Function-correcting codes are a coding framework designed to minimize redundancy while ensuring that specific functions or computations of encoded data can be reliably recovered, even in the presence of errors. The choice of metric is crucial in designing such codes, as it determines which computations must be protected and how errors are measured and corrected. Previous work by Liu and Liu [6] st… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  8. A Comprehensively Adaptive Architectural Optimization-Ingrained Quantum Neural Network Model for Cloud Workloads Prediction

    Authors: Jitendra Kumar, Deepika Saxena, Kishu Gupta, Satyam Kumar, Ashutosh Kumar Singh

    Abstract: Accurate workload prediction and advanced resource reservation are indispensably crucial for managing dynamic cloud services. Traditional neural networks and deep learning models frequently encounter challenges with diverse, high-dimensional workloads, especially during sudden resource demand changes, leading to inefficiencies. This issue arises from their limited optimization during training, rel… ▽ More

    Submitted 11 July, 2025; originally announced July 2025.

  9. arXiv:2506.13935  [pdf, ps, other

    cs.LG cs.DC cs.ET

    ReinDSplit: Reinforced Dynamic Split Learning for Pest Recognition in Precision Agriculture

    Authors: Vishesh Kumar Tanwar, Soumik Sarkar, Asheesh K. Singh, Sajal K. Das

    Abstract: To empower precision agriculture through distributed machine learning (DML), split learning (SL) has emerged as a promising paradigm, partitioning deep neural networks (DNNs) between edge devices and servers to reduce computational burdens and preserve data privacy. However, conventional SL frameworks' one-split-fits-all strategy is a critical limitation in agricultural ecosystems where edge insec… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  10. arXiv:2506.13253  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Distinct Computations Emerge From Compositional Curricula in In-Context Learning

    Authors: Jin Hwa Lee, Andrew K. Lampinen, Aaditya K. Singh, Andrew M. Saxe

    Abstract: In-context learning (ICL) research often considers learning a function in-context through a uniform sample of input-output pairs. Here, we investigate how presenting a compositional subtask curriculum in context may alter the computations a transformer learns. We design a compositional algorithmic task based on the modular exponential-a double exponential task composed of two single exponential su… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  11. arXiv:2506.06023  [pdf, ps, other

    cs.CV

    Restereo: Diffusion stereo video generation and restoration

    Authors: Xingchang Huang, Ashish Kumar Singh, Florian Dubost, Cristina Nader Vasconcelos, Sakar Khattar, Liang Shi, Christian Theobalt, Cengiz Oztireli, Gurprit Singh

    Abstract: Stereo video generation has been gaining increasing attention with recent advancements in video diffusion models. However, most existing methods focus on generating 3D stereoscopic videos from monocular 2D videos. These approaches typically assume that the input monocular video is of high quality, making the task primarily about inpainting occluded regions in the warped video while preserving diso… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: 12 pages, 5 figures

  12. arXiv:2506.03182  [pdf, ps, other

    cs.CV cs.LG

    TerraIncognita: A Dynamic Benchmark for Species Discovery Using Frontier Models

    Authors: Shivani Chiranjeevi, Hossein Zaremehrjerdi, Zi K. Deng, Talukder Z. Jubery, Ari Grele, Arti Singh, Asheesh K Singh, Soumik Sarkar, Nirav Merchant, Harold F. Greeney, Baskar Ganapathysubramanian, Chinmay Hegde

    Abstract: The rapid global loss of biodiversity, particularly among insects, represents an urgent ecological crisis. Current methods for insect species discovery are manual, slow, and severely constrained by taxonomic expertise, hindering timely conservation actions. We introduce TerraIncognita, a dynamic benchmark designed to evaluate state-of-the-art multimodal models for the challenging problem of identi… ▽ More

    Submitted 29 May, 2025; originally announced June 2025.

  13. arXiv:2506.00450  [pdf, ps, other

    cs.IR cs.LG

    DV365: Extremely Long User History Modeling at Instagram

    Authors: Wenhan Lyu, Devashish Tyagi, Yihang Yang, Ziwei Li, Ajay Somani, Karthikeyan Shanmugasundaram, Nikola Andrejevic, Ferdi Adeputra, Curtis Zeng, Arun K. Singh, Maxime Ransan, Sagar Jain

    Abstract: Long user history is highly valuable signal for recommendation systems, but effectively incorporating it often comes with high cost in terms of data center power consumption and GPU. In this work, we chose offline embedding over end-to-end sequence length optimization methods to enable extremely long user sequence modeling as a cost-effective solution, and propose a new user embedding learning str… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: SIGKDD 2025 accepted

  14. arXiv:2505.19259  [pdf, ps, other

    cs.LG cs.AI

    Towards Large Reasoning Models for Agriculture

    Authors: Hossein Zaremehrjerdi, Shreyan Ganguly, Ashlyn Rairdin, Elizabeth Tranel, Benjamin Feuer, Juan Ignacio Di Salvo, Srikanth Panthulugiri, Hernan Torres Pacin, Victoria Moser, Sarah Jones, Joscif G Raigne, Yanben Shen, Heidi M. Dornath, Aditya Balu, Adarsh Krishnamurthy, Asheesh K Singh, Arti Singh, Baskar Ganapathysubramanian, Chinmay Hegde, Soumik Sarkar

    Abstract: Agricultural decision-making involves complex, context-specific reasoning, where choices about crops, practices, and interventions depend heavily on geographic, climatic, and economic conditions. Traditional large language models (LLMs) often fall short in navigating this nuanced problem due to limited reasoning capacity. We hypothesize that recent advances in large reasoning models (LRMs) can bet… ▽ More

    Submitted 27 May, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

  15. arXiv:2505.18930  [pdf, other

    cs.CV cs.AI

    WeedNet: A Foundation Model-Based Global-to-Local AI Approach for Real-Time Weed Species Identification and Classification

    Authors: Yanben Shen, Timilehin T. Ayanlade, Venkata Naresh Boddepalli, Mojdeh Saadati, Ashlyn Rairdin, Zi K. Deng, Muhammad Arbab Arshad, Aditya Balu, Daren Mueller, Asheesh K Singh, Wesley Everman, Nirav Merchant, Baskar Ganapathysubramanian, Meaghan Anderson, Soumik Sarkar, Arti Singh

    Abstract: Early identification of weeds is essential for effective management and control, and there is growing interest in automating the process using computer vision techniques coupled with AI methods. However, challenges associated with training AI-based weed identification models, such as limited expert-verified data and complexity and variability in morphological features, have hindered progress. To a… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  16. arXiv:2505.14751  [pdf, ps, other

    cs.LG cs.AI cs.ET

    Self Distillation via Iterative Constructive Perturbations

    Authors: Maheak Dave, Aniket Kumar Singh, Aryan Pareek, Harshita Jha, Debasis Chaudhuri, Manish Pratap Singh

    Abstract: Deep Neural Networks have achieved remarkable achievements across various domains, however balancing performance and generalization still remains a challenge while training these networks. In this paper, we propose a novel framework that uses a cyclic optimization strategy to concurrently optimize the model and its input data for better training, rethinking the traditional training paradigm. Centr… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  17. arXiv:2505.09473  [pdf, ps, other

    cs.IT

    Function-Correcting $b$-symbol Codes for Locally $(λ, ρ,b)$-Functions

    Authors: Gyanendra K. Verma, Anamika Singh, Abhay Kumar Singh

    Abstract: The family of functions plays a central role in the design and effectiveness of function-correcting codes. By focusing on a well-defined family of functions, function-correcting codes can be constructed with minimal length while still ensuring full error detection and correction within that family. In this work, we explore the concept of locally $(λ,ρ)$-functions for $b$-symbol read channels and i… ▽ More

    Submitted 29 September, 2025; v1 submitted 14 May, 2025; originally announced May 2025.

  18. A Self-Healing and Fault-Tolerant Cloud-based Digital Twin Processing Management Model

    Authors: Deepika Saxena, Ashutosh Kumar Singh

    Abstract: Digital twins, integral to cloud platforms, bridge physical and virtual worlds, fostering collaboration among stakeholders in manufacturing and processing. However, the cloud platforms face challenges like service outages, vulnerabilities, and resource contention, hindering critical digital twin application development. The existing research works have limited focus on reliability and fault tolera… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: 10 pages, 7 figures

    Journal ref: IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS 2025

  19. arXiv:2504.14674  [pdf, ps, other

    cs.IT

    Binary cyclic codes from permutation polynomials over $\mathbb{F}_{2^m}$

    Authors: Mrinal Kanti Bose, Udaya Parampalli, Abhay Kumar Singh

    Abstract: Binary cyclic codes having large dimensions and minimum distances close to the square-root bound are highly valuable in applications where high-rate transmission and robust error correction are both essential. They provide an optimal trade-off between these two factors, making them suitable for demanding communication and storage systems, post-quantum cryptography, radar and sonar systems, wireles… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    MSC Class: 94B15; 11T71; 11T06 ACM Class: B.4.1; H.1.1

  20. $π$-MPPI: A Projection-based Model Predictive Path Integral Scheme for Smooth Optimal Control of Fixed-Wing Aerial Vehicles

    Authors: Edvin Martin Andrejev, Amith Manoharan, Karl-Eerik Unt, Arun Kumar Singh

    Abstract: Model Predictive Path Integral (MPPI) is a popular sampling-based Model Predictive Control (MPC) algorithm for nonlinear systems. It optimizes trajectories by sampling control sequences and averaging them. However, a key issue with MPPI is the non-smoothness of the optimal control sequence, leading to oscillations in systems like fixed-wing aerial vehicles (FWVs). Existing solutions use post-hoc s… ▽ More

    Submitted 16 April, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

    Comments: 8 pages, 4 figures, submitted to IEEE RA-L

    Journal ref: IEEE ROBOTICS AND AUTOMATION LETTERS, VOL. 10, NO. 6, JUNE 2025

  21. arXiv:2504.10088  [pdf, ps, other

    cs.IT

    Code size constraints in b-symbol read channels: A bound analysis

    Authors: Gyanendra K. Verma, Nupur Patanker, Abhay Kumar Singh

    Abstract: In classical coding theory, error-correcting codes are designed to protect against errors occurring at individual symbol positions in a codeword. However, in practical storage and communication systems, errors often affect multiple adjacent symbols rather than single symbols independently. To address this, symbol-pair read channels were introduced \cite{Yuval2011}, and later generalized to $b$-sym… ▽ More

    Submitted 22 April, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

  22. arXiv:2504.05918  [pdf, other

    cs.RO cs.LG

    Deep RL-based Autonomous Navigation of Micro Aerial Vehicles (MAVs) in a complex GPS-denied Indoor Environment

    Authors: Amit Kumar Singh, Prasanth Kumar Duba, P. Rajalakshmi

    Abstract: The Autonomy of Unmanned Aerial Vehicles (UAVs) in indoor environments poses significant challenges due to the lack of reliable GPS signals in enclosed spaces such as warehouses, factories, and indoor facilities. Micro Aerial Vehicles (MAVs) are preferred for navigating in these complex, GPS-denied scenarios because of their agility, low power consumption, and limited computational capabilities. I… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  23. arXiv:2504.03850  [pdf, other

    cs.CV cs.AI cs.CR cs.LG stat.ML

    Detection Limits and Statistical Separability of Tree Ring Watermarks in Rectified Flow-based Text-to-Image Generation Models

    Authors: Ved Umrajkar, Aakash Kumar Singh

    Abstract: Tree-Ring Watermarking is a significant technique for authenticating AI-generated images. However, its effectiveness in rectified flow-based models remains unexplored, particularly given the inherent challenges of these models with noise latent inversion. Through extensive experimentation, we evaluated and compared the detection and separability of watermarks between SD 2.1 and FLUX.1-dev models.… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  24. arXiv:2503.19508  [pdf, other

    cs.CV cs.LG

    Improved Alignment of Modalities in Large Vision Language Models

    Authors: Kartik Jangra, Aman Kumar Singh, Yashwani Mann, Geetanjali Rathee

    Abstract: Recent advancements in vision-language models have achieved remarkable results in making language models understand vision inputs. However, a unified approach to align these models across diverse tasks such as image captioning and visual question answering remains a challenge. Existing methods either require very big language models or very big datasets which is not efficient in utilizing existing… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  25. arXiv:2503.17985  [pdf, other

    cs.RO cs.AI

    Optimizing Navigation And Chemical Application in Precision Agriculture With Deep Reinforcement Learning And Conditional Action Tree

    Authors: Mahsa Khosravi, Zhanhong Jiang, Joshua R Waite, Sarah Jonesc, Hernan Torres, Arti Singh, Baskar Ganapathysubramanian, Asheesh Kumar Singh, Soumik Sarkar

    Abstract: This paper presents a novel reinforcement learning (RL)-based planning scheme for optimized robotic management of biotic stresses in precision agriculture. The framework employs a hierarchical decision-making structure with conditional action masking, where high-level actions direct the robot's exploration, while low-level actions optimize its navigation and efficient chemical spraying in affected… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

    Comments: 32 pages, 9 figures

  26. arXiv:2503.12894  [pdf, other

    cs.IT

    Function-Correcting Codes for b-Symbol Read Channels

    Authors: Anamika Singh, Abhay Kumar Singh, Eitan Yaakobi

    Abstract: Function-correcting codes are an innovative class of codes that are designed to protect a function evaluation of the data against errors or corruptions. Due to its usefulness in machine learning applications and archival data storage, where preserving the integrity of computation is crucial, Lenz et al. recently introduced function-correcting codes for binary symmetric channels to safeguard functi… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Comments: 28 pages, 4 figures

  27. arXiv:2503.10486  [pdf, ps, other

    cs.CL cs.AI

    LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions

    Authors: Gaurav Kumar Gupta, Pranal Pande, Nirajan Acharya, Aniket Kumar Singh, Suman Niroula

    Abstract: Large Language Models (LLMs) are revolutionizing medical diagnostics by enhancing both disease classification and clinical decision-making. In this study, we evaluate the performance of two LLM- based diagnostic tools, DeepSeek R1 and O3 Mini, using a structured dataset of symptoms and diagnoses. We assessed their predictive accuracy at both the disease and category levels, as well as the reliabil… ▽ More

    Submitted 19 June, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    Comments: 12 pages, 3 figures

  28. arXiv:2503.05631  [pdf, other

    cs.LG

    Strategy Coopetition Explains the Emergence and Transience of In-Context Learning

    Authors: Aaditya K. Singh, Ted Moskovitz, Sara Dragutinovic, Felix Hill, Stephanie C. Y. Chan, Andrew M. Saxe

    Abstract: In-context learning (ICL) is a powerful ability that emerges in transformer models, enabling them to learn from context without weight updates. Recent work has established emergent ICL as a transient phenomenon that can sometimes disappear after long training times. In this work, we sought a mechanistic understanding of these transient dynamics. Firstly, we find that, after the disappearance of IC… ▽ More

    Submitted 10 March, 2025; v1 submitted 7 March, 2025; originally announced March 2025.

    Comments: 20 pages, 18 figures

  29. arXiv:2502.18339  [pdf, other

    cs.CL cs.LG

    Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks

    Authors: Rylan Schaeffer, Punit Singh Koura, Binh Tang, Ranjan Subramanian, Aaditya K Singh, Todor Mihaylov, Prajjwal Bhargava, Lovish Madaan, Niladri S. Chatterji, Vedanuj Goswami, Sergey Edunov, Dieuwke Hupkes, Sanmi Koyejo, Sharan Narang

    Abstract: The explosion of high-performing conversational language models (LMs) has spurred a shift from classic natural language processing (NLP) benchmarks to expensive, time-consuming and noisy human evaluations - yet the relationship between these two evaluation strategies remains hazy. In this paper, we conduct a large-scale study of four Chat Llama 2 models, comparing their performance on 160 standard… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  30. Secure Resource Management in Cloud Computing: Challenges, Strategies and Meta-Analysis

    Authors: Deepika Saxena, Smruti Rekha Swain, Jatinder Kumar, Sakshi Patni, Kishu Gupta, Ashutosh Kumar Singh, Volker Lindenstruth

    Abstract: Secure resource management (SRM) within a cloud computing environment is a critical yet infrequently studied research topic. This paper provides a comprehensive survey and comparative performance evaluation of potential cyber threat countermeasure strategies that address security challenges during cloud workload execution and resource management. Cybersecurity is explored specifically in the conte… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

    Comments: 16 Pages, 12 Figures, 6 Tables, in IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2025

  31. Trajectory Optimization Under Stochastic Dynamics Leveraging Maximum Mean Discrepancy

    Authors: Basant Sharma, Arun Kumar Singh

    Abstract: This paper addresses sampling-based trajectory optimization for risk-aware navigation under stochastic dynamics. Typically such approaches operate by computing $\tilde{N}$ perturbed rollouts around the nominal dynamics to estimate the collision risk associated with a sequence of control commands. We consider a setting where it is expensive to estimate risk using perturbed rollouts, for example, du… ▽ More

    Submitted 10 April, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

    Comments: https://github.com/Basant1861/MPC-MMD

  32. arXiv:2501.19042  [pdf, other

    cs.RO cs.AI

    Swarm-Gen: Fast Generation of Diverse Feasible Swarm Behaviors

    Authors: Simon Idoko, B. Bhanu Teja, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Coordination behavior in robot swarms is inherently multi-modal in nature. That is, there are numerous ways in which a swarm of robots can avoid inter-agent collisions and reach their respective goals. However, the problem of generating diverse and feasible swarm behaviors in a scalable manner remains largely unaddressed. In this paper, we fill this gap by combining generative models with a safety… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

    Comments: Submitted to RAL

  33. arXiv:2501.16265  [pdf, other

    cs.LG

    Training Dynamics of In-Context Learning in Linear Attention

    Authors: Yedi Zhang, Aaditya K. Singh, Peter E. Latham, Andrew Saxe

    Abstract: While attention-based models have demonstrated the remarkable ability of in-context learning (ICL), the theoretical understanding of how these models acquired this ability through gradient descent training is still preliminary. Towards answering this question, we study the gradient descent dynamics of multi-head linear self-attention trained for in-context linear regression. We examine two paramet… ▽ More

    Submitted 27 May, 2025; v1 submitted 27 January, 2025; originally announced January 2025.

    Comments: ICML 2025 Spotlight

  34. arXiv:2501.11477  [pdf

    cs.NE

    Image Classification Method using Dynamic Quantum Inspired Genetic Algorithm

    Authors: Akhilesh Kumar Singh, Kirankumar R. Hiremath

    Abstract: This study presents a dynamic Quantum-Inspired Genetic Algorithm (D-QIGA) for feature selection, leveraging quantum principles like superposition and rotation gates to enhance exploration and exploitation. D-QIGA introduces adaptive mechanisms and a lengthening chromosome strategy to avoid local optima and improve optimization. Tested on benchmark and real-world problems, it significantly outperfo… ▽ More

    Submitted 4 April, 2025; v1 submitted 20 January, 2025; originally announced January 2025.

  35. FedMUP: Federated Learning driven Malicious User Prediction Model for Secure Data Distribution in Cloud Environments

    Authors: Kishu Gupta, Deepika Saxena, Rishabh Gupta, Jatinder Kumar, Ashutosh Kumar Singh

    Abstract: Cloud computing is flourishing at a rapid pace. Significant consequences related to data security appear as a malicious user may get unauthorized access to sensitive data which may be misused, further. This raises an alarm-ringing situation to tackle the crucial issue related to data security and proactive malicious user prediction. This article proposes a Federated learning driven Malicious User… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: 33 pages, 9 figures

    Journal ref: Fedmup: Federated learning driven malicious user prediction model for secure data distribution in cloud environments, Applied Soft Computing, vol. 157, p. 111519, 2024

  36. MAIDS: Malicious Agent Identification-based Data Security Model for Cloud Environments

    Authors: Kishu Gupta, Deepika Saxena, Rishabh Gupta, Ashutosh Kumar Singh

    Abstract: With the vigorous development of cloud computing, most organizations have shifted their data and applications to the cloud environment for storage, computation, and sharing purposes. During storage and data sharing across the participating entities, a malicious agent may gain access to outsourced data from the cloud environment. A malicious agent is an entity that deliberately breaches the data. T… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: 28 pages, 10 figures

    Journal ref: Cluster Comput 27, 6167 to 6184, (2024)

  37. arXiv:2412.09696  [pdf, other

    cs.CV

    Soybean Maturity Prediction using 2D Contour Plots from Drone based Time Series Imagery

    Authors: Bitgoeul Kim, Samuel W. Blair, Talukder Z. Jubery, Soumik Sarkar, Arti Singh, Asheesh K. Singh, Baskar Ganapathysubramanian

    Abstract: Plant breeding programs require assessments of days to maturity for accurate selection and placement of entries in appropriate tests. In the early stages of the breeding pipeline, soybean breeding programs assign relative maturity ratings to experimental varieties that indicate their suitable maturity zones. Traditionally, the estimation of maturity value for breeding varieties has involved breede… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

  38. arXiv:2412.09121  [pdf, ps, other

    cs.LG cs.RO

    MMD-OPT : Maximum Mean Discrepancy Based Sample Efficient Collision Risk Minimization for Autonomous Driving

    Authors: Basant Sharma, Arun Kumar Singh

    Abstract: We propose MMD-OPT: a sample-efficient approach for minimizing the risk of collision under arbitrary prediction distribution of the dynamic obstacles. MMD-OPT is based on embedding distribution in Reproducing Kernel Hilbert Space (RKHS) and the associated Maximum Mean Discrepancy (MMD). We show how these two concepts can be used to define a sample efficient surrogate for collision risk estimate. W… ▽ More

    Submitted 7 July, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

  39. arXiv:2412.08819  [pdf, other

    cs.LG

    HARP: A challenging human-annotated math reasoning benchmark

    Authors: Albert S. Yue, Lovish Madaan, Ted Moskovitz, DJ Strouse, Aaditya K. Singh

    Abstract: Math reasoning is becoming an ever increasing area of focus as we scale large language models. However, even the previously-toughest evals like MATH are now close to saturated by frontier models (90.0% for o1-mini and 86.5% for Gemini 1.5 Pro). We introduce HARP, Human Annotated Reasoning Problems (for Math), consisting of 5,409 problems from the US national math competitions (A(J)HSME, AMC, AIME,… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: 28 pages, 17 figures

  40. arXiv:2412.03782  [pdf, ps, other

    cs.CL cs.LG

    The broader spectrum of in-context learning

    Authors: Andrew Kyle Lampinen, Stephanie C. Y. Chan, Aaditya K. Singh, Murray Shanahan

    Abstract: The ability of language models to learn a task from a few examples in context has generated substantial interest. Here, we provide a perspective that situates this type of supervised few-shot learning within a much broader spectrum of meta-learned in-context learning. Indeed, we suggest that any distribution of sequences in which context non-trivially decreases loss on subsequent predictions can b… ▽ More

    Submitted 5 June, 2025; v1 submitted 4 December, 2024; originally announced December 2024.

  41. arXiv:2412.02642  [pdf, other

    cs.CV

    Robust soybean seed yield estimation using high-throughput ground robot videos

    Authors: Jiale Feng, Samuel W. Blair, Timilehin Ayanlade, Aditya Balu, Baskar Ganapathysubramanian, Arti Singh, Soumik Sarkar, Asheesh K Singh

    Abstract: We present a novel method for soybean (Glycine max (L.) Merr.) yield estimation leveraging high throughput seed counting via computer vision and deep learning techniques. Traditional methods for collecting yield data are labor-intensive, costly, prone to equipment failures at critical data collection times, and require transportation of equipment across field sites. Computer vision, the field of t… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 23 pages, 12 figures, 2 tables

  42. arXiv:2412.01354  [pdf

    cs.CV cs.AI

    Integrative CAM: Adaptive Layer Fusion for Comprehensive Interpretation of CNNs

    Authors: Aniket K. Singh, Debasis Chaudhuri, Manish P. Singh, Samiran Chattopadhyay

    Abstract: With the growing demand for interpretable deep learning models, this paper introduces Integrative CAM, an advanced Class Activation Mapping (CAM) technique aimed at providing a holistic view of feature importance across Convolutional Neural Networks (CNNs). Traditional gradient-based CAM methods, such as Grad-CAM and Grad-CAM++, primarily use final layer activations to highlight regions of interes… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  43. arXiv:2411.03923  [pdf, other

    cs.CL

    Evaluation data contamination in LLMs: how do we measure it and (when) does it matter?

    Authors: Aaditya K. Singh, Muhammed Yusuf Kocyigit, Andrew Poulton, David Esiobu, Maria Lomeli, Gergely Szilvasy, Dieuwke Hupkes

    Abstract: Hampering the interpretation of benchmark scores, evaluation data contamination has become a growing concern in the evaluation of LLMs, and an active area of research studies its effects. While evaluation data contamination is easily understood intuitively, it is surprisingly difficult to define precisely which samples should be considered contaminated and, consequently, how it impacts benchmark s… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

  44. arXiv:2410.19712  [pdf, other

    cs.RO

    DA-VIL: Adaptive Dual-Arm Manipulation with Reinforcement Learning and Variable Impedance Control

    Authors: Md Faizal Karim, Shreya Bollimuntha, Mohammed Saad Hashmi, Autrio Das, Gaurav Singh, Srinath Sridhar, Arun Kumar Singh, Nagamanikandan Govindan, K Madhava Krishna

    Abstract: Dual-arm manipulation is an area of growing interest in the robotics community. Enabling robots to perform tasks that require the coordinated use of two arms, is essential for complex manipulation tasks such as handling large objects, assembling components, and performing human-like interactions. However, achieving effective dual-arm manipulation is challenging due to the need for precise coordina… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  45. arXiv:2410.18751  [pdf, ps, other

    cs.LO q-fin.TR

    Double Auctions: Formalization and Automated Checkers

    Authors: Mohit Garg, N. Raja, Suneel Sarswat, Abhishek Kr Singh

    Abstract: Double auctions are widely used in financial markets, such as those for stocks, derivatives, currencies, and commodities, to match demand and supply. Once all buyers and sellers have placed their trade requests, the exchange determines how these requests are to be matched. The two most common objectives for determining the matching are maximizing trade volume at a uniform price and maximizing trad… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

    Comments: 23 pages, Preliminary version of this work was published in ITP 2021

    ACM Class: F.3.1; K.4.4

  46. arXiv:2410.18494  [pdf, other

    cs.SE cs.LG cs.PL

    Assured Automatic Programming via Large Language Models

    Authors: Martin Mirchev, Andreea Costea, Abhishek Kr Singh, Abhik Roychoudhury

    Abstract: With the advent of AI-based coding engines, it is possible to convert natural language requirements to executable code in standard programming languages. However, AI-generated code can be unreliable, and the natural language requirements driving this code may be ambiguous. In other words, the intent may not be accurately captured in the code generated from AI-coding engines like Copilot. The goal… ▽ More

    Submitted 4 November, 2024; v1 submitted 24 October, 2024; originally announced October 2024.

  47. arXiv:2410.09339  [pdf

    cs.CV cs.AI cs.LG

    Advanced Gesture Recognition for Autism Spectrum Disorder Detection: Integrating YOLOv7, Video Augmentation, and VideoMAE for Naturalistic Video Analysis

    Authors: Amit Kumar Singh, Vrijendra Singh

    Abstract: Deep learning and contactless sensing technologies have significantly advanced the automated assessment of human behaviors in healthcare. In the context of autism spectrum disorder (ASD), repetitive motor behaviors such as spinning, head banging, and arm flapping are key indicators for diagnosis. This study focuses on distinguishing between children with ASD and typically developed (TD) peers by a… ▽ More

    Submitted 17 August, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: Change Note for Version 3 - Extended Study (ASD vs TD Classification) This version extends v2 from 3-class gesture recognition to binary ASD vs TD detection, using expanded SSBD variants, a new TD class, improved preprocessing, and updated metrics (95% acc, 0.93 prec, 0.94 rec, 0.94 F1). Methodology remains YOLOv7 + VideoMAE + augmentation

  48. A Global Medical Data Security and Privacy Preserving Standards Identification Framework for Electronic Healthcare Consumers

    Authors: Vinaytosh Mishra, Kishu Gupta, Deepika Saxena, Ashutosh Kumar Singh

    Abstract: Electronic Health Records (EHR) are crucial for the success of digital healthcare, with a focus on putting consumers at the center of this transformation. However, the digitalization of healthcare records brings along security and privacy risks for personal data. The major concern is that different countries have varying standards for the security and privacy of medical data. This paper proposed a… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Journal ref: A Global Medical Data Security and Privacy Preserving Standards Identification Framework for Electronic Healthcare Consumers, in IEEE Transactions on Consumer Electronics, vol. 70, no. 1, pp. 4379-4387, Feb. 2024

  49. An Intelligent Quantum Cyber-Security Framework for Healthcare Data Management

    Authors: Kishu Gupta, Deepika Saxena, Pooja Rani, Jitendra Kumar, Aaisha Makkar, Ashutosh Kumar Singh, Chung-Nan Lee

    Abstract: Digital healthcare is essential to facilitate consumers to access and disseminate their medical data easily for enhanced medical care services. However, the significant concern with digitalization across healthcare systems necessitates for a prompt, productive, and secure storage facility along with a vigorous communication strategy, to stimulate sensitive digital healthcare data sharing and proac… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Journal ref: IEEE Transactions on Automation Science and Engineering (2024)

  50. arXiv:2409.16011  [pdf, other

    cs.RO math.OC

    CrowdSurfer: Sampling Optimization Augmented with Vector-Quantized Variational AutoEncoder for Dense Crowd Navigation

    Authors: Naman Kumar, Antareep Singha, Laksh Nanwani, Dhruv Potdar, Tarun R, Fatemeh Rastgar, Simon Idoko, Arun Kumar Singh, K. Madhava Krishna

    Abstract: Navigation amongst densely packed crowds remains a challenge for mobile robots. The complexity increases further if the environment layout changes, making the prior computed global plan infeasible. In this paper, we show that it is possible to dramatically enhance crowd navigation by just improving the local planner. Our approach combines generative modelling with inference time optimization to ge… ▽ More

    Submitted 7 March, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: Accepted at IEEE ICRA 2025