Skip to main content

Showing 1–50 of 273 results for author: Atul

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.23111  [pdf, ps, other

    cs.CL

    FairI Tales: Evaluation of Fairness in Indian Contexts with a Focus on Bias and Stereotypes

    Authors: Janki Atul Nawale, Mohammed Safi Ur Rahman Khan, Janani D, Mansi Gupta, Danish Pruthi, Mitesh M. Khapra

    Abstract: Existing studies on fairness are largely Western-focused, making them inadequate for culturally diverse countries such as India. To address this gap, we introduce INDIC-BIAS, a comprehensive India-centric benchmark designed to evaluate fairness of LLMs across 85 identity groups encompassing diverse castes, religions, regions, and tribes. We first consult domain experts to curate over 1,800 socio-c… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

    Comments: Accepted in ACL 2025

  2. arXiv:2506.19863  [pdf, ps, other

    physics.comp-ph cs.AI

    Exploring the Capabilities of the Frontier Large Language Models for Nuclear Energy Research

    Authors: Ahmed Almeldein, Mohammed Alnaggar, Rick Archibald, Tom Beck, Arpan Biswas, Rike Bostelmann, Wes Brewer, Chris Bryan, Christopher Calle, Cihangir Celik, Rajni Chahal, Jong Youl Choi, Arindam Chowdhury, Mark Cianciosa, Franklin Curtis, Gregory Davidson, Sebastian De Pascuale, Lisa Fassino, Ana Gainaru, Yashika Ghai, Luke Gibson, Qian Gong, Christopher Greulich, Scott Greenwood, Cory Hauck , et al. (25 additional authors not shown)

    Abstract: The AI for Nuclear Energy workshop at Oak Ridge National Laboratory evaluated the potential of Large Language Models (LLMs) to accelerate fusion and fission research. Fourteen interdisciplinary teams explored diverse nuclear science challenges using ChatGPT, Gemini, Claude, and other AI models over a single day. Applications ranged from developing foundation models for fusion reactor control to au… ▽ More

    Submitted 26 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

  3. arXiv:2506.19583  [pdf, ps, other

    cs.LG physics.plasm-ph

    ConStellaration: A dataset of QI-like stellarator plasma boundaries and optimization benchmarks

    Authors: Santiago A. Cadena, Andrea Merlo, Emanuel Laude, Alexander Bauer, Atul Agrawal, Maria Pascu, Marija Savtchouk, Enrico Guiraud, Lukas Bonauer, Stuart Hudson, Markus Kaiser

    Abstract: Stellarators are magnetic confinement devices under active development to deliver steady-state carbon-free fusion energy. Their design involves a high-dimensional, constrained optimization problem that requires expensive physics simulations and significant domain expertise. Recent advances in plasma physics and open-source tools have made stellarator optimization more accessible. However, broader… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  4. arXiv:2506.18789  [pdf, ps, other

    cs.LG cs.AI

    Shift Happens: Mixture of Experts based Continual Adaptation in Federated Learning

    Authors: Rahul Atul Bhope, K. R. Jayaram, Praveen Venkateswaran, Nalini Venkatasubramanian

    Abstract: Federated Learning (FL) enables collaborative model training across decentralized clients without sharing raw data, yet faces significant challenges in real-world settings where client data distributions evolve dynamically over time. This paper tackles the critical problem of covariate and label shifts in streaming FL environments, where non-stationary data distributions degrade model performance… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  5. arXiv:2506.14900  [pdf, ps, other

    cs.CL

    Adverse Event Extraction from Discharge Summaries: A New Dataset, Annotation Scheme, and Initial Findings

    Authors: Imane Guellil, Salomé Andres, Atul Anand, Bruce Guthrie, Huayu Zhang, Abul Hasan, Honghan Wu, Beatrice Alex

    Abstract: In this work, we present a manually annotated corpus for Adverse Event (AE) extraction from discharge summaries of elderly patients, a population often underrepresented in clinical NLP resources. The dataset includes 14 clinically significant AEs-such as falls, delirium, and intracranial haemorrhage, along with contextual attributes like negation, diagnosis type, and in-hospital occurrence. Unique… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: Accepted and will be published at ACL2025 (main conference)

  6. arXiv:2506.13912  [pdf, ps, other

    cs.SI cs.LG

    Density-aware Walks for Coordinated Campaign Detection

    Authors: Atul Anand Gopalakrishnan, Jakir Hossain, Tuğrulcan Elmas, Ahmet Erdem Sarıyüce

    Abstract: Coordinated campaigns frequently exploit social media platforms by artificially amplifying topics, making inauthentic trends appear organic, and misleading users into engagement. Distinguishing these coordinated efforts from genuine public discourse remains a significant challenge due to the sophisticated nature of such attacks. Our work focuses on detecting coordinated campaigns by modeling the p… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: 16 Pages. Accepted at ECML-PKDD 2025

  7. arXiv:2506.10999  [pdf

    cs.SE cs.AI

    Automated Validation of COBOL to Java Transformation

    Authors: Atul Kumar, Diptikalyan Saha, Toshikai Yasue, Kohichi Ono, Saravanan Krishnan, Sandeep Hans, Fumiko Satoh, Gerald Mitchell, Sachin Kumar

    Abstract: Recent advances in Large Language Model (LLM) based Generative AI techniques have made it feasible to translate enterpriselevel code from legacy languages such as COBOL to modern languages such as Java or Python. While the results of LLM-based automatic transformation are encouraging, the resulting code cannot be trusted to correctly translate the original code. We propose a framework and a tool t… ▽ More

    Submitted 14 April, 2025; originally announced June 2025.

    Comments: arXiv admin note: text overlap with arXiv:2504.10548

    Journal ref: ASE 2024

  8. arXiv:2506.08504  [pdf, ps, other

    cs.CL cs.AI cs.LG

    CoMuMDR: Code-mixed Multi-modal Multi-domain corpus for Discourse paRsing in conversations

    Authors: Divyaksh Shukla, Ritesh Baviskar, Dwijesh Gohil, Aniket Tiwari, Atul Shree, Ashutosh Modi

    Abstract: Discourse parsing is an important task useful for NLU applications such as summarization, machine comprehension, and emotion recognition. The current discourse parsing datasets based on conversations consists of written English dialogues restricted to a single domain. In this resource paper, we introduce CoMuMDR: Code-mixed Multi-modal Multi-domain corpus for Discourse paRsing in conversations. Th… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: Accepted at ACL Findings 2025 (16 pages: 5 pages main content + 3 pages references + 8 pages appendix)

  9. arXiv:2506.06003  [pdf, ps, other

    cs.LG cs.CR

    What Really is a Member? Discrediting Membership Inference via Poisoning

    Authors: Neal Mangaokar, Ashish Hooda, Zhuohang Li, Bradley A. Malin, Kassem Fawaz, Somesh Jha, Atul Prakash, Amrita Roy Chowdhury

    Abstract: Membership inference tests aim to determine whether a particular data point was included in a language model's training set. However, recent works have shown that such tests often fail under the strict definition of membership based on exact matching, and have suggested relaxing this definition to include semantic neighbors as members as well. In this work, we show that membership inference tests… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  10. arXiv:2506.00316  [pdf, ps, other

    cs.LG math.ST stat.ML

    Active Learning via Regression Beyond Realizability

    Authors: Atul Ganju, Shashaank Aiyer, Ved Sriraman, Karthik Sridharan

    Abstract: We present a new active learning framework for multiclass classification based on surrogate risk minimization that operates beyond the standard realizability assumption. Existing surrogate-based active learning algorithms crucially rely on realizability$\unicode{x2014}$the assumption that the optimal surrogate predictor lies within the model class$\unicode{x2014}$limiting their applicability in pr… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

  11. arXiv:2505.22550  [pdf

    cs.IR

    Domain specific ontologies from Linked Open Data (LOD)

    Authors: Rosario Uceda-Sosa, Nandana Mihindukulasooriya, Atul Kumar, Sahil Bansal, Seema Nagar

    Abstract: Logical and probabilistic reasoning tasks that require a deeper knowledge of semantics are increasingly relying on general purpose ontologies such as Wikidata and DBpedia. However, tasks such as entity disambiguation and linking may benefit from domain specific knowledge graphs, which make it more efficient to consume the knowledge and easier to extend with proprietary content. We discuss our expe… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  12. arXiv:2505.18058  [pdf

    eess.IV cs.CV

    A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer

    Authors: Yumeng Zhang, Zohaib Salahuddin, Danial Khan, Shruti Atul Mali, Henry C. Woodruff, Sina Amirrajab, Eduardo Ibor-Crespo, Ana Jimenez-Pastor, Luis Marti-Bonmati, Philippe Lambin

    Abstract: Background: Accurate MRI-based identification of extramural vascular invasion (EVI) and mesorectal fascia invasion (MFI) is pivotal for risk-stratified management of rectal cancer, yet visual assessment is subjective and vulnerable to inter-institutional variability. Purpose: To develop and externally evaluate a multicenter, foundation-model-driven framework that automatically classifies EVI and M… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 22 pages, 8 figures

  13. arXiv:2505.18019  [pdf, other

    cs.SE cs.AI

    LLM assisted web application functional requirements generation: A case study of four popular LLMs over a Mess Management System

    Authors: Rashmi Gupta, Aditya K Gupta, Aarav Jain, Avinash C Pandey, Atul Gupta

    Abstract: Like any other discipline, Large Language Models (LLMs) have significantly impacted software engineering by helping developers generate the required artifacts across various phases of software development. This paper presents a case study comparing the performance of popular LLMs GPT, Claude, Gemini, and DeepSeek in generating functional specifications that include use cases, business rules, and c… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 11 pages, 12 figures, Accepted in EASE 2025 https://conf.researchr.org/details/ease-2025/ease-2025-ai-models---data/11/LLM-assisted-web-application-functional-requirements-generation-A-case-study-of-fou

  14. arXiv:2505.17971  [pdf

    eess.IV cs.CV

    Explainable Anatomy-Guided AI for Prostate MRI: Foundation Models and In Silico Clinical Trials for Virtual Biopsy-based Risk Assessment

    Authors: Danial Khan, Zohaib Salahuddin, Yumeng Zhang, Sheng Kuang, Shruti Atul Mali, Henry C. Woodruff, Sina Amirrajab, Rachel Cavill, Eduardo Ibor-Crespo, Ana Jimenez-Pastor, Adrian Galiana-Bordera, Paula Jimenez Gomez, Luis Marti-Bonmati, Philippe Lambin

    Abstract: We present a fully automated, anatomically guided deep learning pipeline for prostate cancer (PCa) risk stratification using routine MRI. The pipeline integrates three key components: an nnU-Net module for segmenting the prostate gland and its zones on axial T2-weighted MRI; a classification module based on the UMedPT Swin Transformer foundation model, fine-tuned on 3D patches with optional anatom… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  15. arXiv:2505.17893  [pdf

    cs.CV

    Pixels to Prognosis: Harmonized Multi-Region CT-Radiomics and Foundation-Model Signatures Across Multicentre NSCLC Data

    Authors: Shruti Atul Mali, Zohaib Salahuddin, Danial Khan, Yumeng Zhang, Henry C. Woodruff, Eduardo Ibor-Crespo, Ana Jimenez-Pastor, Luis Marti-Bonmati, Philippe Lambin

    Abstract: Purpose: To evaluate the impact of harmonization and multi-region CT image feature integration on survival prediction in non-small cell lung cancer (NSCLC) patients, using handcrafted radiomics, pretrained foundation model (FM) features, and clinical data from a multicenter dataset. Methods: We analyzed CT scans and clinical data from 876 NSCLC patients (604 training, 272 test) across five cente… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  16. arXiv:2505.06885  [pdf

    cs.SE cs.IR

    Incremental Analysis of Legacy Applications Using Knowledge Graphs for Application Modernization

    Authors: Saravanan Krishnan, Amith Singhee, Keerthi Narayan Raghunath, Alex Mathai, Atul Kumar, David Wenk

    Abstract: Industries such as banking, telecom, and airlines - o6en have large so6ware systems that are several decades old. Many of these systems are written in old programming languages such as COBOL, PL/1, Assembler, etc. In many cases, the documentation is not updated, and those who developed/designed these systems are no longer around. Understanding these systems for either modernization or even regular… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  17. arXiv:2504.20988  [pdf, other

    cs.LG cs.AI cs.DC

    Hubs and Spokes Learning: Efficient and Scalable Collaborative Machine Learning

    Authors: Atul Sharma, Kavindu Herath, Saurabh Bagchi, Chaoyue Liu, Somali Chaterji

    Abstract: We introduce the Hubs and Spokes Learning (HSL) framework, a novel paradigm for collaborative machine learning that combines the strengths of Federated Learning (FL) and Decentralized Learning (P2PL). HSL employs a two-tier communication structure that avoids the single point of failure inherent in FL and outperforms the state-of-the-art P2PL framework, Epidemic Learning Local (ELL). At equal comm… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  18. arXiv:2504.12515  [pdf, other

    cs.CV

    Event Quality Score (EQS): Assessing the Realism of Simulated Event Camera Streams via Distances in Latent Space

    Authors: Kaustav Chanda, Aayush Atul Verma, Arpitsinh Vaghela, Yezhou Yang, Bharatesh Chakravarthi

    Abstract: Event cameras promise a paradigm shift in vision sensing with their low latency, high dynamic range, and asynchronous nature of events. Unfortunately, the scarcity of high-quality labeled datasets hinders their widespread adoption in deep learning-driven computer vision. To mitigate this, several simulators have been proposed to generate synthetic event data for training models for detection and e… ▽ More

    Submitted 20 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: Accepted at 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW); Fifth International Workshop on Event-Based Vision

  19. arXiv:2504.10548  [pdf

    cs.SE cs.AI

    Automated Testing of COBOL to Java Transformation

    Authors: Sandeep Hans, Atul Kumar, Toshikai Yasue, Kouichi Ono, Saravanan Krishnan, Devika Sondhi, Fumiko Satoh, Gerald Mitchell, Sachin Kumar, Diptikalyan Saha

    Abstract: Recent advances in Large Language Model (LLM) based Generative AI techniques have made it feasible to translate enterprise-level code from legacy languages such as COBOL to modern languages such as Java or Python. While the results of LLM-based automatic transformation are encouraging, the resulting code cannot be trusted to correctly translate the original code, making manual validation of transl… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  20. arXiv:2504.10335  [pdf

    cs.CL

    MorphTok: Morphologically Grounded Tokenization for Indian Languages

    Authors: Maharaj Brahma, N J Karthika, Atul Singh, Devaraj Adiga, Smruti Bhate, Ganesh Ramakrishnan, Rohit Saluja, Maunendra Sankar Desarkar

    Abstract: Tokenization is a crucial step in NLP, especially with the rise of large language models (LLMs), impacting downstream performance, computational cost, and efficiency. Existing LLMs rely on the classical Byte-pair Encoding (BPE) algorithm for subword tokenization that greedily merges frequent character bigrams. This often leads to segmentation that does not align with linguistically meaningful unit… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  21. arXiv:2504.09877  [pdf

    cs.IR cs.AI

    Constructing Micro Knowledge Graphs from Technical Support Documents

    Authors: Atul Kumar, Nisha Gupta, Saswati Dana

    Abstract: Short technical support pages such as IBM Technotes are quite common in technical support domain. These pages can be very useful as the knowledge sources for technical support applications such as chatbots, search engines and question-answering (QA) systems. Information extracted from documents to drive technical support applications is often stored in the form of Knowledge Graph (KG). Building KG… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  22. arXiv:2504.06622  [pdf, other

    quant-ph cs.LG

    Quantum neural networks facilitating quantum state classification

    Authors: Diksha Sharma, Vivek Balasaheb Sabale, Thirumalai M., Atul Kumar

    Abstract: The classification of quantum states into distinct classes poses a significant challenge. In this study, we address this problem using quantum neural networks in combination with a problem-inspired circuit and customised as well as predefined ansätz. To facilitate the resource-efficient quantum state classification, we construct the dataset of quantum states using the proposed problem-inspired cir… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  23. arXiv:2503.14095  [pdf, other

    physics.ao-ph cs.LG

    Towards Location-Specific Precipitation Projections Using Deep Neural Networks

    Authors: Bipin Kumar, Bhvisy Kumar Yadav, Soumypdeep Mukhopadhyay, Rakshit Rohan, Bhupendra Bahadur Singh, Rajib Chattopadhyay, Nagraju Chilukoti, Atul Kumar Sahai

    Abstract: Accurate precipitation estimates at individual locations are crucial for weather forecasting and spatial analysis. This study presents a paradigm shift by leveraging Deep Neural Networks (DNNs) to surpass traditional methods like Kriging for station-specific precipitation approximation. We propose two innovative NN architectures: one utilizing precipitation, elevation, and location, and another in… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Comments: 21 pages, 9 figures

  24. arXiv:2503.00599  [pdf, other

    cs.SI cs.LG

    Large Engagement Networks for Classifying Coordinated Campaigns and Organic Twitter Trends

    Authors: Atul Anand Gopalakrishnan, Jakir Hossain, Tugrulcan Elmas, Ahmet Erdem Sariyuce

    Abstract: Social media users and inauthentic accounts, such as bots, may coordinate in promoting their topics. Such topics may give the impression that they are organically popular among the public, even though they are astroturfing campaigns that are centrally managed. It is challenging to predict if a topic is organic or a coordinated campaign due to the lack of reliable ground truth. In this paper, we cr… ▽ More

    Submitted 28 March, 2025; v1 submitted 1 March, 2025; originally announced March 2025.

    Comments: 14 Pages

    Journal ref: ICWSM 2025

  25. arXiv:2502.11306  [pdf, other

    cs.CL cs.LG

    Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation

    Authors: Hieu Nguyen, Zihao He, Shoumik Atul Gandre, Ujjwal Pasupulety, Sharanya Kumari Shivakumar, Kristina Lerman

    Abstract: Large language models (LLMs) often suffer from hallucination, generating factually incorrect or ungrounded content, which limits their reliability in high-stakes applications. A key factor contributing to hallucination is the use of hard labels during training, which enforce deterministic supervision, encourage overconfidence, and disregard the uncertainty inherent in natural language. To address… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  26. arXiv:2502.11287  [pdf, other

    cs.CV

    MC-BEVRO: Multi-Camera Bird Eye View Road Occupancy Detection for Traffic Monitoring

    Authors: Arpitsinh Vaghela, Duo Lu, Aayush Atul Verma, Bharatesh Chakravarthi, Hua Wei, Yezhou Yang

    Abstract: Single camera 3D perception for traffic monitoring faces significant challenges due to occlusion and limited field of view. Moreover, fusing information from multiple cameras at the image feature level is difficult because of different view angles. Further, the necessity for practical implementation and compatibility with existing traffic infrastructure compounds these challenges. To address these… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  27. arXiv:2502.10953  [pdf, other

    cs.SE cs.AI

    Empirical evaluation of LLMs in predicting fixes of Configuration bugs in Smart Home System

    Authors: Sheikh Moonwara Anjum Monisha, Atul Bharadwaj

    Abstract: This empirical study evaluates the effectiveness of Large Language Models (LLMs) in predicting fixes for configuration bugs in smart home systems. The research analyzes three prominent LLMs - GPT-4, GPT-4o (GPT-4 Turbo), and Claude 3.5 Sonnet - using four distinct prompt designs to assess their ability to identify appropriate fix strategies and generate correct solutions. The study utilized a data… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

  28. arXiv:2502.04718  [pdf

    cs.CL

    Evaluating Text Style Transfer Evaluation: Are There Any Reliable Metrics?

    Authors: Sourabrata Mukherjee, Atul Kr. Ojha, John P. McCrae, Ondrej Dusek

    Abstract: Text style transfer (TST) is the task of transforming a text to reflect a particular style while preserving its original content. Evaluating TST outputs is a multidimensional challenge, requiring the assessment of style transfer accuracy, content preservation, and naturalness. Using human evaluation is ideal but costly, as is common in other natural language processing (NLP) tasks, however, automa… ▽ More

    Submitted 23 April, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: Accepted at NAACL SRW 2025

  29. arXiv:2502.03472  [pdf, ps, other

    cs.CY

    Powering LLM Regulation through Data: Bridging the Gap from Compute Thresholds to Customer Experiences

    Authors: Wesley Pasfield

    Abstract: The rapid advancement of Large Language Models (LLMs) has created a critical gap in consumer protection due to the lack of standardized certification processes for LLM-powered Artificial Intelligence (AI) systems. This paper argues that current regulatory approaches, which focus on compute-level thresholds and generalized model evaluations, are insufficient to ensure the safety and effectiveness o… ▽ More

    Submitted 12 January, 2025; originally announced February 2025.

    Comments: Presented at the 2nd Workshop on Regulatable ML at NeurIPS 2024

  30. arXiv:2502.03470  [pdf, other

    cs.CY

    Responsible Artificial Intelligence (RAI) in U.S. Federal Government : Principles, Policies, and Practices

    Authors: Atul Rawal, Katie Johnson, Curtis Mitchell, Michael Walton, Diamond Nwankwo

    Abstract: Artificial intelligence (AI) and machine learning (ML) have made tremendous advancements in the past decades. From simple recommendation systems to more complex tumor identification systems, AI/ML systems have been utilized in a plethora of applications. This rapid growth of AI/ML and its proliferation in numerous private and public sector applications, while successful, has also opened new challe… ▽ More

    Submitted 12 January, 2025; originally announced February 2025.

    Comments: Presented at the 2nd Workshop on Regulatable ML at NeurIPS 2024

  31. Harmful Terms and Where to Find Them: Measuring and Modeling Unfavorable Financial Terms and Conditions in Shopping Websites at Scale

    Authors: Elisa Tsai, Neal Mangaokar, Boyuan Zheng, Haizhong Zheng, Atul Prakash

    Abstract: Terms and conditions for online shopping websites often contain terms that can have significant financial consequences for customers. Despite their impact, there is currently no comprehensive understanding of the types and potential risks associated with unfavorable financial terms. Furthermore, there are no publicly available detection systems or datasets to systematically identify or mitigate th… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

    Comments: This paper has been accepted to The Web Conference 2025 (WWW '25)

    ACM Class: H.3.3; K.4.1; K.4.2; I.2.7

  32. arXiv:2501.15030  [pdf, other

    cs.LG cs.AI cs.CL cs.PF

    OptiSeq: Ordering Examples On-The-Fly for In-Context Learning

    Authors: Rahul Atul Bhope, Praveen Venkateswaran, K. R. Jayaram, Vatche Isahagian, Vinod Muthusamy, Nalini Venkatasubramanian

    Abstract: Developers using LLMs and LLM-based agents in their applications have provided plenty of anecdotal evidence that in-context-learning (ICL) is fragile. In this paper, we show that in addition to the quantity and quality of examples, the order in which the in-context examples are listed in the prompt affects the output of the LLM and, consequently, their performance. While prior work has explored im… ▽ More

    Submitted 18 February, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

  33. arXiv:2501.11978  [pdf, ps, other

    cs.IT math.CO

    Weight Distribution of the Weighted Coordinates Poset Block Space and Singleton Bound

    Authors: Atul Kumar Shriwastva, R. S. Selvaraj

    Abstract: In this paper, we determine the complete weight distribution of the space $ \mathbb{F}_q^N $ endowed by the weighted coordinates poset block metric ($(P,w,π)$-metric), also known as the $(P,w,π)$-space, thereby obtaining it for $(P,w)$-space, $(P,π)$-space, $π$-space, and $P$-space as special cases. Further, when $P$ is a chain, the resulting space is called as Niederreiter-Rosenbloom-Tsfasman (NR… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: 28 pages. arXiv admin note: substantial text overlap with arXiv:2210.12183

    MSC Class: 94B05; 15A03; 06A06

  34. arXiv:2501.09359  [pdf, other

    cs.IR

    A Multi-tiered Solution for Personalized Baggage Item Recommendations using FastText and Association Rule Mining

    Authors: Mudavath Ravi, Atul Negi

    Abstract: This paper introduces an intelligent baggage item recommendation system to optimize packing for air travelers by providing tailored suggestions based on specific travel needs and destinations. Using FastText word embeddings and Association Rule Mining (ARM), the system ensures efficient luggage space utilization, compliance with weight limits, and an enhanced travel experience. The methodology com… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  35. arXiv:2501.01464  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    Estimation of 3T MR images from 1.5T images regularized with Physics based Constraint

    Authors: Prabhjot Kaur, Atul Singh Minhas, Chirag Kamal Ahuja, Anil Kumar Sao

    Abstract: Limited accessibility to high field MRI scanners (such as 7T, 11T) has motivated the development of post-processing methods to improve low field images. Several existing post-processing methods have shown the feasibility to improve 3T images to produce 7T-like images [3,18]. It has been observed that improving lower field (LF, <=1.5T) images comes with additional challenges due to poor image quali… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: conference paper

    Journal ref: Medical Image Computing and Computer Assisted Intervention - MICCAI 2023. Lecture Notes in Computer Science, vol 14229. Springer, Cham

  36. arXiv:2412.16607  [pdf, other

    cs.CR

    Improving Discovery of Known Software Vulnerability For Enhanced Cybersecurity

    Authors: Devesh Sawant, Manjesh K. Hanawal, Atul Kabra

    Abstract: Software vulnerabilities are commonly exploited as attack vectors in cyberattacks. Hence, it is crucial to identify vulnerable software configurations early to apply preventive measures. Effective vulnerability detection relies on identifying software vulnerabilities through standardized identifiers such as Common Platform Enumeration (CPE) strings. However, non-standardized CPE strings issued by… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  37. arXiv:2412.05026  [pdf, ps, other

    quant-ph cs.CR

    Quantum Security Analysis of the Key-Alternating Ciphers

    Authors: Chen Bai, Mehdi Esmaili, Atul Mantri

    Abstract: In this work, we study the quantum security of key-alternating ciphers (KAC), a natural multi-round generalization of the Even-Mansour (EM) cipher underlying many block cipher constructions, including AES. While the classical security of KAC and the quantum security of the $1$-round KAC (i.e. Even-Mansour) cipher are well understood, the quantum resistance of multi-round KAC remains largely unexpl… ▽ More

    Submitted 23 May, 2025; v1 submitted 6 December, 2024; originally announced December 2024.

    Comments: (v2) Added new lower bound results for 2-KAC in the Q1 and Q2 models. Improved presentation throughout

  38. arXiv:2412.02052  [pdf, other

    eess.IV cs.CV

    FoveaSPAD: Exploiting Depth Priors for Adaptive and Efficient Single-Photon 3D Imaging

    Authors: Justin Folden, Atul Ingle, Sanjeev J. Koppal

    Abstract: Fast, efficient, and accurate depth-sensing is important for safety-critical applications such as autonomous vehicles. Direct time-of-flight LiDAR has the potential to fulfill these demands, thanks to its ability to provide high-precision depth measurements at long standoff distances. While conventional LiDAR relies on avalanche photodiodes (APDs), single-photon avalanche diodes (SPADs) are an eme… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  39. arXiv:2411.14611  [pdf, other

    cs.SE cs.LG

    CodeSAM: Source Code Representation Learning by Infusing Self-Attention with Multi-Code-View Graphs

    Authors: Alex Mathai, Kranthi Sedamaki, Debeshee Das, Noble Saji Mathews, Srikanth Tamilselvam, Sridhar Chimalakonda, Atul Kumar

    Abstract: Machine Learning (ML) for software engineering (SE) has gained prominence due to its ability to significantly enhance the performance of various SE applications. This progress is largely attributed to the development of generalizable source code representations that effectively capture the syntactic and semantic characteristics of code. In recent years, pre-trained transformer-based models, inspir… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  40. arXiv:2411.10817  [pdf, other

    cs.LG q-bio.QM stat.ML

    Conformation Generation using Transformer Flows

    Authors: Sohil Atul Shah, Vladlen Koltun

    Abstract: Estimating three-dimensional conformations of a molecular graph allows insight into the molecule's biological and chemical functions. Fast generation of valid conformations is thus central to molecular modeling. Recent advances in graph-based deep networks have accelerated conformation generation from hours to seconds. However, current network architectures do not scale well to large molecules. He… ▽ More

    Submitted 16 February, 2025; v1 submitted 16 November, 2024; originally announced November 2024.

    Comments: This paper was completed in December 2022. Code available at https://github.com/IntelLabs/ConfFlow

  41. arXiv:2411.08400  [pdf, other

    cs.RO cs.AI

    BAMAX: Backtrack Assisted Multi-Agent Exploration using Reinforcement Learning

    Authors: Geetansh Kalra, Amit Patel, Atul Chaudhari, Divye Singh

    Abstract: Autonomous robots collaboratively exploring an unknown environment is still an open problem. The problem has its roots in coordination among non-stationary agents, each with only a partial view of information. The problem is compounded when the multiple robots must completely explore the environment. In this paper, we introduce Backtrack Assisted Multi-Agent Exploration using Reinforcement Learnin… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

  42. arXiv:2411.05088  [pdf

    cs.CL

    Findings of the IWSLT 2024 Evaluation Campaign

    Authors: Ibrahim Said Ahmad, Antonios Anastasopoulos, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, William Chen, Qianqian Dong, Marcello Federico, Barry Haddow, Dávid Javorský, Mateusz Krubiński, Tsz Kin Lam, Xutai Ma, Prashant Mathur, Evgeny Matusov, Chandresh Maurya, John McCrae, Kenton Murray, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, Atul Kr. Ojha , et al. (20 additional authors not shown)

    Abstract: This paper reports on the shared tasks organized by the 21st IWSLT Conference. The shared tasks address 7 scientific challenges in spoken language translation: simultaneous and offline translation, automatic subtitling and dubbing, speech-to-speech translation, dialect and low-resource speech translation, and Indic languages. The shared tasks attracted 18 teams whose submissions are documented in… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: IWSLT 2024; 59 pages

  43. arXiv:2411.02993  [pdf

    cs.DL

    Empowering Library Users: Creative Strategies for Engagement and Innovation

    Authors: Snehasish Paul, Shivali Chauhan, Atul Kumar Pal

    Abstract: This study investigated the integration of cutting-edge technologies and methodologies for creating dynamic, user-centered library environments. In creative strategies for engagement and innovation, library users must be empowered to undertake the new role of modernizing library services and enhancing user experiences. It also enhances the information management and user engagement. This can be at… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  44. A Unified Solution to Diverse Heterogeneities in One-shot Federated Learning

    Authors: Jun Bai, Yiliao Song, Di Wu, Atul Sajjanhar, Yong Xiang, Wei Zhou, Xiaohui Tao, Yan Li, Yue Li

    Abstract: One-Shot Federated Learning (OSFL) restricts communication between the server and clients to a single round, significantly reducing communication costs and minimizing privacy leakage risks compared to traditional Federated Learning (FL), which requires multiple rounds of communication. However, existing OSFL frameworks remain vulnerable to distributional heterogeneity, as they primarily focus on m… ▽ More

    Submitted 1 June, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

    Comments: Accepted version to KDD 2025

  45. arXiv:2410.08938  [pdf, other

    q-bio.QM cs.LG

    KinDEL: DNA-Encoded Library Dataset for Kinase Inhibitors

    Authors: Benson Chen, Tomasz Danel, Patrick J. McEnaney, Nikhil Jain, Kirill Novikov, Spurti Umesh Akki, Joshua L. Turnbull, Virja Atul Pandya, Boris P. Belotserkovskii, Jared Bryce Weaver, Ankita Biswas, Dat Nguyen, Gabriel H. S. Dreiman, Mohammad Sultan, Nathaniel Stanley, Daniel M Whalen, Divya Kanichar, Christoph Klein, Emily Fox, R. Edward Watts

    Abstract: DNA-Encoded Libraries (DEL) are combinatorial small molecule libraries that offer an efficient way to characterize diverse chemical spaces. Selection experiments using DELs are pivotal to drug discovery efforts, enabling high-throughput screens for hit finding. However, limited availability of public DEL datasets hinders the advancement of computational techniques designed to process such data. To… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  46. arXiv:2409.18337  [pdf, other

    eess.IV cs.CV physics.ins-det

    Photon Inhibition for Energy-Efficient Single-Photon Imaging

    Authors: Lucas J. Koerner, Shantanu Gupta, Atul Ingle, Mohit Gupta

    Abstract: Single-photon cameras (SPCs) are emerging as sensors of choice for various challenging imaging applications. One class of SPCs based on the single-photon avalanche diode (SPAD) detects individual photons using an avalanche process; the raw photon data can then be processed to extract scene information under extremely low light, high dynamic range, and rapid motion. Yet, single-photon sensitivity i… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: Accepted for ECCV 2024. Supplementary material and code available at https://wisionlab.com/project/inhibition

  47. arXiv:2409.17460  [pdf, other

    cs.IR

    Towards More Relevant Product Search Ranking Via Large Language Models: An Empirical Study

    Authors: Qi Liu, Atul Singh, Jingbo Liu, Cun Mu, Zheng Yan

    Abstract: Training Learning-to-Rank models for e-commerce product search ranking can be challenging due to the lack of a gold standard of ranking relevance. In this paper, we decompose ranking relevance into content-based and engagement-based aspects, and we propose to leverage Large Language Models (LLMs) for both label and feature generation in model training, primarily aiming to improve the model's predi… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: To be published in CIKM 2024 GenAIECommerce Workshop

  48. arXiv:2409.17456  [pdf, other

    cs.IR

    Long or Short or Both? An Exploration on Lookback Time Windows of Behavioral Features in Product Search Ranking

    Authors: Qi Liu, Atul Singh, Jingbo Liu, Cun Mu, Zheng Yan, Jan Pedersen

    Abstract: Customer shopping behavioral features are core to product search ranking models in eCommerce. In this paper, we investigate the effect of lookback time windows when aggregating these features at the (query, product) level over history. By studying the pros and cons of using long and short time windows, we propose a novel approach to integrating these historical behavioral features of different tim… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: Published in ACM SIGIR Workshop on eCommerce 2024

  49. arXiv:2409.17176  [pdf

    cs.CR

    XDC Gasless Subnet: Gasless Subnet Staking dApp for XDC Network

    Authors: Mohuya Chakraborty, Atul Khekade

    Abstract: With a delegated proof-of-stake (XDPoS) consensus mechanism, the XDC Network is an enterprise-focused blockchain platform that combines the strength of public and private blockchains to provide quick transaction times, low energy consumption, and economical gas fees. XDC is designed for interoperability and supports decentralized apps (dApps) and integrates smoothly with financial systems. It is p… ▽ More

    Submitted 21 September, 2024; originally announced September 2024.

    Comments: 24 pages, 5 figures, 3 tables, 16 references

    Report number: Sep 2024

  50. arXiv:2409.13083  [pdf, other

    cs.CR cs.AI cs.DC

    FedAT: Federated Adversarial Training for Distributed Insider Threat Detection

    Authors: R G Gayathri, Atul Sajjanhar, Md Palash Uddin, Yong Xiang

    Abstract: Insider threats usually occur from within the workplace, where the attacker is an entity closely associated with the organization. The sequence of actions the entities take on the resources to which they have access rights allows us to identify the insiders. Insider Threat Detection (ITD) using Machine Learning (ML)-based approaches gained attention in the last few years. However, most techniques… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: 10 pages, 7 figures