Skip to main content

Showing 1–50 of 1,075 results for author: khan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.20869  [pdf, ps, other

    cs.SE cs.AI cs.IR

    Engineering RAG Systems for Real-World Applications: Design, Development, and Evaluation

    Authors: Md Toufique Hasan, Muhammad Waseem, Kai-Kristian Kemell, Ayman Asad Khan, Mika Saari, Pekka Abrahamsson

    Abstract: Retrieval-Augmented Generation (RAG) systems are emerging as a key approach for grounding Large Language Models (LLMs) in external knowledge, addressing limitations in factual accuracy and contextual relevance. However, there is a lack of empirical studies that report on the development of RAG-based implementations grounded in real-world use cases, evaluated through general user involvement, and a… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: Accepted as a full paper to the 51st Euromicro Conference on Software Engineering and Advanced Applications (SEAA 2025). 9 pages, 4 figures. This is the preprint version and not the final camera ready version

    ACM Class: D.2.11; I.2.6; H.3.3

  2. arXiv:2506.20685  [pdf, ps, other

    cs.LG cs.AI

    Progressive Size-Adaptive Federated Learning: A Comprehensive Framework for Heterogeneous Multi-Modal Data Systems

    Authors: Sajid Hussain, Muhammad Sohail, Nauman Ali Khan, Naima Iltaf, Ihtesham ul Islam

    Abstract: Federated Learning (FL) has emerged as a transformative paradigm for distributed machine learning while preserving data privacy. However, existing approaches predominantly focus on model heterogeneity and aggregation techniques, largely overlooking the fundamental impact of dataset size characteristics on federated training dynamics. This paper introduces Size-Based Adaptive Federated Learning (SA… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  3. arXiv:2506.19870  [pdf

    cs.CR cs.AI cs.LG

    Secure Energy Transactions Using Blockchain Leveraging AI for Fraud Detection and Energy Market Stability

    Authors: Md Asif Ul Hoq Khan, MD Zahedul Islam, Istiaq Ahmed, Md Masud Karim Rabbi, Farhana Rahman Anonna, MD Abdul Fahim Zeeshan, Mehedi Hasan Ridoy, Bivash Ranjan Chowdhury, Md Nazmul Shakir Rabbi, GM Alamin Sadnan

    Abstract: Peer-to-peer trading and the move to decentralized grids have reshaped the energy markets in the United States. Notwithstanding, such developments lead to new challenges, mainly regarding the safety and authenticity of energy trade. This study aimed to develop and build a secure, intelligent, and efficient energy transaction system for the decentralized US energy market. This research interlinks t… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  4. arXiv:2506.18777  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Programming by Backprop: LLMs Acquire Reusable Algorithmic Abstractions During Code Training

    Authors: Jonathan Cook, Silvia Sapora, Arash Ahmadian, Akbir Khan, Tim Rocktaschel, Jakob Foerster, Laura Ruis

    Abstract: Training large language models (LLMs) on source code significantly enhances their general-purpose reasoning abilities, but the mechanisms underlying this generalisation are poorly understood. In this paper, we propose Programming by Backprop (PBB) as a potential driver of this effect - teaching a model to evaluate a program for inputs by training on its source code alone, without ever seeing I/O e… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  5. arXiv:2506.17977  [pdf, ps, other

    cs.LG cs.DB

    SliceGX: Layer-wise GNN Explanation with Model-slicing

    Authors: Tingting Zhu, Tingyang Chen, Yinghui Wu, Arijit Khan, Xiangyu Ke

    Abstract: Ensuring the trustworthiness of graph neural networks (GNNs) as black-box models requires effective explanation methods. Existing GNN explanations typically apply input perturbations to identify subgraphs that are responsible for the occurrence of the final output of GNNs. However, such approaches lack finer-grained, layer-wise analysis of how intermediate representations contribute to the final r… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

  6. arXiv:2506.17297  [pdf, ps, other

    cs.LG cs.AI

    SafeRL-Lite: A Lightweight, Explainable, and Constrained Reinforcement Learning Library

    Authors: Satyam Mishra, Phung Thao Vi, Shivam Mishra, Vishwanath Bijalwan, Vijay Bhaskar Semwal, Abdul Manan Khan

    Abstract: We introduce SafeRL-Lite, an open-source Python library for building reinforcement learning (RL) agents that are both constrained and explainable. Existing RL toolkits often lack native mechanisms for enforcing hard safety constraints or producing human-interpretable rationales for decisions. SafeRL-Lite provides modular wrappers around standard Gym environments and deep Q-learning agents to enabl… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 10 pages, 7 figures, open-source library, PyPI installable: pip install saferl-lite

    MSC Class: 68T05 ACM Class: I.2.6; I.2.8

  7. arXiv:2506.15562  [pdf, ps, other

    eess.IV cs.CV

    Automated MRI Tumor Segmentation using hybrid U-Net with Transformer and Efficient Attention

    Authors: Syed Haider Ali, Asrar Ahmad, Muhammad Ali, Asifullah Khan, Muhammad Shahban, Nadeem Shaukat

    Abstract: Cancer is an abnormal growth with potential to invade locally and metastasize to distant organs. Accurate auto-segmentation of the tumor and surrounding normal tissues is required for radiotherapy treatment plan optimization. Recent AI-based segmentation models are generally trained on large public datasets, which lack the heterogeneity of local patient populations. While these studies advance AI-… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 16 pages, 5 figures

    ACM Class: I.4.6; I.2.6; I.4.9

  8. arXiv:2506.13704  [pdf, ps, other

    cs.RO

    HARMONI: Haptic-Guided Assistance for Unified Robotic Tele-Manipulation and Tele-Navigation

    Authors: V. Sripada, A. Khan, J. Föcker, S. Parsa, Susmitha P, H Maior, A. Ghalamzan-E

    Abstract: Shared control, which combines human expertise with autonomous assistance, is critical for effective teleoperation in complex environments. While recent advances in haptic-guided teleoperation have shown promise, they are often limited to simplified tasks involving 6- or 7-DoF manipulators and rely on separate control strategies for navigation and manipulation. This increases both cognitive load a… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: To appear in IEEE CASE 2025

  9. arXiv:2506.13201  [pdf, ps, other

    cs.CV

    A Comprehensive Survey on Deep Learning Solutions for 3D Flood Mapping

    Authors: Wenfeng Jia, Bin Liang, Yuxi Liu, Muhammad Arif Khan, Lihong Zheng

    Abstract: Flooding remains a major global challenge, worsened by climate change and urbanization, demanding advanced solutions for effective disaster management. While traditional 2D flood mapping techniques provide limited insights, 3D flood mapping, powered by deep learning (DL), offers enhanced capabilities by integrating flood extent and depth. This paper presents a comprehensive survey of deep learning… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  10. arXiv:2506.13116  [pdf

    cs.LG cs.CL

    Crime Hotspot Prediction Using Deep Graph Convolutional Networks

    Authors: Tehreem Zubair, Syeda Kisaa Fatima, Noman Ahmed, Asifullah Khan

    Abstract: Crime hotspot prediction is critical for ensuring urban safety and effective law enforcement, yet it remains challenging due to the complex spatial dependencies inherent in criminal activity. The previous approaches tended to use classical algorithms such as the KDE and SVM to model data distributions and decision boundaries. The methods often fail to capture these spatial relationships, treating… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  11. arXiv:2506.12365  [pdf

    cs.CL cs.DB

    Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics

    Authors: Asifullah khan, Muhammad Zaeem Khan, Saleha Jamshed, Sadia Ahmad, Aleesha Zainab, Kaynat Khatib, Faria Bibi, Abdul Rehman

    Abstract: This survey paper outlines the key developments in the field of Large Language Models (LLMs), such as enhancing their reasoning skills, adaptability to various tasks, increased computational efficiency, and ability to make ethical decisions. The techniques that have been most effective in bridging the gap between human and machine communications include the Chain-of-Thought prompting, Instruction… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  12. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  13. arXiv:2506.11475  [pdf

    cs.MA cs.CL cs.CV

    AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction

    Authors: Syeda Kisaa Fatima, Tehreem Zubair, Noman Ahmed, Asifullah Khan

    Abstract: This paper introduces LUCID-MA (Learning and Understanding Crime through Dialogue of Multiple Agents), an innovative AI powered framework where multiple AI agents collaboratively analyze and understand crime data. Our system that consists of three core components: an analysis assistant that highlights spatiotemporal crime patterns, a feedback component that reviews and refines analytical results a… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  14. arXiv:2506.10125  [pdf, ps, other

    cs.CR cs.SE

    D-LiFT: Improving LLM-based Decompiler Backend via Code Quality-driven Fine-tuning

    Authors: Muqi Zou, Hongyu Cai, Hongwei Wu, Zion Leonahenahe Basque, Arslan Khan, Berkay Celik, Dave, Tian, Antonio Bianchi, Ruoyu, Wang, Dongyan Xu

    Abstract: Decompilers, which reconstruct human-readable source code from binary executables, are vital to many security tasks. Yet, despite recent advances, their output often suffers from syntactic and semantic errors and remains difficult to read. Recently, with the advent of large language models (LLMs), researchers began to explore the potential of LLMs to refine decompiler output. Nevertheless, our stu… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  15. We Are AI: Taking Control of Technology

    Authors: Julia Stoyanovich, Armanda Lewis, Eric Corbett, Lucius E. J. Bynum, Lucas Rosenblatt, Falaah Arif Khan

    Abstract: Responsible AI (RAI) is the science and practice of ensuring the design, development, use, and oversight of AI are socially sustainable--benefiting diverse stakeholders while controlling the risks. Achieving this goal requires active engagement and participation from the broader public. This paper introduces "We are AI: Taking Control of Technology," a public education course that brings the topic… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 2025

  16. arXiv:2506.05411  [pdf, ps, other

    cs.CR cs.CV

    QA-HFL: Quality-Aware Hierarchical Federated Learning for Resource-Constrained Mobile Devices with Heterogeneous Image Quality

    Authors: Sajid Hussain, Muhammad Sohail, Nauman Ali Khan

    Abstract: This paper introduces QA-HFL, a quality-aware hierarchical federated learning framework that efficiently handles heterogeneous image quality across resource-constrained mobile devices. Our approach trains specialized local models for different image quality levels and aggregates their features using a quality-weighted fusion mechanism, while incorporating differential privacy protection. Experimen… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  17. arXiv:2506.04987  [pdf, ps, other

    cs.SE cs.AI

    A Multi-Dataset Evaluation of Models for Automated Vulnerability Repair

    Authors: Zanis Ali Khan, Aayush Garg, Qiang Tang

    Abstract: Software vulnerabilities pose significant security threats, requiring effective mitigation. While Automated Program Repair (APR) has advanced in fixing general bugs, vulnerability patching, a security-critical aspect of APR remains underexplored. This study investigates pre-trained language models, CodeBERT and CodeT5, for automated vulnerability patching across six datasets and four languages. We… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Preprint has been accepted in ARES AI&CCPS (International Workshop on Artificial Intelligence, Cyber and Cyber-Physical Security)

  18. arXiv:2506.02509  [pdf, ps, other

    cs.DB

    In-context Clustering-based Entity Resolution with Large Language Models: A Design Space Exploration

    Authors: Jiajie Fu, Haitong Tang, Arijit Khan, Sharad Mehrotra, Xiangyu Ke, Yunjun Gao

    Abstract: Entity Resolution (ER) is a fundamental data quality improvement task that identifies and links records referring to the same real-world entity. Traditional ER approaches often rely on pairwise comparisons, which can be costly in terms of time and monetary resources, especially with large datasets. Recently, Large Language Models (LLMs) have shown promising results in ER tasks. However, existing m… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: Accept by SIGMOD26

  19. arXiv:2506.01587  [pdf, other

    cs.CL

    Unified Large Language Models for Misinformation Detection in Low-Resource Linguistic Settings

    Authors: Muhammad Islam, Javed Ali Khan, Mohammed Abaker, Ali Daud, Azeem Irshad

    Abstract: The rapid expansion of social media platforms has significantly increased the dissemination of forged content and misinformation, making the detection of fake news a critical area of research. Although fact-checking efforts predominantly focus on English-language news, there is a noticeable gap in resources and strategies to detect news in regional languages, such as Urdu. Advanced Fake News Detec… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  20. arXiv:2506.01571  [pdf, ps, other

    cs.DS

    A Ranking Framework for Network Resource Allocation and Scheduling via Hypergraphs

    Authors: Rajpreet Singh, Novak Boškov, Aditya Gudal, Manzoor A. Khan

    Abstract: Resource allocation and scheduling are a common problem in various distributed systems. Although widely studied, the state-of-the-art solutions either do not scale or lack the expressive power to capture the most complex instances of the problem. To that end, we present a mathematical framework for hypergraph ranking and analysis, unifying graph theory, lattice theory, and semantic analysis. In ou… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: 12 pages, 9 figures

  21. arXiv:2505.23996  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs

    Authors: Yinong Oliver Wang, Nivedha Sivakumar, Falaah Arif Khan, Rin Metcalf Susa, Adam Golinski, Natalie Mackraz, Barry-John Theobald, Luca Zappella, Nicholas Apostoloff

    Abstract: The recent rapid adoption of large language models (LLMs) highlights the critical need for benchmarking their fairness. Conventional fairness metrics, which focus on discrete accuracy-based evaluations (i.e., prediction correctness), fail to capture the implicit impact of model uncertainty (e.g., higher model confidence about one group over another despite similar accuracy). To address this limita… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 9 pages, 8 figures, and 1 table in main paper. Supplementary appendix attached. Accepted at ICML 2025

  22. arXiv:2505.23984  [pdf, ps, other

    eess.IV cs.HC

    Improved Accuracy in Pelvic Tumor Resections Using a Real-Time Vision-Guided Surgical System

    Authors: Vahid Danesh, Paul Arauz, Maede Boroji, Andrew Zhu, Mia Cottone, Elaine Gould, Fazel A. Khan, Imin Kao

    Abstract: Pelvic bone tumor resections remain significantly challenging due to complex three-dimensional anatomy and limited surgical visualization. Current navigation systems and patient-specific instruments, while accurate, present limitations including high costs, radiation exposure, workflow disruption, long production time, and lack of reusability. This study evaluates a real-time vision-guided surgica… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 9 Pages, 5 figures, Submitted to Journal of Orthopaedic Research

  23. arXiv:2505.23801  [pdf, ps, other

    cs.CL cs.AI cs.LG

    SEMFED: Semantic-Aware Resource-Efficient Federated Learning for Heterogeneous NLP Tasks

    Authors: Sajid Hussain, Muhammad Sohail, Nauman Ali Khan

    Abstract: Background: Federated Learning (FL) has emerged as a promising paradigm for training machine learning models while preserving data privacy. However, applying FL to Natural Language Processing (NLP) tasks presents unique challenges due to semantic heterogeneity across clients, vocabulary mismatches, and varying resource constraints on edge devices. Objectives: This paper introduces SEMFED, a novel… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 13 pages

  24. arXiv:2505.22232  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models

    Authors: Mehdi Ali, Manuel Brack, Max Lübbering, Elias Wendt, Abbas Goher Khan, Richard Rutmann, Alex Jude, Maurice Kraus, Alexander Arno Weber, David Kaczér, Florian Mai, Lucie Flek, Rafet Sifa, Nicolas Flores-Herr, Joachim Köhler, Patrick Schramowski, Michael Fromm, Kristian Kersting

    Abstract: High-quality multilingual training data is essential for effectively pretraining large language models (LLMs). Yet, the availability of suitable open-source multilingual datasets remains limited. Existing state-of-the-art datasets mostly rely on heuristic filtering methods, restricting both their cross-lingual transferability and scalability. Here, we introduce JQL, a systematic approach that effi… ▽ More

    Submitted 31 May, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

    Comments: Project page available at https://huggingface.co/spaces/Jackal-AI/JQL

  25. arXiv:2505.21657   

    cs.CL cs.AI cs.LG

    Explainability of Large Language Models using SMILE: Statistical Model-agnostic Interpretability with Local Explanations

    Authors: Zeinab Dehghani, Mohammed Naveed Akram, Koorosh Aslansefat, Adil Khan

    Abstract: Large language models like GPT, LLAMA, and Claude have become incredibly powerful at generating text, but they are still black boxes, so it is hard to understand how they decide what to say. That lack of transparency can be problematic, especially in fields where trust and accountability matter. To help with this, we introduce SMILE, a new method that explains how these models respond to different… ▽ More

    Submitted 26 June, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

    Comments: The submission contains incorrect references that require substantial revision

  26. arXiv:2505.20251  [pdf, other

    cs.LG cs.CL

    Learning Extrapolative Sequence Transformations from Markov Chains

    Authors: Sophia Hager, Aleem Khan, Andrew Wang, Nicholas Andrews

    Abstract: Most successful applications of deep learning involve similar training and test conditions. However, tasks such as biological sequence design involve searching for sequences that improve desirable properties beyond previously known values, which requires novel hypotheses that \emph{extrapolate} beyond training data. In these settings, extrapolation may be achieved by using random search methods su… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: To be published at the Forty-Second International Conference on Machine Learning

  27. arXiv:2505.20099  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities

    Authors: Chuangtao Ma, Yongrui Chen, Tianxing Wu, Arijit Khan, Haofen Wang

    Abstract: Large language models (LLMs) have demonstrated remarkable performance on question-answering (QA) tasks because of their superior capabilities in natural language understanding and generation. However, LLM-based QA struggles with complex QA tasks due to poor reasoning capacity, outdated knowledge, and hallucinations. Several recent works synthesize LLMs and knowledge graphs (KGs) for QA to address… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Under Review

  28. arXiv:2505.19249  [pdf, ps, other

    astro-ph.GA cs.CV

    RGC-Bent: A Novel Dataset for Bent Radio Galaxy Classification

    Authors: Mir Sazzat Hossain, Khan Muhammad Bin Asad, Payaswini Saikia, Adrita Khan, Md Akil Raihan Iftee, Rakibul Hasan Rajib, Arshad Momen, Md Ashraful Amin, Amin Ahsan Ali, AKM Mahbubur Rahman

    Abstract: We introduce a novel machine learning dataset tailored for the classification of bent radio active galactic nuclei (AGN) in astronomical observations. Bent radio AGN, distinguished by their curved jet structures, provide critical insights into galaxy cluster dynamics, interactions within the intracluster medium, and the broader physics of AGN. Despite their astrophysical significance, the classifi… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 6 pages, 3 figures, 2 tables, Accepted In ICIP 2025

  29. arXiv:2505.18450  [pdf, other

    cs.CL

    BRIT: Bidirectional Retrieval over Unified Image-Text Graph

    Authors: Ainulla Khan, Yamada Moyuru, Srinidhi Akella

    Abstract: Retrieval-Augmented Generation (RAG) has emerged as a promising technique to enhance the quality and relevance of responses generated by large language models. While recent advancements have mainly focused on improving RAG for text-based queries, RAG on multi-modal documents containing both texts and images has not been fully explored. Especially when fine-tuning does not work. This paper proposes… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  30. arXiv:2505.17421  [pdf, ps, other

    cs.IT eess.SP

    Adaptive Implicit-Based Deep Learning Channel Estimation for 6G Communications

    Authors: Zhen Qiao, Jiang Xue, Junkai Zhang, Guanzhang Liu, Xiaoqin Ma, Runhua Li, Faheem A. Khan, John S. Thompson, Zongben Xu

    Abstract: With the widespread deployment of fifth-generation (5G) wireless networks, research on sixth-generation (6G) technology is gaining momentum. Artificial Intelligence (AI) is anticipated to play a significant role in 6G, particularly through integration with the physical layer for tasks such as channel estimation. Considering resource limitations in real systems, the AI algorithm should be designed… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  31. arXiv:2505.16477  [pdf

    cs.AI

    Advancing the Scientific Method with Large Language Models: From Hypothesis to Discovery

    Authors: Yanbo Zhang, Sumeer A. Khan, Adnan Mahmud, Huck Yang, Alexander Lavin, Michael Levin, Jeremy Frey, Jared Dunnmon, James Evans, Alan Bundy, Saso Dzeroski, Jesper Tegner, Hector Zenil

    Abstract: With recent Nobel Prizes recognising AI contributions to science, Large Language Models (LLMs) are transforming scientific research by enhancing productivity and reshaping the scientific method. LLMs are now involved in experimental design, data analysis, and workflows, particularly in chemistry and biology. However, challenges such as hallucinations and reliability persist. In this contribution,… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 45 pages

    Journal ref: npj Artificial Intelligence, 2025

  32. arXiv:2505.15063  [pdf, ps, other

    cs.CL

    UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking

    Authors: Sarfraz Ahmad, Hasan Iqbal, Momina Ahsan, Numaan Naeem, Muhammad Ahsan Riaz Khan, Arham Riaz, Muhammad Arslan Manzoor, Yuxia Wang, Preslav Nakov

    Abstract: The rapid use of large language models (LLMs) has raised critical concerns regarding the factual reliability of their outputs, especially in low-resource languages such as Urdu. Existing automated fact-checking solutions overwhelmingly focus on English, leaving a significant gap for the 200+ million Urdu speakers worldwide. In this work, we introduce UrduFactCheck, the first comprehensive, modular… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 16 pages, 10 figures, 4 tables, Submitted to ARR May 2025

    ACM Class: I.2.7

  33. arXiv:2505.13966  [pdf, other

    eess.SP cs.IT

    Waveform for Next Generation Communication Systems: Comparing Zak-OTFS with OFDM

    Authors: Imran Ali Khan, Saif Khan Mohammed, Ronny Hadani, Ananthanarayanan Chockalingam, Robert Calderbank, Anton Monk, Shachar Kons, Shlomo Rakib, Yoav Hebron

    Abstract: Across the world, there is growing interest in new waveforms, Zak-OTFS in particular, and over-the-air implementations are starting to appear. The choice between OFDM and Zak-OTFS is not so much a choice between waveforms as it is an architectural choice between preventing inter-carrier interference (ICI) and embracing ICI. In OFDM, once the Input-Output (I/O) relation is known, equalization is re… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  34. Finding Counterfactual Evidences for Node Classification

    Authors: Dazhuo Qiu, Jinwen Chen, Arijit Khan, Yan Zhao, Francesco Bonchi

    Abstract: Counterfactual learning is emerging as an important paradigm, rooted in causality, which promises to alleviate common issues of graph neural networks (GNNs), such as fairness and interpretability. However, as in many real-world application domains where conducting randomized controlled trials is impractical, one has to rely on available observational (factual) data to detect counterfactuals. In th… ▽ More

    Submitted 2 June, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

    Comments: Accepted by KDD 2025

  35. arXiv:2505.10879  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Multi-Stage Speaker Diarization for Noisy Classrooms

    Authors: Ali Sartaz Khan, Tolulope Ogunremi, Ahmed Adel Attia, Dorottya Demszky

    Abstract: Speaker diarization, the process of identifying "who spoke when" in audio recordings, is essential for understanding classroom dynamics. However, classroom settings present distinct challenges, including poor recording quality, high levels of background noise, overlapping speech, and the difficulty of accurately capturing children's voices. This study investigates the effectiveness of multi-stage… ▽ More

    Submitted 27 May, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

  36. arXiv:2505.10055  [pdf, ps, other

    cs.CV cs.AI

    PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language

    Authors: Ijazul Haq, Yingjie Zhang, Irfan Ali Khan

    Abstract: This paper evaluates the performance of Large Multimodal Models (LMMs) on Optical Character Recognition (OCR) in the low-resource Pashto language. Natural Language Processing (NLP) in Pashto faces several challenges due to the cursive nature of its script and a scarcity of structured datasets. To address this, we developed a synthetic Pashto OCR dataset, PsOCR, consisting of one million images ann… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  37. arXiv:2505.09894  [pdf, ps, other

    cs.SE

    Advancing Mobile UI Testing by Learning Screen Usage Semantics

    Authors: Safwat Ali Khan

    Abstract: The demand for quality in mobile applications has increased greatly given users' high reliance on them for daily tasks. Developers work tirelessly to ensure that their applications are both functional and user-friendly. In pursuit of this, Automated Input Generation (AIG) tools have emerged as a promising solution for testing mobile applications by simulating user interactions and exploring app fu… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  38. arXiv:2505.07635  [pdf, ps, other

    cs.LG cs.DB

    Generating Skyline Explanations for Graph Neural Networks

    Authors: Dazhuo Qiu, Haolai Che, Arijit Khan, Yinghui Wu

    Abstract: This paper proposes a novel approach to generate subgraph explanations for graph neural networks GNNs that simultaneously optimize multiple measures for explainability. Existing GNN explanation methods often compute subgraphs (called ``explanatory subgraphs'') that optimize a pre-defined, single explainability measure, such as fidelity or conciseness. This can lead to biased explanations that cann… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  39. arXiv:2505.07634  [pdf, ps, other

    cs.RO cs.AI cs.CV

    Neural Brain: A Neuroscience-inspired Framework for Embodied Agents

    Authors: Jian Liu, Xiongtao Shi, Thai Duy Nguyen, Haitian Zhang, Tianxiang Zhang, Wei Sun, Yanjie Li, Athanasios V. Vasilakos, Giovanni Iacca, Arshad Ali Khan, Arvind Kumar, Jae Won Cho, Ajmal Mian, Lihua Xie, Erik Cambria, Lin Wang

    Abstract: The rapid evolution of artificial intelligence (AI) has shifted from static, data-driven models to dynamic systems capable of perceiving and interacting with real-world environments. Despite advancements in pattern recognition and symbolic reasoning, current AI systems, such as large language models, remain disembodied, unable to physically engage with the world. This limitation has driven the ris… ▽ More

    Submitted 14 May, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

    Comments: 51 pages, 17 figures, 9 tables

  40. arXiv:2505.06229  [pdf, ps, other

    cs.LG math.NA

    Neural Network Operator-Based Fractal Approximation: Smoothness Preservation and Convergence Analysis

    Authors: Aaqib Ayoub Bhat, Asif Khan, M. Mursaleen

    Abstract: This paper presents a new approach of constructing $α$-fractal interpolation functions (FIFs) using neural network operators, integrating concepts from approximation theory. Initially, we construct $α$-fractals utilizing neural network-based operators, providing an approach to generating fractal functions with interpolation properties. Based on the same foundation, we have developed fractal interp… ▽ More

    Submitted 22 March, 2025; originally announced May 2025.

    Comments: 18 pages

    MSC Class: 28A80; 41A05; 41A25; 41A29; 41A30; 65D05

  41. arXiv:2505.04318  [pdf, other

    cs.LG cs.AI eess.IV

    Detecting Concept Drift in Neural Networks Using Chi-squared Goodness of Fit Testing

    Authors: Jacob Glenn Ayers, Buvaneswari A. Ramanan, Manzoor A. Khan

    Abstract: As the adoption of deep learning models has grown beyond human capacity for verification, meta-algorithms are needed to ensure reliable model inference. Concept drift detection is a field dedicated to identifying statistical shifts that is underutilized in monitoring neural networks that may encounter inference data with distributional characteristics diverging from their training data. Given the… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 8 pages, 6 figures, 1 table

  42. arXiv:2505.03931  [pdf, other

    cs.RO

    NMPC-Lander: Nonlinear MPC with Barrier Function for UAV Landing on a Mobile Platform

    Authors: Amber Batool, Faryal Batool, Roohan Ahmed Khan, Muhammad Ahsan Mustafa, Aleksey Fedoseev, Dzmitry Tsetserukou

    Abstract: Quadcopters are versatile aerial robots gaining popularity in numerous critical applications. However, their operational effectiveness is constrained by limited battery life and restricted flight range. To address these challenges, autonomous drone landing on stationary or mobile charging and battery-swapping stations has become an essential capability. In this study, we present NMPC-Lander, a nov… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: This manuscript has been submitted to the IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2025

  43. arXiv:2505.03787  [pdf, other

    cs.LG cs.AI eess.SP

    ArrhythmiaVision: Resource-Conscious Deep Learning Models with Visual Explanations for ECG Arrhythmia Classification

    Authors: Zuraiz Baig, Sidra Nasir, Rizwan Ahmed Khan, Muhammad Zeeshan Ul Haque

    Abstract: Cardiac arrhythmias are a leading cause of life-threatening cardiac events, highlighting the urgent need for accurate and timely detection. Electrocardiography (ECG) remains the clinical gold standard for arrhythmia diagnosis; however, manual interpretation is time-consuming, dependent on clinical expertise, and prone to human error. Although deep learning has advanced automated ECG analysis, many… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

    Comments: 14 pages and 08 figures

  44. arXiv:2505.03406  [pdf, other

    cs.CL cs.AI

    Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation

    Authors: Mohammad Shoaib Ansari, Mohd Sohail Ali Khan, Shubham Revankar, Aditya Varma, Anil S. Mokhade

    Abstract: This research paper investigates the application of Large Language Models (LLMs) in healthcare, specifically focusing on enhancing medical decision support through Retrieval-Augmented Generation (RAG) integrated with hospital-specific data and fine-tuning using Quantized Low-Rank Adaptation (QLoRA). The system utilizes Llama 3.2-3B-Instruct as its foundation model. By embedding and retrieving cont… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 12 pages

  45. arXiv:2505.01435  [pdf, other

    cs.IR cs.CL cs.DC cs.LG

    AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling Engine

    Authors: Carlo Siebenschuh, Kyle Hippe, Ozan Gokdemir, Alexander Brace, Arham Khan, Khalid Hossain, Yadu Babuji, Nicholas Chia, Venkatram Vishwanath, Rick Stevens, Arvind Ramanathan, Ian Foster, Robert Underwood

    Abstract: Language models for scientific tasks are trained on text from scientific publications, most distributed as PDFs that require parsing. PDF parsing approaches range from inexpensive heuristics (for simple documents) to computationally intensive ML-driven systems (for complex or degraded ones). The choice of the "best" parser for a particular document depends on its computational cost and the accurac… ▽ More

    Submitted 23 April, 2025; originally announced May 2025.

    Comments: This paper has been accepted at the The Eighth Annual Conference on Machine Learning and Systems (MLSys 2025)

  46. arXiv:2504.21831  [pdf, other

    cs.CV cs.AI

    Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization

    Authors: Anas Anwarul Haq Khan, Utkarsh Verma, Prateek Chanda, Ganesh Ramakrishnan

    Abstract: We introduce DEEVISum (Distilled Early Exit Vision language model for Summarization), a lightweight, efficient, and scalable vision language model designed for segment wise video summarization. Leveraging multi modal prompts that combine textual and audio derived signals, DEEVISum incorporates Multi Stage Knowledge Distillation (MSKD) and Early Exit (EE) to strike a balance between performance and… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

  47. arXiv:2504.19461  [pdf

    cs.SE

    The Role of Generative AI in Strengthening Secure Software Coding Practices: A Systematic Perspective

    Authors: Hathal S. Alwageed, Rafiq Ahmad Khan

    Abstract: As software security threats continue to evolve, the demand for innovative ways of securing coding has tremendously grown. The integration of Generative AI (GenAI) into software development holds significant potential for improving secure coding practices. This paper aims at systematically studying the impact of GenAI in enhancing secure coding practices from improving software security, setting f… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 1-6 pages

  48. arXiv:2504.19271  [pdf, other

    cs.CV

    Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection

    Authors: Athul M. Mathew, Arshad Ali Khan, Thariq Khalid, Faroq AL-Tam, Riad Souissi

    Abstract: Gaze target detection (GTD) is the task of predicting where a person in an image is looking. This is a challenging task, as it requires the ability to understand the relationship between the person's head, body, and eyes, as well as the surrounding environment. In this paper, we propose a novel method for GTD that fuses multiple pieces of information extracted from an image. First, we project the… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

    Comments: accepted at NeurIPS 2023 Gaze Meets ML Workshop

  49. arXiv:2504.18856  [pdf, other

    cs.CV

    Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation

    Authors: Shahad Albastaki, Anabia Sohail, Iyyakutti Iyappan Ganapathi, Basit Alawode, Asim Khan, Sajid Javed, Naoufel Werghi, Mohammed Bennamoun, Arif Mahmood

    Abstract: In Computational Pathology (CPath), the introduction of Vision-Language Models (VLMs) has opened new avenues for research, focusing primarily on aligning image-text pairs at a single magnification level. However, this approach might not be sufficient for tasks like cancer subtype classification, tissue phenotyping, and survival analysis due to the limited level of detail that a single-resolution i… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

  50. arXiv:2504.15995  [pdf, other

    cs.LG cs.AI

    OPUS-VFL: Incentivizing Optimal Privacy-Utility Tradeoffs in Vertical Federated Learning

    Authors: Sindhuja Madabushi, Ahmad Faraz Khan, Haider Ali, Jin-Hee Cho

    Abstract: Vertical Federated Learning (VFL) enables organizations with disjoint feature spaces but shared user bases to collaboratively train models without sharing raw data. However, existing VFL systems face critical limitations: they often lack effective incentive mechanisms, struggle to balance privacy-utility tradeoffs, and fail to accommodate clients with heterogeneous resource capabilities. These cha… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.