Skip to main content

Showing 51–100 of 1,913 results for author: Khan, A

.
  1. arXiv:2505.20099  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities

    Authors: Chuangtao Ma, Yongrui Chen, Tianxing Wu, Arijit Khan, Haofen Wang

    Abstract: Large language models (LLMs) have demonstrated remarkable performance on question-answering (QA) tasks because of their superior capabilities in natural language understanding and generation. However, LLM-based QA struggles with complex QA tasks due to poor reasoning capacity, outdated knowledge, and hallucinations. Several recent works synthesize LLMs and knowledge graphs (KGs) for QA to address… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Under Review

  2. arXiv:2505.19249  [pdf, ps, other

    astro-ph.GA cs.CV

    RGC-Bent: A Novel Dataset for Bent Radio Galaxy Classification

    Authors: Mir Sazzat Hossain, Khan Muhammad Bin Asad, Payaswini Saikia, Adrita Khan, Md Akil Raihan Iftee, Rakibul Hasan Rajib, Arshad Momen, Md Ashraful Amin, Amin Ahsan Ali, AKM Mahbubur Rahman

    Abstract: We introduce a novel machine learning dataset tailored for the classification of bent radio active galactic nuclei (AGN) in astronomical observations. Bent radio AGN, distinguished by their curved jet structures, provide critical insights into galaxy cluster dynamics, interactions within the intracluster medium, and the broader physics of AGN. Despite their astrophysical significance, the classifi… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 6 pages, 3 figures, 2 tables, Accepted In ICIP 2025

  3. arXiv:2505.18450  [pdf, other

    cs.CL

    BRIT: Bidirectional Retrieval over Unified Image-Text Graph

    Authors: Ainulla Khan, Yamada Moyuru, Srinidhi Akella

    Abstract: Retrieval-Augmented Generation (RAG) has emerged as a promising technique to enhance the quality and relevance of responses generated by large language models. While recent advancements have mainly focused on improving RAG for text-based queries, RAG on multi-modal documents containing both texts and images has not been fully explored. Especially when fine-tuning does not work. This paper proposes… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  4. arXiv:2505.17421  [pdf, ps, other

    cs.IT eess.SP

    Adaptive Implicit-Based Deep Learning Channel Estimation for 6G Communications

    Authors: Zhen Qiao, Jiang Xue, Junkai Zhang, Guanzhang Liu, Xiaoqin Ma, Runhua Li, Faheem A. Khan, John S. Thompson, Zongben Xu

    Abstract: With the widespread deployment of fifth-generation (5G) wireless networks, research on sixth-generation (6G) technology is gaining momentum. Artificial Intelligence (AI) is anticipated to play a significant role in 6G, particularly through integration with the physical layer for tasks such as channel estimation. Considering resource limitations in real systems, the AI algorithm should be designed… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  5. arXiv:2505.16477  [pdf

    cs.AI

    Advancing the Scientific Method with Large Language Models: From Hypothesis to Discovery

    Authors: Yanbo Zhang, Sumeer A. Khan, Adnan Mahmud, Huck Yang, Alexander Lavin, Michael Levin, Jeremy Frey, Jared Dunnmon, James Evans, Alan Bundy, Saso Dzeroski, Jesper Tegner, Hector Zenil

    Abstract: With recent Nobel Prizes recognising AI contributions to science, Large Language Models (LLMs) are transforming scientific research by enhancing productivity and reshaping the scientific method. LLMs are now involved in experimental design, data analysis, and workflows, particularly in chemistry and biology. However, challenges such as hallucinations and reliability persist. In this contribution,… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 45 pages

    Journal ref: npj Artificial Intelligence, 2025

  6. arXiv:2505.15063  [pdf, ps, other

    cs.CL

    UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking

    Authors: Sarfraz Ahmad, Hasan Iqbal, Momina Ahsan, Numaan Naeem, Muhammad Ahsan Riaz Khan, Arham Riaz, Muhammad Arslan Manzoor, Yuxia Wang, Preslav Nakov

    Abstract: The rapid use of large language models (LLMs) has raised critical concerns regarding the factual reliability of their outputs, especially in low-resource languages such as Urdu. Existing automated fact-checking solutions overwhelmingly focus on English, leaving a significant gap for the 200+ million Urdu speakers worldwide. In this work, we introduce UrduFactCheck, the first comprehensive, modular… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 16 pages, 10 figures, 4 tables, Submitted to ARR May 2025

    ACM Class: I.2.7

  7. arXiv:2505.13966  [pdf, other

    eess.SP cs.IT

    Waveform for Next Generation Communication Systems: Comparing Zak-OTFS with OFDM

    Authors: Imran Ali Khan, Saif Khan Mohammed, Ronny Hadani, Ananthanarayanan Chockalingam, Robert Calderbank, Anton Monk, Shachar Kons, Shlomo Rakib, Yoav Hebron

    Abstract: Across the world, there is growing interest in new waveforms, Zak-OTFS in particular, and over-the-air implementations are starting to appear. The choice between OFDM and Zak-OTFS is not so much a choice between waveforms as it is an architectural choice between preventing inter-carrier interference (ICI) and embracing ICI. In OFDM, once the Input-Output (I/O) relation is known, equalization is re… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  8. Finding Counterfactual Evidences for Node Classification

    Authors: Dazhuo Qiu, Jinwen Chen, Arijit Khan, Yan Zhao, Francesco Bonchi

    Abstract: Counterfactual learning is emerging as an important paradigm, rooted in causality, which promises to alleviate common issues of graph neural networks (GNNs), such as fairness and interpretability. However, as in many real-world application domains where conducting randomized controlled trials is impractical, one has to rely on available observational (factual) data to detect counterfactuals. In th… ▽ More

    Submitted 2 June, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

    Comments: Accepted by KDD 2025

  9. arXiv:2505.10879  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Multi-Stage Speaker Diarization for Noisy Classrooms

    Authors: Ali Sartaz Khan, Tolulope Ogunremi, Ahmed Adel Attia, Dorottya Demszky

    Abstract: Speaker diarization, the process of identifying "who spoke when" in audio recordings, is essential for understanding classroom dynamics. However, classroom settings present distinct challenges, including poor recording quality, high levels of background noise, overlapping speech, and the difficulty of accurately capturing children's voices. This study investigates the effectiveness of multi-stage… ▽ More

    Submitted 27 May, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

  10. arXiv:2505.10055  [pdf, ps, other

    cs.CV cs.AI

    PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language

    Authors: Ijazul Haq, Yingjie Zhang, Irfan Ali Khan

    Abstract: This paper evaluates the performance of Large Multimodal Models (LMMs) on Optical Character Recognition (OCR) in the low-resource Pashto language. Natural Language Processing (NLP) in Pashto faces several challenges due to the cursive nature of its script and a scarcity of structured datasets. To address this, we developed a synthetic Pashto OCR dataset, PsOCR, consisting of one million images ann… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  11. arXiv:2505.09894  [pdf, ps, other

    cs.SE

    Advancing Mobile UI Testing by Learning Screen Usage Semantics

    Authors: Safwat Ali Khan

    Abstract: The demand for quality in mobile applications has increased greatly given users' high reliance on them for daily tasks. Developers work tirelessly to ensure that their applications are both functional and user-friendly. In pursuit of this, Automated Input Generation (AIG) tools have emerged as a promising solution for testing mobile applications by simulating user interactions and exploring app fu… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  12. arXiv:2505.07929  [pdf, ps, other

    quant-ph

    Evidence that the Quantum Approximate Optimization Algorithm Optimizes the Sherrington-Kirkpatrick Model Efficiently in the Average Case

    Authors: Sami Boulebnane, Abid Khan, Minzhao Liu, Jeffrey Larson, Dylan Herman, Ruslan Shaydulin, Marco Pistoia

    Abstract: The Sherrington-Kirkpatrick (SK) model serves as a foundational framework for understanding disordered systems. The Quantum Approximate Optimization Algorithm (QAOA) is a quantum optimization algorithm whose performance monotonically improves with its depth $p$. We analyze QAOA applied to the SK model in the infinite-size limit and provide numerical evidence that it obtains a $(1-ε)$ approximation… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 17 pages, 5 figures

  13. arXiv:2505.07635  [pdf, ps, other

    cs.LG cs.DB

    Interpreting Graph Inference with Skyline Explanations

    Authors: Dazhuo Qiu, Haolai Che, Arijit Khan, Yinghui Wu

    Abstract: Inference queries have been routinely issued to graph machine learning models such as graph neural networks (GNNs) for various network analytical tasks. Nevertheless, GNNs outputs are often hard to interpret comprehensively. Existing methods typically compromise to individual pre-defined explainability measures (such as fidelity), which often leads to biased, ``one-sided'' interpretations. This pa… ▽ More

    Submitted 3 July, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

  14. arXiv:2505.07634  [pdf, ps, other

    cs.RO cs.AI cs.CV

    Neural Brain: A Neuroscience-inspired Framework for Embodied Agents

    Authors: Jian Liu, Xiongtao Shi, Thai Duy Nguyen, Haitian Zhang, Tianxiang Zhang, Wei Sun, Yanjie Li, Athanasios V. Vasilakos, Giovanni Iacca, Arshad Ali Khan, Arvind Kumar, Jae Won Cho, Ajmal Mian, Lihua Xie, Erik Cambria, Lin Wang

    Abstract: The rapid evolution of artificial intelligence (AI) has shifted from static, data-driven models to dynamic systems capable of perceiving and interacting with real-world environments. Despite advancements in pattern recognition and symbolic reasoning, current AI systems, such as large language models, remain disembodied, unable to physically engage with the world. This limitation has driven the ris… ▽ More

    Submitted 14 May, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

    Comments: 51 pages, 17 figures, 9 tables

  15. arXiv:2505.07435  [pdf, ps, other

    hep-ph

    Predictions for Identified Hadron ($π^\pm$, $K^\pm$ and $p(\overline{p})$) Production and Collective Dynamics in Oxygen-Oxygen Collisions at $\sqrt{s_{NN}}$= 7 TeV with EPOS4, AMPT-SM, and Angantyr in Pythia 8

    Authors: Rabia Bashir, Ramoona Shehzadi, M. U. Ashraf, A. M. Khan

    Abstract: We study the dynamics of identified hadrons ($π^\pm$, $K^\pm$ and $p(\overline{p})$) production in $O+O$ collisions at $\sqrt{s_{\mathrm{NN}}} = 7$TeV using recently updated version of EPOS4, string melting version of A Multi-Phase Transport Model (AMPT-SM) and Angantyr model, incorporated within Pythia 8. We examine the interplay between different mechanisms implemented in these models. Predictio… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 10 pages, 5 figures,

  16. Temperature-dependent nuclear partition functions and abundances in stellar interior

    Authors: Jameel-Un Nabi, Abdel Nasser Tawfik, Nada Ezzelarab, Ali Abas Khan

    Abstract: We calculate temperature-dependent nuclear partition functions (TDNPFs) and nuclear abundances for $728$ nuclei assuming nuclear statistical equilibrium (NSE). The theories of stellar evolution support NSE. Discrete nuclear energy levels have been calculated \textit{microscopically}, using the pn-QRPA theory, up to an excitation energy of $10$ MeV in the calculation of TDNPFs. This feature of our… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

    Comments: 42 Page, 21 Table, 8 Figures

    Journal ref: Physica Scripta, 91(5), 055301 (2016)

  17. arXiv:2505.06229  [pdf, ps, other

    cs.LG math.NA

    Neural Network Operator-Based Fractal Approximation: Smoothness Preservation and Convergence Analysis

    Authors: Aaqib Ayoub Bhat, Asif Khan, M. Mursaleen

    Abstract: This paper presents a new approach of constructing $α$-fractal interpolation functions (FIFs) using neural network operators, integrating concepts from approximation theory. Initially, we construct $α$-fractals utilizing neural network-based operators, providing an approach to generating fractal functions with interpolation properties. Based on the same foundation, we have developed fractal interp… ▽ More

    Submitted 22 March, 2025; originally announced May 2025.

    Comments: 18 pages

    MSC Class: 28A80; 41A05; 41A25; 41A29; 41A30; 65D05

  18. arXiv:2505.06128  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Above-room-temperature ferromagnetism in large-area epitaxial Fe3GaTe2/graphene van der Waals heterostructures

    Authors: Tauqir Shinwari, Kacho Imtiyaz Ali Khan, Hua Lv, Atekelte Abebe Kassa, Frans Munnik, Simon Josephy, Achim Trampert, Victor Ukleev, Chen Luo, Florin Radu, Jens Herfort, Michael Hanke, Joao Marcelo Jordao Lopes

    Abstract: Fe3GaTe2 (FGaT), a two-dimensional (2D) layered ferromagnetic metal, exhibits a high Curie temperature (TC) ~ 360 K along with strong perpendicular magnetic anisotropy (PMA), making it a promising material candidate for next-generation energy-efficient magnetic devices. However, the vast majority of studies on FGaT to date have been limited to millimeter-sized bulk crystals and exfoliated flakes,… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

  19. arXiv:2505.04318  [pdf, other

    cs.LG cs.AI eess.IV

    Detecting Concept Drift in Neural Networks Using Chi-squared Goodness of Fit Testing

    Authors: Jacob Glenn Ayers, Buvaneswari A. Ramanan, Manzoor A. Khan

    Abstract: As the adoption of deep learning models has grown beyond human capacity for verification, meta-algorithms are needed to ensure reliable model inference. Concept drift detection is a field dedicated to identifying statistical shifts that is underutilized in monitoring neural networks that may encounter inference data with distributional characteristics diverging from their training data. Given the… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 8 pages, 6 figures, 1 table

  20. arXiv:2505.03931  [pdf, other

    cs.RO

    NMPC-Lander: Nonlinear MPC with Barrier Function for UAV Landing on a Mobile Platform

    Authors: Amber Batool, Faryal Batool, Roohan Ahmed Khan, Muhammad Ahsan Mustafa, Aleksey Fedoseev, Dzmitry Tsetserukou

    Abstract: Quadcopters are versatile aerial robots gaining popularity in numerous critical applications. However, their operational effectiveness is constrained by limited battery life and restricted flight range. To address these challenges, autonomous drone landing on stationary or mobile charging and battery-swapping stations has become an essential capability. In this study, we present NMPC-Lander, a nov… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: This manuscript has been submitted to the IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2025

  21. arXiv:2505.03787  [pdf, other

    cs.LG cs.AI eess.SP

    ArrhythmiaVision: Resource-Conscious Deep Learning Models with Visual Explanations for ECG Arrhythmia Classification

    Authors: Zuraiz Baig, Sidra Nasir, Rizwan Ahmed Khan, Muhammad Zeeshan Ul Haque

    Abstract: Cardiac arrhythmias are a leading cause of life-threatening cardiac events, highlighting the urgent need for accurate and timely detection. Electrocardiography (ECG) remains the clinical gold standard for arrhythmia diagnosis; however, manual interpretation is time-consuming, dependent on clinical expertise, and prone to human error. Although deep learning has advanced automated ECG analysis, many… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

    Comments: 14 pages and 08 figures

  22. arXiv:2505.03406  [pdf, other

    cs.CL cs.AI

    Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation

    Authors: Mohammad Shoaib Ansari, Mohd Sohail Ali Khan, Shubham Revankar, Aditya Varma, Anil S. Mokhade

    Abstract: This research paper investigates the application of Large Language Models (LLMs) in healthcare, specifically focusing on enhancing medical decision support through Retrieval-Augmented Generation (RAG) integrated with hospital-specific data and fine-tuning using Quantized Low-Rank Adaptation (QLoRA). The system utilizes Llama 3.2-3B-Instruct as its foundation model. By embedding and retrieving cont… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 12 pages

  23. arXiv:2505.03267  [pdf, other

    astro-ph.SR physics.plasm-ph

    Solar Coronal Heating: Role of Kinetic and Inertial Alfvén Waves in Heating and Charged Particle Acceleration

    Authors: Syed Ayaz, Gary P. Zank, Imran A. Khan, Yeimy J. Rivera, Andreas Shalchi, L. -L. Zhao

    Abstract: A comprehensive understanding of solar coronal heating and charged particle acceleration remains one of the most critical challenges in space and astrophysical plasma physics. In this study, we explore the contribution of Alfvén waves, both in their kinetic (KAWs) and inertial (IAWs) regimes, to particle acceleration processes that ultimately lead to coronal heating. Using a kinetic plasma framewo… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Submitted to the Monthly Notices of the Royal Astronomical Society (MNRAS)

  24. arXiv:2505.02946  [pdf, ps, other

    math.NA

    A variational multiscale approach to goal-oriented error estimation in finite element analysis of convection-diffusion-reaction equation problems

    Authors: Sheraz Ahmed Khan, Ramon Codina, Hauke Gravenkamp

    Abstract: This paper presents a goal-oriented a posteriori error estimation framework for linear functionals in the stabilized finite element discretization of the stationary convection-diffusion-reaction (CDR) equation. The theoretical framework for error estimation is based on the variational multiscale (VMS) concept, where the solution is decomposed into resolved (finite element) and unresolved (sub-grid… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  25. arXiv:2505.02531  [pdf, ps, other

    math.NA

    A posteriori error estimates for the finite element approximation of the convection-diffusion-reaction equation based on the variational multiscale concept

    Authors: Ramon Codina, Hauke Gravenkamp, Sheraz Ahmed Khan

    Abstract: In this study, we employ the variational multiscale (VMS) concept to develop a posteriori error estimates for the stationary convection-diffusion-reaction equation. The variational multiscale method is based on splitting the continuous part of the problem into a resolved scale (coarse scale) and an unresolved scale (fine scale). The unresolved scale (also known as the sub-grid scale) is modeled by… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  26. arXiv:2505.01863  [pdf, other

    quant-ph

    Quantum Energy Teleportation across Multi-Qubit Systems using W-State Entanglement

    Authors: Alif Elham Khan, Humayra Anjum, Mahdy Rahman Chowdhury

    Abstract: Quantum-energy teleportation (QET) has so far only been realised on a two-qubit platform. Real-world communication, however, typically involves multiple parties. Here we design and experimentally demonstrate the first multi-qubit QET protocol using a robust W-state multipartite entanglement. Three-, four- and five-qubit circuits were executed both on noiseless simulators and on IBM superconducting… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

  27. arXiv:2505.01435  [pdf, other

    cs.IR cs.CL cs.DC cs.LG

    AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling Engine

    Authors: Carlo Siebenschuh, Kyle Hippe, Ozan Gokdemir, Alexander Brace, Arham Khan, Khalid Hossain, Yadu Babuji, Nicholas Chia, Venkatram Vishwanath, Rick Stevens, Arvind Ramanathan, Ian Foster, Robert Underwood

    Abstract: Language models for scientific tasks are trained on text from scientific publications, most distributed as PDFs that require parsing. PDF parsing approaches range from inexpensive heuristics (for simple documents) to computationally intensive ML-driven systems (for complex or degraded ones). The choice of the "best" parser for a particular document depends on its computational cost and the accurac… ▽ More

    Submitted 23 April, 2025; originally announced May 2025.

    Comments: This paper has been accepted at the The Eighth Annual Conference on Machine Learning and Systems (MLSys 2025)

  28. arXiv:2504.21831  [pdf, other

    cs.CV cs.AI

    Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization

    Authors: Anas Anwarul Haq Khan, Utkarsh Verma, Prateek Chanda, Ganesh Ramakrishnan

    Abstract: We introduce DEEVISum (Distilled Early Exit Vision language model for Summarization), a lightweight, efficient, and scalable vision language model designed for segment wise video summarization. Leveraging multi modal prompts that combine textual and audio derived signals, DEEVISum incorporates Multi Stage Knowledge Distillation (MSKD) and Early Exit (EE) to strike a balance between performance and… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

  29. arXiv:2504.21745  [pdf, other

    quant-ph

    Exponential advantage in quantum sensing of correlated parameters

    Authors: Sridhar Prabhu, Vladimir Kremenetski, Saeed A. Khan, Ryotatsu Yanagimoto, Peter L. McMahon

    Abstract: Conventionally in quantum sensing, the goal is to estimate one or more unknown parameters that are assumed to be deterministic - that is, they do not change between shots of the quantum-sensing protocol. We instead consider the setting where the parameters are stochastic: each shot of the quantum-sensing protocol senses parameter values that come from independent random draws. In this work, we exp… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

  30. arXiv:2504.20235  [pdf, other

    math.OC math.AP

    Dynamic output-based feedback stabilizability for linear parabolic equations with memory

    Authors: Arbaz Khan, Sumit Mahajan, Sérgio S. Rodrigues

    Abstract: The stabilizability of a general class of linear parabolic equations with a memory term, is achieve by explicit output feedback. The control input is given as a function of a state-estimate provided by an exponential dynamic Luenberger observer based on the output of sensor measurements. The numbers of actuators and sensors are finite. The feedback input and output injection operators are given ex… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 15 figures

  31. arXiv:2504.19461  [pdf

    cs.SE

    The Role of Generative AI in Strengthening Secure Software Coding Practices: A Systematic Perspective

    Authors: Hathal S. Alwageed, Rafiq Ahmad Khan

    Abstract: As software security threats continue to evolve, the demand for innovative ways of securing coding has tremendously grown. The integration of Generative AI (GenAI) into software development holds significant potential for improving secure coding practices. This paper aims at systematically studying the impact of GenAI in enhancing secure coding practices from improving software security, setting f… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 1-6 pages

  32. arXiv:2504.19271  [pdf, other

    cs.CV

    Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection

    Authors: Athul M. Mathew, Arshad Ali Khan, Thariq Khalid, Faroq AL-Tam, Riad Souissi

    Abstract: Gaze target detection (GTD) is the task of predicting where a person in an image is looking. This is a challenging task, as it requires the ability to understand the relationship between the person's head, body, and eyes, as well as the surrounding environment. In this paper, we propose a novel method for GTD that fuses multiple pieces of information extracted from an image. First, we project the… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

    Comments: accepted at NeurIPS 2023 Gaze Meets ML Workshop

  33. arXiv:2504.18856  [pdf, other

    cs.CV

    Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation

    Authors: Shahad Albastaki, Anabia Sohail, Iyyakutti Iyappan Ganapathi, Basit Alawode, Asim Khan, Sajid Javed, Naoufel Werghi, Mohammed Bennamoun, Arif Mahmood

    Abstract: In Computational Pathology (CPath), the introduction of Vision-Language Models (VLMs) has opened new avenues for research, focusing primarily on aligning image-text pairs at a single magnification level. However, this approach might not be sufficient for tasks like cancer subtype classification, tissue phenotyping, and survival analysis due to the limited level of detail that a single-resolution i… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

  34. arXiv:2504.15995  [pdf, other

    cs.LG cs.AI

    OPUS-VFL: Incentivizing Optimal Privacy-Utility Tradeoffs in Vertical Federated Learning

    Authors: Sindhuja Madabushi, Ahmad Faraz Khan, Haider Ali, Jin-Hee Cho

    Abstract: Vertical Federated Learning (VFL) enables organizations with disjoint feature spaces but shared user bases to collaboratively train models without sharing raw data. However, existing VFL systems face critical limitations: they often lack effective incentive mechanisms, struggle to balance privacy-utility tradeoffs, and fail to accommodate clients with heterogeneous resource capabilities. These cha… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  35. arXiv:2504.13534  [pdf, other

    cs.CL cs.AI

    CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models

    Authors: Feiyang Li, Peng Fang, Zhan Shi, Arijit Khan, Fang Wang, Dan Feng, Weihao Wang, Xin Zhang, Yongjian Cui

    Abstract: Chain-of-thought (CoT) reasoning boosts large language models' (LLMs) performance on complex tasks but faces two key limitations: a lack of reliability when solely relying on LLM-generated reasoning chains and interference from natural language reasoning steps with the models' inference process, also known as the inference logic of LLMs. To address these issues, we propose CoT-RAG, a novel reasoni… ▽ More

    Submitted 18 May, 2025; v1 submitted 18 April, 2025; originally announced April 2025.

  36. arXiv:2504.13242  [pdf, other

    cs.CV

    Dynamic Memory-enhanced Transformer for Hyperspectral Image Classification

    Authors: Muhammad Ahmad, Manuel Mazzara, Salvatore Distefano, Adil Mehmood Khan

    Abstract: Hyperspectral image (HSI) classification remains a challenging task due to the intricate spatial-spectral correlations. Existing transformer models excel in capturing long-range dependencies but often suffer from information redundancy and attention inefficiencies, limiting their ability to model fine-grained relationships crucial for HSI classification. To overcome these limitations, this work pr… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  37. arXiv:2504.13041  [pdf, other

    quant-ph math.OC

    QI-MPC: A Hybrid Quantum-Inspired Model Predictive Control for Learning Optimal Policies

    Authors: Muhammad Al-Zafar Khan, Jamal Al-Karaki

    Abstract: In this paper, we present Quantum-Inspired Model Predictive Control (QIMPC), an approach that uses Variational Quantum Circuits (VQCs) to learn control polices in MPC problems. The viability of the approach is tested in five experiments: A target-tracking control strategy, energy-efficient building climate control, autonomous vehicular dynamics, the simple pendulum, and the compound pendulum. Thre… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: 41 pages, 21 figures

  38. arXiv:2504.12399  [pdf, other

    quant-ph

    A tensor network approach to sensing quantum light-matter interactions

    Authors: Aiman Khan, Francesco Albarelli, Animesh Datta

    Abstract: We present the fundamental limits to the precision of estimating parameters of a quantum matter system probed by light, even when some of the light is lost. This practically inevitable scenario leads to a tripartite quantum system of matter, and light -- detected and lost. Evaluating fundamental information theoretic quantities such as the quantum Fisher information of only the detected light was… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: 21 pages, 5 figures. See related work by D. Yang et al

  39. arXiv:2504.12088  [pdf, ps, other

    cs.CV cs.AI cs.LG

    AttentionDrop: A Novel Regularization Method for Transformer Models

    Authors: Mirza Samad Ahmed Baig, Syeda Anshrah Gillani, Abdul Akbar Khan, Shahid Munir Shah

    Abstract: Transformer-based architectures achieve state-of-the-art performance across a wide range of tasks in natural language processing, computer vision, and speech. However, their immense capacity often leads to overfitting, especially when training data is limited or noisy. We propose AttentionDrop, a unified family of stochastic regularization techniques that operate directly on the self-attention dis… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: 26 pages

  40. arXiv:2504.10964  [pdf, other

    eess.SY

    Distributed Optimization with Gradient Tracking over Heterogeneous Delay-Prone Directed Networks

    Authors: Evagoras Makridis, Gabriele Oliva, Kasagatta Ramesh Narahari, Mohammadreza Doostmohammadian, Usman A. Khan, Themistoklis Charalambous

    Abstract: In this paper, we address the distributed optimization problem over unidirectional networks with possibly time-invariant heterogeneous bounded transmission delays. In particular, we propose a modified version of the Accelerated Distributed Directed OPTimization (ADD-OPT) algorithm, herein called Robustified ADD-OPT (R-ADD-OPT), which is able to solve the distributed optimization problem, even when… ▽ More

    Submitted 16 April, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

  41. arXiv:2504.10677  [pdf, other

    cs.LG cs.AI cs.MA

    Achieving Optimal Tissue Repair Through MARL with Reward Shaping and Curriculum Learning

    Authors: Muhammad Al-Zafar Khan, Jamal Al-Karaki

    Abstract: In this paper, we present a multi-agent reinforcement learning (MARL) framework for optimizing tissue repair processes using engineered biological agents. Our approach integrates: (1) stochastic reaction-diffusion systems modeling molecular signaling, (2) neural-like electrochemical communication with Hebbian plasticity, and (3) a biologically informed reward function combining chemical gradient t… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: 14 pages, 4 figures, submitted to the 10th International Conference on Information and Communication Technology for Intelligent Systems (ICTIS)

  42. arXiv:2504.10374  [pdf, other

    cs.LG

    Ctrl-Z: Controlling AI Agents via Resampling

    Authors: Aryan Bhatt, Cody Rushing, Adam Kaufman, Tyler Tracy, Vasil Georgiev, David Matolcsi, Akbir Khan, Buck Shlegeris

    Abstract: Control evaluations measure whether monitoring and security protocols for AI systems prevent intentionally subversive AI models from causing harm. Our work presents the first control evaluation performed in an agent environment. We construct BashBench, a dataset of 257 challenging multi-step system administration tasks, and evaluate whether various safety measures can prevent an adversarially cons… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: bashcontrol.com

  43. arXiv:2504.09713  [pdf, other

    cs.ET

    A Full Spectrum of 3D Ferroelectric Memory Architectures Shaped by Polarization Sensing

    Authors: Jiahui Duan, Asif Khan, Xiao Gong, Vijaykrishnan Narayanan, Kai Ni

    Abstract: Ferroelectric memories have attracted significant interest due to their non-volatile storage, energy efficiency, and fast operation, making them prime candidates for future memory technologies. As commercial Dynamic Random Access Memory (DRAM) and NAND flash memory are transiting or have moved toward three-dimensional (3D) integration, 3D ferroelectric memory architectures are also emerging, provi… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

    Comments: 65 pages, 5 figures

  44. arXiv:2504.08340  [pdf, other

    cs.ET cs.AR

    All-in-Memory Stochastic Computing using ReRAM

    Authors: João Paulo C. de Lima, Mehran Shoushtari Moghadam, Sercan Aygun, Jeronimo Castrillon, M. Hassan Najafi, Asif Ali Khan

    Abstract: As the demand for efficient, low-power computing in embedded and edge devices grows, traditional computing methods are becoming less effective for handling complex tasks. Stochastic computing (SC) offers a promising alternative by approximating complex arithmetic operations, such as addition and multiplication, using simple bitwise operations, like majority or AND, on random bit-streams. While SC… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: 7 pages, 5 figures, To appear in DAC 2025

  45. arXiv:2504.08208  [pdf, other

    cs.IR cs.AI

    How Good Are Large Language Models for Course Recommendation in MOOCs?

    Authors: Boxuan Ma, Md Akib Zabed Khan, Tianyuan Yang, Agoritsa Polyzou, Shin'ichi Konomi

    Abstract: Large Language Models (LLMs) have made significant strides in natural language processing and are increasingly being integrated into recommendation systems. However, their potential in educational recommendation systems has yet to be fully explored. This paper investigates the use of LLMs as a general-purpose recommendation model, leveraging their vast knowledge derived from large-scale corpora fo… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  46. arXiv:2504.05809  [pdf, other

    physics.optics

    Loss-free enhancement of photonic spin Hall shift by electromagnetically induced transparency

    Authors: Kezhou Du, Aizaz Khan, Lei Gao, Muzamil Shah, Xinxing Zhou, Dongliang Gao

    Abstract: The photonic spin Hall effect (PSHE), a result of spin-orbit interaction, has attracted significant interest because of its fundamental importance and potential applications. Optical losses are ubiquitous, which inherently suppress the photonic spin Hall shift (PSHS). In this work, we consider an atomic medium that exhibits both absorption and transparency to investigate and mitigate the effects o… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  47. arXiv:2504.04722  [pdf, other

    cs.CV

    TactileNet: Bridging the Accessibility Gap with AI-Generated Tactile Graphics for Individuals with Vision Impairment

    Authors: Adnan Khan, Alireza Choubineh, Mai A. Shaaban, Abbas Akkasi, Majid Komeili

    Abstract: Tactile graphics are essential for providing access to visual information for the 43 million people globally living with vision loss. Traditional methods for creating these graphics are labor-intensive and cannot meet growing demand. We introduce TactileNet, the first comprehensive dataset and AI-driven framework for generating embossing-ready 2D tactile templates using text-to-image Stable Diffus… ▽ More

    Submitted 15 May, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  48. arXiv:2504.04372  [pdf, other

    cs.SE cs.AI cs.LG

    How Accurately Do Large Language Models Understand Code?

    Authors: Sabaat Haroon, Ahmad Faraz Khan, Ahmad Humayun, Waris Gill, Abdul Haddi Amjad, Ali R. Butt, Mohammad Taha Khan, Muhammad Ali Gulzar

    Abstract: Large Language Models (LLMs) are increasingly used in post-development tasks such as code repair and testing. A key factor in these tasks' success is the model's deep understanding of code. However, the extent to which LLMs truly understand code remains largely unevaluated. Quantifying code comprehension is challenging due to its abstract nature and the lack of a standardized metric. Previously, t… ▽ More

    Submitted 9 April, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

    Comments: This paper is currently Under Review. It consists of 11 pages, 12 Figures, and 5 Tables

  49. arXiv:2504.04235  [pdf, other

    quant-ph

    Quantum parallel information exchange (QPIE) hybrid network with transfer learning

    Authors: Ziqing Guo, Alex Khan, Victor S. Sheng, Shabnam Jabeen, Ziwen Pan

    Abstract: Quantum machine learning (QML) has emerged as an innovative framework with the potential to uncover complex patterns by leveraging quantum systems ability to simulate and exploit high-dimensional latent spaces, particularly in learning tasks. Quantum neural network (QNN) frameworks are inherently sensitive to the precision of gradient calculations and the computational limitations of current quant… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

  50. arXiv:2504.04124  [pdf, other

    cs.CV

    EMF: Event Meta Formers for Event-based Real-time Traffic Object Detection

    Authors: Muhammad Ahmed Ullah Khan, Abdul Hannan Khan, Andreas Dengel

    Abstract: Event cameras have higher temporal resolution, and require less storage and bandwidth compared to traditional RGB cameras. However, due to relatively lagging performance of event-based approaches, event cameras have not yet replace traditional cameras in performance-critical applications like autonomous driving. Recent approaches in event-based object detection try to bridge this gap by employing… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

    Comments: 10 pages, 2 figures