Skip to main content

Showing 1–48 of 48 results for author: Khan, S U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.07839  [pdf, ps, other

    eess.IV cs.CV

    MeD-3D: A Multimodal Deep Learning Framework for Precise Recurrence Prediction in Clear Cell Renal Cell Carcinoma (ccRCC)

    Authors: Hasaan Maqsood, Saif Ur Rehman Khan

    Abstract: Accurate prediction of recurrence in clear cell renal cell carcinoma (ccRCC) remains a major clinical challenge due to the disease complex molecular, pathological, and clinical heterogeneity. Traditional prognostic models, which rely on single data modalities such as radiology, histopathology, or genomics, often fail to capture the full spectrum of disease complexity, resulting in suboptimal predi… ▽ More

    Submitted 10 July, 2025; originally announced July 2025.

  2. arXiv:2507.06763  [pdf, ps, other

    cs.CV cs.AI

    FOLC-Net: A Federated-Optimized Lightweight Architecture for Enhanced MRI Disease Diagnosis across Axial, Coronal, and Sagittal Views

    Authors: Saif Ur Rehman Khan, Muhammad Nabeel Asim, Sebastian Vollmer, Andreas Dengel

    Abstract: The framework is designed to improve performance in the analysis of combined as well as single anatomical perspectives for MRI disease diagnosis. It specifically addresses the performance degradation observed in state-of-the-art (SOTA) models, particularly when processing axial, coronal, and sagittal anatomical planes. The paper introduces the FOLC-Net framework, which incorporates a novel federat… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

  3. arXiv:2507.05190  [pdf, ps, other

    quant-ph cs.CV

    QMoE: A Quantum Mixture of Experts Framework for Scalable Quantum Neural Networks

    Authors: Hoang-Quan Nguyen, Xuan-Bac Nguyen, Sankalp Pandey, Samee U. Khan, Ilya Safro, Khoa Luu

    Abstract: Quantum machine learning (QML) has emerged as a promising direction in the noisy intermediate-scale quantum (NISQ) era, offering computational and memory advantages by harnessing superposition and entanglement. However, QML models often face challenges in scalability and expressiveness due to hardware constraints. In this paper, we propose quantum mixture of experts (QMoE), a novel quantum archite… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

  4. arXiv:2507.04249  [pdf, ps, other

    cs.CY cs.ET

    Ethics by Design: A Lifecycle Framework for Trustworthy AI in Medical Imaging From Transparent Data Governance to Clinically Validated Deployment

    Authors: Umer Sadiq Khan, Saif Ur Rehman Khan

    Abstract: The integration of artificial intelligence (AI) in medical imaging raises crucial ethical concerns at every stage of its development, from data collection to deployment. Addressing these concerns is essential for ensuring that AI systems are developed and implemented in a manner that respects patient rights and promotes fairness. This study aims to explore the ethical implications of AI in medical… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  5. arXiv:2506.23111  [pdf, ps, other

    cs.CL

    FairI Tales: Evaluation of Fairness in Indian Contexts with a Focus on Bias and Stereotypes

    Authors: Janki Atul Nawale, Mohammed Safi Ur Rahman Khan, Janani D, Mansi Gupta, Danish Pruthi, Mitesh M. Khapra

    Abstract: Existing studies on fairness are largely Western-focused, making them inadequate for culturally diverse countries such as India. To address this gap, we introduce INDIC-BIAS, a comprehensive India-centric benchmark designed to evaluate fairness of LLMs across 85 identity groups encompassing diverse castes, religions, regions, and tribes. We first consult domain experts to curate over 1,800 socio-c… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

    Comments: Accepted in ACL 2025

  6. arXiv:2505.06381  [pdf, ps, other

    cs.CV

    Robust & Precise Knowledge Distillation-based Novel Context-Aware Predictor for Disease Detection in Brain and Gastrointestinal

    Authors: Saif Ur Rehman Khan, Muhammad Nabeel Asim, Sebastian Vollmer, Andreas Dengel

    Abstract: Medical disease prediction, particularly through imaging, remains a challenging task due to the complexity and variability of medical data, including noise, ambiguity, and differing image quality. Recent deep learning models, including Knowledge Distillation (KD) methods, have shown promising results in brain tumor image identification but still face limitations in handling uncertainty and general… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

  7. arXiv:2504.08110  [pdf, other

    cs.CV

    Towards Unconstrained 2D Pose Estimation of the Human Spine

    Authors: Muhammad Saif Ullah Khan, Stephan Krauß, Didier Stricker

    Abstract: We present SpineTrack, the first comprehensive dataset for 2D spine pose estimation in unconstrained settings, addressing a crucial need in sports analytics, healthcare, and realistic animation. Existing pose datasets often simplify the spine to a single rigid segment, overlooking the nuanced articulation required for accurate motion analysis. In contrast, SpineTrack annotates nine detailed spinal… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: Accepted for publication in CVPRW 2025

  8. arXiv:2503.24301  [pdf, other

    cs.ET

    QUADRO: A Hybrid Quantum Optimization Framework for Drone Delivery

    Authors: James B. Holliday, Darren Blount, Hoang Quan Nguyen, Samee U. Khan, Khoa Luu

    Abstract: Quantum computing holds transformative potential for optimizing large-scale drone fleet operations, yet its near-term limitations necessitate hybrid approaches blending classical and quantum techniques. This work introduces Quantum Unmanned Aerial Delivery Routing Optimization (QUADRO), a novel hybrid framework addressing the Energy-Constrained Capacitated Unmanned Aerial Vehicle Routing Problem a… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

    Comments: submitted to QCE 2025

  9. arXiv:2503.14209  [pdf, other

    cs.CV

    AI-Driven Diabetic Retinopathy Diagnosis Enhancement through Image Processing and Salp Swarm Algorithm-Optimized Ensemble Network

    Authors: Saif Ur Rehman Khan, Muhammad Nabeel Asim, Sebastian Vollmer, Andreas Dengel

    Abstract: Diabetic retinopathy is a leading cause of blindness in diabetic patients and early detection plays a crucial role in preventing vision loss. Traditional diagnostic methods are often time-consuming and prone to errors. The emergence of deep learning techniques has provided innovative solutions to improve diagnostic efficiency. However, single deep learning models frequently face issues related to… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  10. arXiv:2502.17226  [pdf, other

    cs.LG

    Electrical Load Forecasting over Multihop Smart Metering Networks with Federated Learning

    Authors: Ratun Rahman, Pablo Moriano, Samee U. Khan, Dinh C. Nguyen

    Abstract: Electric load forecasting is essential for power management and stability in smart grids. This is mainly achieved via advanced metering infrastructure, where smart meters (SMs) record household energy data. Traditional machine learning (ML) methods are often employed for load forecasting but require data sharing which raises data privacy concerns. Federated learning (FL) can address this issue by… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

    Comments: arXiv admin note: text overlap with arXiv:2411.10619

  11. arXiv:2501.11310  [pdf, other

    cs.CV

    Anomaly Detection for Industrial Applications, Its Challenges, Solutions, and Future Directions: A Review

    Authors: Abdelrahman Alzarooni, Ehtesham Iqbal, Samee Ullah Khan, Sajid Javed, Brain Moyo, Yusra Abdulrahman

    Abstract: Anomaly detection from images captured using camera sensors is one of the mainstream applications at the industrial level. Particularly, it maintains the quality and optimizes the efficiency in production processes across diverse industrial tasks, including advanced manufacturing and aerospace engineering. Traditional anomaly detection workflow is based on a manual inspection by human operators, w… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

  12. arXiv:2501.07244  [pdf, other

    cs.CV cs.CL

    Can Vision-Language Models Evaluate Handwritten Math?

    Authors: Oikantik Nath, Hanani Bathina, Mohammed Safi Ur Rahman Khan, Mitesh M. Khapra

    Abstract: Recent advancements in Vision-Language Models (VLMs) have opened new possibilities in automatic grading of handwritten student responses, particularly in mathematics. However, a comprehensive study to test the ability of VLMs to evaluate and reason over handwritten content remains absent. To address this gap, we introduce FERMAT, a benchmark designed to assess the ability of VLMs to detect, locali… ▽ More

    Submitted 12 March, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

  13. arXiv:2411.19096  [pdf, other

    cs.CL

    Pralekha: An Indic Document Alignment Evaluation Benchmark

    Authors: Sanjay Suryanarayanan, Haiyue Song, Mohammed Safi Ur Rahman Khan, Anoop Kunchukuttan, Mitesh M. Khapra, Raj Dabre

    Abstract: Mining parallel document pairs poses a significant challenge because existing sentence embedding models often have limited context windows, preventing them from effectively capturing document-level information. Another overlooked issue is the lack of concrete evaluation benchmarks comprising high-quality parallel document pairs for assessing document-level mining approaches, particularly for Indic… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

    Comments: Work in Progress

  14. arXiv:2411.13378  [pdf, other

    cs.CV

    Quantum-Brain: Quantum-Inspired Neural Network Approach to Vision-Brain Understanding

    Authors: Hoang-Quan Nguyen, Xuan-Bac Nguyen, Hugh Churchill, Arabinda Kumar Choudhary, Pawan Sinha, Samee U. Khan, Khoa Luu

    Abstract: Vision-brain understanding aims to extract semantic information about brain signals from human perceptions. Existing deep learning methods for vision-brain understanding are usually introduced in a traditional learning paradigm missing the ability to learn the connectivities between brain regions. Meanwhile, the quantum computing theory offers a new paradigm for designing deep learning models. Mot… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  15. arXiv:2411.04699  [pdf, ps, other

    cs.CL

    Towards Building Large Scale Datasets and State-of-the-Art Automatic Speech Translation Systems for 14 Indian Languages

    Authors: Ashwin Sankar, Sparsh Jain, Nikhil Narasimhan, Devilal Choudhary, Dhairya Suman, Mohammed Safi Ur Rahman Khan, Anoop Kunchukuttan, Mitesh M Khapra, Raj Dabre

    Abstract: Speech translation for Indian languages remains a challenging task due to the scarcity of large-scale, publicly available datasets that capture the linguistic diversity and domain coverage essential for real-world applications. Existing datasets cover a fraction of Indian languages and lack the breadth needed to train robust models that generalize beyond curated benchmarks. To bridge this gap, we… ▽ More

    Submitted 31 May, 2025; v1 submitted 7 November, 2024; originally announced November 2024.

    Comments: Accepted at ACL (Main) 2025

  16. arXiv:2411.02538  [pdf, other

    cs.CL

    MILU: A Multi-task Indic Language Understanding Benchmark

    Authors: Sshubam Verma, Mohammed Safi Ur Rahman Khan, Vishwajeet Kumar, Rudra Murthy, Jaydeep Sen

    Abstract: Evaluating Large Language Models (LLMs) in low-resource and linguistically diverse languages remains a significant challenge in NLP, particularly for languages using non-Latin scripts like those spoken in India. Existing benchmarks predominantly focus on English, leaving substantial gaps in assessing LLM capabilities in these languages. We introduce MILU, a Multi task Indic Language Understanding… ▽ More

    Submitted 4 February, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

  17. arXiv:2410.13394  [pdf, other

    cs.CL

    Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs

    Authors: Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Dilip Venkatesh, Raj Dabre, Anoop Kunchukuttan, Mitesh M. Khapra

    Abstract: Evaluating machine-generated text remains a significant challenge in NLP, especially for non-English languages. Current methodologies, including automated metrics, human assessments, and LLM-based evaluations, predominantly focus on English, revealing a significant gap in multilingual evaluation frameworks. We introduce the Cross Lingual Auto Evaluation (CIA) Suite, an extensible framework that in… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  18. arXiv:2409.20469  [pdf, other

    cs.CV

    Continual Human Pose Estimation for Incremental Integration of Keypoints and Pose Variations

    Authors: Muhammad Saif Ullah Khan, Muhammad Ahmed Ullah Khan, Muhammad Zeshan Afzal, Didier Stricker

    Abstract: This paper reformulates cross-dataset human pose estimation as a continual learning task, aiming to integrate new keypoints and pose variations into existing models without losing accuracy on previously learned datasets. We benchmark this formulation against established regularization-based methods for mitigating catastrophic forgetting, including EWC, LFL, and LwF. Moreover, we propose a novel re… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  19. arXiv:2409.20237  [pdf, other

    cs.CV

    Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies

    Authors: Shalini Sarode, Muhammad Saif Ullah Khan, Tahira Shehzadi, Didier Stricker, Muhammad Zeshan Afzal

    Abstract: We propose ClassroomKD, a novel multi-mentor knowledge distillation framework inspired by classroom environments to enhance knowledge transfer between the student and multiple mentors with different knowledge levels. Unlike traditional methods that rely on fixed mentor-student relationships, our framework dynamically selects and adapts the teaching strategies of diverse mentors based on their effe… ▽ More

    Submitted 17 March, 2025; v1 submitted 30 September, 2024; originally announced September 2024.

    Comments: Accepted in IntelliSys 2025

    ACM Class: I.2.6

  20. arXiv:2408.03596  [pdf, other

    quant-ph cs.CV

    Hierarchical Quantum Control Gates for Functional MRI Understanding

    Authors: Xuan-Bac Nguyen, Hoang-Quan Nguyen, Hugh Churchill, Samee U. Khan, Khoa Luu

    Abstract: Quantum computing has emerged as a powerful tool for solving complex problems intractable for classical computers, particularly in popular fields such as cryptography, optimization, and neurocomputing. In this paper, we present a new quantum-based approach named the Hierarchical Quantum Control Gates (HQCG) method for efficient understanding of Functional Magnetic Resonance Imaging (fMRI) data. Th… ▽ More

    Submitted 22 September, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: Accepted to IEEE Workshop on Signal Processing Systems (SiPS 2024)

  21. Shape2.5D: A Dataset of Texture-less Surfaces for Depth and Normals Estimation

    Authors: Muhammad Saif Ullah Khan, Sankalp Sinha, Didier Stricker, Marcus Liwicki, Muhammad Zeshan Afzal

    Abstract: Reconstructing texture-less surfaces poses unique challenges in computer vision, primarily due to the lack of specialized datasets that cater to the nuanced needs of depth and normals estimation in the absence of textural information. We introduce "Shape2.5D," a novel, large-scale dataset designed to address this gap. Comprising 1.17 million frames spanning over 39,772 3D models and 48 unique obje… ▽ More

    Submitted 5 November, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in IEEE Access

  22. arXiv:2406.14370  [pdf, other

    cs.CV

    Enhanced Bank Check Security: Introducing a Novel Dataset and Transformer-Based Approach for Detection and Verification

    Authors: Muhammad Saif Ullah Khan, Tahira Shehzadi, Rabeya Noor, Didier Stricker, Muhammad Zeshan Afzal

    Abstract: Automated signature verification on bank checks is critical for fraud prevention and ensuring transaction authenticity. This task is challenging due to the coexistence of signatures with other textual and graphical elements on real-world documents. Verification systems must first detect the signature and then validate its authenticity, a dual challenge often overlooked by current datasets and meth… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in 16th IAPR International Workshop on Document Analysis Systems 2024

  23. arXiv:2406.13439  [pdf, other

    cs.CL

    Finding Blind Spots in Evaluator LLMs with Interpretable Checklists

    Authors: Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma, Mitesh M. Khapra

    Abstract: Large Language Models (LLMs) are increasingly relied upon to evaluate text outputs of other LLMs, thereby influencing leaderboards and development decisions. However, concerns persist over the accuracy of these assessments and the potential for misleading conclusions. In this work, we investigate the effectiveness of LLMs as evaluators for text generation tasks. We propose FBI, a novel framework d… ▽ More

    Submitted 26 November, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: EMNLP 2024

  24. SituationalLLM: Proactive language models with scene awareness for dynamic, contextual task guidance

    Authors: Muhammad Saif Ullah Khan, Muhammad Zeshan Afzal, Didier Stricker

    Abstract: Large language models (LLMs) have achieved remarkable success in text-based tasks but often struggle to provide actionable guidance in real-world physical environments. This is because of their inability to recognize their limited understanding of the user's physical context. We present SituationalLLM, a novel approach that integrates structured scene information into an LLM to deliver proactive,… ▽ More

    Submitted 31 January, 2025; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Revised Submission to Open Research Europe

    Journal ref: Open Research Europe, 5 (2025) 1-14

  25. arXiv:2406.00843  [pdf, other

    quant-ph cs.LG

    Diffusion-Inspired Quantum Noise Mitigation in Parameterized Quantum Circuits

    Authors: Hoang-Quan Nguyen, Xuan Bac Nguyen, Samuel Yen-Chi Chen, Hugh Churchill, Nicholas Borys, Samee U. Khan, Khoa Luu

    Abstract: Parameterized Quantum Circuits (PQCs) have been acknowledged as a leading strategy to utilize near-term quantum advantages in multiple problems, including machine learning and combinatorial optimization. When applied to specific tasks, the parameters in the quantum circuits are trained to minimize the target function. Although there have been comprehensive studies to improve the performance of the… ▽ More

    Submitted 22 February, 2025; v1 submitted 2 June, 2024; originally announced June 2024.

  26. arXiv:2405.20084  [pdf, other

    cs.CV

    Estimating Human Poses Across Datasets: A Unified Skeleton and Multi-Teacher Distillation Approach

    Authors: Muhammad Saif Ullah Khan, Dhavalkumar Limbachiya, Didier Stricker, Muhammad Zeshan Afzal

    Abstract: Human pose estimation is a key task in computer vision with various applications such as activity recognition and interactive systems. However, the lack of consistency in the annotated skeletons across different datasets poses challenges in developing universally applicable models. To address this challenge, we propose a novel approach integrating multi-teacher knowledge distillation with a unifie… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 15 pages (with references)

  27. arXiv:2405.19725  [pdf, other

    quant-ph cs.CV

    Quantum Visual Feature Encoding Revisited

    Authors: Xuan-Bac Nguyen, Hoang-Quan Nguyen, Hugh Churchill, Samee U. Khan, Khoa Luu

    Abstract: Although quantum machine learning has been introduced for a while, its applications in computer vision are still limited. This paper, therefore, revisits the quantum visual encoding strategies, the initial step in quantum machine learning. Investigating the root cause, we uncover that the existing quantum encoding design fails to ensure information preservation of the visual features after the enc… ▽ More

    Submitted 20 August, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted to Quantum Machine Intelligence

  28. arXiv:2405.19722  [pdf, other

    cs.CV

    QClusformer: A Quantum Transformer-based Framework for Unsupervised Visual Clustering

    Authors: Xuan-Bac Nguyen, Hoang-Quan Nguyen, Samuel Yen-Chi Chen, Samee U. Khan, Hugh Churchill, Khoa Luu

    Abstract: Unsupervised vision clustering, a cornerstone in computer vision, has been studied for decades, yielding significant outcomes across numerous vision tasks. However, these algorithms involve substantial computational demands when confronted with vast amounts of unlabeled data. Conversely, quantum computing holds promise in expediting unsupervised algorithms when handling large-scale databases. In t… ▽ More

    Submitted 7 August, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  29. arXiv:2405.18808  [pdf, other

    cs.CV

    BRACTIVE: A Brain Activation Approach to Human Visual Brain Learning

    Authors: Xuan-Bac Nguyen, Hojin Jang, Xin Li, Samee U. Khan, Pawan Sinha, Khoa Luu

    Abstract: The human brain is a highly efficient processing unit, and understanding how it works can inspire new algorithms and architectures in machine learning. In this work, we introduce a novel framework named Brain Activation Network (BRACTIVE), a transformer-based approach to studying the human visual brain. The main objective of BRACTIVE is to align the visual features of subjects with corresponding b… ▽ More

    Submitted 26 November, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  30. arXiv:2405.03660  [pdf, other

    cs.CV

    CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification

    Authors: Sankalp Sinha, Muhammad Saif Ullah Khan, Talha Uddin Sheikh, Didier Stricker, Muhammad Zeshan Afzal

    Abstract: Zero-shot learning has been extensively investigated in the broader field of visual recognition, attracting significant interest recently. However, the current work on zero-shot learning in document image classification remains scarce. The existing studies either focus exclusively on zero-shot inference, or their evaluation does not align with the established criteria of zero-shot evaluation in th… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 18 Pages, 4 Figures and Accepted in ICDAR 2024

  31. arXiv:2404.03363  [pdf

    cs.RO

    Space Physiology and Technology: Musculoskeletal Adaptations, Countermeasures, and Opportunities for Wearable Systems

    Authors: Shamas Ul Ebad Khan, Rejin John Varghese, Panagiotis Kassanos, Dario Farina, Etienne Burdet

    Abstract: Space poses significant challenges for humans, leading to physiological adaptations in response to an environment vastly different from Earth. A comprehensive understanding of these physiological adaptations is needed to devise effective countermeasures to support human life in space. This narrative review first focuses on the impact of the environment in space on the musculoskeletal system. It hi… ▽ More

    Submitted 6 January, 2025; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 50 pages (including references), 8 figures, 2 tables and 297 references

  32. arXiv:2403.06904  [pdf, other

    cs.CV

    Human Pose Descriptions and Subject-Focused Attention for Improved Zero-Shot Transfer in Human-Centric Classification Tasks

    Authors: Muhammad Saif Ullah Khan, Muhammad Ferjad Naeem, Federico Tombari, Luc Van Gool, Didier Stricker, Muhammad Zeshan Afzal

    Abstract: We present a novel LLM-based pipeline for creating contextual descriptions of human body poses in images using only auxiliary attributes. This approach facilitates the creation of the MPII Pose Descriptions dataset, which includes natural language annotations for 17,367 images containing people engaged in 410 distinct activities. We demonstrate the effectiveness of our pose descriptions in enablin… ▽ More

    Submitted 28 October, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  33. IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

    Authors: Mohammed Safi Ur Rahman Khan, Priyam Mehta, Ananth Sankar, Umashankar Kumaravelan, Sumanth Doddapaneni, Suriyaprasaad B, Varun Balan G, Sparsh Jain, Anoop Kunchukuttan, Pratyush Kumar, Raj Dabre, Mitesh M. Khapra

    Abstract: Despite the considerable advancements in English LLMs, the progress in building comparable models for other languages has been hindered due to the scarcity of tailored resources. Our work aims to bridge this divide by introducing an expansive suite of resources specifically designed for the development of Indic LLMs, covering 22 languages, containing a total of 251B tokens and 74.8M instruction-re… ▽ More

    Submitted 28 November, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: ACL-2024 Outstanding Paper

  34. arXiv:2402.04326  [pdf, other

    cs.HC cs.LG eess.SP

    Personality Trait Recognition using ECG Spectrograms and Deep Learning

    Authors: Muhammad Mohsin Altaf, Saadat Ullah Khan, Muhammad Majd, Syed Muhammad Anwar

    Abstract: This paper presents an innovative approach to recognizing personality traits using deep learning (DL) methods applied to electrocardiogram (ECG) signals. Within the framework of detecting the big five personality traits model encompassing extra-version, neuroticism, agreeableness, conscientiousness, and openness, the research explores the potential of ECG-derived spectrograms as informative featur… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  35. arXiv:2401.15006  [pdf, other

    cs.CL cs.AI

    Airavata: Introducing Hindi Instruction-tuned LLM

    Authors: Jay Gala, Thanmay Jayakumar, Jaavid Aktar Husain, Aswanth Kumar M, Mohammed Safi Ur Rahman Khan, Diptesh Kanojia, Ratish Puduppully, Mitesh M. Khapra, Raj Dabre, Rudra Murthy, Anoop Kunchukuttan

    Abstract: We announce the initial release of "Airavata," an instruction-tuned LLM for Hindi. Airavata was created by fine-tuning OpenHathi with diverse, instruction-tuning Hindi datasets to make it better suited for assistive tasks. Along with the model, we also share the IndicInstruct dataset, which is a collection of diverse instruction-tuning datasets to enable further research for Indic LLMs. Additional… ▽ More

    Submitted 26 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Work in progress

  36. arXiv:2312.00236  [pdf, other

    cs.CV

    Brainformer: Mimic Human Visual Brain Functions to Machine Vision Models via fMRI

    Authors: Xuan-Bac Nguyen, Xin Li, Pawan Sinha, Samee U. Khan, Khoa Luu

    Abstract: Human perception plays a vital role in forming beliefs and understanding reality. A deeper understanding of brain functionality will lead to the development of novel deep neural networks. In this work, we introduce a novel framework named Brainformer, a straightforward yet effective Transformer-based framework, to analyze Functional Magnetic Resonance Imaging (fMRI) patterns in the human perceptio… ▽ More

    Submitted 26 November, 2024; v1 submitted 30 November, 2023; originally announced December 2023.

  37. arXiv:2309.09907  [pdf, other

    quant-ph cs.CV

    Quantum Vision Clustering

    Authors: Xuan Bac Nguyen, Hugh Churchill, Khoa Luu, Samee U. Khan

    Abstract: Unsupervised visual clustering has garnered significant attention in recent times, aiming to characterize distributions of unlabeled visual images through clustering based on a parameterized appearance approach. Alternatively, clustering algorithms can be viewed as assignment problems, often characterized as NP-hard, yet precisely solvable for small instances on contemporary hardware. Adiabatic qu… ▽ More

    Submitted 17 February, 2025; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2202.08837 by other authors

  38. arXiv:2306.09613  [pdf, other

    cs.CV

    UTOPIA: Unconstrained Tracking Objects without Preliminary Examination via Cross-Domain Adaptation

    Authors: Pha Nguyen, Kha Gia Quach, John Gauch, Samee U. Khan, Bhiksha Raj, Khoa Luu

    Abstract: Multiple Object Tracking (MOT) aims to find bounding boxes and identities of targeted objects in consecutive video frames. While fully-supervised MOT methods have achieved high accuracy on existing datasets, they cannot generalize well on a newly obtained dataset or a new unseen domain. In this work, we first address the MOT problem from the cross-domain point of view, imitating the process of new… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  39. arXiv:2304.06036  [pdf, other

    eess.SP cs.HC

    Upper Limb Movement Execution Classification using Electroencephalography for Brain Computer Interface

    Authors: Saadat Ullah Khan, Muhammad Majid, Syed Muhammad Anwar

    Abstract: An accurate classification of upper limb movements using electroencephalography (EEG) signals is gaining significant importance in recent years due to the prevalence of brain-computer interfaces. The upper limbs in the human body are crucial since different skeletal segments combine to make a range of motion that helps us in our trivial daily tasks. Decoding EEG-based upper limb movements can be o… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  40. arXiv:2211.08350  [pdf, other

    cs.HC cs.LG eess.SP q-bio.NC

    Motor imagery classification using EEG spectrograms

    Authors: Saadat Ullah Khan, Muhammad Majid, Syed Muhammad Anwar

    Abstract: The loss of limb motion arising from damage to the spinal cord is a disability that could effect people while performing their day-to-day activities. The restoration of limb movement would enable people with spinal cord injury to interact with their environment more naturally and this is where a brain-computer interface (BCI) system could be beneficial. The detection of limb movement imagination (… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Submitted to ISBI 2023

  41. arXiv:2205.15948  [pdf, other

    cs.CV cs.AI

    Two-Dimensional Quantum Material Identification via Self-Attention and Soft-labeling in Deep Learning

    Authors: Xuan Bac Nguyen, Apoorva Bisht, Ben Thompson, Hugh Churchill, Khoa Luu, Samee U. Khan

    Abstract: In quantum machine field, detecting two-dimensional (2D) materials in Silicon chips is one of the most critical problems. Instance segmentation can be considered as a potential approach to solve this problem. However, similar to other deep learning methods, the instance segmentation requires a large scale training dataset and high quality annotation in order to achieve a considerable performance.… ▽ More

    Submitted 18 September, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

  42. arXiv:2104.12203  [pdf

    cs.CV

    A novel segmentation dataset for signatures on bank checks

    Authors: Muhammad Saif Ullah Khan

    Abstract: The dataset presented provides high-resolution images of real, filled out bank checks containing various complex backgrounds, and handwritten text and signatures in the respective fields, along with both pixel-level and patch-level segmentation masks for the signatures on the checks. The images of bank checks were obtained from different sources, including other publicly available check datasets,… ▽ More

    Submitted 28 April, 2021; v1 submitted 25 April, 2021; originally announced April 2021.

  43. arXiv:1909.12183  [pdf, other

    cs.ET quant-ph

    K-Means Clustering on Noisy Intermediate Scale Quantum Computers

    Authors: Sumsam Ullah Khan, Ahsan Javed Awan, Gemma Vall-Llosera

    Abstract: Real-time clustering of big performance data generated by the telecommunication networks requires domain-specific high performance compute infrastructure to detect anomalies. In this paper, we evaluate noisy intermediate-scale quantum (NISQ) computers characterized by low decoherence times, for K-means clustering and propose three strategies to generate shorter-depth quantum circuits needed to ove… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

  44. arXiv:1904.01897  [pdf, other

    cs.SI

    affinity: A System for Latent User Similarity Comparison on Texting Data

    Authors: Tobias Eichinger, Felix Beierle, Sumsam Ullah Khan, Robin Middelanis, Veeraraghavan Sekar, Sam Tabibzadeh

    Abstract: In the field of social networking services, finding similar users based on profile data is common practice. Smartphones harbor sensor and personal context data that can be used for user profiling. Yet, one vast source of personal data, that is text messaging data, has hardly been studied for user profiling. We see three reasons for this: First, private text messaging data is not shared due to thei… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

  45. arXiv:1701.08474  [pdf, ps, other

    cs.DC

    IFCIoT: Integrated Fog Cloud IoT Architectural Paradigm for Future Internet of Things

    Authors: Arslan Munir, Prasanna Kansakar, Samee U. Khan

    Abstract: We propose a novel integrated fog cloud IoT (IFCIoT) architectural paradigm that promises increased performance, energy efficiency, reduced latency, quicker response time, scalability, and better localized accuracy for future IoT applications. The fog nodes (e.g., edge servers, smart routers, base stations) receive computation offloading requests and sensed data from various IoT devices. To enhanc… ▽ More

    Submitted 29 January, 2017; originally announced January 2017.

    Comments: 9 pages, 3 figures, accepted for publication in IEEE Consumer Electronics Magazine, July 2017 issue

  46. arXiv:1412.8339  [pdf

    cs.CY cs.DB cs.NI

    Big Data Privacy in the Internet of Things Era

    Authors: Charith Perera, Rajiv Ranjan, Lizhe Wang, Samee U. Khan, Albert Y. Zomaya

    Abstract: Over the last few years, we have seen a plethora of Internet of Things (IoT) solutions, products and services, making their way into the industry's market-place. All such solution will capture a large amount of data pertaining to the environment, as well as their users. The objective of the IoT is to learn more and to serve better the system users. Some of these solutions may store the data locall… ▽ More

    Submitted 8 June, 2015; v1 submitted 29 December, 2014; originally announced December 2014.

    Comments: Accepted to be published in IEEE IT Professional Magazine: Special Issue Internet of Anything 2015

  47. arXiv:1312.6170  [pdf

    cs.DC

    An Overview of the Commercial Cloud Monitoring Tools: Research Dimensions, Design Issues, and State-of-the-Art

    Authors: Khalid Alhamazani, Rajiv Ranjan, Karan Mitra, Fethi Rabhi, Samee Ullah Khan, Adnene Guabtni, Vasudha Bhatnagar

    Abstract: Cloud monitoring activity involves dynamically tracking the Quality of Service (QoS) parameters related to virtualized resources (e.g., VM, storage, network, appliances, etc.), the physical resources they share, the applications running on them and data hosted on them. Applications and resources configuration in cloud computing environment is quite challenging considering a large number of heterog… ▽ More

    Submitted 20 December, 2013; originally announced December 2013.

  48. arXiv:1206.6207  [pdf

    cs.DC

    An Optimal Fully Distributed Algorithm to Minimize the Resource Consumption of Cloud Applications

    Authors: Nikos Tziritas, Samee Ullah Khan, Cheng-Zhong Xu, Jue Hong

    Abstract: According to the pay-per-use model adopted in clouds, the more the resources consumed by an application running in a cloud computing environment, the greater the amount of money the owner of the corresponding application will be charged. Therefore, applying intelligent solutions to minimize the resource consumption is of great importance. Because centralized solutions are deemed unsuitable for lar… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.