Skip to main content

Showing 1–50 of 160 results for author: Khan, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.07585  [pdf, ps, other

    cs.CV eess.IV

    HOTA: Hierarchical Overlap-Tiling Aggregation for Large-Area 3D Flood Mapping

    Authors: Wenfeng Jia, Bin Liang, Yuxi Lu, Attavit Wilaiwongsakul, Muhammad Arif Khan, Lihong Zheng

    Abstract: Floods are among the most frequent natural hazards and cause significant social and economic damage. Timely, large-scale information on flood extent and depth is essential for disaster response; however, existing products often trade spatial detail for coverage or ignore flood depth altogether. To bridge this gap, this work presents HOTA: Hierarchical Overlap-Tiling Aggregation, a plug-and-play, m… ▽ More

    Submitted 10 July, 2025; originally announced July 2025.

  2. arXiv:2506.23368  [pdf

    eess.SP

    Optimizing Solar Energy Production in the USA: Time-Series Analysis Using AI for Smart Energy Management

    Authors: Istiaq Ahmed, Md Asif Ul Hoq Khan, MD Zahedul Islam, Md Sakibul Hasan, Tanaya Jakir, Arat Hossain, Joynal Abed, Muhammad Hasanuzzaman, Sadia Sharmeen Shatyi, Kazi Nehal Hasnain

    Abstract: As the US rapidly moves towards cleaner energy sources, solar energy is fast becoming the pillar of its renewable energy mix. Even while solar energy is increasingly being used, its variability is a key hindrance to grid stability, storage efficiency, and system stability overall. Solar energy has emerged as one of the fastest-growing renewable energy sources in the United States, adding noticeabl… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

  3. arXiv:2506.07685  [pdf

    eess.SP

    CommSense: A Rapid and Accurate ISAC Paradigm

    Authors: Sandip Jana, Amit Kumar Mishra, Mohammed Zafar Ali Khan

    Abstract: Future 6G networks envisions to blur the line between communication and sensing, leveraging ubiquitous OFDM waveforms for both high throughput data and environmental awareness. In this work, we do a thorough analysis of Communication based Sensing (CommSense) framework that embeds lightweight, PCA based detectors into standard OFDM receivers; enabling real-time, device free detection of passive sc… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  4. arXiv:2505.20810  [pdf, other

    eess.IV cs.CV

    The Role of AI in Early Detection of Life-Threatening Diseases: A Retinal Imaging Perspective

    Authors: Tariq M Khan, Toufique Ahmed Soomro, Imran Razzak

    Abstract: Retinal imaging has emerged as a powerful, non-invasive modality for detecting and quantifying biomarkers of systemic diseases-ranging from diabetes and hypertension to Alzheimer's disease and cardiovascular disorders but current insights remain dispersed across platforms and specialties. Recent technological advances in optical coherence tomography (OCT/OCTA) and adaptive optics (AO) now deliver… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  5. arXiv:2505.14723  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    QUADS: QUAntized Distillation Framework for Efficient Speech Language Understanding

    Authors: Subrata Biswas, Mohammad Nur Hossain Khan, Bashima Islam

    Abstract: Spoken Language Understanding (SLU) systems must balance performance and efficiency, particularly in resource-constrained environments. Existing methods apply distillation and quantization separately, leading to suboptimal compression as distillation ignores quantization constraints. We propose QUADS, a unified framework that optimizes both through multi-stage training with a pre-tuned model, enha… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Journal ref: INTERSPEECH, 2025

  6. arXiv:2505.04318  [pdf, other

    cs.LG cs.AI eess.IV

    Detecting Concept Drift in Neural Networks Using Chi-squared Goodness of Fit Testing

    Authors: Jacob Glenn Ayers, Buvaneswari A. Ramanan, Manzoor A. Khan

    Abstract: As the adoption of deep learning models has grown beyond human capacity for verification, meta-algorithms are needed to ensure reliable model inference. Concept drift detection is a field dedicated to identifying statistical shifts that is underutilized in monitoring neural networks that may encounter inference data with distributional characteristics diverging from their training data. Given the… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 8 pages, 6 figures, 1 table

  7. arXiv:2505.01831  [pdf, other

    eess.IV cs.CV

    Multi-Scale Target-Aware Representation Learning for Fundus Image Enhancement

    Authors: Haofan Wu, Yin Huang, Yuqing Wu, Qiuyu Yang, Bingfang Wang, Li Zhang, Muhammad Fahadullah Khan, Ali Zia, M. Saleh Memon, Syed Sohail Bukhari, Abdul Fattah Memon, Daizong Ji, Ya Zhang, Ghulam Mustafa, Yin Fang

    Abstract: High-quality fundus images provide essential anatomical information for clinical screening and ophthalmic disease diagnosis. Yet, due to hardware limitations, operational variability, and patient compliance, fundus images often suffer from low resolution and signal-to-noise ratio. Recent years have witnessed promising progress in fundus image enhancement. However, existing works usually focus on r… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

    Comments: Under review at Neural Networks

  8. arXiv:2505.00839  [pdf, other

    cs.SD cs.SI eess.AS

    SMSAT: A Multimodal Acoustic Dataset and Deep Contrastive Learning Framework for Affective and Physiological Modeling of Spiritual Meditation

    Authors: Ahmad Suleman, Yazeed Alkhrijah, Misha Urooj Khan, Hareem Khan, Muhammad Abdullah Husnain Ali Faiz, Mohamad A. Alawad, Zeeshan Kaleem, Guan Gui

    Abstract: Understanding how auditory stimuli influence emotional and physiological states is fundamental to advancing affective computing and mental health technologies. In this paper, we present a multimodal evaluation of the affective and physiological impacts of three auditory conditions, that is, spiritual meditation (SM), music (M), and natural silence (NS), using a comprehensive suite of biometric sig… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  9. arXiv:2504.10695  [pdf, other

    eess.SP

    Neyman-Pearson Detector for Ambient Backscatter Zero-Energy-Devices Beacons

    Authors: Shanglin Yang, Jean-Marie Gorce, Muhammad Jehangir Khan, Dinh-Thuy Phan-Huy, Guillaume Villemaud

    Abstract: Recently, a novel ultra-low power indoor wireless positioning system has been proposed. In this system, Zero-Energy-Devices (ZED) beacons are deployed in Indoor environments, and located on a map with unique broadcast identifiers. They harvest ambient energy to power themselves and backscatter ambient waves from cellular networks to send their identifiers. This paper presents a novel detection met… ▽ More

    Submitted 28 April, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

    Comments: This paper is accepted by European Conference on Networks and Communications 2025

  10. arXiv:2503.17275  [pdf, other

    eess.IV cs.CV eess.SP

    Vision Transformer Based Semantic Communications for Next Generation Wireless Networks

    Authors: Muhammad Ahmed Mohsin, Muhammad Jazib, Zeeshan Alam, Muhmmad Farhan Khan, Muhammad Saad, Muhammad Ali Jamshed

    Abstract: In the evolving landscape of 6G networks, semantic communications are poised to revolutionize data transmission by prioritizing the transmission of semantic meaning over raw data accuracy. This paper presents a Vision Transformer (ViT)-based semantic communication framework that has been deliberately designed to achieve high semantic similarity during image transmission while simultaneously minimi… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: Accepted @ ICC 2025

  11. arXiv:2503.01916  [pdf, other

    quant-ph cs.CV cs.RO eess.IV

    QDCNN: Quantum Deep Learning for Enhancing Safety and Reliability in Autonomous Transportation Systems

    Authors: Ashtakala Meghanath, Subham Das, Bikash K. Behera, Muhammad Attique Khan, Saif Al-Kuwari, Ahmed Farouk

    Abstract: In transportation cyber-physical systems (CPS), ensuring safety and reliability in real-time decision-making is essential for successfully deploying autonomous vehicles and intelligent transportation networks. However, these systems face significant challenges, such as computational complexity and the ability to handle ambiguous inputs like shadows in complex environments. This paper introduces a… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

    Comments: 11 Pages, 7 Figures, 4 Tables

  12. arXiv:2502.02782  [pdf, other

    eess.SP

    A Comprehensive Survey on Feature Extraction Techniques Using I/Q Imbalance in RFFI

    Authors: Muhammad Aqib Khan, Muhammad Usman Siddiqui

    Abstract: The proliferation of Internet of Things (IoT) devices has increased the need for secure authentication. While traditional encryption-based solutions can be robust, they often impose high computational and energy overhead on resource-limited IoT nodes. As an alternative, radio frequency fingerprint identification (RFFI) leverages hardware-induced imperfections-such as Inphase/Quadrature (I/Q) imbal… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

    Comments: 5 pages, 5 figures

  13. arXiv:2501.13690  [pdf

    eess.IV cs.CV

    Variational U-Net with Local Alignment for Joint Tumor Extraction and Registration (VALOR-Net) of Breast MRI Data Acquired at Two Different Field Strengths

    Authors: Muhammad Shahkar Khan, Haider Ali, Laura Villazan Garcia, Noor Badshah, Siegfried Trattnig, Florian Schwarzhans, Ramona Woitek, Olgica Zaric

    Abstract: Background: Multiparametric breast MRI data might improve tumor diagnostics, characterization, and treatment planning. Accurate alignment and delineation of images acquired at different field strengths such as 3T and 7T, remain challenging research tasks. Purpose: To address alignment challenges and enable consistent tumor segmentation across different MRI field strengths. Study type: Retrospectiv… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  14. arXiv:2501.06482  [pdf, other

    eess.SP

    Deep Reinforcement Learning Optimized Intelligent Resource Allocation in Active RIS-Integrated TN-NTN Networks

    Authors: Muhammad Ahmed Mohsin, Hassan Rizwan, Muhammad Jazib, Muhammad Iqbal, Muhammad Bilal, Tabinda Ashraf, Muhammad Farhan Khan, Jen-Yi Pan

    Abstract: This work explores the deployment of active reconfigurable intelligent surfaces (A-RIS) in integrated terrestrial and non-terrestrial networks (TN-NTN) while utilizing coordinated multipoint non-orthogonal multiple access (CoMP-NOMA). Our system model incorporates a UAV-assisted RIS in coordination with a terrestrial RIS which aims for signal enhancement. We aim to maximize the sum rate for all us… ▽ More

    Submitted 11 January, 2025; originally announced January 2025.

    Comments: Accepted to WCNC 2025

  15. Low-cost foil/paper based touch mode pressure sensing element as artificial skin module for prosthetic hand

    Authors: Rishabh B. Mishra, Sherjeel M. Khan, Sohail F. Shaikh, Aftab M. Hussain, Muhammad M. Hussain

    Abstract: Capacitive pressure sensors have several advantages in areas such as robotics, automation, aerospace, biomedical and consumer electronics. We present mathematical modelling, finite element analysis (FEA), fabrication and experimental characterization of ultra-low cost and paper-based, touch-mode, flexible capacitive pressure sensor element using Do-It-Yourself (DIY) technology. The pressure sensin… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

  16. arXiv:2412.14538  [pdf, other

    cs.NI cs.AI eess.SP

    Overview of AI and Communication for 6G Network: Fundamentals, Challenges, and Future Research Opportunities

    Authors: Qimei Cui, Xiaohu You, Ni Wei, Guoshun Nan, Xuefei Zhang, Jianhua Zhang, Xinchen Lyu, Ming Ai, Xiaofeng Tao, Zhiyong Feng, Ping Zhang, Qingqing Wu, Meixia Tao, Yongming Huang, Chongwen Huang, Guangyi Liu, Chenghui Peng, Zhiwen Pan, Tao Sun, Dusit Niyato, Tao Chen, Muhammad Khurram Khan, Abbas Jamalipour, Mohsen Guizani, Chau Yuen

    Abstract: With the growing demand for seamless connectivity and intelligent communication, the integration of artificial intelligence (AI) and sixth-generation (6G) communication networks has emerged as a transformative paradigm. By embedding AI capabilities across various network layers, this integration enables optimized resource allocation, improved efficiency, and enhanced system robust performance, par… ▽ More

    Submitted 13 February, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

    Journal ref: Sci China Inf Sci, 2025, 68(7): 171301

  17. arXiv:2412.07175  [pdf, other

    eess.SP cs.CV cs.LG

    Robust Feature Engineering Techniques for Designing Efficient Motor Imagery-Based BCI-Systems

    Authors: Syed Saim Gardezi, Soyiba Jawed, Mahnoor Khan, Muneeba Bukhari, Rizwan Ahmed Khan

    Abstract: A multitude of individuals across the globe grapple with motor disabilities. Neural prosthetics utilizing Brain-Computer Interface (BCI) technology exhibit promise for improving motor rehabilitation outcomes. The intricate nature of EEG data poses a significant hurdle for current BCI systems. Recently, a qualitative repository of EEG signals tied to both upper and lower limb execution of motor and… ▽ More

    Submitted 20 February, 2025; v1 submitted 9 December, 2024; originally announced December 2024.

    Comments: 26 pages

  18. arXiv:2412.05968  [pdf, other

    eess.IV cs.AI cs.CV

    LVS-Net: A Lightweight Vessels Segmentation Network for Retinal Image Analysis

    Authors: Mehwish Mehmood, Shahzaib Iqbal, Tariq Mahmood Khan, Ivor Spence, Muhammad Fahim

    Abstract: The analysis of retinal images for the diagnosis of various diseases is one of the emerging areas of research. Recently, the research direction has been inclined towards investigating several changes in retinal blood vessels in subjects with many neurological disorders, including dementia. This research focuses on detecting diseases early by improving the performance of models for segmentation of… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

  19. arXiv:2412.01456  [pdf, other

    cs.CV eess.IV

    Phaseformer: Phase-based Attention Mechanism for Underwater Image Restoration and Beyond

    Authors: MD Raqib Khan, Anshul Negi, Ashutosh Kulkarni, Shruti S. Phutke, Santosh Kumar Vipparthi, Subrahmanyam Murala

    Abstract: Quality degradation is observed in underwater images due to the effects of light refraction and absorption by water, leading to issues like color cast, haziness, and limited visibility. This degradation negatively affects the performance of autonomous underwater vehicles used in marine applications. To address these challenges, we propose a lightweight phase-based transformer network with 1.77M pa… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 8 pages, 8 figures, conference

  20. arXiv:2411.17556  [pdf, other

    eess.IV cs.CV

    TAFM-Net: A Novel Approach to Skin Lesion Segmentation Using Transformer Attention and Focal Modulation

    Authors: Tariq M Khan, Dawn Lin, Shahzaib Iqbal, Erik Meijering

    Abstract: Incorporating modern computer vision techniques into clinical protocols shows promise in improving skin lesion segmentation. The U-Net architecture has been a key model in this area, iteratively improved to address challenges arising from the heterogeneity of dermatologic images due to varying clinical settings, lighting, patient attributes, and hair density. To further improve skin lesion segment… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

  21. arXiv:2411.15656  [pdf

    eess.IV cs.CV cs.LG

    Machine-agnostic Automated Lumbar MRI Segmentation using a Cascaded Model Based on Generative Neurons

    Authors: Promit Basak, Rusab Sarmun, Saidul Kabir, Israa Al-Hashimi, Enamul Hoque Bhuiyan, Anwarul Hasan, Muhammad Salman Khan, Muhammad E. H. Chowdhury

    Abstract: Automated lumbar spine segmentation is very crucial for modern diagnosis systems. In this study, we introduce a novel machine-agnostic approach for segmenting lumbar vertebrae and intervertebral discs from MRI images, employing a cascaded model that synergizes an ROI detection and a Self-organized Operational Neural Network (Self-ONN)-based encoder-decoder network for segmentation. Addressing the… ▽ More

    Submitted 23 November, 2024; originally announced November 2024.

    Comments: 19 Pages, 11 Figures, Expert Systems with Applications, 2024

    ACM Class: I.4.6

  22. arXiv:2411.15596  [pdf, other

    eess.IV cs.CV

    Comparative Analysis of Resource-Efficient CNN Architectures for Brain Tumor Classification

    Authors: Md Ashik Khan, Rafath Bin Zafar Auvee

    Abstract: Accurate brain tumor classification in MRI images is critical for timely diagnosis and treatment planning. While deep learning models like ResNet-18, VGG-16 have shown high accuracy, they often come with increased complexity and computational demands. This study presents a comparative analysis of effective yet simple Convolutional Neural Network (CNN) architecture and pre-trained ResNet18, and VGG… ▽ More

    Submitted 23 December, 2024; v1 submitted 23 November, 2024; originally announced November 2024.

    Comments: A revised and extended version of this paper has been accepted at the 27th International Conference on Computer and Information Technology (ICCIT 2024). It spans 8 pages and includes 6 figures

    MSC Class: I.2.10 Vision and Scene Understanding; I.4.8 Scene Analysis; 92C55 Biomedical imaging and signal processing

  23. arXiv:2411.02614  [pdf, other

    eess.IV cs.CV

    Divergent Domains, Convergent Grading: Enhancing Generalization in Diabetic Retinopathy Grading

    Authors: Sharon Chokuwa, Muhammad Haris Khan

    Abstract: Diabetic Retinopathy (DR) constitutes 5% of global blindness cases. While numerous deep learning approaches have sought to enhance traditional DR grading methods, they often falter when confronted with new out-of-distribution data thereby impeding their widespread application. In this study, we introduce a novel deep learning method for achieving domain generalization (DG) in DR grading and make t… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: Accepted at WACV 2025

  24. arXiv:2411.02449  [pdf

    eess.IV cs.CV

    Chronic Obstructive Pulmonary Disease Prediction Using Deep Convolutional Network

    Authors: Shahran Rahman Alve, Muhammad Zawad Mahmud, Samiha Islam, Mohammad Monirujjaman Khan

    Abstract: AI and deep learning are two recent innovations that have made a big difference in helping to solve problems in the clinical space. Using clinical imaging and sound examination, they also work on improving their vision so that they can spot diseases early and correctly. Because there aren't enough trained HR, clinical professionals are asking for help with innovation because it helps them adapt to… ▽ More

    Submitted 22 December, 2024; v1 submitted 3 November, 2024; originally announced November 2024.

    Comments: 16 Pages, 11 Figures

  25. arXiv:2410.21276  [pdf, other

    cs.CL cs.AI cs.CV cs.CY cs.LG cs.SD eess.AS

    GPT-4o System Card

    Authors: OpenAI, :, Aaron Hurst, Adam Lerer, Adam P. Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Ostrow, Akila Welihinda, Alan Hayes, Alec Radford, Aleksander Mądry, Alex Baker-Whitcomb, Alex Beutel, Alex Borzunov, Alex Carney, Alex Chow, Alex Kirillov, Alex Nichol, Alex Paino, Alex Renzin, Alex Tachard Passos, Alexander Kirillov, Alexi Christakis , et al. (395 additional authors not shown)

    Abstract: GPT-4o is an autoregressive omni model that accepts as input any combination of text, audio, image, and video, and generates any combination of text, audio, and image outputs. It's trained end-to-end across text, vision, and audio, meaning all inputs and outputs are processed by the same neural network. GPT-4o can respond to audio inputs in as little as 232 milliseconds, with an average of 320 mil… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  26. arXiv:2410.20395  [pdf, other

    cs.CV eess.IV

    Depth Attention for Robust RGB Tracking

    Authors: Yu Liu, Arif Mahmood, Muhammad Haris Khan

    Abstract: RGB video object tracking is a fundamental task in computer vision. Its effectiveness can be improved using depth information, particularly for handling motion-blurred target. However, depth information is often missing in commonly used tracking benchmarks. In this work, we propose a new framework that leverages monocular depth estimation to counter the challenges of tracking targets that are out… ▽ More

    Submitted 27 October, 2024; originally announced October 2024.

    Comments: Oral Acceptance at the Asian Conference on Computer Vision (ACCV) 2024, Hanoi, Vietnam

  27. arXiv:2410.19772  [pdf, other

    eess.SP eess.AS

    A Novel Numerical Method for Relaxing the Minimal Configurations of TOA-Based Joint Sensors and Sources Localization

    Authors: Faxian Cao, Yongqiang Cheng, Adil Mehmood Khan, Zhijing Yang, Yingxiu Chang

    Abstract: This work introduces a novel numerical method that relaxes the minimal configuration requirements for joint sensors and sources localization (JSSL) in 3D space using time of arrival (TOA) measurements. Traditionally, the principle requires that the number of valid equations (TOA measurements) must be equal to or greater than the number of unknown variables (sensor and source locations). State-of-t… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: 13 pages, 6 figures

  28. arXiv:2410.18366  [pdf

    eess.IV

    Cochlear Implantation of Slim Pre-curved Arrays using Automatic Pre-operative Insertion Plans

    Authors: Kareem O. Tawfik, Mohammad M. R. Khan, Ankita Patro, Miriam R. Smetak, David Haynes, Robert F. Labadie, René H. Gifford, Jack H. Noble

    Abstract: Hypothesis: Pre-operative cochlear implant (CI) electrode array (EL) insertion plans created by automated image analysis methods can improve positioning of slim pre-curved EL. Background: This study represents the first evaluation of a system for patient-customized EL insertion planning for a slim pre-curved EL. Methods: Twenty-one temporal bone specimens were divided into experimental and con… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Comments: First two listed authors are co-first authors

  29. Multi-modal Medical Image Fusion For Non-Small Cell Lung Cancer Classification

    Authors: Salma Hassan, Hamad Al Hammadi, Ibrahim Mohammed, Muhammad Haris Khan

    Abstract: The early detection and nuanced subtype classification of non-small cell lung cancer (NSCLC), a predominant cause of cancer mortality worldwide, is a critical and complex issue. In this paper, we introduce an innovative integration of multi-modal data, synthesizing fused medical imaging (CT and PET scans) with clinical health records and genomic data. This unique fusion methodology leverages advan… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  30. arXiv:2409.06742  [pdf, other

    eess.IV

    Stain Normalization of Hematology Slides using Neural Color Transfer

    Authors: M. Muneeb Arshad, Hasan Sajid, M. Jawad Khan

    Abstract: Deep learning is popularly used for analyzing pathology images, but variations in image properties can limit the effectiveness of the models. The study aims to develop a method that transfers the variability present in the training set to unseen images, improving the model's ability to make accurate inferences. YOLOv5 was trained on peripheral blood and bone marrow sample images and Neural Color T… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

  31. arXiv:2409.06018  [pdf, other

    eess.IV cs.CV

    Pioneering Precision in Lumbar Spine MRI Segmentation with Advanced Deep Learning and Data Enhancement

    Authors: Istiak Ahmed, Md. Tanzim Hossain, Md. Zahirul Islam Nahid, Kazi Shahriar Sanjid, Md. Shakib Shahariar Junayed, M. Monir Uddin, Mohammad Monirujjaman Khan

    Abstract: This study presents an advanced approach to lumbar spine segmentation using deep learning techniques, focusing on addressing key challenges such as class imbalance and data preprocessing. Magnetic resonance imaging (MRI) scans of patients with low back pain are meticulously preprocessed to accurately represent three critical classes: vertebrae, spinal canal, and intervertebral discs (IVDs). By rec… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  32. arXiv:2409.03367  [pdf, other

    eess.IV cs.CV

    TBConvL-Net: A Hybrid Deep Learning Architecture for Robust Medical Image Segmentation

    Authors: Shahzaib Iqbal, Tariq M. Khan, Syed S. Naqvi, Asim Naveed, Erik Meijering

    Abstract: Deep learning has shown great potential for automated medical image segmentation to improve the precision and speed of disease diagnostics. However, the task presents significant difficulties due to variations in the scale, shape, texture, and contrast of the pathologies. Traditional convolutional neural network (CNN) models have certain limitations when it comes to effectively modelling multiscal… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  33. arXiv:2408.12323  [pdf, other

    eess.IV cs.CV

    EUIS-Net: A Convolutional Neural Network for Efficient Ultrasound Image Segmentation

    Authors: Shahzaib Iqbal, Hasnat Ahmed, Muhammad Sharif, Madiha Hena, Tariq M. Khan, Imran Razzak

    Abstract: Segmenting ultrasound images is critical for various medical applications, but it offers significant challenges due to ultrasound images' inherent noise and unpredictability. To address these challenges, we proposed EUIS-Net, a CNN network designed to segment ultrasound images efficiently and precisely. The proposed EUIS-Net utilises four encoder-decoder blocks, resulting in a notable decrease in… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  34. arXiv:2408.09687  [pdf, other

    eess.IV cs.CV

    TESL-Net: A Transformer-Enhanced CNN for Accurate Skin Lesion Segmentation

    Authors: Shahzaib Iqbal, Muhammad Zeeshan, Mehwish Mehmood, Tariq M. Khan, Imran Razzak

    Abstract: Early detection of skin cancer relies on precise segmentation of dermoscopic images of skin lesions. However, this task is challenging due to the irregular shape of the lesion, the lack of sharp borders, and the presence of artefacts such as marker colours and hair follicles. Recent methods for melanoma segmentation are U-Nets and fully connected networks (FCNs). As the depth of these neural netwo… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  35. arXiv:2408.07925  [pdf

    cs.LG eess.SP

    A Single Channel-Based Neonatal Sleep-Wake Classification using Hjorth Parameters and Improved Gradient Boosting

    Authors: Muhammad Arslan, Muhammad Mubeen, Saadullah Farooq Abbasi, Muhammad Shahbaz Khan, Wadii Boulila, Jawad Ahmad

    Abstract: Sleep plays a crucial role in neonatal development. Monitoring the sleep patterns in neonates in a Neonatal Intensive Care Unit (NICU) is imperative for understanding the maturation process. While polysomnography (PSG) is considered the best practice for sleep classification, its expense and reliance on human annotation pose challenges. Existing research often relies on multichannel EEG signals; h… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 8 pages, 5 figures, 3 tables, International Polydisciplinary Conference on Artificial Intelligence and New Technologies

  36. arXiv:2408.04773  [pdf, other

    cs.SD eess.AS

    Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-based Speech Enhancement

    Authors: Muhammad Salman Khan, Moreno La Quatra, Kuo-Hsuan Hung, Szu-Wei Fu, Sabato Marco Siniscalchi, Yu Tsao

    Abstract: Self-supervised representation learning (SSL) has attained SOTA results on several downstream speech tasks, but SSL-based speech enhancement (SE) solutions still lag behind. To address this issue, we exploit three main ideas: (i) Transformer-based masking generation, (ii) consistency-preserving loss, and (iii) perceptual contrast stretching (PCS). In detail, conformer layers, leveraging an attenti… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  37. arXiv:2408.02359  [pdf, other

    eess.SP

    Blind User Activity Detection for Grant-Free Random Access in Cell-Free mMIMO Networks

    Authors: Muhammad Usman Khan, Enrico Testi, Marco Chiani, Enrico Paolini

    Abstract: Cell-free massive MIMO (CF-mMIMO) networks have recently emerged as a promising solution to tackle the challenges arising from next-generation massive machine-type communications. In this paper, a fully grant-free deep learning (DL)-based method for user activity detection in CF-mMIMO networks is proposed. Initially, the known non-orthogonal pilot sequences are used to estimate the channel coeffic… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: Accepted for publication at IEEE RTSI 2024, Lecco, Italy

  38. Spatial and Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification

    Authors: Muhammad Ahmad, Muhammad Hassaan Farooq Butt, Adil Mehmood Khan, Manuel Mazzara, Salvatore Distefano, Muhammad Usama, Swalpa Kumar Roy, Jocelyn Chanussot, Danfeng Hong

    Abstract: Recent advancements in transformers, specifically self-attention mechanisms, have significantly improved hyperspectral image (HSI) classification. However, these models often suffer from inefficiencies, as their computational complexity scales quadratically with sequence length. To address these challenges, we propose the morphological spatial mamba (SMM) and morphological spatial-spectral Mamba (… ▽ More

    Submitted 30 November, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

  39. arXiv:2407.19570  [pdf, other

    eess.SY

    A Baseline Approach for Modeling and Characterization of Commercial Off-The-Shelf (COTS) Droop Controlled Converter

    Authors: Muhammad Anees, Lisa Qi, Mehnaz Khan, Srdjan Lukic

    Abstract: Due to advancements in power electronics, new converter topologies are introduced day by day. It's hard to get an equivalent model from any manufacturer of any Commercial Off-The-Shelf (COTS) power electronics converters because of intellectual property (IP) and safety concerns. Most COTS products don't reveal the exact topology of the converter as well as the control architecture and correspondin… ▽ More

    Submitted 17 September, 2024; v1 submitted 28 July, 2024; originally announced July 2024.

  40. arXiv:2407.08813  [pdf, other

    eess.IV cs.AI cs.CV

    FairDomain: Achieving Fairness in Cross-Domain Medical Image Segmentation and Classification

    Authors: Yu Tian, Congcong Wen, Min Shi, Muhammad Muneeb Afzal, Hao Huang, Muhammad Osama Khan, Yan Luo, Yi Fang, Mengyu Wang

    Abstract: Addressing fairness in artificial intelligence (AI), particularly in medical AI, is crucial for ensuring equitable healthcare outcomes. Recent efforts to enhance fairness have introduced new methodologies and datasets in medical AI. However, the fairness issue under the setting of domain transfer is almost unexplored, while it is common that clinics rely on different imaging technologies (e.g., di… ▽ More

    Submitted 18 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: ECCV 2024; Codes and datasets are available at https://github.com/Harvard-Ophthalmology-AI-Lab/FairDomain

  41. arXiv:2407.02871  [pdf, other

    eess.IV cs.CV

    LMBF-Net: A Lightweight Multipath Bidirectional Focal Attention Network for Multifeatures Segmentation

    Authors: Tariq M Khan, Shahzaib Iqbal, Syed S. Naqvi, Imran Razzak, Erik Meijering

    Abstract: Retinal diseases can cause irreversible vision loss in both eyes if not diagnosed and treated early. Since retinal diseases are so complicated, retinal imaging is likely to show two or more abnormalities. Current deep learning techniques for segmenting retinal images with many labels and attributes have poor detection accuracy and generalisability. This paper presents a multipath convolutional neu… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  42. arXiv:2406.17190  [pdf, other

    cs.SD cs.LG eess.AS

    Sound Tagging in Infant-centric Home Soundscapes

    Authors: Mohammad Nur Hossain Khan, Jialu Li, Nancy L. McElwain, Mark Hasegawa-Johnson, Bashima Islam

    Abstract: Certain environmental noises have been associated with negative developmental outcomes for infants and young children. Though classifying or tagging sound events in a domestic environment is an active research area, previous studies focused on data collected from a non-stationary microphone placed in the environment or from the perspective of adults. Further, many of these works ignore infants or… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted in IEEE/ACM CHASE 2024

  43. arXiv:2405.17520  [pdf, other

    eess.IV cs.CV

    Advancing Medical Image Segmentation with Mini-Net: A Lightweight Solution Tailored for Efficient Segmentation of Medical Images

    Authors: Syed Javed, Tariq M. Khan, Abdul Qayyum, Hamid Alinejad-Rokny, Arcot Sowmya, Imran Razzak

    Abstract: Accurate segmentation of anatomical structures and abnormalities in medical images is crucial for computer-aided diagnosis and analysis. While deep learning techniques excel at this task, their computational demands pose challenges. Additionally, some cutting-edge segmentation methods, though effective for general object segmentation, may not be optimised for medical images. To address these issue… ▽ More

    Submitted 20 September, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  44. arXiv:2404.15337  [pdf, other

    eess.SP cs.LG cs.NI

    RSSI Estimation for Constrained Indoor Wireless Networks using ANN

    Authors: Samrah Arif, M. Arif Khan, Sabih Ur Rehman

    Abstract: In the expanding field of the Internet of Things (IoT), wireless channel estimation is a significant challenge. This is specifically true for low-power IoT (LP-IoT) communication, where efficiency and accuracy are extremely important. This research establishes two distinct LP-IoT wireless channel estimation models using Artificial Neural Networks (ANN): a Feature-based ANN model and a Sequence-bas… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  45. arXiv:2404.11771  [pdf

    eess.SY

    IoT-Driven Cloud-based Energy and Environment Monitoring System for Manufacturing Industry

    Authors: Nitol Saha, Md Masruk Aulia, Md. Mostafizur Rahman, Mohammed Shafiul Alam Khan

    Abstract: This research focused on the development of a cost-effective IoT solution for energy and environment monitoring geared towards manufacturing industries. The proposed system is developed using open-source software that can be easily deployed in any manufacturing environment. The system collects real-time temperature, humidity, and energy data from different devices running on different communicatio… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  46. arXiv:2404.09342  [pdf, other

    cs.CV cs.SD eess.AS

    Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Salman Tahir, Rohan Kumar Das, Muhammad Zaigham Zaheer, Marta Moscati, Markus Schedl, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf

    Abstract: The advancements of technology have led to the use of multimodal systems in various real-world applications. Among them, the audio-visual systems are one of the widely used multimodal systems. In the recent years, associating face and voice of a person has gained attention due to presence of unique correlation between them. The Face-voice Association in Multilingual Environments (FAME) Challenge 2… ▽ More

    Submitted 22 July, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: ACM Multimedia Conference - Grand Challenge

  47. arXiv:2403.14120  [pdf, other

    cs.LG cs.AI eess.SP

    Advancing IIoT with Over-the-Air Federated Learning: The Role of Iterative Magnitude Pruning

    Authors: Fazal Muhammad Ali Khan, Hatem Abou-Zeid, Aryan Kaushik, Syed Ali Hassan

    Abstract: The industrial Internet of Things (IIoT) under Industry 4.0 heralds an era of interconnected smart devices where data-driven insights and machine learning (ML) fuse to revolutionize manufacturing. A noteworthy development in IIoT is the integration of federated learning (FL), which addresses data privacy and security among devices. FL enables edge sensors, also known as peripheral intelligence uni… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 6 pages, 6 figures

  48. arXiv:2403.08099  [pdf, other

    eess.SY eess.SP

    Application of Distributed Arithmetic to Adaptive Filtering Algorithms: Trends, Challenges and Future

    Authors: Mohd. Tasleem Khan

    Abstract: The utilization of distributed arithmetic (DA) in AF algorithms has gained significant attention in recent years due to its potential to enhance computational efficiency and reduce resource requirements. This paper presents an exploration of the application of DA to adaptive filtering (AF) algorithms, analyzing trends, discussing challenges, and outlining future prospects. It begins by providing a… ▽ More

    Submitted 17 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  49. arXiv:2403.05415  [pdf

    eess.SY

    An Overview of Automated Vehicle Longitudinal Platoon Formation Strategies

    Authors: M Sabbir Salek, Mugdha Basu Thakur, Pardha Sai Krishna Ala, Mashrur Chowdhury, Matthias Schmid, Pamela Murray-Tuite, Sakib Mahmud Khan, Venkat Krovi

    Abstract: Automated vehicle (AV) platooning has the potential to improve the safety, operational, and energy efficiency of surface transportation systems by limiting or eliminating human involvement in the driving tasks. The theoretical validity of the AV platooning strategies has been established and practical applications are being tested under real-world conditions. The emergence of sensors, communicatio… ▽ More

    Submitted 16 May, 2025; v1 submitted 8 March, 2024; originally announced March 2024.

  50. Taking Second-life Batteries from Exhausted to Empowered using Experiments, Data Analysis, and Health Estimation

    Authors: Xiaofan Cui, Muhammad Aadil Khan, Gabriele Pozzato, Surinder Singh, Ratnesh Sharma, Simona Onori

    Abstract: The reuse of retired electric vehicle batteries in grid energy storage offers environmental and economic benefits. This study concentrates on health monitoring algorithms for retired batteries deployed in grid storage. Over 15 months of testing, we collect, analyze, and publicize a dataset of second-life batteries, implementing a cycling protocol simulating grid energy storage load profiles within… ▽ More

    Submitted 8 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: 16 pages, 8 figures