Skip to main content

Showing 1–6 of 6 results for author: Khan, M H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2411.02614  [pdf, other

    eess.IV cs.CV

    Divergent Domains, Convergent Grading: Enhancing Generalization in Diabetic Retinopathy Grading

    Authors: Sharon Chokuwa, Muhammad Haris Khan

    Abstract: Diabetic Retinopathy (DR) constitutes 5% of global blindness cases. While numerous deep learning approaches have sought to enhance traditional DR grading methods, they often falter when confronted with new out-of-distribution data thereby impeding their widespread application. In this study, we introduce a novel deep learning method for achieving domain generalization (DG) in DR grading and make t… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: Accepted at WACV 2025

  2. arXiv:2410.20395  [pdf, other

    cs.CV eess.IV

    Depth Attention for Robust RGB Tracking

    Authors: Yu Liu, Arif Mahmood, Muhammad Haris Khan

    Abstract: RGB video object tracking is a fundamental task in computer vision. Its effectiveness can be improved using depth information, particularly for handling motion-blurred target. However, depth information is often missing in commonly used tracking benchmarks. In this work, we propose a new framework that leverages monocular depth estimation to counter the challenges of tracking targets that are out… ▽ More

    Submitted 27 October, 2024; originally announced October 2024.

    Comments: Oral Acceptance at the Asian Conference on Computer Vision (ACCV) 2024, Hanoi, Vietnam

  3. Multi-modal Medical Image Fusion For Non-Small Cell Lung Cancer Classification

    Authors: Salma Hassan, Hamad Al Hammadi, Ibrahim Mohammed, Muhammad Haris Khan

    Abstract: The early detection and nuanced subtype classification of non-small cell lung cancer (NSCLC), a predominant cause of cancer mortality worldwide, is a critical and complex issue. In this paper, we introduce an innovative integration of multi-modal data, synthesizing fused medical imaging (CT and PET scans) with clinical health records and genomic data. This unique fusion methodology leverages advan… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  4. arXiv:2404.09342  [pdf, other

    cs.CV cs.SD eess.AS

    Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Salman Tahir, Rohan Kumar Das, Muhammad Zaigham Zaheer, Marta Moscati, Markus Schedl, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf

    Abstract: The advancements of technology have led to the use of multimodal systems in various real-world applications. Among them, the audio-visual systems are one of the widely used multimodal systems. In the recent years, associating face and voice of a person has gained attention due to presence of unique correlation between them. The Face-voice Association in Multilingual Environments (FAME) Challenge 2… ▽ More

    Submitted 22 July, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: ACM Multimedia Conference - Grand Challenge

  5. arXiv:2201.09873  [pdf, other

    eess.IV cs.CV

    Transformers in Medical Imaging: A Survey

    Authors: Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, Huazhu Fu

    Abstract: Following unprecedented success on the natural language tasks, Transformers have been successfully applied to several computer vision problems, achieving state-of-the-art results and prompting researchers to reconsider the supremacy of convolutional neural networks (CNNs) as {de facto} operators. Capitalizing on these advances in computer vision, the medical imaging field has also witnessed growin… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 41 pages, \url{https://github.com/fahadshamshad/awesome-transformers-in-medical-imaging}

  6. arXiv:1911.03711  [pdf, other

    eess.IV cs.CV

    Unsupervised adulterated red-chili pepper content transformation for hyperspectral classification

    Authors: Muhammad Hussain Khan, Zainab Saleem, Muhammad Ahmad, Ahmed Sohaib, Hamail Ayaz

    Abstract: Preserving red-chili quality is of utmost importance in which the authorities demand the quality techniques to detect, classify and prevent it from the impurities. For example, salt, wheat flour, wheat bran, and rice bran contamination in grounded red chili, which typically a food, are a serious threat to people who are allergic to such items. This work presents the feasibility of utilizing visibl… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

    Comments: 10 pages,