Skip to main content

Showing 1–28 of 28 results for author: Sultani, W

.
  1. arXiv:2504.02602  [pdf, other

    cs.CV

    Leveraging Sparse Annotations for Leukemia Diagnosis on the Large Leukemia Dataset

    Authors: Abdul Rehman, Talha Meraj, Aiman Mahmood Minhas, Ayisha Imran, Mohsen Ali, Waqas Sultani, Mubarak Shah

    Abstract: Leukemia is 10th most frequently diagnosed cancer and one of the leading causes of cancer related deaths worldwide. Realistic analysis of Leukemia requires White Blook Cells (WBC) localization, classification, and morphological assessment. Despite deep learning advances in medical imaging, leukemia analysis lacks a large, diverse multi-task dataset, while existing small datasets lack domain divers… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: Under Review

  2. arXiv:2503.03370  [pdf, other

    cs.CV

    MIAdapt: Source-free Few-shot Domain Adaptive Object Detection for Microscopic Images

    Authors: Nimra Dilawar, Sara Nadeem, Javed Iqbal, Waqas Sultani, Mohsen Ali

    Abstract: Existing generic unsupervised domain adaptation approaches require access to both a large labeled source dataset and a sufficient unlabeled target dataset during adaptation. However, collecting a large dataset, even if unlabeled, is a challenging and expensive endeavor, especially in medical imaging. In addition, constraints such as privacy issues can result in cases where source data is unavailab… ▽ More

    Submitted 6 March, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

    Comments: 6 pages, 5 figures

  3. arXiv:2408.04224  [pdf, other

    cs.CV

    Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance

    Authors: Ahmad Arrabi, Xiaohan Zhang, Waqas Sultani, Chen Chen, Safwan Wshah

    Abstract: Aerial imagery analysis is critical for many research fields. However, obtaining frequent high-quality aerial images is not always accessible due to its high effort and cost requirements. One solution is to use the Ground-to-Aerial (G2A) technique to synthesize aerial images from easily collectible ground images. However, G2A is rarely studied, because of its challenges, including but not limited… ▽ More

    Submitted 20 August, 2024; v1 submitted 8 August, 2024; originally announced August 2024.

  4. arXiv:2407.07633  [pdf, other

    eess.IV cs.CV

    Few-Shot Domain Adaptive Object Detection for Microscopic Images

    Authors: Sumayya Inayat, Nimra Dilawar, Waqas Sultani, Mohsen Ali

    Abstract: In recent years, numerous domain adaptive strategies have been proposed to help deep learning models overcome the challenges posed by domain shift. However, even unsupervised domain adaptive strategies still require a large amount of target data. Medical imaging datasets are often characterized by class imbalance and scarcity of labeled and unlabeled data. Few-shot domain adaptive object detection… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted to MICCAI 2024 main conference

  5. Joint Stream: Malignant Region Learning for Breast Cancer Diagnosis

    Authors: Abdul Rehman, Sarfaraz Hussein, Waqas Sultani

    Abstract: Early diagnosis of breast cancer (BC) significantly contributes to reducing the mortality rate worldwide. The detection of different factors and biomarkers such as Estrogen receptor (ER), Progesterone receptor (PR), Human epidermal growth factor receptor 2 (HER2) gene, Histological grade (HG), Auxiliary lymph node (ALN) status, and Molecular subtype (MS) can play a significant role in improved BC… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Under Review (Biomedical Signal Processing and Control)

    Journal ref: Volume 99, January 2025, 106899

  6. arXiv:2405.10803  [pdf, other

    eess.IV cs.CV

    A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for Explainability

    Authors: Abdul Rehman, Talha Meraj, Aiman Mahmood Minhas, Ayisha Imran, Mohsen Ali, Waqas Sultani

    Abstract: Earlier diagnosis of Leukemia can save thousands of lives annually. The prognosis of leukemia is challenging without the morphological information of White Blood Cells (WBC) and relies on the accessibility of expensive microscopes and the availability of hematologists to analyze Peripheral Blood Samples (PBS). Deep Learning based methods can be employed to assist hematologists. However, these algo… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Early Accept

  7. arXiv:2309.15625  [pdf, other

    cs.CV eess.IV

    Leveraging Topology for Domain Adaptive Road Segmentation in Satellite and Aerial Imagery

    Authors: Javed Iqbal, Aliza Masood, Waqas Sultani, Mohsen Ali

    Abstract: Getting precise aspects of road through segmentation from remote sensing imagery is useful for many real-world applications such as autonomous vehicles, urban development and planning, and achieving sustainable development goals. Roads are only a small part of the image, and their appearance, type, width, elevation, directions, etc. exhibit large variations across geographical areas. Furthermore,… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  8. arXiv:2308.09624  [pdf, other

    cs.CV

    GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement

    Authors: Xiaohan Zhang, Xingyu Li, Waqas Sultani, Chen Chen, Safwan Wshah

    Abstract: Cross-View Geo-Localization (CVGL) estimates the location of a ground image by matching it to a geo-tagged aerial image in a database. Recent works achieve outstanding progress on CVGL benchmarks. However, existing methods still suffer from poor performance in cross-area evaluation, in which the training and testing data are captured from completely distinct areas. We attribute this deficiency to… ▽ More

    Submitted 13 August, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2212.04074

  9. arXiv:2308.06393  [pdf, other

    cs.CV

    R2S100K: Road-Region Segmentation Dataset For Semi-Supervised Autonomous Driving in the Wild

    Authors: Muhammad Atif Butt, Hassan Ali, Adnan Qayyum, Waqas Sultani, Ala Al-Fuqaha, Junaid Qadir

    Abstract: Semantic understanding of roadways is a key enabling factor for safe autonomous driving. However, existing autonomous driving datasets provide well-structured urban roads while ignoring unstructured roadways containing distress, potholes, water puddles, and various kinds of road patches i.e., earthen, gravel etc. To this end, we introduce Road Region Segmentation dataset (R2S100K) -- a large-scale… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  10. arXiv:2212.04074  [pdf, other

    cs.CV

    Cross-view Geo-localization via Learning Disentangled Geometric Layout Correspondence

    Authors: Xiaohan Zhang, Xingyu Li, Waqas Sultani, Yi Zhou, Safwan Wshah

    Abstract: Cross-view geo-localization aims to estimate the location of a query ground image by matching it to a reference geo-tagged aerial images database. As an extremely challenging task, its difficulties root in the drastic view changes and different capturing time between two views. Despite these difficulties, recent works achieve outstanding progress on cross-view geo-localization benchmarks. However,… ▽ More

    Submitted 16 June, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

  11. arXiv:2210.14295  [pdf, other

    cs.CV

    Cross-View Image Sequence Geo-localization

    Authors: Xiaohan Zhang, Waqas Sultani, Safwan Wshah

    Abstract: Cross-view geo-localization aims to estimate the GPS location of a query ground-view image by matching it to images from a reference database of geo-tagged aerial images. To address this challenging problem, recent approaches use panoramic ground-view images to increase the range of visibility. Although appealing, panoramic images are not readily available compared to the videos of limited Field-O… ▽ More

    Submitted 2 November, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

  12. arXiv:2210.08423  [pdf, other

    cs.CV cs.RO

    TransVisDrone: Spatio-Temporal Transformer for Vision-based Drone-to-Drone Detection in Aerial Videos

    Authors: Tushar Sangam, Ishan Rajendrakumar Dave, Waqas Sultani, Mubarak Shah

    Abstract: Drone-to-drone detection using visual feed has crucial applications, such as detecting drone collisions, detecting drone attacks, or coordinating flight with other drones. However, existing methods are computationally costly, follow non-end-to-end optimization, and have complex multi-stage pipelines, making them less suitable for real-time deployment on edge devices. In this work, we propose a sim… ▽ More

    Submitted 25 August, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: ICRA 2023

  13. Mapping Temporary Slums from Satellite Imagery using a Semi-Supervised Approach

    Authors: M. Fasi ur Rehman, Izza Ali, Waqas Sultani, Mohsen Ali

    Abstract: One billion people worldwide are estimated to be living in slums, and documenting and analyzing these regions is a challenging task. As compared to regular slums; the small, scattered and temporary nature of temporary slums makes data collection and labeling tedious and time-consuming. To tackle this challenging problem of temporary slums detection, we present a semi-supervised deep learning segme… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

  14. arXiv:2112.15202  [pdf, other

    cs.CV

    Visual and Object Geo-localization: A Comprehensive Survey

    Authors: Daniel Wilson, Xiaohan Zhang, Waqas Sultani, Safwan Wshah

    Abstract: The concept of geo-localization refers to the process of determining where on earth some `entity' is located, typically using Global Positioning System (GPS) coordinates. The entity of interest may be an image, sequence of images, a video, satellite image, or even objects visible within the image. As massive datasets of GPS tagged media have rapidly become available due to smartphones and the inte… ▽ More

    Submitted 11 October, 2023; v1 submitted 30 December, 2021; originally announced December 2021.

  15. arXiv:2111.13656  [pdf, other

    cs.CV

    Towards Low-Cost and Efficient Malaria Detection

    Authors: Waqas Sultani, Wajahat Nawaz, Syed Javed, Muhammad Sohail Danish, Asma Saadia, Mohsen Ali

    Abstract: Malaria, a fatal but curable disease claims hundreds of thousands of lives every year. Early and correct diagnosis is vital to avoid health complexities, however, it depends upon the availability of costly microscopes and trained experts to analyze blood-smear slides. Deep learning-based methods have the potential to not only decrease the burden of experts but also improve diagnostic accuracy on l… ▽ More

    Submitted 16 April, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

  16. arXiv:2111.01505  [pdf, other

    eess.IV cs.CV

    Out of distribution detection for skin and malaria images

    Authors: Muhammad Zaida, Shafaqat Ali, Mohsen Ali, Sarfaraz Hussein, Asma Saadia, Waqas Sultani

    Abstract: Deep neural networks have shown promising results in disease detection and classification using medical image data. However, they still suffer from the challenges of handling real-world scenarios especially reliably detecting out-of-distribution (OoD) samples. We propose an approach to robustly classify OoD samples in skin and malaria images without the need to access labeled OoD samples during tr… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  17. arXiv:2110.13558  [pdf, other

    cs.CV

    Cross-Region Building Counting in Satellite Imagery using Counting Consistency

    Authors: Muaaz Zakria, Hamza Rawal, Waqas Sultani, Mohsen Ali

    Abstract: Estimating the number of buildings in any geographical region is a vital component of urban analysis, disaster management, and public policy decision. Deep learning methods for building localization and counting in satellite imagery, can serve as a viable and cheap alternative. However, these algorithms suffer performance degradation when applied to the regions on which they have not been trained.… ▽ More

    Submitted 13 August, 2023; v1 submitted 26 October, 2021; originally announced October 2021.

  18. Estimation of BMI from Facial Images using Semantic Segmentation based Region-Aware Pooling

    Authors: Nadeem Yousaf, Sarfaraz Hussein, Waqas Sultani

    Abstract: Body-Mass-Index (BMI) conveys important information about one's life such as health and socio-economic conditions. Large-scale automatic estimation of BMIs can help predict several societal behaviors such as health, job opportunities, friendships, and popularity. The recent works have either employed hand-crafted geometrical face features or face-level deep convolutional neural network features fo… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in computers in biology and medicine

    ACM Class: I.4

    Journal ref: Computers in Biology and Medicine Volume 133, June 2021, Pages 104392

  19. arXiv:2103.17242  [pdf, other

    cs.CV

    Dogfight: Detecting Drones from Drones Videos

    Authors: Muhammad Waseem Ashraf, Waqas Sultani, Mubarak Shah

    Abstract: As airborne vehicles are becoming more autonomous and ubiquitous, it has become vital to develop the capability to detect the objects in their surroundings. This paper attempts to address the problem of drones detection from other flying drones. The erratic movement of the source and target drones, small size, arbitrary shape, large intensity variations, and occlusion make this problem quite chall… ▽ More

    Submitted 9 April, 2021; v1 submitted 31 March, 2021; originally announced March 2021.

    Comments: 10 pages, 10 figures, Accepted for CVPR 2021

  20. arXiv:2102.08708  [pdf, other

    eess.IV cs.CV

    A Dataset and Benchmark for Malaria Life-Cycle Classification in Thin Blood Smear Images

    Authors: Qazi Ammar Arshad, Mohsen Ali, Saeed-ul Hassan, Chen Chen, Ayisha Imran, Ghulam Rasul, Waqas Sultani

    Abstract: Malaria microscopy, microscopic examination of stained blood slides to detect parasite Plasmodium, is considered to be a gold-standard for detecting life-threatening disease malaria. Detecting the plasmodium parasite requires a skilled examiner and may take up to 10 to 15 minutes to completely go through the whole slide. Due to a lack of skilled medical professionals in the underdeveloped or resou… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  21. arXiv:2101.00676  [pdf, other

    cs.CV cs.CR

    Fake Visual Content Detection Using Two-Stream Convolutional Neural Networks

    Authors: Bilal Yousaf, Muhammad Usama, Waqas Sultani, Arif Mahmood, Junaid Qadir

    Abstract: Rapid progress in adversarial learning has enabled the generation of realistic-looking fake visual content. To distinguish between fake and real visual content, several detection techniques have been proposed. The performance of most of these techniques however drops off significantly if the test and the training data are sampled from different distributions. This motivates efforts towards improvi… ▽ More

    Submitted 3 January, 2021; originally announced January 2021.

  22. arXiv:2004.10774  [pdf, other

    cs.CV eess.IV

    Action recognition in real-world videos

    Authors: Waqas Sultani, Qazi Ammar Arshad, Chen Chen

    Abstract: The goal of human action recognition is to temporally or spatially localize the human action of interest in video sequences. Temporal localization (i.e. indicating the start and end frames of the action in a video) is referred to as frame-level detection. Spatial localization, which is more challenging, means to identify the pixels within each action frame that correspond to the action. This setti… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

  23. arXiv:2004.00222  [pdf, ps, other

    cs.CV

    Video Anomaly Detection for Smart Surveillance

    Authors: Sijie Zhu, Chen Chen, Waqas Sultani

    Abstract: In modern intelligent video surveillance systems, automatic anomaly detection through computer vision analytics plays a pivotal role which not only significantly increases monitoring efficiency but also reduces the burden on live monitoring. Anomalies in videos are broadly defined as events or activities that are unusual and signify irregular behavior. The goal of anomaly detection is to temporall… ▽ More

    Submitted 11 April, 2020; v1 submitted 1 April, 2020; originally announced April 2020.

  24. arXiv:1910.10027  [pdf, other

    cs.CV cs.LG cs.RO

    Human Action Recognition in Drone Videos using a Few Aerial Training Examples

    Authors: Waqas Sultani, Mubarak Shah

    Abstract: Drones are enabling new forms of human actions surveillance due to their low cost and fast mobility. However, using deep neural networks for automatic aerial action recognition is difficult due to the need for a large number of training aerial human action videos. Collecting a large number of human action aerial videos is costly, time-consuming, and difficult. In this paper, we explore two alterna… ▽ More

    Submitted 2 April, 2021; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: CVIU, 2021

  25. arXiv:1904.00674  [pdf, other

    cs.CV

    Deep Built-Structure Counting in Satellite Imagery Using Attention Based Re-Weighting

    Authors: Anza Shakeel, Waqas Sultani, Mohsen Ali

    Abstract: In this paper, we attempt to address the challenging problem of counting built-structures in the satellite imagery. Building density is a more accurate estimate of the population density, urban area expansion and its impact on the environment, than the built-up area segmentation. However, building shape variances, overlapping boundaries, and variant densities make this a complex task. To tackle th… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

  26. arXiv:1801.04264  [pdf, other

    cs.CV

    Real-world Anomaly Detection in Surveillance Videos

    Authors: Waqas Sultani, Chen Chen, Mubarak Shah

    Abstract: Surveillance videos are able to capture a variety of realistic anomalies. In this paper, we propose to learn anomalies by exploiting both normal and anomalous videos. To avoid annotating the anomalous segments or clips in training videos, which is very time consuming, we propose to learn anomaly through the deep multiple instance ranking framework by leveraging weakly labeled training videos, i.e.… ▽ More

    Submitted 14 February, 2019; v1 submitted 12 January, 2018; originally announced January 2018.

  27. arXiv:1704.00758  [pdf, other

    cs.CV

    Unsupervised Action Proposal Ranking through Proposal Recombination

    Authors: Waqas Sultani, Dong Zhang, Mubarak Shah

    Abstract: Recently, action proposal methods have played an important role in action recognition tasks, as they reduce the search space dramatically. Most unsupervised action proposal methods tend to generate hundreds of action proposals which include many noisy, inconsistent, and unranked action proposals, while supervised action proposal methods take advantage of predefined object detectors (e.g., human de… ▽ More

    Submitted 3 April, 2017; originally announced April 2017.

  28. arXiv:1605.08125  [pdf, other

    cs.CV

    Automatic Action Annotation in Weakly Labeled Videos

    Authors: Waqas Sultani, Mubarak Shah

    Abstract: Manual spatio-temporal annotation of human action in videos is laborious, requires several annotators and contains human biases. In this paper, we present a weakly supervised approach to automatically obtain spatio-temporal annotations of an actor in action videos. We first obtain a large number of action proposals in each video. To capture a few most representative action proposals in each video… ▽ More

    Submitted 25 May, 2016; originally announced May 2016.