Skip to main content

Showing 1–6 of 6 results for author: Danish, M S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06281  [pdf, other

    cs.CV

    TerraFM: A Scalable Foundation Model for Unified Multisensor Earth Observation

    Authors: Muhammad Sohail Danish, Muhammad Akhtar Munir, Syed Roshaan Ali Shah, Muhammad Haris Khan, Rao Muhammad Anwer, Jorma Laaksonen, Fahad Shahbaz Khan, Salman Khan

    Abstract: Modern Earth observation (EO) increasingly leverages deep learning to harness the scale and diversity of satellite imagery across sensors and regions. While recent foundation models have demonstrated promising generalization across EO tasks, many remain limited by the scale, geographical coverage, and spectral diversity of their training data, factors critical for learning globally transferable re… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2412.15190  [pdf, other

    cs.CV

    EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues

    Authors: Sagar Soni, Akshay Dudhane, Hiyam Debary, Mustansar Fiaz, Muhammad Akhtar Munir, Muhammad Sohail Danish, Paolo Fraccaro, Campbell D Watson, Levente J Klein, Fahad Shahbaz Khan, Salman Khan

    Abstract: Automated analysis of vast Earth observation data via interactive Vision-Language Models (VLMs) can unlock new opportunities for environmental monitoring, disaster response, and {resource management}. Existing generic VLMs do not perform well on Remote Sensing data, while the recent Geo-spatial VLMs remain restricted to a fixed resolution and few sensor modalities. In this paper, we introduce Eart… ▽ More

    Submitted 7 April, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

  3. arXiv:2411.19325  [pdf, other

    cs.CV

    GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks

    Authors: Muhammad Sohail Danish, Muhammad Akhtar Munir, Syed Roshaan Ali Shah, Kartik Kuckreja, Fahad Shahbaz Khan, Paolo Fraccaro, Alexandre Lacoste, Salman Khan

    Abstract: While numerous recent benchmarks focus on evaluating generic Vision-Language Models (VLMs), they do not effectively address the specific challenges of geospatial applications. Generic VLM benchmarks are not designed to handle the complexities of geospatial data, an essential component for applications such as environmental monitoring, urban planning, and disaster management. Key challenges in the… ▽ More

    Submitted 12 March, 2025; v1 submitted 28 November, 2024; originally announced November 2024.

    Comments: This updated version includes revisions and additional analysis

  4. arXiv:2405.14497  [pdf, other

    cs.CV

    Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment

    Authors: Muhammad Sohail Danish, Muhammad Haris Khan, Muhammad Akhtar Munir, M. Saquib Sarfraz, Mohsen Ali

    Abstract: In this work, we tackle the problem of domain generalization for object detection, specifically focusing on the scenario where only a single source domain is available. We propose an effective approach that involves two key steps: diversifying the source domain and aligning detections based on class prediction confidence and localization. Firstly, we demonstrate that by carefully selecting a set o… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  5. arXiv:2311.15826  [pdf, other

    cs.CV cs.AI

    GeoChat: Grounded Large Vision-Language Model for Remote Sensing

    Authors: Kartik Kuckreja, Muhammad Sohail Danish, Muzammal Naseer, Abhijit Das, Salman Khan, Fahad Shahbaz Khan

    Abstract: Recent advancements in Large Vision-Language Models (VLMs) have shown great promise in natural image domains, allowing users to hold a dialogue about given visual content. However, such general-domain VLMs perform poorly for Remote Sensing (RS) scenarios, leading to inaccurate or fabricated information when presented with RS domain-specific queries. Such a behavior emerges due to the unique challe… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 10 pages, 4 figures

  6. arXiv:2111.13656  [pdf, other

    cs.CV

    Towards Low-Cost and Efficient Malaria Detection

    Authors: Waqas Sultani, Wajahat Nawaz, Syed Javed, Muhammad Sohail Danish, Asma Saadia, Mohsen Ali

    Abstract: Malaria, a fatal but curable disease claims hundreds of thousands of lives every year. Early and correct diagnosis is vital to avoid health complexities, however, it depends upon the availability of costly microscopes and trained experts to analyze blood-smear slides. Deep learning-based methods have the potential to not only decrease the burden of experts but also improve diagnostic accuracy on l… ▽ More

    Submitted 16 April, 2022; v1 submitted 26 November, 2021; originally announced November 2021.