ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks

Shabbir, Akashah; Munir, Muhammad Akhtar; Dudhane, Akshay; Sheikh, Muhammad Umer; Khan, Muhammad Haris; Fraccaro, Paolo; Moreno, Juan Bernabe; Khan, Fahad Shahbaz; Khan, Salman

Computer Science > Computer Vision and Pattern Recognition

arXiv:2505.23752 (cs)

[Submitted on 29 May 2025]

Title:ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks

Authors:Akashah Shabbir, Muhammad Akhtar Munir, Akshay Dudhane, Muhammad Umer Sheikh, Muhammad Haris Khan, Paolo Fraccaro, Juan Bernabe Moreno, Fahad Shahbaz Khan, Salman Khan

View PDF HTML (experimental)

Abstract:Recent progress in large language models (LLMs) has enabled tool-augmented agents capable of solving complex real-world tasks through step-by-step reasoning. However, existing evaluations often focus on general-purpose or multimodal scenarios, leaving a gap in domain-specific benchmarks that assess tool-use capabilities in complex remote sensing use cases. We present ThinkGeo, an agentic benchmark designed to evaluate LLM-driven agents on remote sensing tasks via structured tool use and multi-step planning. Inspired by tool-interaction paradigms, ThinkGeo includes human-curated queries spanning a wide range of real-world applications such as urban planning, disaster assessment and change analysis, environmental monitoring, transportation analysis, aviation monitoring, recreational infrastructure, and industrial site analysis. Each query is grounded in satellite or aerial imagery and requires agents to reason through a diverse toolset. We implement a ReAct-style interaction loop and evaluate both open and closed-source LLMs (e.g., GPT-4o, Qwen2.5) on 436 structured agentic tasks. The benchmark reports both step-wise execution metrics and final answer correctness. Our analysis reveals notable disparities in tool accuracy and planning consistency across models. ThinkGeo provides the first extensive testbed for evaluating how tool-enabled LLMs handle spatial reasoning in remote sensing. Our code and dataset are publicly available

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2505.23752 [cs.CV]
	(or arXiv:2505.23752v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2505.23752

Submission history

From: Akashah Shabbir [view email]
[v1] Thu, 29 May 2025 17:59:38 UTC (5,006 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators