Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents

Feng, Peilin; Lv, Zhutao; Ye, Junyan; Wang, Xiaolei; Huo, Xinjie; Yu, Jinhua; Xu, Wanghan; Zhang, Wenlong; Bai, Lei; He, Conghui; Li, Weijia

Computer Science > Computer Vision and Pattern Recognition

arXiv:2509.23141 (cs)

[Submitted on 27 Sep 2025 (v1), last revised 16 Oct 2025 (this version, v2)]

Title:Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents

Authors:Peilin Feng, Zhutao Lv, Junyan Ye, Xiaolei Wang, Xinjie Huo, Jinhua Yu, Wanghan Xu, Wenlong Zhang, Lei Bai, Conghui He, Weijia Li

View PDF

Abstract:Earth observation (EO) is essential for understanding the evolving states of the Earth system. Although recent MLLMs have advanced EO research, they still lack the capability to tackle complex tasks that require multi-step reasoning and the use of domain-specific tools. Agent-based methods offer a promising direction, but current attempts remain in their infancy, confined to RGB perception, shallow reasoning, and lacking systematic evaluation protocols. To overcome these limitations, we introduce Earth-Agent, the first agentic framework that unifies RGB and spectral EO data within an MCP-based tool ecosystem, enabling cross-modal, multi-step, and quantitative spatiotemporal reasoning beyond pretrained MLLMs. Earth-Agent supports complex scientific tasks such as geophysical parameter retrieval and quantitative spatiotemporal analysis by dynamically invoking expert tools and models across modalities. To support comprehensive evaluation, we further propose Earth-Bench, a benchmark of 248 expert-curated tasks with 13,729 images, spanning spectrum, products and RGB modalities, and equipped with a dual-level evaluation protocol that assesses both reasoning trajectories and final outcomes. We conduct comprehensive experiments varying different LLM backbones, comparisons with general agent frameworks, and comparisons with MLLMs on remote sensing benchmarks, demonstrating both the effectiveness and potential of Earth-Agent. Earth-Agent establishes a new paradigm for EO analysis, moving the field toward scientifically grounded, next-generation applications of LLMs in Earth observation.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2509.23141 [cs.CV]
	(or arXiv:2509.23141v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2509.23141

Submission history

From: Weijia Li [view email]
[v1] Sat, 27 Sep 2025 06:04:28 UTC (13,797 KB)
[v2] Thu, 16 Oct 2025 07:27:45 UTC (13,098 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators