FREE: The Foundational Semantic Recognition for Modeling Environmental Ecosystems

Luo, Shiyuan; Ni, Juntong; Chen, Shengyu; Yu, Runlong; Xie, Yiqun; Liu, Licheng; Jin, Zhenong; Yao, Huaxiu; Jia, Xiaowei

Computer Science > Machine Learning

arXiv:2311.10255 (cs)

[Submitted on 17 Nov 2023 (v1), last revised 22 Feb 2025 (this version, v4)]

Title:FREE: The Foundational Semantic Recognition for Modeling Environmental Ecosystems

Authors:Shiyuan Luo, Juntong Ni, Shengyu Chen, Runlong Yu, Yiqun Xie, Licheng Liu, Zhenong Jin, Huaxiu Yao, Xiaowei Jia

View PDF HTML (experimental)

Abstract:Modeling environmental ecosystems is critical for the sustainability of our planet, but is extremely challenging due to the complex underlying processes driven by interactions amongst a large number of physical variables. As many variables are difficult to measure at large scales, existing works often utilize a combination of observable features and locally available measurements or modeled values as input to build models for a specific study region and time period. This raises a fundamental question in advancing the modeling of environmental ecosystems: how to build a general framework for modeling the complex relationships amongst various environmental data over space and time? In this paper, we introduce a framework, FREE, which maps available environmental data into a text space and then converts the traditional predictive modeling task in environmental science to a semantic recognition problem. The proposed framework leverages recent advances in Large Language Models (LLMs) to supplement the original input features with natural language descriptions. This framework facilitates capturing the data semantics and allows harnessing the irregularities of input features. When used for long-term prediction, FREE has the flexibility to incorporate newly collected observations to enhance future prediction. The efficacy of FREE is evaluated in the context of two societally important real-world applications, predicting stream water temperature in the Delaware River Basin and predicting annual corn yield in Illinois and Iowa. Beyond the superior predictive performance over multiple baselines, FREE is shown to be more data- and computation-efficient as it can be pre-trained on simulated data generated by physics-based models.

Subjects:	Machine Learning (cs.LG); Populations and Evolution (q-bio.PE)
Cite as:	arXiv:2311.10255 [cs.LG]
	(or arXiv:2311.10255v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.10255

Submission history

From: Shiyuan Luo [view email]
[v1] Fri, 17 Nov 2023 00:53:09 UTC (14,969 KB)
[v2] Sat, 20 Apr 2024 00:15:04 UTC (15,906 KB)
[v3] Wed, 19 Feb 2025 20:24:23 UTC (16,923 KB)
[v4] Sat, 22 Feb 2025 16:12:49 UTC (16,923 KB)

Computer Science > Machine Learning

Title:FREE: The Foundational Semantic Recognition for Modeling Environmental Ecosystems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:FREE: The Foundational Semantic Recognition for Modeling Environmental Ecosystems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators