Long-Term Mapping of the Douro River Plume with Multi-Agent Reinforcement Learning

Fabbro, Nicolò Dal; Mesbahi, Milad; Mendes, Renato; de Sousa, João Borges; Pappas, George J.

Computer Science > Multiagent Systems

arXiv:2510.03534 (cs)

[Submitted on 3 Oct 2025]

Title:Long-Term Mapping of the Douro River Plume with Multi-Agent Reinforcement Learning

Authors:Nicolò Dal Fabbro, Milad Mesbahi, Renato Mendes, João Borges de Sousa, George J. Pappas

View PDF HTML (experimental)

Abstract:We study the problem of long-term (multiple days) mapping of a river plume using multiple autonomous underwater vehicles (AUVs), focusing on the Douro river representative use-case. We propose an energy - and communication - efficient multi-agent reinforcement learning approach in which a central coordinator intermittently communicates with the AUVs, collecting measurements and issuing commands. Our approach integrates spatiotemporal Gaussian process regression (GPR) with a multi-head Q-network controller that regulates direction and speed for each AUV. Simulations using the Delft3D ocean model demonstrate that our method consistently outperforms both single- and multi-agent benchmarks, with scaling the number of agents both improving mean squared error (MSE) and operational endurance. In some instances, our algorithm demonstrates that doubling the number of AUVs can more than double endurance while maintaining or improving accuracy, underscoring the benefits of multi-agent coordination. Our learned policies generalize across unseen seasonal regimes over different months and years, demonstrating promise for future developments of data-driven long-term monitoring of dynamic plume environments.

Subjects:	Multiagent Systems (cs.MA); Machine Learning (cs.LG); Systems and Control (eess.SY); Machine Learning (stat.ML)
Cite as:	arXiv:2510.03534 [cs.MA]
	(or arXiv:2510.03534v1 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.2510.03534

Submission history

From: Nicolò Dal Fabbro [view email]
[v1] Fri, 3 Oct 2025 22:08:08 UTC (4,237 KB)

Computer Science > Multiagent Systems

Title:Long-Term Mapping of the Douro River Plume with Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:Long-Term Mapping of the Douro River Plume with Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators