Dual Policy Reinforcement Learning for Real-time Rebalancing in Bike-sharing Systems

Liang, Jiaqi; Liu, Defeng; Jena, Sanjay Dominik; Lodi, Andrea; Vidal, Thibaut

Abstract:Bike-sharing systems play a crucial role in easing traffic congestion and promoting healthier lifestyles. However, ensuring their reliability and user acceptance requires effective strategies for rebalancing bikes. This study introduces a novel approach to address the real-time rebalancing problem with a fleet of vehicles. It employs a dual policy reinforcement learning algorithm that decouples inventory and routing decisions, enhancing realism and efficiency compared to previous methods where both decisions were made simultaneously. We first formulate the inventory and routing subproblems as a multi-agent Markov Decision Process within a continuous time framework. Subsequently, we propose a DQN-based dual policy framework to jointly estimate the value functions, minimizing the lost demand. To facilitate learning, a comprehensive simulator is applied to operate under a first-arrive-first-serve rule, which enables the computation of immediate rewards across diverse demand scenarios. We conduct extensive experiments on various datasets generated from historical real-world data, affected by both temporal and weather factors. Our proposed algorithm demonstrates significant performance improvements over previous baseline methods. It offers valuable practical insights for operators and further explores the incorporation of reinforcement learning into real-world dynamic programming problems, paving the way for more intelligent and robust urban mobility solutions.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2406.00868 [cs.LG]
	(or arXiv:2406.00868v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.00868

Computer Science > Machine Learning

Title:Dual Policy Reinforcement Learning for Real-time Rebalancing in Bike-sharing Systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators