Skip to main content

Showing 1–4 of 4 results for author: Enders, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.06975  [pdf, ps, other

    eess.SY cs.LG cs.MA

    Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet Control

    Authors: Zeno Woywood, Jasper I. Wiltfang, Julius Luy, Tobias Enders, Maximilian Schiffer

    Abstract: We study a sequential decision-making problem for a profit-maximizing operator of an autonomous mobility-on-demand system. Optimizing a central operator's vehicle-to-request dispatching policy requires efficient and effective fleet control strategies. To this end, we employ a multi-agent Soft Actor-Critic algorithm combined with weighted bipartite matching. We propose a novel vehicle-based algorit… ▽ More

    Submitted 22 June, 2025; v1 submitted 10 April, 2024; originally announced April 2024.

  2. arXiv:2402.09992  [pdf, other

    cs.LG eess.SY

    Risk-Sensitive Soft Actor-Critic for Robust Deep Reinforcement Learning under Distribution Shifts

    Authors: Tobias Enders, James Harrison, Maximilian Schiffer

    Abstract: We study the robustness of deep reinforcement learning algorithms against distribution shifts within contextual multi-stage stochastic combinatorial optimization problems from the operations research domain. In this context, risk-sensitive algorithms promise to learn robust policies. While this field is of general interest to the reinforcement learning community, most studies up-to-date focus on t… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 11 pages, 8 figures

  3. arXiv:2312.08884  [pdf, other

    cs.LG cs.MA eess.SY

    Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems

    Authors: Heiko Hoppe, Tobias Enders, Quentin Cappart, Maximilian Schiffer

    Abstract: We study vehicle dispatching in autonomous mobility on demand (AMoD) systems, where a central operator assigns vehicles to customer requests or rejects these with the aim of maximizing its total profit. Recent approaches use multi-agent deep reinforcement learning (MADRL) to realize scalable yet performant algorithms, but train agents based on local rewards, which distorts the reward signal with r… ▽ More

    Submitted 19 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 22 pages, 6 figures, extended version of paper accepted at the 6th Learning for Dynamics & Control Conference (L4DC 2024)

  4. arXiv:2212.07313  [pdf, other

    cs.LG cs.MA eess.SY

    Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems

    Authors: Tobias Enders, James Harrison, Marco Pavone, Maximilian Schiffer

    Abstract: We consider the sequential decision-making problem of making proactive request assignment and rejection decisions for a profit-maximizing operator of an autonomous mobility on demand system. We formalize this problem as a Markov decision process and propose a novel combination of multi-agent Soft Actor-Critic and weighted bipartite matching to obtain an anticipative control policy. Thereby, we fac… ▽ More

    Submitted 10 May, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: 20 pages, 7 figures, extended version of paper accepted at the 5th Learning for Dynamics & Control Conference (L4DC 2023)