Search | arXiv e-print repository

Online Dynamic Pricing for Electric Vehicle Charging Stations with Reservations

Authors: Jan Mrkos, Antonín Komenda, David Fiedler, Jiří Vokřínek

Abstract: This paper introduces a novel model for online dynamic pricing of electric vehicle charging services that integrates reservation, parking, and charging into a comprehensive bundle priced as a whole. Our approach focuses on the individual high-demand, fast-charging location, employing a Poisson process as a model of charging reservation arrivals, and develops an online dynamic pricing strategy opti… ▽ More This paper introduces a novel model for online dynamic pricing of electric vehicle charging services that integrates reservation, parking, and charging into a comprehensive bundle priced as a whole. Our approach focuses on the individual high-demand, fast-charging location, employing a Poisson process as a model of charging reservation arrivals, and develops an online dynamic pricing strategy optimized through a Markov Decision Process (MDP). A key contribution is the novel analysis of discretization error introduced when incorporating the continuous-time Poisson process into the discrete MDP framework. The MDP model's feasibility is demonstrated with a heuristic dynamic pricing method based on Monte-Carlo tree search, offering a viable path for real-world applications. △ Less

Submitted 28 April, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

Comments: 45 pages, 11 figure, accepted to IEEE Transactions on Intelligent Transportation Systems (T-ITS)

arXiv:2310.19463 [pdf, ps, other]

Optimize Planning Heuristics to Rank, not to Estimate Cost-to-Goal

Authors: Leah Chrestien, Tomás Pevný, Stefan Edelkamp, Antonín Komenda

Abstract: In imitation learning for planning, parameters of heuristic functions are optimized against a set of solved problem instances. This work revisits the necessary and sufficient conditions of strictly optimally efficient heuristics for forward search algorithms, mainly A* and greedy best-first search, which expand only states on the returned optimal path. It then proposes a family of loss functions b… ▽ More In imitation learning for planning, parameters of heuristic functions are optimized against a set of solved problem instances. This work revisits the necessary and sufficient conditions of strictly optimally efficient heuristics for forward search algorithms, mainly A* and greedy best-first search, which expand only states on the returned optimal path. It then proposes a family of loss functions based on ranking tailored for a given variant of the forward search algorithm. Furthermore, from a learning theory point of view, it discusses why optimizing cost-to-goal \hstar\ is unnecessarily difficult. The experimental comparison on a diverse set of problems unequivocally supports the derived theory. △ Less

Submitted 30 October, 2023; originally announced October 2023.

Comments: 10 pages

arXiv:2209.05206 [pdf, other]

A Differentiable Loss Function for Learning Heuristics in A*

Authors: Leah Chrestien, Tomas Pevny, Antonin Komenda, Stefan Edelkamp

Abstract: Optimization of heuristic functions for the A* algorithm, realized by deep neural networks, is usually done by minimizing square root loss of estimate of the cost to goal values. This paper argues that this does not necessarily lead to a faster search of A* algorithm since its execution relies on relative values instead of absolute ones. As a mitigation, we propose a L* loss, which upper-bounds th… ▽ More Optimization of heuristic functions for the A* algorithm, realized by deep neural networks, is usually done by minimizing square root loss of estimate of the cost to goal values. This paper argues that this does not necessarily lead to a faster search of A* algorithm since its execution relies on relative values instead of absolute ones. As a mitigation, we propose a L* loss, which upper-bounds the number of excessively expanded states inside the A* search. The L* loss, when used in the optimization of state-of-the-art deep neural networks for automated planning in maze domains like Sokoban and maze with teleports, significantly improves the fraction of solved problems, the quality of founded plans, and reduces the number of expanded states to approximately 50% △ Less

Submitted 12 September, 2022; originally announced September 2022.

Comments: 10 pages

arXiv:2112.01918 [pdf, other]

Heuristic Search Planning with Deep Neural Networks using Imitation, Attention and Curriculum Learning

Authors: Leah Chrestien, Tomas Pevny, Antonin Komenda, Stefan Edelkamp

Abstract: Learning a well-informed heuristic function for hard task planning domains is an elusive problem. Although there are known neural network architectures to represent such heuristic knowledge, it is not obvious what concrete information is learned and whether techniques aimed at understanding the structure help in improving the quality of the heuristics. This paper presents a network model to learn… ▽ More Learning a well-informed heuristic function for hard task planning domains is an elusive problem. Although there are known neural network architectures to represent such heuristic knowledge, it is not obvious what concrete information is learned and whether techniques aimed at understanding the structure help in improving the quality of the heuristics. This paper presents a network model to learn a heuristic capable of relating distant parts of the state space via optimal plan imitation using the attention mechanism, which drastically improves the learning of a good heuristic function. To counter the limitation of the method in the creation of problems of increasing difficulty, we demonstrate the use of curriculum learning, where newly solved problem instances are added to the training set, which, in turn, helps to solve problems of higher complexities and far exceeds the performances of all existing baselines including classical planning heuristics. We demonstrate its effectiveness for grid-type PDDL domains. △ Less

Submitted 3 December, 2021; originally announced December 2021.

Comments: 8 pages plus references

arXiv:1711.09057 [pdf, other]

doi 10.1145/3128584

Cooperative Multi-Agent Planning: A Survey

Authors: Alejandro Torreño, Eva Onaindia, Antonín Komenda, Michal Štolba

Abstract: Cooperative multi-agent planning (MAP) is a relatively recent research field that combines technologies, algorithms and techniques developed by the Artificial Intelligence Planning and Multi-Agent Systems communities. While planning has been generally treated as a single-agent task, MAP generalizes this concept by considering multiple intelligent agents that work cooperatively to develop a course… ▽ More Cooperative multi-agent planning (MAP) is a relatively recent research field that combines technologies, algorithms and techniques developed by the Artificial Intelligence Planning and Multi-Agent Systems communities. While planning has been generally treated as a single-agent task, MAP generalizes this concept by considering multiple intelligent agents that work cooperatively to develop a course of action that satisfies the goals of the group. This paper reviews the most relevant approaches to MAP, putting the focus on the solvers that took part in the 2015 Competition of Distributed and Multi-Agent Planning, and classifies them according to their key features and relative performance. △ Less

Submitted 24 November, 2017; originally announced November 2017.

Comments: 34 pages, 4 figures, 4 tables

MSC Class: 68-42; 68-20; 68-35

Journal ref: ACM Computing Surveys, Volume 50, Number 6, Article 84. Publication date: November 2017

arXiv:1202.2773 [pdf, other]

Decentralized Multi-agent Plan Repair in Dynamic Environments

Authors: Antonín Komenda, Peter Novák, Michal Pěchouček

Abstract: Achieving joint objectives by teams of cooperative planning agents requires significant coordination and communication efforts. For a single-agent system facing a plan failure in a dynamic environment, arguably, attempts to repair the failed plan in general do not straightforwardly bring any benefit in terms of time complexity. However, in multi-agent settings the communication complexity might be… ▽ More Achieving joint objectives by teams of cooperative planning agents requires significant coordination and communication efforts. For a single-agent system facing a plan failure in a dynamic environment, arguably, attempts to repair the failed plan in general do not straightforwardly bring any benefit in terms of time complexity. However, in multi-agent settings the communication complexity might be of a much higher importance, possibly a high communication overhead might be even prohibitive in certain domains. We hypothesize that in decentralized systems, where coordination is enforced to achieve joint objectives, attempts to repair failed multi-agent plans should lead to lower communication overhead than replanning from scratch. The contribution of the presented paper is threefold. Firstly, we formally introduce the multi-agent plan repair problem and formally present the core hypothesis underlying our work. Secondly, we propose three algorithms for multi-agent plan repair reducing the problem to specialized instances of the multi-agent planning problem. Finally, we present results of experimental validation confirming the core hypothesis of the paper. △ Less

Submitted 13 February, 2012; originally announced February 2012.

Comments: 21 pages, 5 algorithms, 3 figures. This is the full version of an extended abstract published in Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2012), Conitzer, Winikoff, Padgham, and van der Hoek (eds.), June, 4--8, 2012, Valencia, Spain

ACM Class: I.2.11; I.2.8

Showing 1–6 of 6 results for author: Komenda, A