Skip to main content

Showing 1–5 of 5 results for author: Pasztor, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.16745  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    Bandits with Preference Feedback: A Stackelberg Game Perspective

    Authors: Barna Pásztor, Parnian Kassraie, Andreas Krause

    Abstract: Bandits with preference feedback present a powerful tool for optimizing unknown target functions when only pairwise comparisons are allowed instead of direct value queries. This model allows for incorporating human feedback into online inference and optimization and has been employed in systems for fine-tuning large language models. The problem is well understood in simplified settings with linear… ▽ More

    Submitted 30 October, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: In Proceedings of the 38th Conference on Neural Information Processing Systems (NeurIPS), 30 pages, 8 figures

  2. arXiv:2406.01575  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Contextual Bilevel Reinforcement Learning for Incentive Alignment

    Authors: Vinzenz Thoma, Barna Pasztor, Andreas Krause, Giorgia Ramponi, Yifan Hu

    Abstract: The optimal policy in various real-world strategic decision-making problems depends both on the environmental configuration and exogenous events. For these settings, we introduce Contextual Bilevel Reinforcement Learning (CB-RL), a stochastic bilevel decision-making model, where the lower level consists of solving a contextual Markov Decision Process (CMDP). CB-RL can be viewed as a Stackelberg Ga… ▽ More

    Submitted 8 December, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 60 pages, 21 Figures

  3. arXiv:2306.17052  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Safe Model-Based Multi-Agent Mean-Field Reinforcement Learning

    Authors: Matej Jusup, Barna Pásztor, Tadeusz Janik, Kenan Zhang, Francesco Corman, Andreas Krause, Ilija Bogunovic

    Abstract: Many applications, e.g., in shared mobility, require coordinating a large number of agents. Mean-field reinforcement learning addresses the resulting scalability challenge by optimizing the policy of a representative agent interacting with the infinite population of identical agents instead of considering individual pairwise interactions. In this paper, we address an important generalization where… ▽ More

    Submitted 27 December, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 23 pages, 26 figures, 6 tables

  4. arXiv:2107.04050  [pdf, other

    stat.ML cs.LG cs.MA

    Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning

    Authors: Barna Pásztor, Ilija Bogunovic, Andreas Krause

    Abstract: Learning in multi-agent systems is highly challenging due to several factors including the non-stationarity introduced by agents' interactions and the combinatorial nature of their state and action spaces. In particular, we consider the Mean-Field Control (MFC) problem which assumes an asymptotically infinite population of identical agents that aim to collaboratively maximize the collective reward… ▽ More

    Submitted 9 May, 2023; v1 submitted 8 July, 2021; originally announced July 2021.

    Journal ref: Pásztor, B., Krause, A., & Bogunovic, I. (2023). Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning. Transactions on Machine Learning Research

  5. arXiv:2008.10376  [pdf, other

    cs.CG cs.SI stat.ML

    Stochastic Gradient Descent Works Really Well for Stress Minimization

    Authors: Katharina Börsig, Ulrik Brandes, Barna Pasztor

    Abstract: Stress minimization is among the best studied force-directed graph layout methods because it reliably yields high-quality layouts. It thus comes as a surprise that a novel approach based on stochastic gradient descent (Zheng, Pawar and Goodman, TVCG 2019) is claimed to improve on state-of-the-art approaches based on majorization. We present experimental evidence that the new approach does not actu… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: Appears in the Proceedings of the 28th International Symposium on Graph Drawing and Network Visualization (GD 2020)