Skip to main content

Showing 1–7 of 7 results for author: Tampubolon, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.08010  [pdf, other

    eess.SY cs.GT cs.MA cs.NI

    Welfare Measure for Resource Allocation with Algorithmic Implementation: Beyond Average and Max-Min

    Authors: Ezra Tampubolon, Holger Boche

    Abstract: In this work, we propose an axiomatic approach for measuring the performance/welfare of a system consisting of concurrent agents in a resource-driven system. Our approach provides a unifying view on popular system optimality principles, such as the maximal average/total utilities and the max-min fairness. Moreover, it gives rise to other system optimality notions that have not been fully exploited… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  2. arXiv:2010.10901  [pdf, ps, other

    cs.LG cs.GT cs.MA econ.TH eess.SY

    On Information Asymmetry in Competitive Multi-Agent Reinforcement Learning: Convergence and Optimality

    Authors: Ezra Tampubolon, Haris Ceribasic, Holger Boche

    Abstract: In this work, we study the system of interacting non-cooperative two Q-learning agents, where one agent has the privilege of observing the other's actions. We show that this information asymmetry can lead to a stable outcome of population learning, which generally does not occur in an environment of general independent learners. The resulting post-learning policies are almost optimal in the underl… ▽ More

    Submitted 22 January, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Preprint

  3. arXiv:2010.10878  [pdf, other

    math.OC cs.GT cs.LG cs.MA eess.SY

    Coordinated Online Learning for Multi-Agent Systems with Coupled Constraints and Perturbed Utility Observations

    Authors: Ezra Tampubolon, Holger Boche

    Abstract: Competitive non-cooperative online decision-making agents whose actions increase congestion of scarce resources constitute a model for widespread modern large-scale applications. To ensure sustainable resource behavior, we introduce a novel method to steer the agents toward a stable population state, fulfilling the given coupled resource constraints. The proposed method is a decentralized resource… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: Preprint: To appear in IEEE Transaction on Automatic Control

  4. arXiv:2002.06080  [pdf, other

    eess.SY cs.GT cs.MA

    Resource-Aware Control via Dynamic Pricing for Congestion Game with Finite-Time Guarantees

    Authors: Ezra Tampubolon, Haris Ceribasic, Holger Boche

    Abstract: Congestion game is a widely used model for modern networked applications. A central issue in such applications is that the selfish behavior of the participants may result in resource overloading and negative externalities for the system participants. In this work, we propose a pricing mechanism that guarantees the sub-linear increase of the time-cumulative violation of the resource load constraint… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

  5. arXiv:1910.09314  [pdf, ps, other

    cs.LG cs.GT cs.MA econ.TH eess.SY

    Pricing Mechanism for Resource Sustainability in Competitive Online Learning Multi-Agent Systems

    Authors: Ezra Tampubolon, Holger Boche

    Abstract: In this paper, we consider the problem of resource congestion control for competing online learning agents. On the basis of non-cooperative game as the model for the interaction between the agents, and the noisy online mirror ascent as the model for rational behavior of the agents, we propose a novel pricing mechanism which gives the agents incentives for sustainable use of the resources. Our mech… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

  6. arXiv:1910.09282  [pdf, other

    math.OC cs.LG

    Robust Online Learning for Resource Allocation -- Beyond Euclidean Projection and Dynamic Fit

    Authors: Ezra Tampubolon, Holger Boche

    Abstract: Online-learning literature has focused on designing algorithms that ensure sub-linear growth of the cumulative long-term constraint violations. The drawback of this guarantee is that strictly feasible actions may cancel out constraint violations on other time slots. For this reason, we introduce a new performance measure called $\hCFit$, whose particular instance is the cumulative positive part of… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

  7. arXiv:1910.09276  [pdf, ps, other

    cs.GT cs.LG eess.SY math.OC

    Semi-Decentralized Coordinated Online Learning for Continuous Games with Coupled Constraints via Augmented Lagrangian

    Authors: Ezra Tampubolon, Holger Boche

    Abstract: We consider a class of concave continuous games in which the corresponding admissible strategy profile of each player underlies affine coupling constraints. We propose a novel algorithm that leads the relevant population dynamic toward Nash equilibrium. This algorithm is based on a mirror ascent algorithm, which suits with the framework of no-regret online learning, and on the augmented Lagrangian… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.