Skip to main content

Showing 1–1 of 1 results for author: Toyoshima, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2208.09855  [pdf, other

    cs.GT cs.LG

    Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games

    Authors: Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Kentaro Toyoshima, Atsushi Iwasaki

    Abstract: This paper proposes Mutation-Driven Multiplicative Weights Update (M2WU) for learning an equilibrium in two-player zero-sum normal-form games and proves that it exhibits the last-iterate convergence property in both full and noisy feedback settings. In the former, players observe their exact gradient vectors of the utility functions. In the latter, they only observe the noisy gradient vectors. Eve… ▽ More

    Submitted 26 May, 2023; v1 submitted 21 August, 2022; originally announced August 2022.

    Comments: Accepted in AISTATS 2023