Skip to main content

Showing 1–2 of 2 results for author: Kubíček, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.15220  [pdf, other

    cs.GT

    Look-ahead Search on Top of Policy Networks in Imperfect Information Games

    Authors: Ondrej Kubicek, Neil Burch, Viliam Lisy

    Abstract: Search in test time is often used to improve the performance of reinforcement learning algorithms. Performing theoretically sound search in fully adversarial two-player games with imperfect information is notoriously difficult and requires a complicated training process. We present a method for adding test-time search to an arbitrary policy-gradient algorithm that learns from sampled trajectories.… ▽ More

    Submitted 29 January, 2025; v1 submitted 23 December, 2023; originally announced December 2023.

  2. arXiv:2112.12594  [pdf, other

    cs.GT

    Continual Depth-limited Responses for Computing Counter-strategies in Sequential Games

    Authors: David Milec, Ondřej Kubíček, Viliam Lisý

    Abstract: In zero-sum games, the optimal strategy is well-defined by the Nash equilibrium. However, it is overly conservative when playing against suboptimal opponents and it can not exploit their weaknesses. Limited look-ahead game solving in imperfect-information games allows defeating human experts in massive real-world games such as Poker, Liar's Dice, and Scotland Yard. However, since they approximate… ▽ More

    Submitted 3 April, 2024; v1 submitted 23 December, 2021; originally announced December 2021.

    Comments: 16 pages, 15 figures