Skip to main content

Showing 1–1 of 1 results for author: Vadocz, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.13455  [pdf, other

    cs.LG cs.AI

    Epistemic Monte Carlo Tree Search

    Authors: Yaniv Oren, Villiam Vadocz, Matthijs T. J. Spaan, Wendelin Böhmer

    Abstract: The AlphaZero/MuZero (A/MZ) family of algorithms has achieved remarkable success across various challenging domains by integrating Monte Carlo Tree Search (MCTS) with learned models. Learned models introduce epistemic uncertainty, which is caused by learning from limited data and is useful for exploration in sparse reward environments. MCTS does not account for the propagation of this uncertainty… ▽ More

    Submitted 2 April, 2025; v1 submitted 21 October, 2022; originally announced October 2022.