Skip to main content

Showing 1–2 of 2 results for author: Raveh, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16168  [pdf, other

    cs.LG stat.ML

    Multi-Player Approaches for Dueling Bandits

    Authors: Or Raveh, Junya Honda, Masashi Sugiyama

    Abstract: Various approaches have emerged for multi-armed bandits in distributed systems. The multiplayer dueling bandit problem, common in scenarios with only preference-based information like human feedback, introduces challenges related to controlling collaborative exploration of non-informative arm pairs, but has received little attention. To fill this gap, we demonstrate that the direct use of a Follow… ▽ More

    Submitted 23 April, 2025; v1 submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:1905.09951  [pdf, other

    cs.LG stat.ML

    PAC Guarantees for Cooperative Multi-Agent Reinforcement Learning with Restricted Communication

    Authors: Or Raveh, Ron Meir

    Abstract: We develop model free PAC performance guarantees for multiple concurrent MDPs, extending recent works where a single learner interacts with multiple non-interacting agents in a noise free environment. Our framework allows noisy and resource limited communication between agents, and develops novel PAC guarantees in this extended setting. By allowing communication between the agents themselves, we s… ▽ More

    Submitted 10 October, 2019; v1 submitted 23 May, 2019; originally announced May 2019.