Skip to main content

Showing 1–2 of 2 results for author: Malenfant, D

.
  1. arXiv:2505.20579  [pdf, ps, other

    cs.LG cs.AI cs.MA

    The challenge of hidden gifts in multi-agent reinforcement learning

    Authors: Dane Malenfant, Blake A. Richards

    Abstract: Sometimes we benefit from actions that others have taken even when we are unaware that they took those actions. For example, if your neighbor chooses not to take a parking spot in front of your house when you are not there, you can benefit, even without being aware that they took this action. These "hidden gifts" represent an interesting challenge for multi-agent reinforcement learning (MARL), sin… ▽ More

    Submitted 29 May, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

  2. arXiv:2210.05845  [pdf, other

    cs.LG cs.AI

    Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL

    Authors: Chen Sun, Wannan Yang, Thomas Jiralerspong, Dane Malenfant, Benjamin Alsbury-Nealy, Yoshua Bengio, Blake Richards

    Abstract: In real life, success is often contingent upon multiple critical steps that are distant in time from each other and from the final reward. These critical steps are challenging to identify with traditional reinforcement learning (RL) methods that rely on the Bellman equation for credit assignment. Here, we present a new RL algorithm that uses offline contrastive learning to hone in on these critica… ▽ More

    Submitted 27 October, 2023; v1 submitted 11 October, 2022; originally announced October 2022.