Skip to main content

Showing 1–2 of 2 results for author: Beeson, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.11088  [pdf, other

    stat.ML cs.LG

    An Investigation of Offline Reinforcement Learning in Factorisable Action Spaces

    Authors: Alex Beeson, David Ireland, Giovanni Montana

    Abstract: Expanding reinforcement learning (RL) to offline domains generates promising prospects, particularly in sectors where data collection poses substantial challenges or risks. Pivotal to the success of transferring RL offline is mitigating overestimation bias in value estimates for state-action pairs absent from data. Whilst numerous approaches have been proposed in recent years, these tend to focus… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

    Comments: Published in Transactions on Machine Learning Research (11/2024)

  2. arXiv:2211.11802  [pdf, other

    cs.LG cs.AI stat.ML

    Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning

    Authors: Alex Beeson, Giovanni Montana

    Abstract: The ability to discover optimal behaviour from fixed data sets has the potential to transfer the successes of reinforcement learning (RL) to domains where data collection is acutely problematic. In this offline setting, a key challenge is overcoming overestimation bias for actions not present in data which, without the ability to correct for via interaction with the environment, can propagate and… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: 3rd Offline Reinforcement Learning Workshop at Neural Information Processing Systems, 2022