Skip to main content

Showing 1–7 of 7 results for author: Hepburn, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.14374  [pdf, other

    stat.ML cs.AI cs.LG

    State-Constrained Offline Reinforcement Learning

    Authors: Charles A. Hepburn, Yue Jin, Giovanni Montana

    Abstract: Traditional offline reinforcement learning methods predominantly operate in a batch-constrained setting. This confines the algorithms to a specific state-action distribution present in the dataset, reducing the effects of distributional shift but restricting the algorithm greatly. In this paper, we alleviate this limitation by introducing a novel framework named \emph{state-constrained} offline re… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2212.04280  [pdf, other

    stat.ML cs.LG

    Model-based trajectory stitching for improved behavioural cloning and its applications

    Authors: Charles A. Hepburn, Giovanni Montana

    Abstract: Behavioural cloning (BC) is a commonly used imitation learning method to infer a sequential decision-making policy from expert demonstrations. However, when the quality of the data is not optimal, the resulting behavioural policy also performs sub-optimally once deployed. Recently, there has been a surge in offline reinforcement learning methods that hold the promise to extract high-quality polici… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2211.11603

  3. arXiv:2211.11603  [pdf, other

    cs.LG stat.ML

    Model-based Trajectory Stitching for Improved Offline Reinforcement Learning

    Authors: Charles A. Hepburn, Giovanni Montana

    Abstract: In many real-world applications, collecting large and high-quality datasets may be too costly or impractical. Offline reinforcement learning (RL) aims to infer an optimal decision-making policy from a fixed set of data. Getting the most information from historical data is then vital for good performance once the policy is deployed. We propose a model-based data augmentation strategy, Trajectory St… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Offline RL Workshop at Neural Information Processing Systems, 2022

  4. arXiv:2102.10951  [pdf, other

    cs.CV stat.ML

    Explainers in the Wild: Making Surrogate Explainers Robust to Distortions through Perception

    Authors: Alexander Hepburn, Raul Santos-Rodriguez

    Abstract: Explaining the decisions of models is becoming pervasive in the image processing domain, whether it is by using post-hoc methods or by creating inherently interpretable models. While the widespread use of surrogate explainers is a welcome addition to inspect and understand black-box models, assessing the robustness and reliability of the explanations is key for their success. Additionally, whilst… ▽ More

    Submitted 16 June, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Journal ref: 2021 IEEE International Conference on Image Processing (ICIP), Anchorage, Alaska, USA

  5. arXiv:1910.13016  [pdf, other

    cs.LG stat.ML

    bLIMEy: Surrogate Prediction Explanations Beyond LIME

    Authors: Kacper Sokol, Alexander Hepburn, Raul Santos-Rodriguez, Peter Flach

    Abstract: Surrogate explainers of black-box machine learning predictions are of paramount importance in the field of eXplainable Artificial Intelligence since they can be applied to any type of data (images, text and tabular), are model-agnostic and are post-hoc (i.e., can be retrofitted). The Local Interpretable Model-agnostic Explanations (LIME) algorithm is often mistakenly unified with a more general fr… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: 2019 Workshop on Human-Centric Machine Learning (HCML 2019); 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  6. arXiv:1910.12548  [pdf, other

    cs.LG eess.IV stat.ML

    PerceptNet: A Human Visual System Inspired Neural Network for Estimating Perceptual Distance

    Authors: Alexander Hepburn, Valero Laparra, Jesús Malo, Ryan McConville, Raul Santos-Rodriguez

    Abstract: Traditionally, the vision community has devised algorithms to estimate the distance between an original image and images that have been subject to perturbations. Inspiration was usually taken from the human visual perceptual system and how the system processes different perturbations in order to replicate to what extent it determines our ability to judge image quality. While recent works have pres… ▽ More

    Submitted 17 November, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Journal ref: 2020 IEEE International Conference on Image Processing (ICIP), Abu Dhabi, United Arab Emirates, 2020, pp. 121-125

  7. arXiv:1908.04347  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Enforcing Perceptual Consistency on Generative Adversarial Networks by Using the Normalised Laplacian Pyramid Distance

    Authors: Alexander Hepburn, Valero Laparra, Ryan McConville, Raul Santos-Rodriguez

    Abstract: In recent years there has been a growing interest in image generation through deep learning. While an important part of the evaluation of the generated images usually involves visual inspection, the inclusion of human perception as a factor in the training process is often overlooked. In this paper we propose an alternative perceptual regulariser for image-to-image translation using conditional ge… ▽ More

    Submitted 17 November, 2020; v1 submitted 9 August, 2019; originally announced August 2019.

    Journal ref: Proceedings of the Northern Lights Deep Learning Workshop. Vol. 1. 2020