Skip to main content

Showing 1–14 of 14 results for author: Hafez, M B

Searching in archive cs. Search in all archives.
.
  1. Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation

    Authors: Muhammad Burhan Hafez, Kerim Erekmen

    Abstract: Central to the development of universal learning systems is the ability to solve multiple tasks without retraining from scratch when new data arrives. This is crucial because each task requires significant training time. Addressing the problem of continual learning necessitates various methods due to the complexity of the problem space. This problem space includes: (1) addressing catastrophic forg… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: Accepted for publication in Scientific Reports

  2. arXiv:2407.18841  [pdf, other

    cs.LG

    QT-TDM: Planning With Transformer Dynamics Model and Autoregressive Q-Learning

    Authors: Mostafa Kotb, Cornelius Weber, Muhammad Burhan Hafez, Stefan Wermter

    Abstract: Inspired by the success of the Transformer architecture in natural language processing and computer vision, we investigate the use of Transformers in Reinforcement Learning (RL), specifically in modeling the environment's dynamics using Transformer Dynamics Models (TDMs). We evaluate the capabilities of TDMs for continuous control in real-time planning scenarios with Model Predictive Control (MPC)… ▽ More

    Submitted 16 November, 2024; v1 submitted 26 July, 2024; originally announced July 2024.

    Comments: Accepted by IEEE Robotics and Automation Letters (RA-L)

  3. Continual Robot Learning using Self-Supervised Task Inference

    Authors: Muhammad Burhan Hafez, Stefan Wermter

    Abstract: Endowing robots with the human ability to learn a growing set of skills over the course of a lifetime as opposed to mastering single tasks is an open problem in robot learning. While multi-task learning approaches have been proposed to address this problem, they pay little attention to task inference. In order to continually learn new tasks, the robot first needs to infer the task at hand without… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: Accepted for publication in IEEE Transactions on Cognitive and Developmental Systems

  4. arXiv:2305.02054  [pdf

    cs.LG cs.AI cs.RO

    Map-based Experience Replay: A Memory-Efficient Solution to Catastrophic Forgetting in Reinforcement Learning

    Authors: Muhammad Burhan Hafez, Tilman Immisch, Tom Weber, Stefan Wermter

    Abstract: Deep Reinforcement Learning agents often suffer from catastrophic forgetting, forgetting previously found solutions in parts of the input space when training on new data. Replay Memories are a common solution to the problem, decorrelating and shuffling old and new training samples. They naively store state transitions as they come in, without regard for redundancy. We introduce a novel cognitive-i… ▽ More

    Submitted 28 August, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Journal ref: Frontiers in Neurorobotics 17:1127642 (2023)

  5. arXiv:2304.07219  [pdf, other

    cs.LG cs.AI

    Model Predictive Control with Self-supervised Representation Learning

    Authors: Jonas Matthies, Muhammad Burhan Hafez, Mostafa Kotb, Stefan Wermter

    Abstract: Over the last few years, we have not seen any major developments in model-free or model-based learning methods that would make one obsolete relative to the other. In most cases, the used technique is heavily dependent on the use case scenario or other attributes, e.g. the environment. Both approaches have their own advantages, for example, sample efficiency or computational efficiency. However, wh… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  6. arXiv:2303.08268  [pdf, other

    cs.RO cs.AI cs.CL cs.LG cs.SD eess.AS

    Chat with the Environment: Interactive Multimodal Perception Using Large Language Models

    Authors: Xufeng Zhao, Mengdi Li, Cornelius Weber, Muhammad Burhan Hafez, Stefan Wermter

    Abstract: Programming robot behavior in a complex world faces challenges on multiple levels, from dextrous low-level skills to high-level planning and reasoning. Recent pre-trained Large Language Models (LLMs) have shown remarkable reasoning ability in few-shot robotic planning. However, it remains challenging to ground LLMs in multimodal sensory input and continuous action output, while enabling a robot to… ▽ More

    Submitted 11 October, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: IROS2023, Detroit. See the project website at https://matcha-agent.github.io

  7. arXiv:2301.03353  [pdf, other

    cs.CL cs.AI cs.NE cs.RO

    Learning Bidirectional Action-Language Translation with Limited Supervision and Incongruent Input

    Authors: Ozan Ă–zdemir, Matthias Kerzel, Cornelius Weber, Jae Hee Lee, Muhammad Burhan Hafez, Patrick Bruns, Stefan Wermter

    Abstract: Human infant learning happens during exploration of the environment, by interaction with objects, and by listening to and repeating utterances casually, which is analogous to unsupervised learning. Only occasionally, a learning infant would receive a matching verbal description of an action it is committing, which is similar to supervised learning. Such a learning mechanism can be mimicked with de… ▽ More

    Submitted 22 February, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: Published in: Applied Artificial Intelligence, 37:1, 2179167

    Journal ref: Applied Artificial Intelligence Volume 37, 2023 - Issue 1

  8. arXiv:2208.02680  [pdf, other

    cs.RO cs.LG cs.SD eess.AS

    Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and Explorations

    Authors: Xufeng Zhao, Cornelius Weber, Muhammad Burhan Hafez, Stefan Wermter

    Abstract: Sound is one of the most informative and abundant modalities in the real world while being robust to sense without contacts by small and cheap sensors that can be placed on mobile devices. Although deep learning is capable of extracting information from multiple sensory inputs, there has been little use of sound for the control and learning of robotic actions. For unsupervised reinforcement learni… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: Accepted at IROS 2022

  9. Behavior Self-Organization Supports Task Inference for Continual Robot Learning

    Authors: Muhammad Burhan Hafez, Stefan Wermter

    Abstract: Recent advances in robot learning have enabled robots to become increasingly better at mastering a predefined set of tasks. On the other hand, as humans, we have the ability to learn a growing set of tasks over our lifetime. Continual robot learning is an emerging research direction with the goal of endowing robots with this ability. In order to learn new tasks over time, the robot first needs to… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: Accepted at IROS 2021

  10. Improving Model-Based Reinforcement Learning with Internal State Representations through Self-Supervision

    Authors: Julien Scholz, Cornelius Weber, Muhammad Burhan Hafez, Stefan Wermter

    Abstract: Using a model of the environment, reinforcement learning agents can plan their future moves and achieve superhuman performance in board games like Chess, Shogi, and Go, while remaining relatively sample-efficient. As demonstrated by the MuZero Algorithm, the environment model can even be learned dynamically, generalizing the agent to many more tasks while at the same time achieving state-of-the-ar… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Journal ref: Proc. Intl. Joint Conf. Neural Networks (IJCNN), 2021, forthcoming

  11. arXiv:2004.08830  [pdf

    cs.LG cs.AI cs.RO stat.ML

    Improving Robot Dual-System Motor Learning with Intrinsically Motivated Meta-Control and Latent-Space Experience Imagination

    Authors: Muhammad Burhan Hafez, Cornelius Weber, Matthias Kerzel, Stefan Wermter

    Abstract: Combining model-based and model-free learning systems has been shown to improve the sample efficiency of learning to perform complex robotic tasks. However, dual-system approaches fail to consider the reliability of the learned model when it is applied to make multiple-step predictions, resulting in a compounding of prediction errors and performance degradation. In this paper, we present a novel d… ▽ More

    Submitted 1 November, 2020; v1 submitted 19 April, 2020; originally announced April 2020.

    Journal ref: Robotics and Autonomous Systems 133 (2020) 103630

  12. arXiv:1910.04729  [pdf

    cs.LG cs.AI cs.RO stat.ML

    Efficient Intrinsically Motivated Robotic Grasping with Learning-Adaptive Imagination in Latent Space

    Authors: Muhammad Burhan Hafez, Cornelius Weber, Matthias Kerzel, Stefan Wermter

    Abstract: Combining model-based and model-free deep reinforcement learning has shown great promise for improving sample efficiency on complex control tasks while still retaining high performance. Incorporating imagination is a recent effort in this direction inspired by human mental simulation of motor behavior. We propose a learning-adaptive imagination approach which, unlike previous approaches, takes int… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: In: Proceedings of the Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics (ICDL-EpiRob), Oslo, Norway, Aug. 19-22, 2019

  13. arXiv:1905.01718  [pdf, ps, other

    cs.LG cs.AI cs.RO stat.ML

    Curious Meta-Controller: Adaptive Alternation between Model-Based and Model-Free Control in Deep Reinforcement Learning

    Authors: Muhammad Burhan Hafez, Cornelius Weber, Matthias Kerzel, Stefan Wermter

    Abstract: Recent success in deep reinforcement learning for continuous control has been dominated by model-free approaches which, unlike model-based approaches, do not suffer from representational limitations in making assumptions about the world dynamics and model errors inevitable in complex domains. However, they require a lot of experiences compared to model-based approaches that are typically more samp… ▽ More

    Submitted 5 May, 2019; originally announced May 2019.

    Comments: Accepted at IJCNN 2019

  14. arXiv:1810.11388  [pdf

    cs.LG cs.AI cs.RO

    Deep Intrinsically Motivated Continuous Actor-Critic for Efficient Robotic Visuomotor Skill Learning

    Authors: Muhammad Burhan Hafez, Cornelius Weber, Matthias Kerzel, Stefan Wermter

    Abstract: In this paper, we present a new intrinsically motivated actor-critic algorithm for learning continuous motor skills directly from raw visual input. Our neural architecture is composed of a critic and an actor network. Both networks receive the hidden representation of a deep convolutional autoencoder which is trained to reconstruct the visual input, while the centre-most hidden representation is a… ▽ More

    Submitted 18 February, 2019; v1 submitted 26 October, 2018; originally announced October 2018.

    Journal ref: Paladyn, Journal of Behavioral Robotics, Volume 10, Issue 1, Pages 14-29, 2019