Skip to main content

Showing 1–4 of 4 results for author: Chatzaroulas, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11289  [pdf, other

    cs.AI cs.LG

    Meta-World+: An Improved, Standardized, RL Benchmark

    Authors: Reginald McLean, Evangelos Chatzaroulas, Luc McCutcheon, Frank Röder, Tianhe Yu, Zhanpeng He, K. R. Zentner, Ryan Julian, J K Terry, Isaac Woungang, Nariman Farsad, Pablo Samuel Castro

    Abstract: Meta-World is widely used for evaluating multi-task and meta-reinforcement learning agents, which are challenged to master diverse skills simultaneously. Since its introduction however, there have been numerous undocumented changes which inhibit a fair comparison of algorithms. This work strives to disambiguate these results from the literature, while also leveraging the past versions of Meta-Worl… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  2. arXiv:2503.05126  [pdf, other

    cs.LG cs.AI

    Multi-Task Reinforcement Learning Enables Parameter Scaling

    Authors: Reginald McLean, Evangelos Chatzaroulas, Jordan Terry, Isaac Woungang, Nariman Farsad, Pablo Samuel Castro

    Abstract: Multi-task reinforcement learning (MTRL) aims to endow a single agent with the ability to perform well on multiple tasks. Recent works have focused on developing novel sophisticated architectures to improve performance, often resulting in larger models; it is unclear, however, whether the performance gains are a consequence of the architecture design itself or the extra parameters. We argue that g… ▽ More

    Submitted 12 March, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

  3. arXiv:2409.15867  [pdf, other

    cs.AI

    In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Understanding

    Authors: Moucheng Xu, Evangelos Chatzaroulas, Luc McCutcheon, Abdul Ahad, Hamzah Azeem, Janusz Marecki, Ammar Anwar

    Abstract: A Standard Operating Procedure (SOP) defines a low-level, step-by-step written guide for a business software workflow. SOP generation is a crucial step towards automating end-to-end software workflows. Manually creating SOPs can be time-consuming. Recent advancements in large video-language models offer the potential for automating SOP generation by analyzing recordings of human demonstrations. Ho… ▽ More

    Submitted 20 October, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: To appear in NeurIPS Workshop on Video-Language Models 2024

  4. arXiv:2210.12229  [pdf, other

    cs.LG cs.AI eess.SY math.OC

    Deep Reinforcement Learning for Stabilization of Large-scale Probabilistic Boolean Networks

    Authors: Sotiris Moschoyiannis, Evangelos Chatzaroulas, Vytenis Sliogeris, Yuhu Wu

    Abstract: The ability to direct a Probabilistic Boolean Network (PBN) to a desired state is important to applications such as targeted therapeutics in cancer biology. Reinforcement Learning (RL) has been proposed as a framework that solves a discrete-time optimal control problem cast as a Markov Decision Process. We focus on an integrative framework powered by a model-free deep RL method that can address di… ▽ More

    Submitted 25 October, 2022; v1 submitted 21 October, 2022; originally announced October 2022.