Skip to main content

Showing 1–3 of 3 results for author: Sufiyan, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.15214  [pdf, other

    cs.LG cs.AI cs.CL

    The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning

    Authors: Sheila Schoepp, Masoud Jafaripour, Yingyue Cao, Tianpei Yang, Fatemeh Abdollahi, Shadan Golestan, Zahin Sufiyan, Osmar R. Zaiane, Matthew E. Taylor

    Abstract: Reinforcement learning (RL) has shown impressive results in sequential decision-making tasks. Meanwhile, Large Language Models (LLMs) and Vision-Language Models (VLMs) have emerged, exhibiting impressive capabilities in multimodal understanding and reasoning. These advances have led to a surge of research integrating LLMs and VLMs into RL. In this survey, we review representative works in which LL… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: 9 pages, 4 figures

  2. arXiv:2501.14271  [pdf, other

    cs.LG

    TLXML: Task-Level Explanation of Meta-Learning via Influence Functions

    Authors: Yoshihiro Mitsuka, Shadan Golestan, Zahin Sufiyan, Sheila Schoepp, Shotaro Miwa, Osmar R. Zaiane

    Abstract: The scheme of adaptation via meta-learning is seen as an ingredient for solving the problem of data shortage or distribution shift in real-world applications, but it also brings the new risk of inappropriate updates of the model in the user environment, which increases the demand for explainability. Among the various types of XAI methods, establishing a method of explanation based on past experien… ▽ More

    Submitted 7 February, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 22 pages; v2: modification in metadata

  3. arXiv:2501.03405  [pdf, other

    cs.RO

    A Study of the Efficacy of Generative Flow Networks for Robotics and Machine Fault-Adaptation

    Authors: Zahin Sufiyan, Shadan Golestan, Shotaro Miwa, Yoshihiro Mitsuka, Osmar Zaiane

    Abstract: Advancements in robotics have opened possibilities to automate tasks in various fields such as manufacturing, emergency response and healthcare. However, a significant challenge that prevents robots from operating in real-world environments effectively is out-of-distribution (OOD) situations, wherein robots encounter unforseen situations. One major OOD situations is when robots encounter faults, m… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.