Skip to main content

Showing 1–11 of 11 results for author: Fan, F X

.
  1. arXiv:2504.11511  [pdf, other

    cs.LG cs.AI

    Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs

    Authors: Flint Xiaofeng Fan, Cheston Tan, Roger Wattenhofer, Yew-Soon Ong

    Abstract: The rise of reinforcement learning (RL) in critical real-world applications demands a fundamental rethinking of privacy in AI systems. Traditional privacy frameworks, designed to protect isolated data points, fall short for sequential decision-making systems where sensitive information emerges from temporal patterns, behavioral strategies, and collaborative dynamics. Modern RL paradigms, such as f… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: Accepted to IJCNN 2025 Position Paper Track

  2. arXiv:2502.00870  [pdf, other

    cs.LG cs.AI cs.MA

    FedHPD: Heterogeneous Federated Reinforcement Learning via Policy Distillation

    Authors: Wenzheng Jiang, Ji Wang, Xiongtao Zhang, Weidong Bao, Cheston Tan, Flint Xiaofeng Fan

    Abstract: Federated Reinforcement Learning (FedRL) improves sample efficiency while preserving privacy; however, most existing studies assume homogeneous agents, limiting its applicability in real-world scenarios. This paper investigates FedRL in black-box settings with heterogeneous agents, where each agent employs distinct policy networks and training configurations without disclosing their internal detai… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

    Comments: This preprint presents the full version of the Extended Abstract accepted by AAMAS 2025, including all the proofs and experiments

    ACM Class: I.2.11

  3. arXiv:2412.15538  [pdf, other

    cs.LG cs.AI cs.CR

    FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF

    Authors: Flint Xiaofeng Fan, Cheston Tan, Yew-Soon Ong, Roger Wattenhofer, Wei-Tsang Ooi

    Abstract: In the era of increasing privacy concerns and demand for personalized experiences, traditional Reinforcement Learning with Human Feedback (RLHF) frameworks face significant challenges due to their reliance on centralized data. We introduce Federated Reinforcement Learning with Human Feedback (FedRLHF), a novel framework that decentralizes the RLHF process. FedRLHF enables collaborative policy lear… ▽ More

    Submitted 7 February, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

    Comments: Updated for AAMAS 2025 camera-ready. This preprint represents the full version of the paper, including all proofs, experimental details, and additional discussions

    ACM Class: I.2.11

  4. arXiv:2410.06151  [pdf, other

    cs.LG cs.AI

    Quality Diversity Imitation Learning

    Authors: Zhenglin Wan, Xingrui Yu, David Mark Bossens, Yueming Lyu, Qing Guo, Flint Xiaofeng Fan, Ivor Tsang

    Abstract: Imitation learning (IL) has shown great potential in various applications, such as robot control. However, traditional IL methods are usually designed to learn only one specific type of behavior since demonstrations typically correspond to a single expert. In this work, we introduce the first generic framework for Quality Diversity Imitation Learning (QD-IL), which enables the agent to learn a bro… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 22 pages, conference paper

  5. arXiv:2406.01631  [pdf, other

    cs.IR cs.LG

    SUBER: An RL Environment with Simulated Human Behavior for Recommender Systems

    Authors: Nathan Corecco, Giorgio Piatti, Luca A. Lanzendörfer, Flint Xiaofeng Fan, Roger Wattenhofer

    Abstract: Reinforcement learning (RL) has gained popularity in the realm of recommender systems due to its ability to optimize long-term rewards and guide users in discovering relevant content. However, the successful implementation of RL in recommender systems is challenging because of several factors, including the limited availability of online data for training on-policy methods. This scarcity requires… ▽ More

    Submitted 20 August, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  6. arXiv:2403.20156  [pdf, other

    cs.LG cs.AI

    CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

    Authors: Hei Yi Mak, Flint Xiaofeng Fan, Luca A. Lanzendörfer, Cheston Tan, Wei Tsang Ooi, Roger Wattenhofer

    Abstract: In this study, we delve into Federated Reinforcement Learning (FedRL) in the context of value-based agents operating across diverse Markov Decision Processes (MDPs). Existing FedRL methods typically aggregate agents' learning by averaging the value functions across them to improve their performance. However, this aggregation strategy is suboptimal in heterogeneous environments where agents converg… ▽ More

    Submitted 16 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  7. arXiv:2401.03489  [pdf, other

    cs.LG cs.AI cs.DC cs.MA

    Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence

    Authors: Philip Jordan, Florian Grötschla, Flint Xiaofeng Fan, Roger Wattenhofer

    Abstract: In Federated Reinforcement Learning (FRL), agents aim to collaboratively learn a common task, while each agent is acting in its local environment without exchanging raw trajectories. Existing approaches for FRL either (a) do not provide any fault-tolerance guarantees (against misbehaving agents), or (b) rely on a trusted central agent (a single point of failure) for aggregating updates. We provide… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: Accepted at AAMAS'24

  8. arXiv:2306.15968  [pdf, other

    cs.RO cs.LG

    Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning

    Authors: Xinyang Lu, Flint Xiaofeng Fan, Tianying Wang

    Abstract: Reinforcement Learning (RL) has made promising progress in planning and decision-making for Autonomous Vehicles (AVs) in simple driving scenarios. However, existing RL algorithms for AVs fail to learn critical driving skills in complex urban scenarios. First, urban driving scenarios require AVs to handle multiple driving tasks of which conventional RL algorithms are incapable. Second, the presence… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems

  9. arXiv:2301.11135  [pdf, other

    cs.LG cs.DC

    FedHQL: Federated Heterogeneous Q-Learning

    Authors: Flint Xiaofeng Fan, Yining Ma, Zhongxiang Dai, Cheston Tan, Bryan Kian Hsiang Low, Roger Wattenhofer

    Abstract: Federated Reinforcement Learning (FedRL) encourages distributed agents to learn collectively from each other's experience to improve their performance without exchanging their raw trajectories. The existing work on FedRL assumes that all participating agents are homogeneous, which requires all agents to share the same policy parameterization (e.g., network architectures and training configurations… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: Preprint. Under review

  10. arXiv:2205.14309  [pdf, other

    cs.LG cs.AI

    Federated Neural Bandits

    Authors: Zhongxiang Dai, Yao Shu, Arun Verma, Flint Xiaofeng Fan, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Recent works on neural contextual bandits have achieved compelling performances due to their ability to leverage the strong representation power of neural networks (NNs) for reward prediction. Many applications of contextual bandits involve multiple agents who collaborate without sharing raw observations, thus giving rise to the setting of federated contextual bandits. Existing works on federated… ▽ More

    Submitted 28 February, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: ICLR 2023. Code: https://github.com/daizhongxiang/Federated-Neural-Bandits

  11. arXiv:2110.14074  [pdf, other

    cs.LG cs.AI

    Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

    Authors: Flint Xiaofeng Fan, Yining Ma, Zhongxiang Dai, Wei Jing, Cheston Tan, Bryan Kian Hsiang Low

    Abstract: The growing literature of Federated Learning (FL) has recently inspired Federated Reinforcement Learning (FRL) to encourage multiple agents to federatively build a better decision-making policy without sharing raw trajectories. Despite its promising applications, existing works on FRL fail to I) provide theoretical analysis on its convergence, and II) account for random system failures and adversa… ▽ More

    Submitted 3 November, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021. Extended version with proofs and additional experimental details and results. New version changes: reduced file size of figures; added a diagram illustrating the problem setting; added link to code on GitHub; modified proof for Theorem 6 (highlighted in red)