Skip to main content

Showing 1–1 of 1 results for author: Neufang, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.02579  [pdf, ps, other

    cs.CL cs.AI cs.LG

    EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning

    Authors: Lingxiao Kong, Cong Yang, Susanne Neufang, Oya Deniz Beyan, Zeyd Boukhers

    Abstract: Recent advances in reinforcement learning (RL) for large language model (LLM) fine-tuning show promise in addressing multi-objective tasks but still face significant challenges, including competing objective balancing, low training efficiency, poor scalability, and limited explainability. Leveraging ensemble learning principles, we introduce an Ensemble Multi-Objective RL (EMORL) framework that fi… ▽ More

    Submitted 9 July, 2025; v1 submitted 5 May, 2025; originally announced May 2025.

    Comments: 14 pages, 9 figures, accepted by the SIGDIAL 2025 conference