Skip to main content

Showing 1–3 of 3 results for author: Milenovic, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00410  [pdf, other

    cs.LG

    UCB-driven Utility Function Search for Multi-objective Reinforcement Learning

    Authors: Yucheng Shi, Alexandros Agapitos, David Lynch, Giorgio Cruciata, Cengis Hasan, Hao Wang, Yayu Yao, Aleksandar Milenovic

    Abstract: In Multi-objective Reinforcement Learning (MORL) agents are tasked with optimising decision-making behaviours that trade-off between multiple, possibly conflicting, objectives. MORL based on decomposition is a family of solution methods that employ a number of utility functions to decompose the multi-objective problem into individual single-objective problems solved simultaneously in order to appr… ▽ More

    Submitted 16 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  2. arXiv:2404.19462  [pdf, other

    cs.LG

    Continual Model-based Reinforcement Learning for Data Efficient Wireless Network Optimisation

    Authors: Cengis Hasan, Alexandros Agapitos, David Lynch, Alberto Castagna, Giorgio Cruciata, Hao Wang, Aleksandar Milenovic

    Abstract: We present a method that addresses the pain point of long lead-time required to deploy cell-level parameter optimisation policies to new wireless network sites. Given a sequence of action spaces represented by overlapping subsets of cell-level configuration parameters provided by domain experts, we formulate throughput optimisation as Continual Reinforcement Learning of control policies. Simulatio… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Published at ECML 2023

  3. arXiv:2111.08587  [pdf, other

    cs.AI

    Offline Contextual Bandits for Wireless Network Optimization

    Authors: Miguel Suau, Alexandros Agapitos, David Lynch, Derek Farrell, Mingqi Zhou, Aleksandar Milenovic

    Abstract: The explosion in mobile data traffic together with the ever-increasing expectations for higher quality of service call for the development of AI algorithms for wireless network optimization. In this paper, we investigate how to learn policies that can automatically adjust the configuration parameters of every cell in the network in response to the changes in the user demand. Our solution combines… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.