Learning-based Two-tiered Online Optimization of Region-wide Datacenter Resource Allocation
Authors:
Chang-Lin Chen,
Hanhan Zhou,
Jiayu Chen,
Mohammad Pedramfar,
Tian Lan,
Zheqing Zhu,
Chi Zhou,
Pol Mauri Ruiz,
Neeraj Kumar,
Hongbo Dong,
Vaneet Aggarwal
Abstract:
Online optimization of resource management for large-scale data centers and infrastructures to meet dynamic capacity reservation demands and various practical constraints (e.g., feasibility and robustness) is a very challenging problem. Mixed Integer Programming (MIP) approaches suffer from recognized limitations in such a dynamic environment, while learning-based approaches may face with prohibit…
▽ More
Online optimization of resource management for large-scale data centers and infrastructures to meet dynamic capacity reservation demands and various practical constraints (e.g., feasibility and robustness) is a very challenging problem. Mixed Integer Programming (MIP) approaches suffer from recognized limitations in such a dynamic environment, while learning-based approaches may face with prohibitively large state/action spaces. To this end, this paper presents a novel two-tiered online optimization to enable a learning-based Resource Allowance System (RAS). To solve optimal server-to-reservation assignment in RAS in an online fashion, the proposed solution leverages a reinforcement learning (RL) agent to make high-level decisions, e.g., how much resource to select from the Main Switch Boards (MSBs), and then a low-level Mixed Integer Linear Programming (MILP) solver to generate the local server-to-reservation mapping, conditioned on the RL decisions. We take into account fault tolerance, server movement minimization, and network affinity requirements and apply the proposed solution to large-scale RAS problems. To provide interpretability, we further train a decision tree model to explain the learned policies and to prune unreasonable corner cases at the low-level MILP solver, resulting in further performance improvement. Extensive evaluations show that our two-tiered solution outperforms baselines such as pure MIP solver by over $15\%$ while delivering $100\times$ speedup in computation.
△ Less
Submitted 17 October, 2024; v1 submitted 29 June, 2023;
originally announced June 2023.
From MANET to people-centric networking: milestones and open research challenges
Authors:
Marco Conti,
Chiara Boldrini,
Salil S. Kanhere,
Enzo Mingozzi,
Elena Pagani,
Pedro M. Ruiz,
Mohamed Younis
Abstract:
In this paper we discuss the state of the art of (mobile) multi-hop ad hoc networking with the aim to present the current status of the research activities and identify the consolidated research areas, with limited research opportunities, and the hot and emerging research areas for which further research is required. We start by briefly discussing the MANET paradigm, and why the research on MANET…
▽ More
In this paper we discuss the state of the art of (mobile) multi-hop ad hoc networking with the aim to present the current status of the research activities and identify the consolidated research areas, with limited research opportunities, and the hot and emerging research areas for which further research is required. We start by briefly discussing the MANET paradigm, and why the research on MANET protocols is now a cold research topic. Then we analyze the active research areas. Specifically, after discussing the wireless-network technologies we analyze four successful ad hoc networking paradigms, mesh, opportunistic, vehicular networks, and sensor networks that emerged from the MANET world. We also present the new research directions in the multi-hop ad hoc networking field: people-centric networking, triggered by the increasing penetration of the smartphones in everyday life, which is generating a people-centric revolution in computing and communications.
△ Less
Submitted 21 September, 2021;
originally announced September 2021.