Skip to main content

Showing 1–2 of 2 results for author: Shpigelman, Y

Searching in archive cs. Search in all archives.
.
  1. Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

    Authors: Benjamin Fuhrer, Yuval Shpigelman, Chen Tessler, Shie Mannor, Gal Chechik, Eitan Zahavi, Gal Dalal

    Abstract: As communication protocols evolve, datacenter network utilization increases. As a result, congestion is more frequent, causing higher latency and packet loss. Combined with the increasing complexity of workloads, manual design of congestion control (CC) algorithms becomes extremely difficult. This calls for the development of AI approaches to replace the human effort. Unfortunately, it is currentl… ▽ More

    Submitted 1 June, 2024; v1 submitted 5 July, 2022; originally announced July 2022.

  2. arXiv:2102.09337  [pdf, other

    cs.LG cs.AI cs.NI

    Reinforcement Learning for Datacenter Congestion Control

    Authors: Chen Tessler, Yuval Shpigelman, Gal Dalal, Amit Mandelbaum, Doron Haritan Kazakov, Benjamin Fuhrer, Gal Chechik, Shie Mannor

    Abstract: We approach the task of network congestion control in datacenters using Reinforcement Learning (RL). Successful congestion control algorithms can dramatically improve latency and overall network throughput. Until today, no such learning-based algorithms have shown practical potential in this domain. Evidently, the most popular recent deployments rely on rule-based heuristics that are tested on a p… ▽ More

    Submitted 29 June, 2022; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Presented at IAAI 2022