-
PSI-PFL: Population Stability Index for Client Selection in non-IID Personalized Federated Learning
Authors:
Daniel-M. Jimenez-Gutierrez,
David Solans,
Mohammed Elbamby,
Nicolas Kourtellis
Abstract:
Federated Learning (FL) enables decentralized machine learning (ML) model training while preserving data privacy by keeping data localized across clients. However, non-independent and identically distributed (non-IID) data across clients poses a significant challenge, leading to skewed model updates and performance degradation. Addressing this, we propose PSI-PFL, a novel client selection framewor…
▽ More
Federated Learning (FL) enables decentralized machine learning (ML) model training while preserving data privacy by keeping data localized across clients. However, non-independent and identically distributed (non-IID) data across clients poses a significant challenge, leading to skewed model updates and performance degradation. Addressing this, we propose PSI-PFL, a novel client selection framework for Personalized Federated Learning (PFL) that leverages the Population Stability Index (PSI) to quantify and mitigate data heterogeneity (so-called non-IIDness). Our approach selects more homogeneous clients based on PSI, reducing the impact of label skew, one of the most detrimental factors in FL performance. Experimental results over multiple data modalities (tabular, image, text) demonstrate that PSI-PFL significantly improves global model accuracy, outperforming state-of-the-art baselines by up to 10\% under non-IID scenarios while ensuring fairer local performance. PSI-PFL enhances FL performance and offers practical benefits in applications where data privacy and heterogeneity are critical.
△ Less
Submitted 31 May, 2025;
originally announced June 2025.
-
RAG-Enabled Intent Reasoning for Application-Network Interaction
Authors:
Salwa Mostafa,
Mohamed K. Abdel-Aziz,
Mohammed S. Elbamby,
Mehdi Bennis
Abstract:
Intent-based network (IBN) is a promising solution to automate network operation and management. IBN aims to offer human-tailored network interaction, allowing the network to communicate in a way that aligns with the network users' language, rather than requiring the network users to understand the technical language of the network/devices. Nowadays, different applications interact with the networ…
▽ More
Intent-based network (IBN) is a promising solution to automate network operation and management. IBN aims to offer human-tailored network interaction, allowing the network to communicate in a way that aligns with the network users' language, rather than requiring the network users to understand the technical language of the network/devices. Nowadays, different applications interact with the network, each with its own specialized needs and domain language. Creating semantic languages (i.e., ontology-based languages) and associating them with each application to facilitate intent translation lacks technical expertise and is neither practical nor scalable. To tackle the aforementioned problem, we propose a context-aware AI framework that utilizes machine reasoning (MR), retrieval augmented generation (RAG), and generative AI technologies to interpret intents from different applications and generate structured network intents. The proposed framework allows for generalized/domain-specific intent expression and overcomes the drawbacks of large language models (LLMs) and vanilla-RAG framework. The experimental results show that our proposed intent-RAG framework outperforms the LLM and vanilla-RAG framework in intent translation.
△ Less
Submitted 14 May, 2025;
originally announced May 2025.
-
Intent Profiling and Translation Through Emergent Communication
Authors:
Salwa Mostafa,
Mohammed S. Elbamby,
Mohamed K. Abdel-Aziz,
Mehdi Bennis
Abstract:
To effectively express and satisfy network application requirements, intent-based network management has emerged as a promising solution. In intent-based methods, users and applications express their intent in a high-level abstract language to the network. Although this abstraction simplifies network operation, it induces many challenges to efficiently express applications' intents and map them to…
▽ More
To effectively express and satisfy network application requirements, intent-based network management has emerged as a promising solution. In intent-based methods, users and applications express their intent in a high-level abstract language to the network. Although this abstraction simplifies network operation, it induces many challenges to efficiently express applications' intents and map them to different network capabilities. Therefore, in this work, we propose an AI-based framework for intent profiling and translation. We consider a scenario where applications interacting with the network express their needs for network services in their domain language. The machine-to-machine communication (i.e., between applications and the network) is complex since it requires networks to learn how to understand the domain languages of each application, which is neither practical nor scalable. Instead, a framework based on emergent communication is proposed for intent profiling, in which applications express their abstract quality-of-experience (QoE) intents to the network through emergent communication messages. Subsequently, the network learns how to interpret these communication messages and map them to network capabilities (i.e., slices) to guarantee the requested Quality-of-Service (QoS). Simulation results show that the proposed method outperforms self-learning slicing and other baselines, and achieves a performance close to the perfect knowledge baseline.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction
Authors:
Mohamed K. Abdelaziz,
Mohammed S. Elbamby,
Sumudu Samarakoon,
Mehdi Bennis
Abstract:
Cooperative multi-agent reinforcement learning (MARL) for navigation enables agents to cooperate to achieve their navigation goals. Using emergent communication, agents learn a communication protocol to coordinate and share information that is needed to achieve their navigation tasks. In emergent communication, symbols with no pre-specified usage rules are exchanged, in which the meaning and synta…
▽ More
Cooperative multi-agent reinforcement learning (MARL) for navigation enables agents to cooperate to achieve their navigation goals. Using emergent communication, agents learn a communication protocol to coordinate and share information that is needed to achieve their navigation tasks. In emergent communication, symbols with no pre-specified usage rules are exchanged, in which the meaning and syntax emerge through training. Learning a navigation policy along with a communication protocol in a MARL environment is highly complex due to the huge state space to be explored. To cope with this complexity, this work proposes a novel neural network architecture, for jointly learning an adaptive state space abstraction and a communication protocol among agents participating in navigation tasks. The goal is to come up with an adaptive abstractor that significantly reduces the size of the state space to be explored, without degradation in the policy performance. Simulation results show that the proposed method reaches a better policy, in terms of achievable rewards, resulting in fewer training iterations compared to the case where raw states or fixed state abstraction are used. Moreover, it is shown that a communication protocol emerges during training which enables the agents to learn better policies within fewer training iterations.
△ Less
Submitted 12 February, 2024; v1 submitted 20 June, 2023;
originally announced June 2023.
-
Proactive Scheduling and Caching for Wireless VR Viewport Streaming
Authors:
Mostafa Abdelrahman,
Mohammed Elbamby,
Vilho Räisänen
Abstract:
Virtual Reality (VR) applications require high data rate for a high-quality immersive experience, in addition to low latency to avoid dizziness and motion sickness. One of the key wireless VR challenges is providing seamless connectivity and meeting the stringent latency and bandwidth requirements. This work proposes a proactive wireless VR system that utilizes information about the user's future…
▽ More
Virtual Reality (VR) applications require high data rate for a high-quality immersive experience, in addition to low latency to avoid dizziness and motion sickness. One of the key wireless VR challenges is providing seamless connectivity and meeting the stringent latency and bandwidth requirements. This work proposes a proactive wireless VR system that utilizes information about the user's future orientation for proactive scheduling and caching. This is achieved by leveraging deep neural networks to predict users' orientation trained on a real dataset. The 360° scene is then partitioned using an overlapping viewports scheme so that only portions of the scene covered by the users' perceptive field-of-view are streamed. Furthermore, to minimize the backhaul latency, popular viewports are cached at the edge cloud based on spatial popularity profiles. Through extensive simulations, we show that the proposed system provides significant latency and throughput performance improvement, especially in fluctuating channels and heavy load conditions. The proactive scheduling enabled by the combination of machine learning prediction and the proposed viewport scheme reduces the mean latency by more than 80% while achieving successful delivery rate close to 100%.
△ Less
Submitted 6 October, 2021;
originally announced October 2021.
-
Cross Layer Optimization and Distributed Reinforcement Learning for Wireless 360° Video Streaming
Authors:
Anis Elgabli,
Mohammed S. Elbamby,
Cristina Perfecto,
Mounssif Krouka,
Mehdi Bennis,
Vaneet Aggarwal
Abstract:
Wirelessly streaming high quality 360 degree videos is still a challenging problem. When there are many users watching different 360 degree videos and competing for the computing and communication resources, the streaming algorithm at hand should maximize the average quality of experience (QoE) while guaranteeing a minimum rate for each user. In this paper, we propose a cross layer optimization ap…
▽ More
Wirelessly streaming high quality 360 degree videos is still a challenging problem. When there are many users watching different 360 degree videos and competing for the computing and communication resources, the streaming algorithm at hand should maximize the average quality of experience (QoE) while guaranteeing a minimum rate for each user. In this paper, we propose a cross layer optimization approach that maximizes the available rate to each user and efficiently uses it to maximize users' QoE. Particularly, we consider a tile based 360 degree video streaming, and we optimize a QoE metric that balances the tradeoff between maximizing each user's QoE and ensuring fairness among users. We show that the problem can be decoupled into two interrelated subproblems: (i) a physical layer subproblem whose objective is to find the download rate for each user, and (ii) an application layer subproblem whose objective is to use that rate to find a quality decision per tile such that the user's QoE is maximized. We prove that the physical layer subproblem can be solved optimally with low complexity and an actor-critic deep reinforcement learning (DRL) is proposed to leverage the parallel training of multiple independent agents and solve the application layer subproblem. Extensive experiments reveal the robustness of our scheme and demonstrate its significant performance improvement compared to several baseline algorithms.
△ Less
Submitted 24 September, 2024; v1 submitted 12 November, 2020;
originally announced November 2020.
-
Cross-stakeholder service orchestration for B5G through capability provisioning
Authors:
Vilho Raisanen,
Mohammed Elbamby,
Dmitry Petrov
Abstract:
Cross-stakeholder service orchestration is a generalization of 5G network slices which has potential to increase business agility in Beyond 5G (B5G). An architectural framework is proposed which enables domain operators to expose their functionalities towards E2E services as capabilities. Capability orchestration is proposed as a mechanism for exposure. The use of intent-based management for commu…
▽ More
Cross-stakeholder service orchestration is a generalization of 5G network slices which has potential to increase business agility in Beyond 5G (B5G). An architectural framework is proposed which enables domain operators to expose their functionalities towards E2E services as capabilities. Capability orchestration is proposed as a mechanism for exposure. The use of intent-based management for communicating domain owner's business goals to capability orchestration is analyzed. The combination of business goal input and capability orchestration provides a basis for agile monetization of domain resources for domain owners, and a building block for rich end-to-end B5G services.
△ Less
Submitted 17 August, 2020;
originally announced August 2020.
-
Wireless Edge Computing with Latency and Reliability Guarantees
Authors:
Mohammed S. Elbamby,
Cristina Perfecto,
Chen-Feng Liu,
Jihong Park,
Sumudu Samarakoon,
Xianfu Chen,
Mehdi Bennis
Abstract:
Edge computing is an emerging concept based on distributing computing, storage, and control services closer to end network nodes. Edge computing lies at the heart of the fifth generation (5G) wireless systems and beyond. While current state-of-the-art networks communicate, compute, and process data in a centralized manner (at the cloud), for latency and compute-centric applications, both radio acc…
▽ More
Edge computing is an emerging concept based on distributing computing, storage, and control services closer to end network nodes. Edge computing lies at the heart of the fifth generation (5G) wireless systems and beyond. While current state-of-the-art networks communicate, compute, and process data in a centralized manner (at the cloud), for latency and compute-centric applications, both radio access and computational resources must be brought closer to the edge, harnessing the availability of computing and storage-enabled small cell base stations in proximity to the end devices. Furthermore, the network infrastructure must enable a distributed edge decision-making service that learns to adapt to the network dynamics with minimal latency and optimize network deployment and operation accordingly. This article will provide a fresh look to the concept of edge computing by first discussing the applications that the network edge must provide, with a special emphasis on the ensuing challenges in enabling ultra-reliable and low-latency edge computing services for mission-critical applications such as virtual reality (VR), vehicle-to-everything (V2X), edge artificial intelligence (AI), and so forth. Furthermore, several case studies where the edge is key are explored followed by insights and prospect for future work.
△ Less
Submitted 13 May, 2019;
originally announced May 2019.
-
Taming the latency in multi-user VR 360$^\circ$: A QoE-aware deep learning-aided multicast framework
Authors:
Cristina Perfecto,
Mohammed S. Elbamby,
Javier Del Ser,
Mehdi Bennis
Abstract:
Immersive virtual reality (VR) applications require ultra-high data rate and low-latency for smooth operation. Hence in this paper, aiming to improve VR experience in multi-user VR wireless video streaming, a deep-learning aided scheme for maximizing the quality of the delivered video chunks with low-latency is proposed. Therein the correlations in the predicted field of view (FoV) and locations o…
▽ More
Immersive virtual reality (VR) applications require ultra-high data rate and low-latency for smooth operation. Hence in this paper, aiming to improve VR experience in multi-user VR wireless video streaming, a deep-learning aided scheme for maximizing the quality of the delivered video chunks with low-latency is proposed. Therein the correlations in the predicted field of view (FoV) and locations of viewers watching 360$^\circ$ HD VR videos are capitalized on to realize a proactive FoV-centric millimeter wave (mmWave) physical-layer multicast transmission. The problem is cast as a frame quality maximization problem subject to tight latency constraints and network stability. The problem is then decoupled into an HD frame request admission and scheduling subproblems and a matching theory game is formulated to solve the scheduling subproblem by associating requests from clusters of users to mmWave small cell base stations (SBSs) for their unicast/multicast transmission. Furthermore, for realistic modeling and simulation purposes, a real VR head-tracking dataset and a deep recurrent neural network (DRNN) based on gated recurrent units (GRUs) are leveraged. Extensive simulation results show how the content-reuse for clusters of users with highly overlapping FoVs brought in by multicasting reduces the VR frame delay in 12\%. This reduction is further boosted by proactiveness that cuts by half the average delays of both reactive unicast and multicast baselines while preserving HD delivery rates above 98\%. Finally, enforcing tight latency bounds shortens the delay-tail as evinced by 13\% lower delays in the 99th percentile.
△ Less
Submitted 10 January, 2020; v1 submitted 18 November, 2018;
originally announced November 2018.
-
Edge Computing Meets Millimeter-wave Enabled VR: Paving the Way to Cutting the Cord
Authors:
Mohammed S. Elbamby,
Cristina Perfecto,
Mehdi Bennis,
Klaus Doppler
Abstract:
In this paper, a novel proactive computing and mmWave communication for ultra-reliable and low latency wireless virtual reality (VR is proposed. By leveraging information about users' poses, proactive computing and caching are used to pre-compute and store users' HD video frames to minimize the computing latency. Furthermore, multi-connectivity is exploited to ensure reliable mmWave links to deliv…
▽ More
In this paper, a novel proactive computing and mmWave communication for ultra-reliable and low latency wireless virtual reality (VR is proposed. By leveraging information about users' poses, proactive computing and caching are used to pre-compute and store users' HD video frames to minimize the computing latency. Furthermore, multi-connectivity is exploited to ensure reliable mmWave links to deliver users' requested HD frames. The performance of the proposed approach is validated on a VR network serving an interactive gaming arcade, where dynamic and real-time rendering of HD video frames is needed and impulse actions of different players impact the content to be shown. Simulation results show significant gains of up to $30\%$ reduction in end-to-end delay and $50\%$ in the $90^{\textrm{th}}$ percentile communication delay.
△ Less
Submitted 9 February, 2018; v1 submitted 23 January, 2018;
originally announced January 2018.
-
Towards Low-Latency and Ultra-Reliable Virtual Reality
Authors:
Mohammed S. Elbamby,
Cristina Perfecto,
Mehdi Bennis,
Klaus Doppler
Abstract:
Virtual Reality (VR) is expected to be one of the killer-applications in 5G networks. However, many technical bottlenecks and challenges need to be overcome to facilitate its wide adoption. In particular, VR requirements in terms of high-throughput, low-latency and reliable communication call for innovative solutions and fundamental research cutting across several disciplines. In view of this, thi…
▽ More
Virtual Reality (VR) is expected to be one of the killer-applications in 5G networks. However, many technical bottlenecks and challenges need to be overcome to facilitate its wide adoption. In particular, VR requirements in terms of high-throughput, low-latency and reliable communication call for innovative solutions and fundamental research cutting across several disciplines. In view of this, this article discusses the challenges and enablers for ultra-reliable and low-latency VR. Furthermore, in an interactive VR gaming arcade case study, we show that a smart network design that leverages the use of mmWave communication, edge computing and proactive caching can achieve the future vision of VR over wireless.
△ Less
Submitted 23 January, 2018;
originally announced January 2018.
-
Resource Optimization and Power Allocation in In-band Full Duplex (IBFD)-Enabled Non-Orthogonal Multiple Access Networks
Authors:
Mohammed S. Elbamby,
Mehdi Bennis,
Walid Saad,
Mérouane Debbah,
Matti Latva-aho
Abstract:
In this paper, the problem of uplink (UL) and downlink (DL) resource optimization, mode selection and power allocation is studied for wireless cellular networks under the assumption of in-band full duplex (IBFD) base stations, non-orthogonal multiple access (NOMA) operation, and queue stability constraints. The problem is formulated as a network utility maximization problem for which a Lyapunov fr…
▽ More
In this paper, the problem of uplink (UL) and downlink (DL) resource optimization, mode selection and power allocation is studied for wireless cellular networks under the assumption of in-band full duplex (IBFD) base stations, non-orthogonal multiple access (NOMA) operation, and queue stability constraints. The problem is formulated as a network utility maximization problem for which a Lyapunov framework is used to decompose it into two disjoint subproblems of auxiliary variable selection and rate maximization. The latter is further decoupled into a user association and mode selection (UAMS) problem and a UL/DL power optimization (UDPO) problem that are solved concurrently. The UAMS problem is modeled as a many-to-one matching problem to associate users to small cell base stations (SBSs) and select transmission mode (half/full-duplex and orthogonal/non-orthogonal multiple access), and an algorithm is proposed to solve the problem converging to a pairwise stable matching. Subsequently, the UDPO problem is formulated as a sequence of convex problems and is solved using the concave-convex procedure. Simulation results demonstrate the effectiveness of the proposed scheme to allocate UL and DL power levels after dynamically selecting the operating mode and the served users, under different traffic intensity conditions, network density, and self-interference cancellation capability. The proposed scheme is shown to achieve up to 63% and 73% of gains in UL and DL packet throughput, and 21% and 17% in UL and DL cell edge throughput, respectively, compared to existing baseline schemes.
△ Less
Submitted 17 June, 2017;
originally announced June 2017.
-
Proactive Edge Computing in Latency-Constrained Fog Networks
Authors:
Mohammed S. Elbamby,
Mehdi Bennis,
Walid Saad
Abstract:
In this paper, the fundamental problem of distribution and proactive caching of computing tasks in fog networks is studied under latency and reliability constraints. In the proposed scenario, computing can be executed either locally at the user device or offloaded to an edge cloudlet. Moreover, cloudlets exploit both their computing and storage capabilities by proactively caching popular task comp…
▽ More
In this paper, the fundamental problem of distribution and proactive caching of computing tasks in fog networks is studied under latency and reliability constraints. In the proposed scenario, computing can be executed either locally at the user device or offloaded to an edge cloudlet. Moreover, cloudlets exploit both their computing and storage capabilities by proactively caching popular task computation results to minimize computing latency. To this end, a clustering method to group spatially proximate user devices with mutual task popularity interests and their serving cloudlets is proposed. Then, cloudlets can proactively cache the popular tasks' computations of their cluster members to minimize computing latency. Additionally, the problem of distributing tasks to cloudlets is formulated as a matching game in which a cost function of computing delay is minimized under latency and reliability constraints. Simulation results show that the proposed scheme guarantees reliable computations with bounded latency and achieves up to 91% decrease in computing latency as compared to baseline schemes.
△ Less
Submitted 26 April, 2017; v1 submitted 22 April, 2017;
originally announced April 2017.
-
Content-Aware User Clustering and Caching in Wireless Small Cell Networks
Authors:
Mohammed S. ElBamby,
Mehdi Bennis,
Walid Saad,
Matti Latva-aho
Abstract:
In this paper, the problem of content-aware user clustering and content caching in wireless small cell networks is studied. In particular, a service delay minimization problem is formulated, aiming at optimally caching contents at the small cell base stations (SCBSs). To solve the optimization problem, we decouple it into two interrelated subproblems. First, a clustering algorithm is proposed grou…
▽ More
In this paper, the problem of content-aware user clustering and content caching in wireless small cell networks is studied. In particular, a service delay minimization problem is formulated, aiming at optimally caching contents at the small cell base stations (SCBSs). To solve the optimization problem, we decouple it into two interrelated subproblems. First, a clustering algorithm is proposed grouping users with similar content popularity to associate similar users to the same SCBS, when possible. Second, a reinforcement learning algorithm is proposed to enable each SCBS to learn the popularity distribution of contents requested by its group of users and optimize its caching strategy accordingly. Simulation results show that by correlating the different popularity patterns of different users, the proposed scheme is able to minimize the service delay by 42% and 27%, while achieving a higher offloading gain of up to 280% and 90%, respectively, compared to random caching and unclustered learning schemes.
△ Less
Submitted 11 September, 2014;
originally announced September 2014.
-
Dynamic Uplink-Downlink Optimization in TDD-based Small Cell Networks
Authors:
Mohammed S. ElBamby,
Mehdi Bennis,
Walid Saad,
Matti Latva-aho
Abstract:
Dynamic Time-division duplex (TDD) can provide efficient and flexible splitting of the common wireless cellular resources between uplink (UL) and downlink (DL) users. In this paper, the UL/DL optimization problem is formulated as a noncooperative game among the small cell base stations (SCBSs) in which each base station aims at minimizing its total UL and DL flow delays. To solve this game, a self…
▽ More
Dynamic Time-division duplex (TDD) can provide efficient and flexible splitting of the common wireless cellular resources between uplink (UL) and downlink (DL) users. In this paper, the UL/DL optimization problem is formulated as a noncooperative game among the small cell base stations (SCBSs) in which each base station aims at minimizing its total UL and DL flow delays. To solve this game, a self-organizing UL/DL resource configuration scheme for TDD-based small cell networks is proposed. Using the proposed scheme, an SCBS is able to estimate and learn the UL and DL loads autonomously while optimizing its UL/DL configuration accordingly. Simulations results show that the proposed algorithm achieves significant gains in terms of packet throughput in case of asymmetric UL and DL traffic loads. This gain increases as the traffic asymmetry increases, reaching up to 97% and 200% gains relative to random and fixed duplexing schemes respectively. Our results also show that the proposed algorithm is well- adapted to dynamic traffic conditions and different network sizes, and operates efficiently in case of severe cross-link interference in which neighboring cells transmit in opposite directions.
△ Less
Submitted 11 September, 2014; v1 submitted 28 February, 2014;
originally announced February 2014.