Search | arXiv e-print repository

Towards Decentralized and Sustainable Foundation Model Training with the Edge

Authors: Leyang Xue, Meghana Madhyastha, Randal Burns, Myungjin Lee, Mahesh K. Marina

Abstract: Foundation models are at the forefront of AI research, appealing for their ability to learn from vast datasets and cater to diverse tasks. Yet, their significant computational demands raise issues of environmental impact and the risk of centralized control in their development. We put forward a vision towards decentralized and sustainable foundation model training that leverages the collective com… ▽ More Foundation models are at the forefront of AI research, appealing for their ability to learn from vast datasets and cater to diverse tasks. Yet, their significant computational demands raise issues of environmental impact and the risk of centralized control in their development. We put forward a vision towards decentralized and sustainable foundation model training that leverages the collective compute of sparingly used connected edge AI devices. We present the rationale behind our vision, particularly in support of its sustainability benefit. We further outline a set of challenges that need to be addressed to turn this vision into reality. △ Less

Submitted 2 July, 2025; originally announced July 2025.

arXiv:2506.23740 [pdf, ps, other]

Campus5G: A Campus Scale Private 5G Open RAN Testbed

Authors: Andrew E. Ferguson, Ujjwal Pawar, Tianxin Wang, Mahesh K. Marina

Abstract: Mobile networks are embracing disaggregation, reflected by the industry trend towards Open RAN. Private 5G networks are viewed as particularly suitable contenders as early adopters of Open RAN, owing to their setting, high degree of control, and opportunity for innovation they present. Motivated by this, we have recently deployed Campus5G, the first of its kind campus-wide, O-RAN-compliant private… ▽ More Mobile networks are embracing disaggregation, reflected by the industry trend towards Open RAN. Private 5G networks are viewed as particularly suitable contenders as early adopters of Open RAN, owing to their setting, high degree of control, and opportunity for innovation they present. Motivated by this, we have recently deployed Campus5G, the first of its kind campus-wide, O-RAN-compliant private 5G testbed across the central campus of the University of Edinburgh. We present in detail our process developing the testbed, from planning, to architecting, to deployment, and measuring the testbed performance. We then discuss the lessons learned from building the testbed, and highlight some research opportunities that emerged from our deployment experience. △ Less

Submitted 30 June, 2025; originally announced June 2025.

ACM Class: C.2.1

arXiv:2505.21115 [pdf, other]

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Authors: Sergey Pletenev, Maria Marina, Nikolay Ivanov, Daria Galimzianova, Nikita Krayko, Mikhail Salnikov, Vasily Konovalov, Alexander Panchenko, Viktor Moskvoretskii

Abstract: Large Language Models (LLMs) often hallucinate in question answering (QA) tasks. A key yet underexplored factor contributing to this is the temporality of questions -- whether they are evergreen (answers remain stable over time) or mutable (answers change). In this work, we introduce EverGreenQA, the first multilingual QA dataset with evergreen labels, supporting both evaluation and training. Usin… ▽ More Large Language Models (LLMs) often hallucinate in question answering (QA) tasks. A key yet underexplored factor contributing to this is the temporality of questions -- whether they are evergreen (answers remain stable over time) or mutable (answers change). In this work, we introduce EverGreenQA, the first multilingual QA dataset with evergreen labels, supporting both evaluation and training. Using EverGreenQA, we benchmark 12 modern LLMs to assess whether they encode question temporality explicitly (via verbalized judgments) or implicitly (via uncertainty signals). We also train EG-E5, a lightweight multilingual classifier that achieves SoTA performance on this task. Finally, we demonstrate the practical utility of evergreen classification across three applications: improving self-knowledge estimation, filtering QA datasets, and explaining GPT-4o retrieval behavior. △ Less

Submitted 27 May, 2025; originally announced May 2025.

arXiv:2505.12566 [pdf, ps, other]

HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing

Authors: Leyang Xue, Yao Fu, Luo Mai, Mahesh K. Marina

Abstract: Giant Deep Neural Networks (DNNs), have become indispensable for accurate and robust support of large-scale cloud based AI services. However, serving giant DNNs is prohibitively expensive from an energy consumption viewpoint easily exceeding that of training, due to the enormous scale of GPU clusters needed to hold giant DNN model partitions and replicas. Existing approaches can either optimize en… ▽ More Giant Deep Neural Networks (DNNs), have become indispensable for accurate and robust support of large-scale cloud based AI services. However, serving giant DNNs is prohibitively expensive from an energy consumption viewpoint easily exceeding that of training, due to the enormous scale of GPU clusters needed to hold giant DNN model partitions and replicas. Existing approaches can either optimize energy efficiency or inference accuracy but not both. To overcome this status quo, we propose HybridServe, a novel hybrid DNN model serving system that leverages multiple sized versions (small to giant) of the model to be served in tandem. Through a confidence based hybrid model serving dataflow, HybridServe prefers to serve inference requests with energy-efficient smaller models so long as accuracy is not compromised, thereby reducing the number of replicas needed for giant DNNs. HybridServe also features a dataflow planner for efficient partitioning and replication of candidate models to maximize serving system throughput. Experimental results using a prototype implementation of HybridServe show that it reduces energy footprint by up to 19.8x compared to the state-of-the-art DNN model serving systems while matching the accuracy of serving solely with giant DNNs. △ Less

Submitted 18 May, 2025; originally announced May 2025.

arXiv:2505.04253 [pdf, other]

LLM-Independent Adaptive RAG: Let the Question Speak for Itself

Authors: Maria Marina, Nikolay Ivanov, Sergey Pletenev, Mikhail Salnikov, Daria Galimzianova, Nikita Krayko, Vasily Konovalov, Alexander Panchenko, Viktor Moskvoretskii

Abstract: Large Language Models~(LLMs) are prone to hallucinations, and Retrieval-Augmented Generation (RAG) helps mitigate this, but at a high computational cost while risking misinformation. Adaptive retrieval aims to retrieve only when necessary, but existing approaches rely on LLM-based uncertainty estimation, which remain inefficient and impractical. In this study, we introduce lightweight LLM-independ… ▽ More Large Language Models~(LLMs) are prone to hallucinations, and Retrieval-Augmented Generation (RAG) helps mitigate this, but at a high computational cost while risking misinformation. Adaptive retrieval aims to retrieve only when necessary, but existing approaches rely on LLM-based uncertainty estimation, which remain inefficient and impractical. In this study, we introduce lightweight LLM-independent adaptive retrieval methods based on external information. We investigated 27 features, organized into 7 groups, and their hybrid combinations. We evaluated these methods on 6 QA datasets, assessing the QA performance and efficiency. The results show that our approach matches the performance of complex LLM-based methods while achieving significant efficiency gains, demonstrating the potential of external information for adaptive retrieval. △ Less

Submitted 7 May, 2025; originally announced May 2025.

Comments: 11 pages, 5 figures, 2 tables

arXiv:2502.14502 [pdf, other]

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Authors: Sergey Pletenev, Maria Marina, Daniil Moskovskiy, Vasily Konovalov, Pavel Braslavski, Alexander Panchenko, Mikhail Salnikov

Abstract: The performance of Large Language Models (LLMs) on many tasks is greatly limited by the knowledge learned during pre-training and stored in the model's parameters. Low-rank adaptation (LoRA) is a popular and efficient training technique for updating or domain-specific adaptation of LLMs. In this study, we investigate how new facts can be incorporated into the LLM using LoRA without compromising th… ▽ More The performance of Large Language Models (LLMs) on many tasks is greatly limited by the knowledge learned during pre-training and stored in the model's parameters. Low-rank adaptation (LoRA) is a popular and efficient training technique for updating or domain-specific adaptation of LLMs. In this study, we investigate how new facts can be incorporated into the LLM using LoRA without compromising the previously learned knowledge. We fine-tuned Llama-3.1-8B-instruct using LoRA with varying amounts of new knowledge. Our experiments have shown that the best results are obtained when the training data contains a mixture of known and new facts. However, this approach is still potentially harmful because the model's performance on external question-answering benchmarks declines after such fine-tuning. When the training data is biased towards certain entities, the model tends to regress to few overrepresented answers. In addition, we found that the model becomes more confident and refuses to provide an answer in only few cases. These findings highlight the potential pitfalls of LoRA-based LLM updates and underscore the importance of training data composition and tuning parameters to balance new knowledge integration and general model capabilities. △ Less

Submitted 24 March, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

arXiv:2411.10108 [pdf, other]

Identifying Key Drivers of Heatwaves: A Novel Spatio-Temporal Framework for Extreme Event Detection

Authors: J. Pérez-Aracil, C. Peláez-Rodríguez, Ronan McAdam, Antonello Squintu, Cosmin M. Marina, Eugenio Lorente-Ramos, Niklas Luther, Veronica Torralba, Enrico Scoccimarro, Leone Cavicchia, Matteo Giuliani, Eduardo Zorita, Felicitas Hansen, David Barriopedro, Ricardo Garcia-Herrera, Pedro A. Gutiérrez, Jürg Luterbacher, Elena Xoplaki, Andrea Castelletti, S. Salcedo-Sanz

Abstract: Heatwaves (HWs) are extreme atmospheric events that produce significant societal and environmental impacts. Predicting these extreme events remains challenging, as their complex interactions with large-scale atmospheric and climatic variables are difficult to capture with traditional statistical and dynamical models. This work presents a general method for driver identification in extreme climate… ▽ More Heatwaves (HWs) are extreme atmospheric events that produce significant societal and environmental impacts. Predicting these extreme events remains challenging, as their complex interactions with large-scale atmospheric and climatic variables are difficult to capture with traditional statistical and dynamical models. This work presents a general method for driver identification in extreme climate events. A novel framework (STCO-FS) is proposed to identify key immediate (short-term) HW drivers by combining clustering algorithms with an ensemble evolutionary algorithm. The framework analyzes spatio-temporal data, reduces dimensionality by grouping similar geographical nodes for each variable, and develops driver selection in spatial and temporal domains, identifying the best time lags between predictive variables and HW occurrences. The proposed method has been applied to analyze HWs in the Adda river basin in Italy. The approach effectively identifies significant variables influencing HWs in this region. This research can potentially enhance our understanding of HW drivers and predictability. △ Less

Submitted 15 November, 2024; originally announced November 2024.

Comments: 28 pages, 10 figures, 4 tables

arXiv:2410.05468 [pdf, other]

PH-Dropout: Practical Epistemic Uncertainty Quantification for View Synthesis

Authors: Chuanhao Sun, Thanos Triantafyllou, Anthos Makris, Maja Drmač, Kai Xu, Luo Mai, Mahesh K. Marina

Abstract: View synthesis using Neural Radiance Fields (NeRF) and Gaussian Splatting (GS) has demonstrated impressive fidelity in rendering real-world scenarios. However, practical methods for accurate and efficient epistemic Uncertainty Quantification (UQ) in view synthesis are lacking. Existing approaches for NeRF either introduce significant computational overhead (e.g., ``10x increase in training time" o… ▽ More View synthesis using Neural Radiance Fields (NeRF) and Gaussian Splatting (GS) has demonstrated impressive fidelity in rendering real-world scenarios. However, practical methods for accurate and efficient epistemic Uncertainty Quantification (UQ) in view synthesis are lacking. Existing approaches for NeRF either introduce significant computational overhead (e.g., ``10x increase in training time" or ``10x repeated training") or are limited to specific uncertainty conditions or models. Notably, GS models lack any systematic approach for comprehensive epistemic UQ. This capability is crucial for improving the robustness and scalability of neural view synthesis, enabling active model updates, error estimation, and scalable ensemble modeling based on uncertainty. In this paper, we revisit NeRF and GS-based methods from a function approximation perspective, identifying key differences and connections in 3D representation learning. Building on these insights, we introduce PH-Dropout (Post hoc Dropout), the first real-time and accurate method for epistemic uncertainty estimation that operates directly on pre-trained NeRF and GS models. Extensive evaluations validate our theoretical findings and demonstrate the effectiveness of PH-Dropout. △ Less

Submitted 11 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

Comments: 21 pages, in submision

arXiv:2407.09370 [pdf, other]

Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding

Authors: Chuanhao Sun, Zhihang Yuan, Kai Xu, Luo Mai, N. Siddharth, Shuo Chen, Mahesh K. Marina

Abstract: Fourier features based positional encoding (PE) is commonly used in machine learning tasks that involve learning high-frequency features from low-dimensional inputs, such as 3D view synthesis and time series regression with neural tangent kernels. Despite their effectiveness, existing PEs require manual, empirical adjustment of crucial hyperparameters, specifically the Fourier features, tailored t… ▽ More Fourier features based positional encoding (PE) is commonly used in machine learning tasks that involve learning high-frequency features from low-dimensional inputs, such as 3D view synthesis and time series regression with neural tangent kernels. Despite their effectiveness, existing PEs require manual, empirical adjustment of crucial hyperparameters, specifically the Fourier features, tailored to each unique task. Further, PEs face challenges in efficiently learning high-frequency functions, particularly in tasks with limited data. In this paper, we introduce sinusoidal PE (SPE), designed to efficiently learn adaptive frequency features closely aligned with the true underlying function. Our experiments demonstrate that SPE, without hyperparameter tuning, consistently achieves enhanced fidelity and faster training across various tasks, including 3D view synthesis, Text-to-Speech generation, and 1D regression. SPE is implemented as a direct replacement for existing PEs. Its plug-and-play nature lets numerous tasks easily adopt and benefit from SPE. △ Less

Submitted 17 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

Comments: 16 pages, Conference, Accepted by ICML 2024

arXiv:2401.14361 [pdf, other]

MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert Cache

Authors: Leyang Xue, Yao Fu, Zhan Lu, Luo Mai, Mahesh Marina

Abstract: This paper presents MoE-Infinity, an efficient MoE inference system designed for personal machines with limited GPU memory capacity. The key idea for MoE-Infinity is that on personal machines, which are often single-user environments, MoE-based LLMs typically operate with a batch size of one. In this setting, MoE models exhibit a high degree of activation sparsity, meaning a small number of expert… ▽ More This paper presents MoE-Infinity, an efficient MoE inference system designed for personal machines with limited GPU memory capacity. The key idea for MoE-Infinity is that on personal machines, which are often single-user environments, MoE-based LLMs typically operate with a batch size of one. In this setting, MoE models exhibit a high degree of activation sparsity, meaning a small number of experts are frequently reused in generating tokens during the decode phase. Leveraging this idea, we design a sparsity-aware expert cache, which can trace the sparse activation of experts during inference and carefully select the trace that represents the sparsity pattern. By analyzing these selected traces, MoE-Infinity guides the replacement and prefetching of the expert cache, providing 3.1-16.7x per-token latency improvements over numerous state-of-the-art systems, including vLLM, Ollama, DeepSpeed and BrainStorm across various MoE models (DeepSeek and Mixtral) when handling different LLM tasks. MoE-Infinity's source code is publicly available at https://github.com/EfficientMoE/MoE-Infinity △ Less

Submitted 12 March, 2025; v1 submitted 25 January, 2024; originally announced January 2024.

arXiv:2310.09242 [pdf, other]

doi 10.1145/3589334.3645328

A Multifaceted Look at Starlink Performance

Authors: Nitinder Mohan, Andrew Ferguson, Hendrik Cech, Prakita Rayyan Renatin, Rohan Bose, Mahesh Marina, Jörg Ott

Abstract: In recent years, Low-Earth Orbit (LEO) mega-constellations have emerged as a promising network technology and have ushered in a new era for democratizing Internet access. The Starlink network from SpaceX stands out as the only consumer-facing LEO network with over 2M+ customers and more than 4000 operational satellites. In this paper, we conduct the first-of-its-kind extensive multi-faceted analys… ▽ More In recent years, Low-Earth Orbit (LEO) mega-constellations have emerged as a promising network technology and have ushered in a new era for democratizing Internet access. The Starlink network from SpaceX stands out as the only consumer-facing LEO network with over 2M+ customers and more than 4000 operational satellites. In this paper, we conduct the first-of-its-kind extensive multi-faceted analysis of Starlink network performance leveraging several measurement sources. First, based on 19.2M crowdsourced M-Lab speed test measurements from 34 countries since 2021, we analyze Starlink global performance relative to terrestrial cellular networks. Second, we examine Starlink's ability to support real-time web-based latency and bandwidth-critical applications by analyzing the performance of (i) Zoom video conferencing, and (ii) Luna cloud gaming, comparing it to 5G and terrestrial fiber. Third, we orchestrate targeted measurements from Starlink-enabled RIPE Atlas probes to shed light on the last-mile Starlink access and other factors affecting its performance globally. Finally, we conduct controlled experiments from Starlink dishes in two countries and analyze the impact of globally synchronized "15-second reconfiguration intervals" of the links that cause substantial latency and throughput variations. Our unique analysis provides revealing insights on global Starlink functionality and paints the most comprehensive picture of the LEO network's operation to date. △ Less

Submitted 22 February, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

Comments: Accepted in ACM Web Conference 2024 (WWW 24)

Journal ref: In Proceedings of ACM Web Conference 2024 (WWW 24)

arXiv:2212.00742 [pdf, other]

New Probabilistic-Dynamic Multi-Method Ensembles for Optimization based on the CRO-SL

Authors: Jorge Pérez-Aracil, Carlos Camacho-Gómez, Eugenio Lorente-Ramos, Cosmin M. Marina, Sancho Salcedo-Sanz

Abstract: In this paper we propose new probabilistic and dynamic (adaptive) strategies to create multi-method ensembles based on the Coral Reefs Optimization with Substrate Layers (CRO-SL) algorithm. The CRO-SL is an evolutionary-based ensemble approach, able to combine different search procedures within a single population. In this work we discuss two different probabilistic strategies to improve the algor… ▽ More In this paper we propose new probabilistic and dynamic (adaptive) strategies to create multi-method ensembles based on the Coral Reefs Optimization with Substrate Layers (CRO-SL) algorithm. The CRO-SL is an evolutionary-based ensemble approach, able to combine different search procedures within a single population. In this work we discuss two different probabilistic strategies to improve the algorithm. First, we defined the Probabilistic CRO-SL (PCRO-SL), which substitutes the substrates in the CRO-SL population by {\em tags} associated with each individual. Each tag represents a different operator which will modify the individual in the reproduction phase. In each generation of the algorithm, the tags are randomly assigned to the individuals with a similar probability, obtaining this way an ensemble with a more intense change in the application of different operators to a given individual than the original CRO-SL. The second strategy discussed in this paper is the Dynamical Probabilistic CRO-SL (DPCRO-SL), in which the probability of tag assignment is modified during the evolution of the algorithm, depending on the quality of the solutions generated in each substrate. Thus, the best substrates in the search process will be assigned with a higher probability that those which showed a worse performance during the search. We test the performance of the proposed probabilistic and dynamic ensembles in different optimization problems, including benchmark functions and a real application of wind turbines layout optimization, comparing the results obtained with that of existing algorithms in the literature. △ Less

Submitted 2 December, 2022; v1 submitted 30 November, 2022; originally announced December 2022.

Comments: 18 pages, 6 figures, 5 tables

MSC Class: 68T01 and 68T20

arXiv:2009.02473 [pdf, other]

Examining Machine Learning for 5G and Beyond through an Adversarial Lens

Authors: Muhammad Usama, Rupendra Nath Mitra, Inaam Ilahi, Junaid Qadir, Mahesh K. Marina

Abstract: Spurred by the recent advances in deep learning to harness rich information hidden in large volumes of data and to tackle problems that are hard to model/solve (e.g., resource allocation problems), there is currently tremendous excitement in the mobile networks domain around the transformative potential of data-driven AI/ML based network automation, control and analytics for 5G and beyond. In this… ▽ More Spurred by the recent advances in deep learning to harness rich information hidden in large volumes of data and to tackle problems that are hard to model/solve (e.g., resource allocation problems), there is currently tremendous excitement in the mobile networks domain around the transformative potential of data-driven AI/ML based network automation, control and analytics for 5G and beyond. In this article, we present a cautionary perspective on the use of AI/ML in the 5G context by highlighting the adversarial dimension spanning multiple types of ML (supervised/unsupervised/RL) and support this through three case studies. We also discuss approaches to mitigate this adversarial ML risk, offer guidelines for evaluating the robustness of ML models, and call attention to issues surrounding ML oriented research in 5G more generally. △ Less

Submitted 5 September, 2020; originally announced September 2020.

arXiv:2007.11472 [pdf, other]

Characterization and Identification of Cloudified Mobile Network Performance Bottlenecks

Authors: G. Patounas, X. Foukas, A. Elmokashfi, M. K. Marina

Abstract: This study is a first attempt to experimentally explore the range of performance bottlenecks that 5G mobile networks can experience. To this end, we leverage a wide range of measurements obtained with a prototype testbed that captures the key aspects of a cloudified mobile network. We investigate the relevance of the metrics and a number of approaches to accurately and efficiently identify bottlen… ▽ More This study is a first attempt to experimentally explore the range of performance bottlenecks that 5G mobile networks can experience. To this end, we leverage a wide range of measurements obtained with a prototype testbed that captures the key aspects of a cloudified mobile network. We investigate the relevance of the metrics and a number of approaches to accurately and efficiently identify bottlenecks across the different locations of the network and layers of the system architecture. Our findings validate the complexity of this task in the multi-layered architecture and highlight the need for novel monitoring approaches that intelligently fuse metrics across network layers and functions. In particular, we find that distributed analytics performs reasonably well both in terms of bottleneck identification accuracy and incurred computational and communication overhead. △ Less

Submitted 23 July, 2020; v1 submitted 22 July, 2020; originally announced July 2020.

Comments: 17 pages, 16 figures, documentclass[journal,comsoc]{IEEEtran}, corrected title

arXiv:1903.00433 [pdf, other]

doi 10.1145/3308558.3313628

Urban Vibes and Rural Charms: Analysis of Geographic Diversity in Mobile Service Usage at National Scale

Authors: Rajkarn Singh, Marco Fiore, Mahesh K. Marina, Alessandro Nordio, Alberto Tarable

Abstract: We investigate spatial patterns in mobile service consumption that emerge at national scale. Our investigation focuses on a representative case study, i.e., France, where we find that: (i) the demand for popular mobile services is fairly uniform across the whole country, and only a reduced set of peculiar services (mainly operating system updates and long-lived video streaming) yields geographic d… ▽ More We investigate spatial patterns in mobile service consumption that emerge at national scale. Our investigation focuses on a representative case study, i.e., France, where we find that: (i) the demand for popular mobile services is fairly uniform across the whole country, and only a reduced set of peculiar services (mainly operating system updates and long-lived video streaming) yields geographic diversity; (ii) even for such distinguishing services, the spatial heterogeneity of demands is limited, and a small set of consumption behaviors is sufficient to characterize most of the mobile service usage across the country; (iii) the spatial distribution of these behaviors correlates well with the urbanization level, ultimately suggesting that the adoption of geographically-diverse mobile applications is linked to a dichotomy of cities and rural areas. We derive our results through the analysis of substantial measurement data collected by a major mobile network operator, leveraging an approach rooted in information theory that can be readily applied to other scenarios. △ Less

Submitted 1 March, 2019; originally announced March 2019.

Comments: to be published in Proceedings of the 2019 World Wide Web Conference (WWW'19), May 13-17 2019, San Francisco, CA, USA. 11 pages, 11 figures, 2 tables

arXiv:1812.06183 [pdf, other]

doi 10.1109/JSAC.2019.2927067

Iris: Deep Reinforcement Learning Driven Shared Spectrum Access Architecture for Indoor Neutral-Host Small Cells

Authors: Xenofon Foukas, Mahesh K. Marina, Kimon Kontovasilis

Abstract: We consider indoor mobile access, a vital use case for current and future mobile networks. For this key use case, we outline a vision that combines a neutral-host based shared small-cell infrastructure with a common pool of spectrum for dynamic sharing as a way forward to proliferate indoor small-cell deployments and open up the mobile operator ecosystem. Towards this vision, we focus on the chall… ▽ More We consider indoor mobile access, a vital use case for current and future mobile networks. For this key use case, we outline a vision that combines a neutral-host based shared small-cell infrastructure with a common pool of spectrum for dynamic sharing as a way forward to proliferate indoor small-cell deployments and open up the mobile operator ecosystem. Towards this vision, we focus on the challenges pertaining to managing access to shared spectrum (e.g., 3.5GHz US CBRS spectrum). We propose Iris, a practical shared spectrum access architecture for indoor neutral-host small-cells. At the core of Iris is a deep reinforcement learning based dynamic pricing mechanism that efficiently mediates access to shared spectrum for diverse operators in a way that provides incentives for operators and the neutral-host alike. We then present the Iris system architecture that embeds this dynamic pricing mechanism alongside cloud-RAN and RAN slicing design principles in a practical neutral-host design tailored for the indoor small-cell environment. Using a prototype implementation of the Iris system, we present extensive experimental evaluation results that not only offer insight into the Iris dynamic pricing process and its superiority over alternative approaches but also demonstrate its deployment feasibility. △ Less

Submitted 24 July, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

arXiv:1608.05069 [pdf, other]

doi 10.1145/2988287.2989143

Holistic Small Cell Traffic Balancing across Licensed and Unlicensed Bands

Authors: Ursula Challita, Mahesh K. Marina

Abstract: Due to the dramatic growth in mobile data traffic on one hand and the scarcity of the licensed spectrum on the other hand, mobile operators are considering the use of unlicensed bands (especially those in 5 GHz) as complementary spectrum for providing higher system capacity and better user experience. This approach is currently being standardized by 3GPP under the name of LTE Licensed-Assisted Acc… ▽ More Due to the dramatic growth in mobile data traffic on one hand and the scarcity of the licensed spectrum on the other hand, mobile operators are considering the use of unlicensed bands (especially those in 5 GHz) as complementary spectrum for providing higher system capacity and better user experience. This approach is currently being standardized by 3GPP under the name of LTE Licensed-Assisted Access (LTE-LAA). In this paper, we take a holistic approach for LTE-LAA small cell traffic balancing by jointly optimizing the use of the licensed and unlicensed bands. We pose this traffic balancing as an optimization problem that seeks proportional fair coexistence of WiFi, small cell and macro cell users by adapting the transmission probability of the LTE-LAA small cell in the licensed and unlicensed bands. The motivation for this formulation is for the LTE-LAA small cell to switch between or aggregate licensed and unlicensed bands depending on the interference/traffic level and the number of active users in each band. We derive a closed form solution for this optimization problem and additionally propose a transmission mechanism for the operation of the LTE-LAA small cell on both bands. Through numerical and simulation results, we show that our proposed traffic balancing scheme, besides enabling better LTE-WiFi coexistence and efficient utilization of the radio resources relative to the existing traffic balancing scheme, also provides a better tradeoff between maximizing the total network throughput and achieving fairness among all network flows compared to alternative approaches. △ Less

Submitted 13 September, 2016; v1 submitted 17 August, 2016; originally announced August 2016.

Comments: Accepted for publication at MSWiM 2016

arXiv:1307.0962 [pdf, ps, other]

Auctioning based Coordinated TV White Space Spectrum Sharing for Home Networks

Authors: Saravana Manickam, Mahesh K. Marina, Sofia Pediaditaki, Maziar Nekovee

Abstract: The idea of having the geolocation database monitor the secondary use of TV white space (TVWS) spectrum and assist in coordinating the secondary usage is gaining ground. Considering the home networking use case, we leverage the geolocation database for interference-aware coordinated TVWS sharing among secondary users (home networks) using {\em short-term auctions}, thereby realize a dynamic second… ▽ More The idea of having the geolocation database monitor the secondary use of TV white space (TVWS) spectrum and assist in coordinating the secondary usage is gaining ground. Considering the home networking use case, we leverage the geolocation database for interference-aware coordinated TVWS sharing among secondary users (home networks) using {\em short-term auctions}, thereby realize a dynamic secondary market. To enable this auctioning based coordinated TVWS sharing framework, we propose an enhanced {\em market-driven TVWS spectrum access model}. For the short-term auctions, we propose an online multi-unit, iterative truthful mechanism called VERUM that takes into consideration spatially heterogeneous spectrum availability, an inherent characteristic in the TVWS context. We prove that VERUM is truthful (i.e., the best strategy for every bidder is to bid based on its true valuation) and is also efficient in that it allocates spectrum to users who value it the most. Evaluation results from scenarios with real home distributions in urban and dense-urban environments and using realistic TVWS spectrum availability maps show that VERUM performs close to optimal allocation in terms of revenue for the coordinating spectrum manager. Comparison with two existing efficient and truthful multi-unit spectrum auction schemes, VERITAS and SATYA, shows that VERUM fares better in terms of revenue, spectrum utilisation and percentage of winning bidders in diverse conditions. Taking all of the above together, VERUM can be seen to offer incentives to subscribed users encouraging them to use TVWS spectrum through greater spectrum availability (as measured by percentage of winning bidders) as well as to the coordinating spectrum manager through revenue generation. △ Less

Submitted 5 December, 2013; v1 submitted 3 July, 2013; originally announced July 2013.

Showing 1–18 of 18 results for author: Marina, M