-
LLMs are everywhere: Ubiquitous Utilization of AI Models through Air Computing
Authors:
Baris Yamansavascilar,
Atay Ozgovde,
Cem Ersoy
Abstract:
We are witnessing a new era where problem-solving and cognitive tasks are being increasingly delegated to Large Language Models (LLMs) across diverse domains, ranging from code generation to holiday planning. This trend also creates a demand for the ubiquitous execution of LLM-powered applications in a wide variety of environments in which traditional terrestrial 2D networking infrastructures may…
▽ More
We are witnessing a new era where problem-solving and cognitive tasks are being increasingly delegated to Large Language Models (LLMs) across diverse domains, ranging from code generation to holiday planning. This trend also creates a demand for the ubiquitous execution of LLM-powered applications in a wide variety of environments in which traditional terrestrial 2D networking infrastructures may prove insufficient. A promising solution in this context is to extend edge computing into a 3D setting to include aerial platforms organized in multiple layers, a paradigm we refer to as air computing, to augment local devices for running LLM and Generative AI (GenAI) applications. This approach alleviates the strain on existing infrastructure while enhancing service efficiency by offloading computational tasks to the corresponding air units such as UAVs. Furthermore, the coordinated deployment of various air units can significantly improve the Quality of Experience (QoE) by ensuring seamless, adaptive, and resilient task execution. In this study, we investigate the synergy between LLM-based applications and air computing, exploring their potential across various use cases. Additionally, we present a disaster response case study demonstrating how the collaborative utilization of LLMs and air computing can significantly improve outcomes in critical situations.
△ Less
Submitted 2 March, 2025;
originally announced March 2025.
-
AirCompSim: A Discrete Event Simulator for Air Computing
Authors:
Baris Yamansavascilar,
Atay Ozgovde,
Cem Ersoy
Abstract:
Air components, including UAVs, planes, balloons, and satellites have been widely utilized since the fixed capacity of ground infrastructure cannot meet the dynamic load of the users. However, since those air components should be coordinated in order to achieve the desired quality of service, several next-generation paradigms have been defined including air computing. Nevertheless, even though man…
▽ More
Air components, including UAVs, planes, balloons, and satellites have been widely utilized since the fixed capacity of ground infrastructure cannot meet the dynamic load of the users. However, since those air components should be coordinated in order to achieve the desired quality of service, several next-generation paradigms have been defined including air computing. Nevertheless, even though many studies and open research issues exist for air computing, there are limited test environments that cannot satisfy the performance evaluation requirements of the dynamic environment. Therefore, in this study, we introduce our discrete event simulator, AirCompSim, which fulfills an air computing environment considering dynamically changing requirements, loads, and capacities through its modular structure. To show its capabilities, a dynamic capacity enhancement scenario is used for investigating the effect of the number of users, UAVs, and requirements of different application types on the average task success rate, service time, and server utilization. The results demonstrate that AirCompSim can be used for experiments in air computing.
△ Less
Submitted 1 September, 2024;
originally announced September 2024.
-
DeepAir: A Multi-Agent Deep Reinforcement Learning Based Scheme for an Unknown User Location Problem
Authors:
Baris Yamansavascilar,
Atay Ozgovde,
Cem Ersoy
Abstract:
The deployment of unmanned aerial vehicles (UAVs) in many different settings has provided various solutions and strategies for networking paradigms. Therefore, it reduces the complexity of the developments for the existing problems, which otherwise require more sophisticated approaches. One of those existing problems is the unknown user locations in an infrastructure-less environment in which user…
▽ More
The deployment of unmanned aerial vehicles (UAVs) in many different settings has provided various solutions and strategies for networking paradigms. Therefore, it reduces the complexity of the developments for the existing problems, which otherwise require more sophisticated approaches. One of those existing problems is the unknown user locations in an infrastructure-less environment in which users cannot connect to any communication device or computation-providing server, which is essential to task offloading in order to achieve the required quality of service (QoS). Therefore, in this study, we investigate this problem thoroughly and propose a novel deep reinforcement learning (DRL) based scheme, DeepAir. DeepAir considers all of the necessary steps including sensing, localization, resource allocation, and multi-access edge computing (MEC) to achieve QoS requirements for the offloaded tasks without violating the maximum tolerable delay. To this end, we use two types of UAVs including detector UAVs, and serving UAVs. We utilize detector UAVs as DRL agents which ensure sensing, localization, and resource allocation. On the other hand, we utilize serving UAVs to provide MEC features. Our experiments show that DeepAir provides a high task success rate by deploying fewer detector UAVs in the environment, which includes different numbers of users and user attraction points, compared to benchmark methods.
△ Less
Submitted 11 August, 2024;
originally announced August 2024.
-
Dynamic Capacity Enhancement using Air Computing: An Earthquake Case
Authors:
Baris Yamansavascilar,
Atay Ozgovde,
Cem Ersoy
Abstract:
Earthquakes are one of the most destructive natural disasters harming life and the infrastructure of cities. After an earthquake, functioning communication and computational capacity are crucial for rescue teams and healthcare of victims. Therefore, an earthquake can be investigated for dynamic capacity enhancement in which additional resources are deployed since the surviving portion of the infra…
▽ More
Earthquakes are one of the most destructive natural disasters harming life and the infrastructure of cities. After an earthquake, functioning communication and computational capacity are crucial for rescue teams and healthcare of victims. Therefore, an earthquake can be investigated for dynamic capacity enhancement in which additional resources are deployed since the surviving portion of the infrastructure may not meet the demand of the users. In this study, we propose a new computation paradigm, air computing, which is the air vehicle assisted next generation edge computing through different air platforms, in order to enhance the capacity of the areas affected by an earthquake. To this end, we put forward a novel paradigm that presents a dynamic, responsive, and high-resolution computation environment by explaining its corresponding components, air layers, and essential advantages. Moreover, we focus on the unmanned aerial vehicle (UAV) deployment problem and apply three different methods including the emergency method, the load balancing method, and the location selection index (LSI) method in which we take the delay requirements of applications into account. To test and compare their performance in terms of the task success rate, we developed an earthquake scenario in which three towns are affected with different severity. The experimental results showed that each method can be beneficial considering the circumstances, and goal of the rescue.
△ Less
Submitted 13 July, 2023;
originally announced July 2023.
-
Air Computing: A Survey on a New Generation Computation Paradigm in 6G Wireless Networks
Authors:
Baris Yamansavascilar,
Atay Ozgovde,
Cem Ersoy
Abstract:
There is an ever-growing race between what novel applications demand from the infrastructure and what the continuous technological breakthroughs bring in. Especially after the proliferation of smart devices and diverse IoT requirements, we observe the dominance of cutting-edge applications with ever-increased user expectations in terms of mobility, pervasiveness, and real-time response. Over the y…
▽ More
There is an ever-growing race between what novel applications demand from the infrastructure and what the continuous technological breakthroughs bring in. Especially after the proliferation of smart devices and diverse IoT requirements, we observe the dominance of cutting-edge applications with ever-increased user expectations in terms of mobility, pervasiveness, and real-time response. Over the years, to meet the requirements of those applications, cloud computing provides the necessary capacity for computation, while edge computing ensures low latency. However, these two essential solutions would be insufficient for the next-generation applications since computational and communicational bottlenecks are inevitable due to the highly dynamic load. Therefore, a 3D networking structure using different air layers including Low Altitude Platforms, High Altitude Platforms, and Low Earth Orbits in a harmonized manner for both urban and rural areas should be applied to satisfy the requirements of the dynamic environment. In this perspective, we put forward a novel, next-generation paradigm called Air Computing that presents a dynamic, responsive, and high-resolution computation and communication environment for all spectrum of applications using the 6G Wireless Networks as the fundamental communication system. In this survey, we define the components of air computing, investigate its architecture in detail, and discuss its essential use cases and the advantages it brings for next-generation application scenarios. We provide a detailed and technical overview of the benefits and challenges of air computing as a novel paradigm and spot the important future research directions.
△ Less
Submitted 10 September, 2022;
originally announced September 2022.
-
An Indoor Localization Dataset and Data Collection Framework with High Precision Position Annotation
Authors:
F. Serhan Daniş,
A. Teoman Naskali,
A. Taylan Cemgil,
Cem Ersoy
Abstract:
We introduce a novel technique and an associated high resolution dataset that aims to precisely evaluate wireless signal based indoor positioning algorithms. The technique implements an augmented reality (AR) based positioning system that is used to annotate the wireless signal parameter data samples with high precision position data. We track the position of a practical and low cost navigable set…
▽ More
We introduce a novel technique and an associated high resolution dataset that aims to precisely evaluate wireless signal based indoor positioning algorithms. The technique implements an augmented reality (AR) based positioning system that is used to annotate the wireless signal parameter data samples with high precision position data. We track the position of a practical and low cost navigable setup of cameras and a Bluetooth Low Energy (BLE) beacon in an area decorated with AR markers. We maximize the performance of the AR-based localization by using a redundant number of markers. Video streams captured by the cameras are subjected to a series of marker recognition, subset selection and filtering operations to yield highly precise pose estimations. Our results show that we can reduce the positional error of the AR localization system to a rate under 0.05 meters. The position data are then used to annotate the BLE data that are captured simultaneously by the sensors stationed in the environment, hence, constructing a wireless signal data set with the ground truth, which allows a wireless signal based localization system to be evaluated accurately.
△ Less
Submitted 6 September, 2022;
originally announced September 2022.
-
DeepEdge: A Deep Reinforcement Learning based Task Orchestrator for Edge Computing
Authors:
Baris Yamansavascilar,
Ahmet Cihat Baktir,
Cagatay Sonmez,
Atay Ozgovde,
Cem Ersoy
Abstract:
The improvements in the edge computing technology pave the road for diversified applications that demand real-time interaction. However, due to the mobility of the end-users and the dynamic edge environment, it becomes challenging to handle the task offloading with high performance. Moreover, since each application in mobile devices has different characteristics, a task orchestrator must be adapti…
▽ More
The improvements in the edge computing technology pave the road for diversified applications that demand real-time interaction. However, due to the mobility of the end-users and the dynamic edge environment, it becomes challenging to handle the task offloading with high performance. Moreover, since each application in mobile devices has different characteristics, a task orchestrator must be adaptive and have the ability to learn the dynamics of the environment. For this purpose, we develop a deep reinforcement learning based task orchestrator, DeepEdge, which learns to meet different task requirements without needing human interaction even under the heavily-loaded stochastic network conditions in terms of mobile users and applications. Given the dynamic offloading requests and time-varying communication conditions, we successfully model the problem as a Markov process and then apply the Double Deep Q-Network (DDQN) algorithm to implement DeepEdge. To evaluate the robustness of DeepEdge, we experiment with four different applications including image rendering, infotainment, pervasive health, and augmented reality in the network under various loads. Furthermore, we compare the performance of our agent with the four different task offloading approaches in the literature. Our results show that DeepEdge outperforms its competitors in terms of the percentage of satisfactorily completed tasks.
△ Less
Submitted 31 March, 2022; v1 submitted 5 October, 2021;
originally announced October 2021.
-
Reinforcement Learning Based Dynamic Function Splitting in Disaggregated Green Open RANs
Authors:
Turgay Pamuklu,
Melike Erol-Kantarci,
Cem Ersoy
Abstract:
With the growing momentum around Open RAN (O-RAN) initiatives, performing dynamic Function Splitting (FS) in disaggregated and virtualized Radio Access Networks (vRANs), in an efficient way, is becoming highly important. An equally important efficiency demand is emerging from the energy consumption dimension of the RAN hardware and software. Supplying the RAN with Renewable Energy Sources (RESs) p…
▽ More
With the growing momentum around Open RAN (O-RAN) initiatives, performing dynamic Function Splitting (FS) in disaggregated and virtualized Radio Access Networks (vRANs), in an efficient way, is becoming highly important. An equally important efficiency demand is emerging from the energy consumption dimension of the RAN hardware and software. Supplying the RAN with Renewable Energy Sources (RESs) promises to boost the energy-efficiency. Yet, FS in such a dynamic setting, calls for intelligent mechanisms that can adapt to the varying conditions of the RES supply and the traffic load on the mobile network. In this paper, we propose a reinforcement learning (RL)-based dynamic function splitting (RLDFS) technique that decides on the function splits in an O-RAN to make the best use of RES supply and minimize operator costs. We also formulate an operational expenditure minimization problem. We evaluate the performance of the proposed approach on a real data set of solar irradiation and traffic rate variations. Our results show that the proposed RLDFS method makes effective use of RES and reduces the cost of an MNO. We also investigate the impact of the size of solar panels and batteries which may guide MNOs to decide on proper RES and battery sizing for their networks.
△ Less
Submitted 14 February, 2021; v1 submitted 6 December, 2020;
originally announced December 2020.
-
Renewable Energy Assisted Function Splitting in Cloud Radio Access Networks
Authors:
Turgay Pamuklu,
Cicek Cavdar,
Cem Ersoy
Abstract:
Cloud-Radio Access Network (C-RAN) is a promising network architecture to reduce energy consumption and the increasing number of base station deployment costs in mobile networks. However, the necessity of enormous fronthaul bandwidth between a remote radio head and a baseband unit (BBU) calls for novel solutions. One of the solutions introduces the edge-cloud layer in addition to the centralized c…
▽ More
Cloud-Radio Access Network (C-RAN) is a promising network architecture to reduce energy consumption and the increasing number of base station deployment costs in mobile networks. However, the necessity of enormous fronthaul bandwidth between a remote radio head and a baseband unit (BBU) calls for novel solutions. One of the solutions introduces the edge-cloud layer in addition to the centralized cloud (CC) to keep resources closer to the radio units (RUs). Then, split the BBU functions between the center cloud (CC) and edge clouds (ECs) to reduce the fronthaul bandwidth requirement and to relax the stringent end-to-end delay requirements. This paper expands this architecture by combining it with renewable energy sources in CC and ECs. We explain this novel system and formulate a mixed-integer linear programming (MILP) problem, which aims to reduce the operational expenditure of this system. Due to the NP-Hard property of this problem, we solve the smaller instances by using a MILP Solver and provide the results in this paper. Moreover, we propose a faster online heuristic to find solutions for high user densities. The results show that make splitting decisions by considering renewable energy provides more cost-effective solutions to mobile network operators (MNOs). Lastly, we provide an economic feasibility study for renewable energy sources in a CRAN architecture, which will encourage the MNOs to use these sources in this architecture.
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
Reducing the total cost of ownership in radio access networks by using renewable energy resources
Authors:
Turgay Pamuklu,
Cem Ersoy
Abstract:
Increasing electricity prices motivates the mobile network operators to find new energy-efficient solutions for radio access networks (RANs). In this study, we focus on a specific type of RAN where the stand-alone solar panels are used as alternative energy sources to the electrical grid energy. First, we describe this hybrid energy based radio access network (HEBRAN) and formulate an optimization…
▽ More
Increasing electricity prices motivates the mobile network operators to find new energy-efficient solutions for radio access networks (RANs). In this study, we focus on a specific type of RAN where the stand-alone solar panels are used as alternative energy sources to the electrical grid energy. First, we describe this hybrid energy based radio access network (HEBRAN) and formulate an optimization problem which aims to reduce the total cost of ownership of this network. Then, we propose a framework that provides a cost-efficient algorithm for choosing the proper size for the solar panels and batteries of a HEBRAN and two novel switch on/off algorithms which regulate the consumption of grid electricity during the operation of the network. In addition, we create a reduced model of the HEBRAN optimization problem to solve it in a mixed integer linear programming (MILP) solver. The results show that our algorithms outperform the MILP solution and classical switch on/off methods. Moreover, our findings show that migrating to a HEBRAN system is feasible and has cost-benefits for mobile network operators.
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
GROVE: A Cost-Efficient Green Radio over Ethernet Architecture for Next Generation Radio Access Network
Authors:
Turgay Pamuklu,
Cem Ersoy
Abstract:
Centralized/Cloud Radio Access Network (C-RAN) comes into prominence to reduce the rising energy consumptions and maintenance difficulties of the next-generation networks. However, C-RAN has strict delay requirements, and it needs large fronthaul bandwidth. Function splitting and Radio over Ethernet are two promising approaches to reduce these drawbacks of the C-RAN architecture. Meanwhile, the us…
▽ More
Centralized/Cloud Radio Access Network (C-RAN) comes into prominence to reduce the rising energy consumptions and maintenance difficulties of the next-generation networks. However, C-RAN has strict delay requirements, and it needs large fronthaul bandwidth. Function splitting and Radio over Ethernet are two promising approaches to reduce these drawbacks of the C-RAN architecture. Meanwhile, the usage of renewable energy sources (RESs) in a C-RAN boosts the energy-efficiency potential of this network. In this paper, we propose a novel model, which is called Green Radio OVer Ethernet (GROVE), that merges these three approaches to maximize the benefits of C-RAN while maintaining the economic feasibility of this architecture. We briefly explain this model and formulate an operational expenditure minimization problem by considering the several restrictions due to the network design and the service provisioning. Then we linearize the quadratic routing decision constraints in the problem to solve it with a mixed-integer linear programming (MILP) solver. Results show that it is cost-effective to choose routing, function splitting, and RES decisions together. Our solution surpasses classical disjoint approaches for all studied cases. Besides, we provide a network scalability analysis to determine the MILP solver's limits for larger network topologies.
△ Less
Submitted 5 December, 2020; v1 submitted 18 May, 2020;
originally announced May 2020.
-
Fault Tolerance in SDN Data Plane Considering Network and Application Based Metrics
Authors:
Baris Yamansavascilar,
Ahmet Cihat Baktir,
Atay Ozgovde,
Cem Ersoy
Abstract:
Failures in networks result in service disruptions which may cause deteriorated Quality of Service (QoS) for the end users. Since SDN is becoming the mainstream paradigm for networks, implementation of a robust fault tolerance scheme for SDN-based networks is crucial. Existing SDN data plane fault tolerance approaches can be classified as reactive and proactive which may or may not rely on the con…
▽ More
Failures in networks result in service disruptions which may cause deteriorated Quality of Service (QoS) for the end users. Since SDN is becoming the mainstream paradigm for networks, implementation of a robust fault tolerance scheme for SDN-based networks is crucial. Existing SDN data plane fault tolerance approaches can be classified as reactive and proactive which may or may not rely on the controller, respectively. However, none of them qualifies as a complete solution, providing only partial remedies. In this work, we propose Dynamic Protection with Quality of Alternative Paths (DPQoAP) that considers not only the existing faults within the network but also the quality of alternative paths. As a result, we can sustain the QoS throughout the network after the recovery. We also investigate how application based parameters are affected by link failures. To this end, we explore the change in Quality of Experience (QoE) caused by link failures under different cases using Dynamic Adaptive Streaming over HTTP (DASH) for video streaming. On the other hand, even though DASH is proposed as a solution to improve the QoE affected by the dynamic conditions of the networks, it remains insufficient to handle the congested links that show the symptoms of a link failure. Thus, we apply the data plane fault tolerance approach in SDN to improve the QoE of DASH clients in the case of congestion as well as the failure. The performance of the proposed solutions are evaluated through various experiments considering the QoS and QoE parameters. It is observed that DPQoAP enhances the efficiency of the networking operations and adaptability of the applications.
△ Less
Submitted 26 December, 2019;
originally announced December 2019.
-
Is Your Smartband Smart Enough to Know Who You Are: Continuous Physiological Authentication in The Wild
Authors:
Deniz Ekiz,
Yekta Said Can,
Yagmur Ceren Dardagan,
Cem Ersoy
Abstract:
The use of cloud services that process privacy-sensitive information such as digital banking, pervasive healthcare, smart home applications requires an implicit continuous authentication solution which will make these systems less vulnerable to the spoofing attacks. Physiological signals can be used for continuous authentication due to their personal uniqueness. Ubiquitous wrist-worn wearable devi…
▽ More
The use of cloud services that process privacy-sensitive information such as digital banking, pervasive healthcare, smart home applications requires an implicit continuous authentication solution which will make these systems less vulnerable to the spoofing attacks. Physiological signals can be used for continuous authentication due to their personal uniqueness. Ubiquitous wrist-worn wearable devices are equipped with photoplethysmogram sensors which enable to extract heart rate variability (HRV) features. In this study, we show that these devices can be used for continuous physiological authentication, for enhancing the security of the cloud, edge services, and IoT devices. A system that is suitable for the smartband framework comes with new challenges such as relatively low signal quality and artifacts due to placement which were not encountered in full lead electrocardiogram systems. After the artifact removal, cleaned physiological signals are fed to the machine learning algorithms. In order to train our machine learning models, we collected physiological data using off-the-shelf smartbands and smartwatches in a real-life event. Performance evaluation of selected machine learning algorithms shows that HRV is a strong candidate for continuous unobtrusive implicit physiological authentication.
△ Less
Submitted 15 January, 2020; v1 submitted 10 December, 2019;
originally announced December 2019.
-
Long Short-Term Network Based Unobtrusive Perceived Workload Monitoring with Consumer Grade Smartwatches in the Wild
Authors:
Deniz Ekiz,
Yekta Said Can,
Cem Ersoy
Abstract:
Continuous high perceived workload has a negative impact on the individual's well-being. Prior works focused on detecting the workload with medical-grade wearable systems in the restricted settings, and the effect of applying deep learning techniques for perceived workload detection in the wild settings is not investigated. We present an unobtrusive, comfortable, pervasive and affordable Long Shor…
▽ More
Continuous high perceived workload has a negative impact on the individual's well-being. Prior works focused on detecting the workload with medical-grade wearable systems in the restricted settings, and the effect of applying deep learning techniques for perceived workload detection in the wild settings is not investigated. We present an unobtrusive, comfortable, pervasive and affordable Long Short-Term Memory Network based continuous workload monitoring system based on a smartwatch application that monitors the perceived workload of individuals in the wild. We make use of modern consumer-grade smartwatches. We have recorded physiological data from daily life with perceived workload questionnaires from subjects in their real-life environments over a month. The model was trained and evaluated with the daily-life physiological data coming from different days which makes it robust to daily changes in the heart rate variability, that we use with accelerometer features to asses low and high workload. Our system has the capability of removing motion-related artifacts and detecting perceived workload by using traditional and deep classifiers. We discussed the problems related to in the wild applications with the consumer-grade smartwatches. We showed that Long Short-Term Memory Network outperforms traditional classifiers on discrimination of low and high workload with smartwatches in the wild.
△ Less
Submitted 30 November, 2019;
originally announced December 2019.
-
Addressing the Challenges in Federating Edge Resources
Authors:
Cihat Baktir,
Cagatay Sonmez,
Cem Ersoy,
Atay Ozgovde,
Blesson Varghese
Abstract:
This book chapter considers how Edge deployments can be brought to bear in a global context by federating them across multiple geographic regions to create a global Edge-based fabric that decentralizes data center computation. This is currently impractical, not only because of technical challenges, but is also shrouded by social, legal and geopolitical issues. In this chapter, we discuss two key c…
▽ More
This book chapter considers how Edge deployments can be brought to bear in a global context by federating them across multiple geographic regions to create a global Edge-based fabric that decentralizes data center computation. This is currently impractical, not only because of technical challenges, but is also shrouded by social, legal and geopolitical issues. In this chapter, we discuss two key challenges - networking and management in federating Edge deployments. Additionally, we consider resource and modeling challenges that will need to be addressed for a federated Edge.
△ Less
Submitted 14 March, 2018;
originally announced March 2018.