Skip to main content

Showing 1–22 of 22 results for author: Ilager, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.07755  [pdf, ps, other

    cs.DC cs.AI

    Benchmarking of CPU-intensive Stream Data Processing in The Edge Computing Systems

    Authors: Tomasz Szydlo, Viacheslaw Horbanow, Dev Nandan Jha, Shashikant Ilager, Aleksander Slominski, Rajiv Ranjan

    Abstract: Edge computing has emerged as a pivotal technology, offering significant advantages such as low latency, enhanced data security, and reduced reliance on centralized cloud infrastructure. These benefits are crucial for applications requiring real-time data processing or strict security measures. Despite these advantages, edge devices operating within edge clusters are often underutilized. This inef… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  2. arXiv:2501.15829  [pdf, other

    cs.DC

    Aging-aware CPU Core Management for Embodied Carbon Amortization in Cloud LLM Inference

    Authors: Tharindu B. Hewage, Shashikant Ilager, Maria Rodriguez Read, Rajkumar Buyya

    Abstract: Broad adoption of Large Language Models (LLM) demands rapid expansions of cloud LLM inference clusters, leading to accumulation of embodied carbon$-$the emissions from manufacturing and supplying IT assets$-$that mostly concentrate on inference server CPU. This paper delves into the challenges of sustainable growth of cloud LLM inference, emphasizing extended amortization of CPU embodied over an i… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  3. arXiv:2501.11006  [pdf, other

    cs.DC cs.AI cs.PF cs.SE

    GREEN-CODE: Learning to Optimize Energy Efficiency in LLM-based Code Generation

    Authors: Shashikant Ilager, Lukas Florian Briem, Ivona Brandic

    Abstract: Large Language Models (LLMs) are becoming integral to daily life, showcasing their vast potential across various Natural Language Processing (NLP) tasks. Beyond NLP, LLMs are increasingly used in software development tasks, such as code completion, modification, bug fixing, and code translation. Software engineers widely use tools like GitHub Copilot and Amazon Q, streamlining workflows and automa… ▽ More

    Submitted 21 March, 2025; v1 submitted 19 January, 2025; originally announced January 2025.

    Comments: Under submission in ACM/IEEE conference, 11 pages

    ACM Class: C.4; D.0; E.4; I.7

  4. arXiv:2501.08219  [pdf, ps, other

    cs.LG

    Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings

    Authors: Paul Joe Maliakel, Shashikant Ilager, Ivona Brandic

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across a wide range of natural language processing (NLP) tasks, leading to widespread adoption in both research and industry. However, their inference workloads are computationally and energy intensive, raising concerns about sustainability and environmental impact. As LLMs continue to scale, it becomes essential to identify and… ▽ More

    Submitted 2 June, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

  5. arXiv:2411.07628  [pdf, other

    cs.DC

    A Framework for Carbon-aware Real-Time Workload Management in Clouds using Renewables-driven Cores

    Authors: Tharindu B. Hewage, Shashikant Ilager, Maria A. Rodriguez, Rajkumar Buyya

    Abstract: Cloud platforms commonly exploit workload temporal flexibility to reduce their carbon emissions. They suspend/resume workload execution for when and where the energy is greenest. However, increasingly prevalent delay-intolerant real-time workloads challenge this approach. To this end, we present a framework to harvest green renewable energy for real-time workloads in cloud systems. We use renewabl… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

  6. arXiv:2410.23881  [pdf, other

    cs.DC cs.LG cs.SE

    DynaSplit: A Hardware-Software Co-Design Framework for Energy-Aware Inference on Edge

    Authors: Daniel May, Alessandro Tundo, Shashikant Ilager, Ivona Brandic

    Abstract: The deployment of ML models on edge devices is challenged by limited computational resources and energy availability. While split computing enables the decomposition of large neural networks (NNs) and allows partial computation on both edge and cloud devices, identifying the most suitable split layer and hardware configurations is a non-trivial task. This process is in fact hindered by the large c… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

  7. arXiv:2410.10285  [pdf, other

    cs.LG cs.AI

    ABBA-VSM: Time Series Classification using Symbolic Representation on the Edge

    Authors: Meerzhan Kanatbekova, Shashikant Ilager, Ivona Brandic

    Abstract: In recent years, Edge AI has become more prevalent with applications across various industries, from environmental monitoring to smart city management. Edge AI facilitates the processing of Internet of Things (IoT) data and provides privacy-enabled and latency-sensitive services to application users using Machine Learning (ML) algorithms, e.g., Time Series Classification (TSC). However, existing T… ▽ More

    Submitted 5 November, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

    Comments: 15 pages with references, 5 figures

  8. arXiv:2410.06715  [pdf, other

    cs.DC

    FRESCO: Fast and Reliable Edge Offloading with Reputation-based Hybrid Smart Contracts

    Authors: Josip Zilic, Vincenzo de Maio, Shashikant Ilager, Ivona Brandic

    Abstract: Mobile devices offload latency-sensitive application tasks to edge servers to satisfy applications' Quality of Service (QoS) deadlines. Consequently, ensuring reliable offloading without QoS violations is challenging in distributed and unreliable edge environments. However, current edge offloading solutions are either centralized or do not adequately address challenges in distributed environments.… ▽ More

    Submitted 28 November, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: 14 pages, 12 figures

  9. arXiv:2409.08949  [pdf, other

    cs.DC cs.AR

    Generic and ML Workloads in an HPC Datacenter: Node Energy, Job Failures, and Node-Job Analysis

    Authors: Xiaoyu Chu, Daniel Hofstätter, Shashikant Ilager, Sacheendra Talluri, Duncan Kampert, Damian Podareanu, Dmitry Duplyakin, Ivona Brandic, Alexandru Iosup

    Abstract: HPC datacenters offer a backbone to the modern digital society. Increasingly, they run Machine Learning (ML) jobs next to generic, compute-intensive workloads, supporting science, business, and other decision-making processes. However, understanding how ML jobs impact the operation of HPC datacenters, relative to generic jobs, remains desirable but understudied. In this work, we leverage long-term… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: 10 pages, 10 figures, 6 tables, ICPADS 2024

  10. arXiv:2405.07806  [pdf, other

    cs.DC cs.IR cs.NI eess.SY

    A Decentralized and Self-Adaptive Approach for Monitoring Volatile Edge Environments

    Authors: Shashikant Ilager, Jakob Fahringer, Alessandro Tundo, Ivona Brandić

    Abstract: Edge computing provides resources for IoT workloads at the network edge. Monitoring systems are vital for efficiently managing resources and application workloads by collecting, storing, and providing relevant information about the state of the resources. However, traditional monitoring systems have a centralized architecture for both data plane and control plane, which increases latency, creates… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Submitted to ACM Transactions on Autonomous and Adaptive Systems

  11. arXiv:2403.16930  [pdf, other

    cs.LG

    FLIGAN: Enhancing Federated Learning with Incomplete Data using GAN

    Authors: Paul Joe Maliakel, Shashikant Ilager, Ivona Brandic

    Abstract: Federated Learning (FL) provides a privacy-preserving mechanism for distributed training of machine learning models on networked devices (e.g., mobile devices, IoT edge nodes). It enables Artificial Intelligence (AI) at the edge by creating models without sharing actual data across the network. Existing research typically focuses on generic aspects of non-IID data and heterogeneity in client's sys… ▽ More

    Submitted 2 April, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  12. arXiv:2311.00974  [pdf, other

    cs.DC

    CloudSim Express: A Novel Framework for Rapid Low Code Simulation of Cloud Computing Environments

    Authors: Tharindu B. Hewage, Shashikant Ilager, Maria A. Rodriguez, Rajkumar Buyya

    Abstract: Cloud computing environment simulators enable cost-effective experimentation of novel infrastructure designs and management approaches by avoiding significant costs incurred from repetitive deployments in real Cloud platforms. However, widely used Cloud environment simulators compromise on usability due to complexities in design and configuration, along with the added overhead of programming langu… ▽ More

    Submitted 10 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

  13. SymED: Adaptive and Online Symbolic Representation of Data on the Edge

    Authors: Daniel Hofstätter, Shashikant Ilager, Ivan Lujic, Ivona Brandic

    Abstract: The edge computing paradigm helps handle the Internet of Things (IoT) generated data in proximity to its source. Challenges occur in transferring, storing, and processing this rapidly growing amount of data on resource-constrained edge devices. Symbolic Representation (SR) algorithms are promising solutions to reduce the data size by converting actual raw data into symbols. Also, they allow data a… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 14 pages, 5 figures

    Journal ref: Euro-Par 2023: Parallel Processing pp 411-425. Springer Nature Switzerland, Cham (2023)

  14. arXiv:2309.00022  [pdf, other

    cs.SE cs.LG eess.SY

    An Energy-Aware Approach to Design Self-Adaptive AI-based Applications on the Edge

    Authors: Alessandro Tundo, Marco Mobilio, Shashikant Ilager, Ivona Brandić, Ezio Bartocci, Leonardo Mariani

    Abstract: The advent of edge devices dedicated to machine learning tasks enabled the execution of AI-based applications that efficiently process and classify the data acquired by the resource-constrained devices populating the Internet of Things. The proliferation of such applications (e.g., critical monitoring in smart cities) demands new strategies to make these systems also sustainable from an energetic… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

  15. arXiv:2303.10572  [pdf

    cs.DC

    Energy-Efficiency and Sustainability in New Generation Cloud Computing: A Vision and Directions for Integrated Management of Data Centre Resources and Workloads

    Authors: Rajkumar Buyya, Shashikant Ilager, Patricia Arroba

    Abstract: Cloud computing has become a critical infrastructure for modern society, like electric power grids and roads. As the backbone of the modern economy, it offers subscription-based computing services anytime, anywhere, on a pay-as-you-go basis. Its use is growing exponentially with the continued development of new classes of applications driven by a huge number of emerging networked devices. However,… ▽ More

    Submitted 20 July, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: 15 pages, 6 figures

    ACM Class: C.1.4

  16. arXiv:2107.02342  [pdf, other

    cs.DC cs.LG cs.NI

    Energy and Thermal-aware Resource Management of Cloud Data Centres: A Taxonomy and Future Directions

    Authors: Shashikant Ilager, Rajkumar Buyya

    Abstract: This paper investigates the existing resource management approaches in Cloud Data Centres for energy and thermal efficiency. It identifies the need for integrated computing and cooling systems management and learning-based solutions in resource management systems. A taxonomy on energy and thermal efficient resource management in data centres is proposed based on an in-depth analysis of the literat… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: Submitted to ACM Computing Surveys

  17. arXiv:2011.03649  [pdf, other

    cs.DC cs.AI cs.LG

    Thermal Prediction for Efficient Energy Management of Clouds using Machine Learning

    Authors: Shashikant Ilager, Kotagiri Ramamohanarao, Rajkumar Buyya

    Abstract: Thermal management in the hyper-scale cloud data centers is a critical problem. Increased host temperature creates hotspots which significantly increases cooling cost and affects reliability. Accurate prediction of host temperature is crucial for managing the resources effectively. Temperature estimation is a non-trivial problem due to thermal variations in the data center. Existing solutions for… ▽ More

    Submitted 15 December, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Under submission at IEEE Transactions on Parallel and Distributed Systems (TPDS)

  18. arXiv:2009.03598  [pdf

    cs.DC

    Green-aware Mobile Edge Computing for IoT: Challenges, Solutions and Future Directions

    Authors: Minxian Xu, Chengxi Gao, Shashikant Ilager, Huaming Wu, Chengzhong Xu, Rajkumar Buyya

    Abstract: The development of Internet of Things (IoT) technology enables the rapid growth of connected smart devices and mobile applications. However, due to the constrained resources and limited battery capacity, there are bottlenecks when utilizing the smart devices. Mobile edge computing (MEC) offers an attractive paradigm to handle this challenge. In this work, we concentrate on the MEC application for… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

    Comments: 19 pages, 2 figures, 2 tables, to be appeared in Mobile Edge Computing book published by Springer

  19. Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments using A3C learning and Residual Recurrent Neural Networks

    Authors: Shreshth Tuli, Shashikant Ilager, Kotagiri Ramamohanarao, Rajkumar Buyya

    Abstract: The ubiquitous adoption of Internet-of-Things (IoT) based applications has resulted in the emergence of the Fog computing paradigm, which allows seamlessly harnessing both mobile-edge and cloud resources. Efficient scheduling of application tasks in such environments is challenging due to constrained resource capabilities, mobility factors in IoT, resource heterogeneity, network hierarchy, and sto… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

    Comments: Accepted in IEEE Transaction on Mobile Computing

  20. arXiv:2006.05075  [pdf, other

    cs.DC cs.LG

    Artificial Intelligence (AI)-Centric Management of Resources in Modern Distributed Computing Systems

    Authors: Shashikant Ilager, Rajeev Muralidhar, Rajkumar Buyya

    Abstract: Contemporary Distributed Computing Systems (DCS) such as Cloud Data Centres are large scale, complex, heterogeneous, and distributed across multiple networks and geographical boundaries. On the other hand, the Internet of Things (IoT)-driven applications are producing a huge amount of data that requires real-time processing and fast response. Managing these resources efficiently to provide reliabl… ▽ More

    Submitted 6 November, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: Presented at IEEE cloud summit, 2020

  21. arXiv:2004.08177  [pdf, other

    cs.DC

    A Data-Driven Frequency Scaling Approach for Deadline-aware Energy Efficient Scheduling on Graphics Processing Units (GPUs)

    Authors: Shashikant Ilager, Rajeev Muralidhar, Kotagiri Rammohanrao, Rajkumar Buyya

    Abstract: Modern computing paradigms, such as cloud computing, are increasingly adopting GPUs to boost their computing capabilities primarily due to the heterogeneous nature of AI/ML/deep learning workloads. However, the energy consumption of GPUs is a critical problem. Dynamic Voltage Frequency Scaling (DVFS) is a widely used technique to reduce the dynamic power of GPUs. Yet, configuring the optimal clock… ▽ More

    Submitted 27 April, 2020; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: In the Proceedings of the 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID 2020)

  22. arXiv:1808.06332  [pdf

    cs.DC

    GPU PaaS Computation Model in Aneka Cloud Computing Environment

    Authors: Shashikant Ilager, Rajeev Wankar, Raghavendra Kune, Rajkumar Buyya

    Abstract: Due to the surge in the volume of data generated and rapid advancement in Artificial Intelligence (AI) techniques like machine learning and deep learning, the existing traditional computing models have become inadequate to process an enormous volume of data and the complex application logic for extracting intrinsic information. Computing accelerators such as Graphics processing units (GPUs) have b… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    Comments: Submitted as book chapter, under processing, 32 pages