-
Higher-Order Graph Databases
Authors:
Maciej Besta,
Shriram Chandran,
Jakub Cudak,
Patrick Iff,
Marcin Copik,
Robert Gerstenberger,
Tomasz Szydlo,
Jürgen Müller,
Torsten Hoefler
Abstract:
Recent advances in graph databases (GDBs) have been driving interest in large-scale analytics, yet current systems fail to support higher-order (HO) interactions beyond first-order (one-hop) relations, which are crucial for tasks such as subgraph counting, polyadic modeling, and HO graph learning. We address this by introducing a new class of systems, higher-order graph databases (HO-GDBs) that us…
▽ More
Recent advances in graph databases (GDBs) have been driving interest in large-scale analytics, yet current systems fail to support higher-order (HO) interactions beyond first-order (one-hop) relations, which are crucial for tasks such as subgraph counting, polyadic modeling, and HO graph learning. We address this by introducing a new class of systems, higher-order graph databases (HO-GDBs) that use lifting and lowering paradigms to seamlessly extend traditional GDBs with HO. We provide a theoretical analysis of OLTP and OLAP queries, ensuring correctness, scalability, and ACID compliance. We implement a lightweight, modular, and parallelizable HO-GDB prototype that offers native support for hypergraphs, node-tuples, subgraphs, and other HO structures under a unified API. The prototype scales to large HO OLTP & OLAP workloads and shows how HO improves analytical tasks, for example enhancing accuracy of graph neural networks within a GDB by 44%. Our work ensures low latency and high query throughput, and generalizes both ACID-compliant and eventually consistent systems.
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
Benchmarking of CPU-intensive Stream Data Processing in The Edge Computing Systems
Authors:
Tomasz Szydlo,
Viacheslaw Horbanow,
Dev Nandan Jha,
Shashikant Ilager,
Aleksander Slominski,
Rajiv Ranjan
Abstract:
Edge computing has emerged as a pivotal technology, offering significant advantages such as low latency, enhanced data security, and reduced reliance on centralized cloud infrastructure. These benefits are crucial for applications requiring real-time data processing or strict security measures. Despite these advantages, edge devices operating within edge clusters are often underutilized. This inef…
▽ More
Edge computing has emerged as a pivotal technology, offering significant advantages such as low latency, enhanced data security, and reduced reliance on centralized cloud infrastructure. These benefits are crucial for applications requiring real-time data processing or strict security measures. Despite these advantages, edge devices operating within edge clusters are often underutilized. This inefficiency is mainly due to the absence of a holistic performance profiling mechanism which can help dynamically adjust the desired system configuration for a given workload. Since edge computing environments involve a complex interplay between CPU frequency, power consumption, and application performance, a deeper understanding of these correlations is essential. By uncovering these relationships, it becomes possible to make informed decisions that enhance both computational efficiency and energy savings. To address this gap, this paper evaluates the power consumption and performance characteristics of a single processing node within an edge cluster using a synthetic microbenchmark by varying the workload size and CPU frequency. The results show how an optimal measure can lead to optimized usage of edge resources, given both performance and power consumption.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
IoTSim-Osmosis-RES: Towards autonomic renewable energy-aware osmotic computing
Authors:
Tomasz Szydlo,
Amadeusz Szabala,
Nazar Kordiumov,
Konrad Siuzdak,
Lukasz Wolski,
Khaled Alwasel,
Fawzy Habeeb,
Rajiv Ranjan
Abstract:
Internet of Things systems exists in various areas of our everyday life. For example, sensors installed in smart cities and homes are processed in edge and cloud computing centres providing several benefits that improve our lives. The place of data processing is related to the required system response times -- processing data closer to its source results in a shorter system response time. The Osmo…
▽ More
Internet of Things systems exists in various areas of our everyday life. For example, sensors installed in smart cities and homes are processed in edge and cloud computing centres providing several benefits that improve our lives. The place of data processing is related to the required system response times -- processing data closer to its source results in a shorter system response time. The Osmotic Computing concept enables flexible deployment of data processing services and their possible movement, just like particles in the osmosis phenomenon move between regions of different densities. At the same time, the impact of complex computer architecture on the environment is increasingly being compensated by the use of renewable and low-carbon energy sources. However, the uncertainty of supplying green energy makes the management of Osmotic Computing demanding, and therefore their autonomy is desirable. In the paper, we present a framework enabling osmotic computing simulation based on renewable energy sources and autonomic osmotic agents, allowing the analysis of distributed management algorithms. We discuss the challenges posed to the framework and analyze various management algorithms for cooperating osmotic agents. In the evaluation we show that changing the adaptation logic of the osmotic agents, it is possible to increase the self-consumption of renewable energy sources or increase the usage of low emission ones.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Device management and network connectivity as missing elements in TinyML landscape
Authors:
Tomasz Szydlo,
Marcin Nagy
Abstract:
Deployment of solutions based on TinyML requires meeting several challenges. These include hardware heterogeneity, microprocessor (MCU) architectures, and resource availability constraints. Another challenge is the variety of operating systems for MCU, limited memory management implementations and limited software interoperability between devices. A number of these challenges are solved by dedicat…
▽ More
Deployment of solutions based on TinyML requires meeting several challenges. These include hardware heterogeneity, microprocessor (MCU) architectures, and resource availability constraints. Another challenge is the variety of operating systems for MCU, limited memory management implementations and limited software interoperability between devices. A number of these challenges are solved by dedicated programming libraries and the ability to compile code for specific devices. Nevertheless, the challenge discussed in the paper is the issue of network connectivity for such solutions. We point out that more emphasis should be placed on standard protocols, interoperability of solutions and security. Finally, the paper discusses how the LwM2M protocol can solve the identified challenges related to network connectivity and interoperability.
△ Less
Submitted 23 April, 2023;
originally announced April 2023.
-
Online Anomaly Detection Based On Reservoir Sampling and LOF for IoT devices
Authors:
Tomasz Szydlo
Abstract:
The growing number of IoT devices and their use to monitor the operation of machines and equipment increases interest in anomaly detection algorithms running on devices. However, the difficulty is the limitations of the available computational and memory resources on the devices. In the case of microcontrollers (MCUs), these are single megabytes of program and several hundred kilobytes of working…
▽ More
The growing number of IoT devices and their use to monitor the operation of machines and equipment increases interest in anomaly detection algorithms running on devices. However, the difficulty is the limitations of the available computational and memory resources on the devices. In the case of microcontrollers (MCUs), these are single megabytes of program and several hundred kilobytes of working memory. Consequently, algorithms must be appropriately matched to the capabilities of the devices. In the paper, we analyse the processing pipeline for anomaly detection and implementation of the Local Outliner Factor (LOF) algorithm on a MCU. We also show that it is possible to train such an algorithm directly on the device, which gives great potential to use the solution in real devices.
△ Less
Submitted 28 June, 2022;
originally announced June 2022.
-
Machine Learning in the Internet of Things for Industry 4.0
Authors:
Tomasz Szydlo,
Joanna Sendorek,
Robert Brzoza-Woch,
Mateusz Windak
Abstract:
Number of IoT devices is constantly increasing which results in greater complexity of computations and high data velocity. One of the approach to process sensor data is dataflow programming. It enables the development of reactive software with short processing and rapid response times, especially when moved to the edge of the network. This is especially important in systems that utilize online mac…
▽ More
Number of IoT devices is constantly increasing which results in greater complexity of computations and high data velocity. One of the approach to process sensor data is dataflow programming. It enables the development of reactive software with short processing and rapid response times, especially when moved to the edge of the network. This is especially important in systems that utilize online machine learning algorithms to analyze ongoing processes such as those observed in Industry 4.0. In this paper, we show that organization of such systems depends on the entire processing stack, from the hardware layer all the way to the software layer, as well as on the required response times of the IoT system. We propose a flow processing stack for such systems along with the organizational machine learning architectural patterns that enable the possibility to spread the learning and inferencing on the edge and the cloud. In the paper, we analyse what latency is introduced by communication technologies used in the IoT for cloud connectivity and how they influence the response times of the system. Finally, we are providing recommendations which machine learning patterns should be used in the IoT systems depending on the application type.
△ Less
Submitted 22 May, 2020;
originally announced May 2020.
-
Dataset for anomalies detection in 3D printing
Authors:
Joanna Sendorek,
Tomasz Szydlo,
Mateusz Windak,
Robert Brzoza-Woch
Abstract:
Nowadays, Internet of Things plays a significant role in many domains. Especially, Industry 4.0 is making a great usage of concepts like smart sensors and big data analysis. IoT devices are commonly used to monitor industry machines and detect anomalies in their work. In this paper we present and describe a set of data streams coming from working 3D printer. Among others, it contains accelerometer…
▽ More
Nowadays, Internet of Things plays a significant role in many domains. Especially, Industry 4.0 is making a great usage of concepts like smart sensors and big data analysis. IoT devices are commonly used to monitor industry machines and detect anomalies in their work. In this paper we present and describe a set of data streams coming from working 3D printer. Among others, it contains accelerometer data of printer head, intrusion power and temperatures of the printer elements. In order to gain data we lead to several printing malfunctions applied to the 3D model. Resulting dataset can therefore be used for anomalies detection research.
△ Less
Submitted 19 April, 2020;
originally announced April 2020.