-
Implementing the draft Graph Query Language Standard
Authors:
Malcolm Crowe,
Fritz Laux
Abstract:
The International Standards Organization (ISO) is developing a new standard for Graph Query Language, with a particular focus on graph patterns with repeating paths. The Linked Database Benchmark Council (LDBC) has developed benchmarks to test proposed implementations. Their Financial Benchmark includes a novel requirement for truncation of results. This paper presents an open-source implementatio…
▽ More
The International Standards Organization (ISO) is developing a new standard for Graph Query Language, with a particular focus on graph patterns with repeating paths. The Linked Database Benchmark Council (LDBC) has developed benchmarks to test proposed implementations. Their Financial Benchmark includes a novel requirement for truncation of results. This paper presents an open-source implementation of the benchmark workloads and truncation.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
Implementing the Typed Graph Data Model Using Relational Database Technology
Authors:
Malcolm Crowe,
Fritz Laux
Abstract:
Recent standardization work for database languages has reflected the growing use of typed graph models (TGM) in application development. Such data models are frequently only used early in the design process, and not reflected directly in underlying physical database. In previous work, we have added support to a relational database management system (RDBMS) with role-based structures to ensure that…
▽ More
Recent standardization work for database languages has reflected the growing use of typed graph models (TGM) in application development. Such data models are frequently only used early in the design process, and not reflected directly in underlying physical database. In previous work, we have added support to a relational database management system (RDBMS) with role-based structures to ensure that relevant data models are not separately declared in each application but are an important part of the database implementation. In this work, we implement this approach for the TGM: the resulting database implementation is novel in retaining the best features of the graph-based and relational database technologies.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
Database Technology Evolution III: Knowledge Graphs and Linked Data
Authors:
Malcolm Crowe,
Fritz Laux
Abstract:
This paper reviews the changes for database technology represented by the current development of the draft international standard ISO 39075 (Database Languages - GQL), which seeks a unified specification for property graphs and knowledge graphs. This paper examines these current developments as part of our review of the evolution of database technology, and their relation to the longer-term goal o…
▽ More
This paper reviews the changes for database technology represented by the current development of the draft international standard ISO 39075 (Database Languages - GQL), which seeks a unified specification for property graphs and knowledge graphs. This paper examines these current developments as part of our review of the evolution of database technology, and their relation to the longer-term goal of supporting the Semantic Web using relational technology.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
O$|$R$|$P$|$E -- A Data Semantics Driven Concurrency Control
Authors:
Tim Lessner,
Fritz Laux,
Thomas M Connolly
Abstract:
This paper presents a concurrency control mechanism that does not follow a 'one concurrency control mechanism fits all needs' strategy. With the presented mechanism a transaction runs under several concurrency control mechanisms and the appropriate one is chosen based on the accessed data. For this purpose, the data is divided into four classes based on its access type and usage (semantics). Class…
▽ More
This paper presents a concurrency control mechanism that does not follow a 'one concurrency control mechanism fits all needs' strategy. With the presented mechanism a transaction runs under several concurrency control mechanisms and the appropriate one is chosen based on the accessed data. For this purpose, the data is divided into four classes based on its access type and usage (semantics). Class $O$ (the optimistic class) implements a first-committer-wins strategy, class $R$ (the reconciliation class) implements a first-n-committers-win strategy, class $P$ (the pessimistic class) implements a first-reader-wins strategy, and class $E$ (the escrow class) implements a first-n-readers-win strategy. Accordingly, the model is called \PeFS. The selected concurrency control mechanism may be automatically adapted at run-time according to the current load or a known usage profile. This run-time adaptation allows \Pe to balance the commit rate and the response time even under changing conditions. \Pe outperforms the Snapshot Isolation concurrency control in terms of response time by a factor of approximately 4.5 under heavy transactional load (4000 concurrent transactions). As consequence, the degree of concurrency is 3.2 times higher.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
SQL Access Patterns for Optimistic Concurrency Control
Authors:
Fritz Laux,
Martti Laiho
Abstract:
Transaction processing is of growing importance for mobile and web applications. Booking tickets, flight reservation, e-Banking, e-Payment, and booking holiday arrangements are just a few examples. Due to temporarily disconnected situations the synchronization and consistent transaction processing are key issues. To avoid difficulties with blocked transactions or communication loss several authors…
▽ More
Transaction processing is of growing importance for mobile and web applications. Booking tickets, flight reservation, e-Banking, e-Payment, and booking holiday arrangements are just a few examples. Due to temporarily disconnected situations the synchronization and consistent transaction processing are key issues. To avoid difficulties with blocked transactions or communication loss several authors and technology providers have recommended to use Optimistic Concurrency Control (OCC) to solve the problem. However most vendors of Relational Database Management Systems (DBMS) implemented only locking schemes for concurrency control which prohibit the immediate use of OCC. We propose Row Version Verifying (RVV) discipline to avoid lost updates and achieve a kind of OCC for those DBMS not providing an adequate non-blocking concurrency control. Moreover, the different mechanisms are categorized as access pattern in order to provide programmers with a general guideline for SQL databases. The proposed SQL access patterns are relevant for all transactional applications with unreliable communication and low conflicting situations. We demonstrate the proposed solution using mainstream database systems like Oracle, DB2, and SQLServer.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Graph Data Models and Relational Database Technology
Authors:
Malcolm Crowe,
Fritz Laux
Abstract:
Recent work on database application development platforms has sought to include a declarative formulation of a conceptual data model in the application code, using annotations or attributes. Some recent work has used metadata to include the details of such formulations in the physical database, and this approach brings significant advantages in that the model can be enforced across a range of appl…
▽ More
Recent work on database application development platforms has sought to include a declarative formulation of a conceptual data model in the application code, using annotations or attributes. Some recent work has used metadata to include the details of such formulations in the physical database, and this approach brings significant advantages in that the model can be enforced across a range of applications for a single database. In previous work, we have discussed the advantages for enterprise integration of typed graph data models (TGM), which can play a similar role in graphical databases, leveraging the existing support for the unified modelling language UML. Ideally, the integration of systems designed with different models, for example, graphical and relational database, should also be supported. In this work, we implement this approach, using metadata in a relational database management system (DBMS).
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
Database Technology Evolution
Authors:
Malcolm Crowe,
Fritz Laux
Abstract:
This paper reviews suggestions for changes to database technology coming from the work of many researchers, particularly those working with evolving big data. We discuss new approaches to remote data access and standards that better provide for durability and auditability in settings including business and scientific computing. We propose ways in which the language standards could evolve, with pro…
▽ More
This paper reviews suggestions for changes to database technology coming from the work of many researchers, particularly those working with evolving big data. We discuss new approaches to remote data access and standards that better provide for durability and auditability in settings including business and scientific computing. We propose ways in which the language standards could evolve, with proof-of-concept implementations on Github.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
Reconsidering Optimistic Algorithms for Relational DBMS
Authors:
Malcolm Crowe,
Fritz Laux
Abstract:
At DBKDA 2019, we demonstrated that StrongDBMS with simple but rigorous optimistic algorithms, provides better performance in situations of high concurrency than major commercial database management systems (DBMS). The demonstration was convincing but the reasons for its success were not fully analysed. There is a brief account of the results below. In this short contribution, we wish to discuss t…
▽ More
At DBKDA 2019, we demonstrated that StrongDBMS with simple but rigorous optimistic algorithms, provides better performance in situations of high concurrency than major commercial database management systems (DBMS). The demonstration was convincing but the reasons for its success were not fully analysed. There is a brief account of the results below. In this short contribution, we wish to discuss the reasons for the results. The analysis leads to a strong criticism of all DBMS algorithms based on locking, and based on these results, it is not fanciful to suggest that it is time to re-engineer existing DBMS.
△ Less
Submitted 6 October, 2021;
originally announced October 2021.
-
Data Validation for Big Live Data
Authors:
Malcolm Crowe,
Carolyn Begg,
Fritz Laux,
Martti Laiho
Abstract:
Data Integration of heterogeneous data sources relies either on periodically transferring large amounts of data to a physical Data Warehouse or retrieving data from the sources on request only. The latter results in the creation of what is referred to as a virtual Data Warehouse, which is preferable when the use of the latest data is paramount. However, the downside is that it adds network traffic…
▽ More
Data Integration of heterogeneous data sources relies either on periodically transferring large amounts of data to a physical Data Warehouse or retrieving data from the sources on request only. The latter results in the creation of what is referred to as a virtual Data Warehouse, which is preferable when the use of the latest data is paramount. However, the downside is that it adds network traffic and suffers from performance degradation when the amount of data is high. In this paper, we propose the use of a readCheck validator to ensure the timeliness of the queried data and reduced data traffic. It is further shown that the readCheck allows transactions to update data in the data sources obeying full Atomicity, Consistency, Isolation, and Durability (ACID) properties.
△ Less
Submitted 5 October, 2021;
originally announced October 2021.
-
The Typed Graph Model -- a Supermodel for Model Management and Data Integration
Authors:
Fritz Laux
Abstract:
In recent years, the Graph Model has become increasingly popular, especially in the application domain of social networks. The model has been semantically augmented with properties and labels attached to the graph elements. It is difficult to ensure data quality for the properties and the data structure because the model does not need a schema. In this paper, we propose a schema bound Typed Graph…
▽ More
In recent years, the Graph Model has become increasingly popular, especially in the application domain of social networks. The model has been semantically augmented with properties and labels attached to the graph elements. It is difficult to ensure data quality for the properties and the data structure because the model does not need a schema. In this paper, we propose a schema bound Typed Graph Model with properties and labels. These enhancements improve not only data quality but also the quality of graph analysis. The power of this model is provided by using hyper-nodes and hyper-edges, which allows to present data structures on different abstraction levels. We prove that the model is at least equivalent in expressive power to most popular data models. Therefore, it can be used as a supermodel for model management and data integration. We illustrate by example the superiority of this model over the property graph data model of Hidders and other prevalent data models, namely the relational, object-oriented, XML model, and RDF Schema.
△ Less
Submitted 6 December, 2021; v1 submitted 5 October, 2021;
originally announced October 2021.
-
The Typed Graph Model
Authors:
Fritz Laux
Abstract:
In recent years, the Graph Model has become increasingly popular, especially in the application domain of social networks. The model has been semantically augmented with properties and labels attached to the graph elements. It is difficult to ensure data quality for the properties and the data structure because the model does not need a schema. In this paper, we propose a schema bound Typed Graph…
▽ More
In recent years, the Graph Model has become increasingly popular, especially in the application domain of social networks. The model has been semantically augmented with properties and labels attached to the graph elements. It is difficult to ensure data quality for the properties and the data structure because the model does not need a schema. In this paper, we propose a schema bound Typed Graph Model with properties and labels. These enhancements improve not only data quality but also the quality of graph analysis. The power of this model is provided by using hyper-nodes and hyper-edges, which allows to present a data structure on different abstraction levels. We demonstrate by example the superiority of this model over the property graph data model of Hidders and other prevalent data models, namely the relational, object-oriented, and XML model.
△ Less
Submitted 3 October, 2021;
originally announced October 2021.
-
Information Integration using the Typed Graph Model
Authors:
Fritz Laux,
Malcolm Crowe
Abstract:
Schema and data integration have been a challenge for more than 40 years. While data warehouse technologies are quite a success story, there is still a lack of information integration methods, especially if the data sources are based on different data models or do not have a schema. Enterprise Information Integration has to deal with heterogeneous data sources and requires up-to-date high-quality…
▽ More
Schema and data integration have been a challenge for more than 40 years. While data warehouse technologies are quite a success story, there is still a lack of information integration methods, especially if the data sources are based on different data models or do not have a schema. Enterprise Information Integration has to deal with heterogeneous data sources and requires up-to-date high-quality information to provide a reliable basis for analysis and decision making. The paper proposes virtual integration using the Typed Graph Model to support schema mediation. The integration process first converts the structure of each source into a typed graph schema, which is then matched to the mediated schema. Mapping rules define transformations between the schemata to reconcile semantics. The mapping can be visually validated by experts. It provides indicators and rules to achieve a consistent schema mapping, which leads to high data integrity and quality.
△ Less
Submitted 20 July, 2021;
originally announced July 2021.
-
The Cryogenic Storage Ring CSR
Authors:
Robert von Hahn,
Arno Becker,
Felix Berg,
Klaus Blaum,
Christian Breitenfeldt,
Hisham Fadil,
Florian Fellenberger,
Michael Froese,
Sebastian George,
Jürgen Göck,
Manfred Grieser,
Florian Grussie,
Elisabeth A. Guerin,
Oded Heber,
Philipp Herwig,
Jonas Karthein,
Claude Krantz,
Holger Kreckel,
Michael Lange,
Felix Laux,
Svenja Lohmann,
Sebastian Menk,
Christian Meyer,
Preeti M. Mishra,
Oldřich Novotný
, et al. (19 additional authors not shown)
Abstract:
An electrostatic cryogenic storage ring, CSR, for beams of anions and cations with up to 300 keV kinetic energy per unit charge has been designed, constructed and put into operation. With a circumference of 35 m, the ion-beam vacuum chambers and all beam optics are in a cryostat and cooled by a closed-cycle liquid helium system. At temperatures as low as (5.5 $\pm$ 1) K inside the ring, storage ti…
▽ More
An electrostatic cryogenic storage ring, CSR, for beams of anions and cations with up to 300 keV kinetic energy per unit charge has been designed, constructed and put into operation. With a circumference of 35 m, the ion-beam vacuum chambers and all beam optics are in a cryostat and cooled by a closed-cycle liquid helium system. At temperatures as low as (5.5 $\pm$ 1) K inside the ring, storage time constants of several minutes up to almost an hour were observed for atomic and molecular, anion and cation beams at an energy of 60 keV. The ion-beam intensity, energy-dependent closed-orbit shifts (dispersion) and the focusing properties of the machine were studied by a system of capacitive pickups. The Schottky-noise spectrum of the stored ions revealed a broadening of the momentum distribution on a time scale of 1000 s. Photodetachment of stored anions was used in the beam lifetime measurements. The detachment rate by anion collisions with residual-gas molecules was found to be extremely low. A residual-gas density below 140 cm$^{-3}$ is derived, equivalent to a room-temperature pressure below 10$^{-14}$ mbar. Fast atomic, molecular and cluster ion beams stored for long periods of time in a cryogenic environment will allow experiments on collision- and radiation-induced fragmentation processes of ions in known internal quantum states with merged and crossed photon and particle beams.
△ Less
Submitted 5 June, 2016;
originally announced June 2016.