-
How to get Rid of SQL, Relational Algebra, the Relational Model, ERM, and ORMs in a Single Paper -- A Thought Experiment
Authors:
Jens Dittrich
Abstract:
Without any doubt, the relational paradigm has been a huge success. At the same time, we believe that the time is ripe to rethink how database systems could look like if we designed them from scratch. Would we really end up with the same abstractions and techniques that are prevalent today? This paper explores that space. We discuss the various issues with both the relational model(RM) and the ent…
▽ More
Without any doubt, the relational paradigm has been a huge success. At the same time, we believe that the time is ripe to rethink how database systems could look like if we designed them from scratch. Would we really end up with the same abstractions and techniques that are prevalent today? This paper explores that space. We discuss the various issues with both the relational model(RM) and the entity-relationship model (ERM). We provide a unified data model: the relational map type model (RMTM) which can represent both RM and ERM as special cases and overcomes all of their problems. We proceed to identify seven rules that an RMTM query language (QL) must fulfill and provide a foundation of a language fulfilling all seven rules. Our QL operates on maps which may represent tuples, relations, databases or sets of databases. Like that we dramatically expand the existing operational abstractions found in SQL and relational algebra (RA) which only operate on relations/tables. In fact, RA is just a special case of our much more generic approach. This work has far-reaching consequences: we show a path how to come up with a modern QL that solves (almost if not) all problems of SQL. Our QL is much more expressive than SQL and integrates smoothly into existing programming languages (PL). We also show results of an initial experiment showcasing that just by switching to our data model, and without changing the underlying query processing algorithms, we can achieve speed-ups of up to a factor 3. We will conclude that, if we build a database system from scratch, we could and should do this without SQL, RA, RM, ERM, and ORMs.
△ Less
Submitted 17 April, 2025;
originally announced April 2025.
-
Good Vibes! Towards Phone-to-User Authentication Through Wristwatch Vibrations
Authors:
Jakob Dittrich,
Rainhard Dieter Findling
Abstract:
While mobile devices frequently require users to authenticate to prevent unauthorized access, mobile devices typically do not authenticate to their users. This leaves room for users to unwittingly interact with different mobile devices. We present GoodVibes authentication, a variant of mobile device-to-user authentication, where the user's phone authenticates to the user through their wristwatch v…
▽ More
While mobile devices frequently require users to authenticate to prevent unauthorized access, mobile devices typically do not authenticate to their users. This leaves room for users to unwittingly interact with different mobile devices. We present GoodVibes authentication, a variant of mobile device-to-user authentication, where the user's phone authenticates to the user through their wristwatch vibrating in their pre-selected authentication vibration pattern. We implement GoodVibes authentication as an Android prototype, evaluate different authentication scenarios with 30 participants, and find users to be able to well recognize and distinguish their authentication vibration pattern from different patters, from unrelated vibrations, and from the pattern being absent.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Dirac particle under dynamical confinement: Fermi acceleration, trembling motion and quantum force
Authors:
J. Dittrich,
S. Rakhmanov,
D. Matrasulov
Abstract:
Quantum dynamics of a Dirac particle in a 1D box with moving wall is studied. Dirac equation with time-dependent boundary condition is mapped onto that with static one, but with time-dependent mass. Exact analytical solution of such modified Dirac equation is obtained for massless particle. For massive particle the problem is solved numerically. Time-dependences of the main characteristics of the…
▽ More
Quantum dynamics of a Dirac particle in a 1D box with moving wall is studied. Dirac equation with time-dependent boundary condition is mapped onto that with static one, but with time-dependent mass. Exact analytical solution of such modified Dirac equation is obtained for massless particle. For massive particle the problem is solved numerically. Time-dependences of the main characteristics of the dynamical confinement, such as average kinetic energy and quantum force are analyzed. It is found that the average kinetic energy remains bounded for the interval length bounded from below, in particular for the periodically oscillating wall.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
What if an SQL Statement Returned a Database?
Authors:
Joris Nix,
Jens Dittrich
Abstract:
Every SQL statement is limited to return a single, possibly denormalized, table. This design decision has far reaching consequences. (1.) for databases users in terms of slow query performance, long query result transfer times, usability-issues of SQL in web applications and object-relational mappers. In addition, (2.) for database architects it has consequences when designing query optimizers lea…
▽ More
Every SQL statement is limited to return a single, possibly denormalized, table. This design decision has far reaching consequences. (1.) for databases users in terms of slow query performance, long query result transfer times, usability-issues of SQL in web applications and object-relational mappers. In addition, (2.) for database architects it has consequences when designing query optimizers leading to logical (algebraic) join enumeration effort, memory consumption for intermediate result materialization, and physical operator selection effort. So basically, the entire query optimization stack is shaped by that design decision. In this paper, we argue that the single-table limitation should be dropped. We extend the SELECT-clause of SQL by a keyword 'RESULTDB' to support returning a result database. Our approach has clear semantics, i.e. our extended SQL returns subsets of all tables with only those tuples that would be part of the traditional (single-table) query result set, however without performing any denormalization through joins. Our SQL-extension is downward compatible. Moreover, we discuss the surprisingly long list of benefits of our approach. First, for database users: far simpler and more readable application code, better query performance, smaller query results, better query result transfer times. Second, for database architects, we present how to leverage existing closed source systems as well as change open source database systems to support our feature. We propose a couple of algorithms to integrate our feature into both closed-source as well as open source database systems. We present an initial experimental study with promising results.
△ Less
Submitted 1 December, 2023;
originally announced December 2023.
-
A Critical Analysis of Recursive Model Indexes
Authors:
Marcel Maltry,
Jens Dittrich
Abstract:
The recursive model index (RMI) has recently been introduced as a machine-learned replacement for traditional indexes over sorted data, achieving remarkably fast lookups. Follow-up work focused on explaining RMI's performance and automatically configuring RMIs through enumeration. Unfortunately, configuring RMIs involves setting several hyperparameters, the enumeration of which is often too time-c…
▽ More
The recursive model index (RMI) has recently been introduced as a machine-learned replacement for traditional indexes over sorted data, achieving remarkably fast lookups. Follow-up work focused on explaining RMI's performance and automatically configuring RMIs through enumeration. Unfortunately, configuring RMIs involves setting several hyperparameters, the enumeration of which is often too time-consuming in practice. Therefore, in this work, we conduct the first inventor-independent broad analysis of RMIs with the goal of understanding the impact of each hyperparameter on performance. In particular, we show that in addition to model types and layer size, error bounds and search algorithms must be considered to achieve the best possible performance. Based on our findings, we develop a simple-to-follow guideline for configuring RMIs. We evaluate our guideline by comparing the resulting RMIs with a number of state-of-the-art indexes, both learned and traditional. We show that our simple guideline is sufficient to achieve competitive performance with other learned indexes and RMIs whose configuration was determined using an expensive enumeration procedure. In addition, while carefully reimplementing RMIs, we are able to improve the build time by 2.5x to 6.3x.
△ Less
Submitted 22 November, 2021; v1 submitted 30 June, 2021;
originally announced June 2021.
-
Fast Compilation and Execution of SQL Queries with WebAssembly
Authors:
Immanuel Haffner,
Jens Dittrich
Abstract:
Interpreted execution of queries, as in the vectorized model, suffers from interpretation overheads. By compiling queries this interpretation overhead is eliminated at the cost of a compilation phase that delays execution, sacrificing latency for throughput. For short-lived queries, minimizing latency is important, while for long-running queries throughput outweighs latency. Because neither a pure…
▽ More
Interpreted execution of queries, as in the vectorized model, suffers from interpretation overheads. By compiling queries this interpretation overhead is eliminated at the cost of a compilation phase that delays execution, sacrificing latency for throughput. For short-lived queries, minimizing latency is important, while for long-running queries throughput outweighs latency. Because neither a purely interpretive model nor a purely compiling model can provide low latency and high throughput, adaptive solutions emerged. Adaptive systems seamlessly transition from interpreted to compiled execution, achieving low latency for short-lived queries and high throughput for long-running queries. However, these adaptive systems pose an immense development effort and require expert knowledge in both interpreter and compiler design.
In this work, we investigate query execution by compilation to WebAssembly. We are able to compile even complex queries in less than a millisecond to machine code with near-optimal performance. By delegating execution of WebAssembly to the V8 engine, we are able to seamlessly transition from rapidly compiled yet non-optimized code to thoroughly optimized code during execution. Our approach provides both low latency and high throughput, is adaptive out of the box, and is straight forward to implement. The drastically reduced compilation times even enable us to explore generative programming of library code, that is fully inlined by construction. Our experimental evaluation confirms that our approach yields competitive and sometimes superior performance.
△ Less
Submitted 3 May, 2021; v1 submitted 30 April, 2021;
originally announced April 2021.
-
Database (Lecture) Streams on the Cloud: An Experience Report on Teaching an Undergrad Database Lecture during a Pandemic
Authors:
Jens Dittrich,
Marcel Maltry
Abstract:
This is an experience report on teaching the undergrad lecture Big Data Engineering at Saarland University in summer term 2020 online. We describe our teaching philosophy, the tools used, what worked and what did not work. As we received extremely positive feedback from the students, in the future, we will continue to use the same teaching model for other lectures.
This is an experience report on teaching the undergrad lecture Big Data Engineering at Saarland University in summer term 2020 online. We describe our teaching philosophy, the tools used, what worked and what did not work. As we received extremely positive feedback from the students, in the future, we will continue to use the same teaching model for other lectures.
△ Less
Submitted 9 October, 2020;
originally announced October 2020.
-
There is No Such Thing as an "Index"! or: The next 500 Indexing Papers
Authors:
Jens Dittrich,
Joris Nix,
Christian Schön
Abstract:
Index structures are a building block of query processing and computer science in general. Since the dawn of computer technology there have been index structures. And since then, a myriad of index structures are being invented and published each and every year. In this paper we argue that the very idea of "inventing an index" is a misleading concept in the first place. It is the analogue of "inven…
▽ More
Index structures are a building block of query processing and computer science in general. Since the dawn of computer technology there have been index structures. And since then, a myriad of index structures are being invented and published each and every year. In this paper we argue that the very idea of "inventing an index" is a misleading concept in the first place. It is the analogue of "inventing a physical query plan". This paper is a paradigm shift in which we propose to drop the idea to handcraft index structures (as done for binary search trees over B-trees to any form of learned index) altogether. We present a new automatic index breeding framework coined Genetic Generic Generation of Index Structures (GENE). It is based on the observation that almost all index structures are assembled along three principal dimensions: (1) structural building blocks, e.g., a B-tree is assembled from two different structural node types (inner and leaf nodes), (2) a couple of invariants, e.g., for a B-tree all paths have the same length, and (3) decisions on the internal layout of nodes (row or column layout, etc.). We propose a generic indexing framework that can mimic many existing index structures along those dimensions. Based on that framework we propose a generic genetic index generation algorithm that, given a workload and an optimization goal, can automatically assemble and mutate, in other words 'breed' new index structure 'species'. In our experiments we follow multiple goals. We reexamine some good old wisdom from database technology. Given a specific workload, will GENE even breed an index that is equivalent to what our textbooks and papers currently recommend for such a workload? Or can we do even more? Our initial results strongly indicate that generated indexes are the next step in designing index structures.
△ Less
Submitted 16 September, 2021; v1 submitted 22 September, 2020;
originally announced September 2020.
-
ChainifyDB: How to Blockchainify any Data Management System
Authors:
Felix Martin Schuhknecht,
Ankur Sharma,
Jens Dittrich,
Divya Agrawal
Abstract:
Today's permissioned blockchain systems come in a stand-alone fashion and require the users to integrate yet another full-fledged transaction processing system into their already complex data management landscape. This seems odd as blockchains and traditional DBMSs share large parts of their processing stack. Thus, rather than replacing the established data systems altogether, we advocate to simpl…
▽ More
Today's permissioned blockchain systems come in a stand-alone fashion and require the users to integrate yet another full-fledged transaction processing system into their already complex data management landscape. This seems odd as blockchains and traditional DBMSs share large parts of their processing stack. Thus, rather than replacing the established data systems altogether, we advocate to simply 'chainify' them with a blockchain layer on top.
Unfortunately, this task is far more challenging than it sounds: As we want to build upon heterogeneous transaction processing systems, which potentially behave differently, we cannot rely on every organization to execute every transaction deterministically in the same way. Further, as these systems are already filled with data and being used by top-level applications, we also cannot rely on every organization being resilient against tampering with its local data.
Therefore, in this work, we will drop these assumptions and introduce a powerful processing model that avoids them in the first place: The so-called Whatever-LedgerConsensus (WLC) model allows us to create a highly flexible permissioned blockchain layer coined ChainifyDB that (a) is centered around bullet-proof database technology, (b) makes even stronger guarantees than existing permissioned systems, (c) provides a sophisticated recovery mechanism, (d) has an up to 6x higher throughput than the permissioned blockchain system Fabric, and (e) can easily be integrated into an existing heterogeneous database landscape.
△ Less
Submitted 10 December, 2019;
originally announced December 2019.
-
Scattering of particles bounded to an infinite planar curve
Authors:
J. Dittrich
Abstract:
Non-relativistic quantum particles bounded to a curve in R^2 by attractive contact $δ$-interaction are considered. The interval between the energy of the transversal bound state and zero is shown to belong to the absolutely continuous spectrum, with possible embedded eigenvalues. The existence of the wave operators is proved for the mentioned energy interval using the Hamiltonians with the interac…
▽ More
Non-relativistic quantum particles bounded to a curve in R^2 by attractive contact $δ$-interaction are considered. The interval between the energy of the transversal bound state and zero is shown to belong to the absolutely continuous spectrum, with possible embedded eigenvalues. The existence of the wave operators is proved for the mentioned energy interval using the Hamiltonians with the interaction supported by the straight lines as the free ones. Their completeness is not proved. The curve is assumed C^3-smooth, non-intersecting, unbounded, asymptotically approaching two different half-lines (non-parallel or parallel but excluding the "U-case"). Physically, the system can be considered as a model of long nanostructural channel.
△ Less
Submitted 12 August, 2020; v1 submitted 9 December, 2019;
originally announced December 2019.
-
Make Thunderbolts Less Frightening -- Predicting Extreme Weather Using Deep Learning
Authors:
Christian Schön,
Jens Dittrich
Abstract:
Forecasting severe weather conditions is still a very challenging and computationally expensive task due to the enormous amount of data and the complexity of the underlying physics. Machine learning approaches and especially deep learning have however shown huge improvements in many research areas dealing with large datasets in recent years. In this work, we tackle one specific sub-problem of weat…
▽ More
Forecasting severe weather conditions is still a very challenging and computationally expensive task due to the enormous amount of data and the complexity of the underlying physics. Machine learning approaches and especially deep learning have however shown huge improvements in many research areas dealing with large datasets in recent years. In this work, we tackle one specific sub-problem of weather forecasting, namely the prediction of thunderstorms and lightning. We propose the use of a convolutional neural network architecture inspired by UNet++ and ResNet to predict thunderstorms as a binary classification problem based on satellite images and lightnings recorded in the past. We achieve a probability of detection of more than 94% for lightnings within the next 15 minutes while at the same time minimizing the false alarm ratio compared to previous approaches.
△ Less
Submitted 6 December, 2019; v1 submitted 3 December, 2019;
originally announced December 2019.
-
The Case for Deep Query Optimisation
Authors:
Jens Dittrich,
Joris Nix
Abstract:
Query Optimisation (QO) is the most important optimisation problem in databases. The goal of QO is to compute the best physical plan under a given cost model. In that process, physical operators are used as building blocks for the planning and optimisation process. In this paper, we propose to deepen that process. We present Deep Query Optimisation (DQO). In DQO, we break up the abstraction of a '…
▽ More
Query Optimisation (QO) is the most important optimisation problem in databases. The goal of QO is to compute the best physical plan under a given cost model. In that process, physical operators are used as building blocks for the planning and optimisation process. In this paper, we propose to deepen that process. We present Deep Query Optimisation (DQO). In DQO, we break up the abstraction of a 'physical' operator to consider more fine-granular subcomponents. These subcomponents are then used to enumerate (sub-)plans both offline and at query time. This idea triggers several exciting research directions: (1) How exactly can DQO help to compute better plans than (shallow) QO and at which costs? (2) DQO can be used to precompute and synthesise database operators and any other database component as Materialised Algorithmic Views (MAVs). (3) We identify the Algorithmic View Selection Problem (AVSP), i.e. which MAVs should be materialised when?
This paper presents the high-level idea of DQO using an analogy inspired from biology. Then we proceed to question the terms 'physical' and 'physical operator'. We present experiments with a 'physical operator' formerly known as 'hash-based grouping'. We benchmark that operator both independently as well as in the context of DQO-enabled dynamic programming. We conclude by sketching a DQO research agenda.
△ Less
Submitted 17 December, 2019; v1 submitted 22 August, 2019;
originally announced August 2019.
-
Absolute continuity of the spectrum in a twisted Dirichlet-Neumann waveguide
Authors:
Philippe Briet,
Jaroslav Dittrich,
David Krejcirik
Abstract:
Quantum waveguide with the shape of planar infinite straight strip and combined Dirichlet and Neumann boundary conditions on the opposite half-lines of the boundary is considered. The absence of the point as well as of the singular continuous spectrum is proved.
Quantum waveguide with the shape of planar infinite straight strip and combined Dirichlet and Neumann boundary conditions on the opposite half-lines of the boundary is considered. The absence of the point as well as of the singular continuous spectrum is proved.
△ Less
Submitted 20 January, 2020; v1 submitted 13 June, 2019;
originally announced June 2019.
-
The Error is the Feature: how to Forecast Lightning using a Model Prediction Error
Authors:
Christian Schön,
Jens Dittrich,
Richard Müller
Abstract:
Despite the progress within the last decades, weather forecasting is still a challenging and computationally expensive task. Current satellite-based approaches to predict thunderstorms are usually based on the analysis of the observed brightness temperatures in different spectral channels and emit a warning if a critical threshold is reached. Recent progress in data science however demonstrates th…
▽ More
Despite the progress within the last decades, weather forecasting is still a challenging and computationally expensive task. Current satellite-based approaches to predict thunderstorms are usually based on the analysis of the observed brightness temperatures in different spectral channels and emit a warning if a critical threshold is reached. Recent progress in data science however demonstrates that machine learning can be successfully applied to many research fields in science, especially in areas dealing with large datasets. We therefore present a new approach to the problem of predicting thunderstorms based on machine learning. The core idea of our work is to use the error of two-dimensional optical flow algorithms applied to images of meteorological satellites as a feature for machine learning models. We interpret that optical flow error as an indication of convection potentially leading to thunderstorms and lightning. To factor in spatial proximity we use various manual convolution steps. We also consider effects such as the time of day or the geographic location. We train different tree classifier models as well as a neural network to predict lightning within the next few hours (called nowcasting in meteorology) based on these features. In our evaluation section we compare the predictive power of the different models and the impact of different features on the classification result. Our results show a high accuracy of 96% for predictions over the next 15 minutes which slightly decreases with increasing forecast period but still remains above 83% for forecasts of up to five hours. The high false positive rate of nearly 6% however needs further investigation to allow for an operational use of our approach.
△ Less
Submitted 1 February, 2019; v1 submitted 23 November, 2018;
originally announced November 2018.
-
How to Databasify a Blockchain: the Case of Hyperledger Fabric
Authors:
Ankur Sharma,
Felix Martin Schuhknecht,
Divya Agrawal,
Jens Dittrich
Abstract:
Within the last few years, a countless number of blockchain systems have emerged on the market, each one claiming to revolutionize the way of distributed transaction processing in one way or the other. Many blockchain features, such as byzantine fault tolerance (BFT), are indeed valuable additions in modern environments. However, despite all the hype around the technology, many of the challenges t…
▽ More
Within the last few years, a countless number of blockchain systems have emerged on the market, each one claiming to revolutionize the way of distributed transaction processing in one way or the other. Many blockchain features, such as byzantine fault tolerance (BFT), are indeed valuable additions in modern environments. However, despite all the hype around the technology, many of the challenges that blockchain systems have to face are fundamental transaction management problems. These are largely shared with traditional database systems, which have been around for decades already.
These similarities become especially visible for systems, that blur the lines between blockchain systems and classical database systems. A great example of this is Hyperledger Fabric, an open-source permissioned blockchain system under development by IBM. By having a relaxed view on BFT, the transaction pipeline of Fabric highly resembles the workflow of classical distributed databases systems.
This raises two questions: (1) Which conceptual similarities and differences do actually exist between a system such as Fabric and a classical distributed database system? (2) Is it possible to improve on the performance of Fabric by transitioning technology from the database world to blockchains and thus blurring the lines between these two types of systems even further? To tackle these questions, we first explore Fabric from the perspective of database research, where we observe weaknesses in the transaction pipeline. We then solve these issues by transitioning well-understood database concepts to Fabric, namely transaction reordering as well as early transaction abort. Our experimental evaluation shows that our improved version Fabric++ significantly increases the throughput of successful transactions over the vanilla version by up to a factor of 3x.
△ Less
Submitted 31 October, 2018;
originally announced October 2018.
-
The Case for Automatic Database Administration using Deep Reinforcement Learning
Authors:
Ankur Sharma,
Felix Martin Schuhknecht,
Jens Dittrich
Abstract:
Like any large software system, a full-fledged DBMS offers an overwhelming amount of configuration knobs. These range from static initialisation parameters like buffer sizes, degree of concurrency, or level of replication to complex runtime decisions like creating a secondary index on a particular column or reorganising the physical layout of the store. To simplify the configuration, industry grad…
▽ More
Like any large software system, a full-fledged DBMS offers an overwhelming amount of configuration knobs. These range from static initialisation parameters like buffer sizes, degree of concurrency, or level of replication to complex runtime decisions like creating a secondary index on a particular column or reorganising the physical layout of the store. To simplify the configuration, industry grade DBMSs are usually shipped with various advisory tools, that provide recommendations for given workloads and machines. However, reality shows that the actual configuration, tuning, and maintenance is usually still done by a human administrator, relying on intuition and experience. Recent work on deep reinforcement learning has shown very promising results in solving problems, that require such a sense of intuition. For instance, it has been applied very successfully in learning how to play complicated games with enormous search spaces. Motivated by these achievements, in this work we explore how deep reinforcement learning can be used to administer a DBMS. First, we will describe how deep reinforcement learning can be used to automatically tune an arbitrary software system like a DBMS by defining a problem environment. Second, we showcase our concept of NoDBA at the concrete example of index selection and evaluate how well it recommends indexes for given workloads.
△ Less
Submitted 17 January, 2018;
originally announced January 2018.
-
On the solution to the separated equation in the 3-particle Calogero-Moser problem
Authors:
N. G. Inozemtseva,
J. Dittrich,
V. I. Inozemtsev
Abstract:
We propose the exact solution of the equation in separated variable which appears in the process of constructing solutions to the quantum Calogero-Moser three-particle problem with elliptic two-particle potential $g(g-1)\wp(q)$. This solution is found for special values of coupling constants $g\in {\mathbb Z}, \, g>1$. It can be used for solving three-paricle CM problem under appropriate boundary…
▽ More
We propose the exact solution of the equation in separated variable which appears in the process of constructing solutions to the quantum Calogero-Moser three-particle problem with elliptic two-particle potential $g(g-1)\wp(q)$. This solution is found for special values of coupling constants $g\in {\mathbb Z}, \, g>1$. It can be used for solving three-paricle CM problem under appropriate boundary conditions.
△ Less
Submitted 24 October, 2017;
originally announced October 2017.
-
Accelerating Analytical Processing in MVCC using Fine-Granular High-Frequency Virtual Snapshotting
Authors:
Ankur Sharma,
Felix Martin Schuhknecht,
Jens Dittrich
Abstract:
Efficient transactional management is a delicate task. As systems face transactions of inherently different types, ranging from point updates to long running analytical computations, it is hard to satisfy their individual requirements with a single processing component. Unfortunately, most systems nowadays rely on such a single component that implements its parallelism using multi-version concurre…
▽ More
Efficient transactional management is a delicate task. As systems face transactions of inherently different types, ranging from point updates to long running analytical computations, it is hard to satisfy their individual requirements with a single processing component. Unfortunately, most systems nowadays rely on such a single component that implements its parallelism using multi-version concurrency control (MVCC). While MVCC parallelizes short-running OLTP transactions very well, it struggles in the presence of mixed workloads containing long-running scan-centric OLAP queries, as scans have to work their way through large amounts of versioned data. To overcome this problem, we propose a system, which reintroduces the concept of heterogeneous transaction processing: OLAP transactions are outsourced to run on separate (virtual) snapshots while OLTP transactions run on the most recent representation of the database. Inside both components, MVCC ensures a high degree of concurrency. The biggest challenge of such a heterogeneous approach is to generate the snapshots at a high frequency. Previous approaches heavily suffered from the tremendous cost of snapshot creation. In our system, we overcome the restrictions of the OS by introducing a custom system call vm_snapshot, that is hand-tailored to our precise needs: it allows fine-granular snapshot creation at very high frequencies, rendering the snapshot creation phase orders of magnitudes faster than state-of-the-art approaches. Our experimental evaluation on a heterogeneous workload based on TPC-H transactions and handcrafted OLTP transactions shows that our system enables significantly higher analytical transaction throughputs on mixed workloads than homogeneous approaches. In this sense, we introduce a system that accelerates Analytical processing by introducing custom Kernel functionalities: AnKerDB.
△ Less
Submitted 13 September, 2017;
originally announced September 2017.
-
On eigenvalue asymptotics for strong delta-interactions supported by surfaces with boundaries
Authors:
J. Dittrich,
P. Exner,
Ch. Kühn,
K. Pankrashkin
Abstract:
Let $S\subset\mathbb{R}^3$ be a $C^4$-smooth relatively compact orientable surface with a sufficiently regular boundary. For $β\in\mathbb{R}_+$, let $E_j(β)$ denote the $j$th negative eigenvalue of the operator associated with the quadratic form \[ H^1(\mathbb{R}^3)\ni u\mapsto \iiint_{\mathbb{R}^3} |\nabla u|^2dx -β\iint_S |u|^2dσ, \] where $σ$ is the two-dimensional Hausdorff measure on $S$. We…
▽ More
Let $S\subset\mathbb{R}^3$ be a $C^4$-smooth relatively compact orientable surface with a sufficiently regular boundary. For $β\in\mathbb{R}_+$, let $E_j(β)$ denote the $j$th negative eigenvalue of the operator associated with the quadratic form \[ H^1(\mathbb{R}^3)\ni u\mapsto \iiint_{\mathbb{R}^3} |\nabla u|^2dx -β\iint_S |u|^2dσ, \] where $σ$ is the two-dimensional Hausdorff measure on $S$. We show that for each fixed $j$ one has the asymptotic expansion \[ E_j(β)=-\dfrac{β^2}{4}+μ^D_j+ o(1) \;\text{ as }\; β\to+\infty\,, \] where $μ_j^D$ is the $j$th eigenvalue of the operator $-Δ_S+K-M^2$ on $L^2(S)$, in which $K$ and $M$ are the Gauss and mean curvatures, respectively, and $-Δ_S$ is the Laplace-Beltrami operator with the Dirichlet condition at the boundary of $S$. If, in addition, the boundary of $S$ is $C^2$-smooth, then the remainder estimate can be improved to ${\mathcal O}(β^{-1}\logβ)$.
△ Less
Submitted 22 June, 2015;
originally announced June 2015.
-
Scattering through a straight quantum waveguide with combined boundary conditions
Authors:
Ph. Briet,
J. Dittrich,
E. Soccorsi
Abstract:
Scattering through a straight two-dimensional quantum waveguide Rx(0,d) with Dirichlet boundary conditions on (-\infty,0)x{y=0} \cup (0,\infty)x{y=d} and Neumann boundary condition on (-infty,0)x{y=d} \cup (0,\infty)x{y=0} is considered using stationary scattering theory. The existence of a matching conditions solution at x=0 is proved. The use of stationary scattering theory is justified showing…
▽ More
Scattering through a straight two-dimensional quantum waveguide Rx(0,d) with Dirichlet boundary conditions on (-\infty,0)x{y=0} \cup (0,\infty)x{y=d} and Neumann boundary condition on (-infty,0)x{y=d} \cup (0,\infty)x{y=0} is considered using stationary scattering theory. The existence of a matching conditions solution at x=0 is proved. The use of stationary scattering theory is justified showing its relation to the wave packets motion. As an illustration, the matching conditions are also solved numerically and the transition probabilities are shown.
△ Less
Submitted 18 August, 2014;
originally announced August 2014.
-
Main Memory Adaptive Indexing for Multi-core Systems
Authors:
Victor Alvarez,
Felix Martin Schuhknecht,
Jens Dittrich,
Stefan Richter
Abstract:
Adaptive indexing is a concept that considers index creation in databases as a by-product of query processing; as opposed to traditional full index creation where the indexing effort is performed up front before answering any queries. Adaptive indexing has received a considerable amount of attention, and several algorithms have been proposed over the past few years; including a recent experimental…
▽ More
Adaptive indexing is a concept that considers index creation in databases as a by-product of query processing; as opposed to traditional full index creation where the indexing effort is performed up front before answering any queries. Adaptive indexing has received a considerable amount of attention, and several algorithms have been proposed over the past few years; including a recent experimental study comparing a large number of existing methods. Until now, however, most adaptive indexing algorithms have been designed single-threaded, yet with multi-core systems already well established, the idea of designing parallel algorithms for adaptive indexing is very natural. In this regard only one parallel algorithm for adaptive indexing has recently appeared in the literature: The parallel version of standard cracking. In this paper we describe three alternative parallel algorithms for adaptive indexing, including a second variant of a parallel standard cracking algorithm. Additionally, we describe a hybrid parallel sorting algorithm, and a NUMA-aware method based on sorting. We then thoroughly compare all these algorithms experimentally; along a variant of a recently published parallel version of radix sort. Parallel sorting algorithms serve as a realistic baseline for multi-threaded adaptive indexing techniques. In total we experimentally compare seven parallel algorithms. Additionally, we extensively profile all considered algorithms. The initial set of experiments considered in this paper indicates that our parallel algorithms significantly improve over previously known ones. Our results suggest that, although adaptive indexing algorithms are a good design choice in single-threaded environments, the rules change considerably in the parallel case. That is, in future highly-parallel environments, sorting algorithms could be serious alternatives to adaptive indexing.
△ Less
Submitted 8 April, 2014;
originally announced April 2014.
-
Towards Zero-Overhead Adaptive Indexing in Hadoop
Authors:
Stefan Richter,
Jorge-Arnulfo Quiané-Ruiz,
Stefan Schuh,
Jens Dittrich
Abstract:
Several research works have focused on supporting index access in MapReduce systems. These works have allowed users to significantly speed up selective MapReduce jobs by orders of magnitude. However, all these proposals require users to create indexes upfront, which might be a difficult task in certain applications (such as in scientific and social applications) where workloads are evolving or har…
▽ More
Several research works have focused on supporting index access in MapReduce systems. These works have allowed users to significantly speed up selective MapReduce jobs by orders of magnitude. However, all these proposals require users to create indexes upfront, which might be a difficult task in certain applications (such as in scientific and social applications) where workloads are evolving or hard to predict. To overcome this problem, we propose LIAH (Lazy Indexing and Adaptivity in Hadoop), a parallel, adaptive approach for indexing at minimal costs for MapReduce systems. The main idea of LIAH is to automatically and incrementally adapt to users' workloads by creating clustered indexes on HDFS data blocks as a byproduct of executing MapReduce jobs. Besides distributing indexing efforts over multiple computing nodes, LIAH also parallelises indexing with both map tasks computation and disk I/O. All this without any additional data copy in main memory and with minimal synchronisation. The beauty of LIAH is that it piggybacks index creation on map tasks, which read relevant data from disk to main memory anyways. Hence, LIAH does not introduce any additional read I/O-costs and exploit free CPU cycles. As a result and in contrast to existing adaptive indexing works, LIAH has a very low (or invisible) indexing overhead, usually for the very first job. Still, LIAH can quickly converge to a complete index, i.e. all HDFS data blocks are indexed. Especially, LIAH can trade early job runtime improvements with fast complete index convergence. We compare LIAH with HAIL, a state-of-the-art indexing technique, as well as with standard Hadoop with respect to indexing overhead and workload performance.
△ Less
Submitted 14 December, 2012;
originally announced December 2012.
-
Only Aggressive Elephants are Fast Elephants
Authors:
Jens Dittrich,
Jorge-Arnulfo Quiané-Ruiz,
Stefan Richter,
Stefan Schuh,
Alekh Jindal,
Jörg Schad
Abstract:
Yellow elephants are slow. A major reason is that they consume their inputs entirely before responding to an elephant rider's orders. Some clever riders have trained their yellow elephants to only consume parts of the inputs before responding. However, the teaching time to make an elephant do that is high. So high that the teaching lessons often do not pay off. We take a different approach. We mak…
▽ More
Yellow elephants are slow. A major reason is that they consume their inputs entirely before responding to an elephant rider's orders. Some clever riders have trained their yellow elephants to only consume parts of the inputs before responding. However, the teaching time to make an elephant do that is high. So high that the teaching lessons often do not pay off. We take a different approach. We make elephants aggressive; only this will make them very fast. We propose HAIL (Hadoop Aggressive Indexing Library), an enhancement of HDFS and Hadoop MapReduce that dramatically improves runtimes of several classes of MapReduce jobs. HAIL changes the upload pipeline of HDFS in order to create different clustered indexes on each data block replica. An interesting feature of HAIL is that we typically create a win-win situation: we improve both data upload to HDFS and the runtime of the actual Hadoop MapReduce job. In terms of data upload, HAIL improves over HDFS by up to 60% with the default replication factor of three. In terms of query execution, we demonstrate that HAIL runs up to 68x faster than Hadoop. In our experiments, we use six clusters including physical and EC2 clusters of up to 100 nodes. A series of scalability experiments also demonstrates the superiority of HAIL.
△ Less
Submitted 1 August, 2012;
originally announced August 2012.
-
PaperBricks: An Alternative to Complete-Story Peer Reviewing
Authors:
Jens Dittrich
Abstract:
The peer review system as used in several computer science communities has several flaws including long review times, overloaded reviewers, as well as fostering of niche topics. These flaws decrease quality, lower impact, slowdown the innovation process, and lead to frustration of authors, readers, and reviewers. In order to fix this, we propose a new peer review system termed paper bricks. Paper…
▽ More
The peer review system as used in several computer science communities has several flaws including long review times, overloaded reviewers, as well as fostering of niche topics. These flaws decrease quality, lower impact, slowdown the innovation process, and lead to frustration of authors, readers, and reviewers. In order to fix this, we propose a new peer review system termed paper bricks. Paper bricks has several advantages over the existing system including shorter publications, better competition for new ideas, as well as an accelerated innovation process. Furthermore, paper bricks may be implemented with minimal change to the existing peer review systems.
△ Less
Submitted 17 February, 2011;
originally announced February 2011.
-
A Uniqueness and Periodicity Result for Solutions of Elliptic Equations in Unbounded Domains
Authors:
Matthias Bergner,
Jens Dittrich
Abstract:
We proof a uniqueness and periodicity theorem for bounded solutions of uniformly elliptic equations in certain unbounded domains.
We proof a uniqueness and periodicity theorem for bounded solutions of uniformly elliptic equations in certain unbounded domains.
△ Less
Submitted 20 November, 2007;
originally announced November 2007.
-
On Surfaces of Prescribed Weighted Mean Curvature
Authors:
Matthias Bergner,
Jens Dittrich
Abstract:
Utilizing a weight matrix we study surfaces of prescribed weighted mean curvature which yield a natural generalisation to critical points of anisotropic surface energies. We first derive a differential equation for the normal of immersions with prescribed weighted mean curvature, generalising a result of Clarenz and von der Mosel. Next we study graphs of prescribed weighted mean curvature, for w…
▽ More
Utilizing a weight matrix we study surfaces of prescribed weighted mean curvature which yield a natural generalisation to critical points of anisotropic surface energies. We first derive a differential equation for the normal of immersions with prescribed weighted mean curvature, generalising a result of Clarenz and von der Mosel. Next we study graphs of prescribed weighted mean curvature, for which a quasilinear elliptic equation is proved. Using this equation, we can show height and boundary gradient estimates. Finally, we solve the Dirichlet problem for graphs of prescribed weighted mean curvature.
△ Less
Submitted 15 November, 2007;
originally announced November 2007.
-
The Commutativity of Integrals of Motion for Quantum Spin Chains and Elliptic Functions Identities
Authors:
J. Dittrich,
V. I. Inozemtsev
Abstract:
We prove the commutativity of the first two nontrivial integrals of motion for quantum spin chains with elliptic form of the exchange interaction. We also show thair linear independence for the numbers of spins larger than 4. As a byproduct, we obtained several identities between elliptic Weierstrass functions of three and four arguments.
We prove the commutativity of the first two nontrivial integrals of motion for quantum spin chains with elliptic form of the exchange interaction. We also show thair linear independence for the numbers of spins larger than 4. As a byproduct, we obtained several identities between elliptic Weierstrass functions of three and four arguments.
△ Less
Submitted 13 November, 2007;
originally announced November 2007.
-
Analytic proof of the Sutherland conjecture
Authors:
J. Dittrich,
V. I. Inozemtsev
Abstract:
Using the integral representation of the inverse of the logarithmic derivative of the elliptic theta function, the spectrum of the Lax matrix for the 1D system of particles interacting via inverse sinh-squared potential is shown to be given by the asymptotic Bethe ansatz in the thermodynamic limit.
Using the integral representation of the inverse of the logarithmic derivative of the elliptic theta function, the spectrum of the Lax matrix for the 1D system of particles interacting via inverse sinh-squared potential is shown to be given by the asymptotic Bethe ansatz in the thermodynamic limit.
△ Less
Submitted 23 December, 2003;
originally announced December 2003.
-
Nonsmoothness of the boundary and the relevant heat kernel coefficients
Authors:
V. V. Nesterenko,
I. G. Pirozhenko,
J. Dittrich
Abstract:
The contributions to the heat kernel coefficients generated by the corners of the boundary are studied. For this purpose the internal and external sectors of a wedge and a cone are considered. These sectors are obtained by introducing, inside the wedge, a cylindrical boundary. Transition to a cone is accomplished by identification of the wedge sides. The basic result of the paper is the calculat…
▽ More
The contributions to the heat kernel coefficients generated by the corners of the boundary are studied. For this purpose the internal and external sectors of a wedge and a cone are considered. These sectors are obtained by introducing, inside the wedge, a cylindrical boundary. Transition to a cone is accomplished by identification of the wedge sides. The basic result of the paper is the calculation of the individual contributions to the heat kernel coefficients generated by the boundary singularities. In the course of this analysis certain patterns, that are followed by these contributions, are revealed. The implications of the obtained results in calculations of the vacuum energy for regions with nonsmooth boundary are discussed. The rules for obtaining all the heat kernel coefficients for the minus Laplace operator defined on a polygon or in its cylindrical generalization are formulated.
△ Less
Submitted 3 December, 2002; v1 submitted 4 July, 2002;
originally announced July 2002.
-
Massive Scalar Field in an One-Dimensional Oscillating Region
Authors:
J. Dittrich,
P. Duclos
Abstract:
The classical scalar massive field satisfying the Klein-Gordon equation in a finite one-dimensional space interval of periodically varying length with Dirichlet boundary conditions is studied. For the sufficiently small mass, the energy can exponentially grow with time under the same conditions as for the massless case. The proofs are based on estimates of exactly given mass-induced corrections…
▽ More
The classical scalar massive field satisfying the Klein-Gordon equation in a finite one-dimensional space interval of periodically varying length with Dirichlet boundary conditions is studied. For the sufficiently small mass, the energy can exponentially grow with time under the same conditions as for the massless case. The proofs are based on estimates of exactly given mass-induced corrections to the massless case.
△ Less
Submitted 27 June, 2002;
originally announced June 2002.
-
Curved planar quantum wires with Dirichlet and Neumann boundary conditions
Authors:
Jaroslav Dittrich,
Jan Kriz
Abstract:
We investigate the discrete spectrum of the Hamiltonian describing a quantum particle living in the two-dimensional curved strip. We impose the Dirichlet and Neumann boundary conditions on opposite sides of the strip. The existence of the discrete eigenvalue below the essential spectrum threshold depends on the sign of the total bending angle for the asymptotically straight strips.
We investigate the discrete spectrum of the Hamiltonian describing a quantum particle living in the two-dimensional curved strip. We impose the Dirichlet and Neumann boundary conditions on opposite sides of the strip. The existence of the discrete eigenvalue below the essential spectrum threshold depends on the sign of the total bending angle for the asymptotically straight strips.
△ Less
Submitted 17 April, 2002; v1 submitted 5 March, 2002;
originally announced March 2002.
-
Bound states in straight quantum waveguides with combined boundary conditions
Authors:
Jaroslav Dittrich,
Jan Kriz
Abstract:
We investigate the discrete spectrum of the Hamiltonian describing a quantum particle living in the two-dimensional straight strip. We impose the combined Dirichlet and Neumann boundary conditions on different parts of the boundary. Several statements on the existence or the absence of the discrete spectrum are proven for two models with combined boundary conditions. Examples of eigenfunctions a…
▽ More
We investigate the discrete spectrum of the Hamiltonian describing a quantum particle living in the two-dimensional straight strip. We impose the combined Dirichlet and Neumann boundary conditions on different parts of the boundary. Several statements on the existence or the absence of the discrete spectrum are proven for two models with combined boundary conditions. Examples of eigenfunctions and eigenvalues are computed numerically.
△ Less
Submitted 11 December, 2001;
originally announced December 2001.
-
On the Ground State of Ferromagnetic Hamiltonians
Authors:
J. Dittrich,
V. I. Inozemtsev
Abstract:
It is generally believed that the ground state of the ferromagnetic Heisenberg-Dirac-Van Vleck Hamiltonians acting on s=1/2 spins of a lattice with N sites has the maximal possible value of the total spin S=N/2 and is N+1 times degenerate. We present a rigorous proof of this statement, independent of the lattice dimension and topology.
It is generally believed that the ground state of the ferromagnetic Heisenberg-Dirac-Van Vleck Hamiltonians acting on s=1/2 spins of a lattice with N sites has the maximal possible value of the total spin S=N/2 and is N+1 times degenerate. We present a rigorous proof of this statement, independent of the lattice dimension and topology.
△ Less
Submitted 28 December, 2000;
originally announced December 2000.
-
A model of interband radiative transitions
Authors:
J. Dittrich,
P. Exner,
M. Hirokawa
Abstract:
We consider a simple model which is a caricature of a crystal interacting with a radiation field. The model has two bands of continuous spectrum and the particle can pass from the upper one to the lower by radiating a photon, the coupling between the excited and deexcited states being of a Friedrichs type. Under suitable regularity and analyticity assumptions we find the continued resolvent and…
▽ More
We consider a simple model which is a caricature of a crystal interacting with a radiation field. The model has two bands of continuous spectrum and the particle can pass from the upper one to the lower by radiating a photon, the coupling between the excited and deexcited states being of a Friedrichs type. Under suitable regularity and analyticity assumptions we find the continued resolvent and show that for weak enough coupling it has a curve-type singularity in the lower halfplane which is a deformation of the upper-band spectral cut. We then find a formula for the decay amplitude and show that for a fixed energy it is approximately exponential at intermediate times, while the tail has a power-like behaviour.
△ Less
Submitted 7 November, 2000;
originally announced November 2000.
-
On the second-neighbor correlator in 1D XXX quantum antiferromagnetic spin chain
Authors:
J. Dittrich,
V. I. Inozemtsev
Abstract:
We have calculated the energy per site for the ground state of antiferromagnetic quantum spin chain with variable range exchange $h(j-k)\propto \sinh^2 a \sinh^{-2}a(j-k)$ in the framework of the asymptotic Bethe ansatz. By expanding it in powers of $e^{-2a}$, we have confirmed the value of the second-neighbor correlator for the model with nearest-neighbor exchange obtained earlier in the atomic…
▽ More
We have calculated the energy per site for the ground state of antiferromagnetic quantum spin chain with variable range exchange $h(j-k)\propto \sinh^2 a \sinh^{-2}a(j-k)$ in the framework of the asymptotic Bethe ansatz. By expanding it in powers of $e^{-2a}$, we have confirmed the value of the second-neighbor correlator for the model with nearest-neighbor exchange obtained earlier in the atomic limit of the Hubbard chain.
△ Less
Submitted 26 June, 1997;
originally announced June 1997.
-
On the two-magnon bound states for the quantum Heisenberg chain with variable range exchange
Authors:
J. Dittrich,
V. I. Inozemtsev
Abstract:
The spectrum of finite-difference two-magnon operator is investigated for quantum S=1/2 chain with variable range exchange of the form $h(j-k)\propto \sinh^{-2}a(j-k)$. It is found that usual bound state appears for some values of the total pseudomomentum of two magnons as for the Heisenberg chain with nearest-neighbor spin interaction. Besides this state, a new type of bound state with oscillat…
▽ More
The spectrum of finite-difference two-magnon operator is investigated for quantum S=1/2 chain with variable range exchange of the form $h(j-k)\propto \sinh^{-2}a(j-k)$. It is found that usual bound state appears for some values of the total pseudomomentum of two magnons as for the Heisenberg chain with nearest-neighbor spin interaction. Besides this state, a new type of bound state with oscillating wave function appears at larger values of the total pseudomomentum.
△ Less
Submitted 20 December, 1996;
originally announced December 1996.