FITing-Tree: A Data-aware Index Structure

Galakatos, Alex; Markovitch, Michael; Binnig, Carsten; Fonseca, Rodrigo; Kraska, Tim

doi:10.1145/3299869.3319860

Computer Science > Databases

arXiv:1801.10207 (cs)

[Submitted on 30 Jan 2018 (v1), last revised 25 Mar 2020 (this version, v2)]

Title:FITing-Tree: A Data-aware Index Structure

Authors:Alex Galakatos, Michael Markovitch, Carsten Binnig, Rodrigo Fonseca, Tim Kraska

View PDF

Abstract:Index structures are one of the most important tools that DBAs leverage to improve the performance of analytics and transactional workloads. However, building several indexes over large datasets can often become prohibitive and consume valuable system resources. In fact, a recent study showed that indexes created as part of the TPC-C benchmark can account for 55% of the total memory available in a modern DBMS. This overhead consumes valuable and expensive main memory, and limits the amount of space available to store new data or process existing data.
In this paper, we present FITing-Tree, a novel form of a learned index which uses piece-wise linear functions with a bounded error specified at construction time. This error knob provides a tunable parameter that allows a DBA to FIT an index to a dataset and workload by being able to balance lookup performance and space consumption. To navigate this tradeoff, we provide a cost model that helps determine an appropriate error parameter given either (1) a lookup latency requirement (e.g., 500ns) or (2) a storage budget (e.g., 100MB). Using a variety of real-world datasets, we show that our index is able to provide performance that is comparable to full index structures while reducing the storage footprint by orders of magnitude.

Comments:	18 pages
Subjects:	Databases (cs.DB)
Cite as:	arXiv:1801.10207 [cs.DB]
	(or arXiv:1801.10207v2 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.1801.10207
Journal reference:	SIGMOD (2019) 1189-1206
Related DOI:	https://doi.org/10.1145/3299869.3319860

Submission history

From: Alex Galakatos [view email]
[v1] Tue, 30 Jan 2018 20:22:53 UTC (372 KB)
[v2] Wed, 25 Mar 2020 14:24:15 UTC (4,491 KB)

Computer Science > Databases

Title:FITing-Tree: A Data-aware Index Structure

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:FITing-Tree: A Data-aware Index Structure

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators