Engineering a Distributed Full-Text Index

Fischer, Johannes; Kurpicz, Florian; Sanders, Peter

Computer Science > Data Structures and Algorithms

arXiv:1610.03332v1 (cs)

[Submitted on 11 Oct 2016 (this version), latest version 5 Dec 2016 (v2)]

Title:Engineering a Distributed Full-Text Index

Authors:Johannes Fischer, Florian Kurpicz, Peter Sanders

View PDF

Abstract:We present a distributed full-text index for big data applications in a distributed environment. The index can be used to answer different types of pattern matching queries (existential, counting and enumeration) and also be extended to answer document retrieval queries (counting, retrieve and top-$k$). We also show that succinct data structures are indeed useful for big data applications, as their low memory consumption allows us to build indices for larger slices of text in the main memory.

Subjects:	Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:1610.03332 [cs.DS]
	(or arXiv:1610.03332v1 [cs.DS] for this version)
	https://doi.org/10.48550/arXiv.1610.03332

Submission history

From: Florian Kurpicz [view email]
[v1] Tue, 11 Oct 2016 13:46:54 UTC (219 KB)
[v2] Mon, 5 Dec 2016 21:39:22 UTC (146 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.DS

< prev | next >

new | recent | 2016-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Johannes Fischer
Florian Kurpicz
Peter Sanders

export BibTeX citation

Computer Science > Data Structures and Algorithms

Title:Engineering a Distributed Full-Text Index

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Data Structures and Algorithms

Title:Engineering a Distributed Full-Text Index

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators