Adaptive Indexing for Approximate Query Processing in Exploratory Data Analysis

Maroulis, Stavros; Bikakis, Nikos; Stamatopoulos, Vassilis; Papastefanatos, George

Computer Science > Databases

arXiv:2505.19872 (cs)

[Submitted on 26 May 2025]

Title:Adaptive Indexing for Approximate Query Processing in Exploratory Data Analysis

Authors:Stavros Maroulis, Nikos Bikakis, Vassilis Stamatopoulos, George Papastefanatos

View PDF HTML (experimental)

Abstract:Minimizing data-to-analysis time while enabling real-time interaction and efficient analytical computations on large datasets are fundamental objectives of contemporary exploratory systems. Although some of the recent adaptive indexing and on-the-fly processing approaches address most of these needs, there are cases, where they do not always guarantee reliable performance. Some examples of such cases include: exploring areas with a high density of objects; executing the first exploratory queries or exploring previously unseen areas (where the index has not yet adapted sufficiently); and working with very large data files on commodity hardware, such as low-specification laptops. In such demanding cases, approximate and incremental techniques can be exploited to ensure efficiency and scalability by allowing users to prioritize response time over result accuracy, acknowledging that exact results are not always necessary. Therefore, approximation mechanisms that enable smooth user interaction by defining the trade-off between accuracy and performance based on vital factors (e.g., task, preferences, available resources) are of great importance. Considering the aforementioned, in this work, we present an adaptive approximate query processing framework for interactive on-the-fly analysis (with out a preprocessing phase) over large raw data. The core component of the framework is a main-memory adaptive indexing scheme (VALINOR-A) that interoperates with user-driven sampling and incremental aggregation computations. Additionally, an effective error-bounded approximation strategy is designed and integrated in the query processing process. We conduct extensive experiments using both real and synthetic datasets, demonstrating the efficiency and effectiveness of the proposed framework.

Comments:	Keywords: Approximate aggregations, Incremental indexing, User-driven sampling, On-the-fly data analysis, Error-bounded queries, Data visualization, Visual analytics, Aggregation queries, Big data, Human-Data interaction
Subjects:	Databases (cs.DB)
MSC classes:	97R50, 68P05, 68P15
ACM classes:	H.3.1; H.2.4; E.1
Cite as:	arXiv:2505.19872 [cs.DB]
	(or arXiv:2505.19872v1 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.2505.19872

Submission history

From: Nikos Bikakis [view email]
[v1] Mon, 26 May 2025 11:57:47 UTC (1,921 KB)

Computer Science > Databases

Title:Adaptive Indexing for Approximate Query Processing in Exploratory Data Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:Adaptive Indexing for Approximate Query Processing in Exploratory Data Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators