Semantic and Influence aware k-Representative Queries over Social Streams

Wang, Yanhao; Li, Yuchen; Tan, Kian-Lee

doi:10.5441/002/edbt.2019.17

Computer Science > Social and Information Networks

arXiv:1901.10109 (cs)

[Submitted on 29 Jan 2019]

Title:Semantic and Influence aware k-Representative Queries over Social Streams

Authors:Yanhao Wang, Yuchen Li, Kian-Lee Tan

View PDF

Abstract:Massive volumes of data continuously generated on social platforms have become an important information source for users. A primary method to obtain fresh and valuable information from social streams is \emph{social search}. Although there have been extensive studies on social search, existing methods only focus on the \emph{relevance} of query results but ignore the \emph{representativeness}. In this paper, we propose a novel Semantic and Influence aware $k$-Representative ($k$-SIR) query for social streams based on topic modeling. Specifically, we consider that both user queries and elements are represented as vectors in the topic space. A $k$-SIR query retrieves a set of $k$ elements with the maximum \emph{representativeness} over the sliding window at query time w.r.t. the query vector. The representativeness of an element set comprises both semantic and influence scores computed by the topic model. Subsequently, we design two approximation algorithms, namely \textsc{Multi-Topic ThresholdStream} (MTTS) and \textsc{Multi-Topic ThresholdDescend} (MTTD), to process $k$-SIR queries in real-time. Both algorithms leverage the ranked lists maintained on each topic for $k$-SIR processing with theoretical guarantees. Extensive experiments on real-world datasets demonstrate the effectiveness of $k$-SIR query compared with existing methods as well as the efficiency and scalability of our proposed algorithms for $k$-SIR processing.

Comments:	27 pages, 14 figures, to appear in the 22nd International Conference on Extending Database Technology (EDBT 2019)
Subjects:	Social and Information Networks (cs.SI); Databases (cs.DB)
Cite as:	arXiv:1901.10109 [cs.SI]
	(or arXiv:1901.10109v1 [cs.SI] for this version)
	https://doi.org/10.48550/arXiv.1901.10109
Related DOI:	https://doi.org/10.5441/002/edbt.2019.17

Submission history

From: Yanhao Wang [view email]
[v1] Tue, 29 Jan 2019 05:25:33 UTC (841 KB)

Computer Science > Social and Information Networks

Title:Semantic and Influence aware k-Representative Queries over Social Streams

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Social and Information Networks

Title:Semantic and Influence aware k-Representative Queries over Social Streams

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators