Mining Non-Redundant Local Process Models From Sequence Databases

Tax, Niek; Dumas, Marlon

Computer Science > Data Structures and Algorithms

arXiv:1712.04159 (cs)

[Submitted on 12 Dec 2017 (v1), last revised 21 Sep 2018 (this version, v2)]

Title:Mining Non-Redundant Local Process Models From Sequence Databases

Authors:Niek Tax, Marlon Dumas

View PDF

Abstract:Sequential pattern mining techniques extract patterns corresponding to frequent subsequences from a sequence database. A practical limitation of these techniques is that they overload the user with too many patterns. Local Process Model (LPM) mining is an alternative approach coming from the field of process mining. While in traditional sequential pattern mining, a pattern describes one subsequence, an LPM captures a set of subsequences. Also, while traditional sequential patterns only match subsequences that are observed in the sequence database, an LPM may capture subsequences that are not explicitly observed, but that are related to observed subsequences. In other words, LPMs generalize the behavior observed in the sequence database. These properties make it possible for a set of LPMs to cover the behavior of a much larger set of sequential patterns. Yet, existing LPM mining techniques still suffer from the pattern explosion problem because they produce sets of redundant LPMs. In this paper, we propose several heuristics to mine a set of non-redundant LPMs either from a set of redundant LPMs or from a set of sequential patterns. We empirically compare the proposed heuristics between them and against existing (local) process mining techniques in terms of coverage, redundancy, and complexity of the produced sets of LPMs.

Subjects:	Data Structures and Algorithms (cs.DS); Artificial Intelligence (cs.AI); Databases (cs.DB)
Cite as:	arXiv:1712.04159 [cs.DS]
	(or arXiv:1712.04159v2 [cs.DS] for this version)
	https://doi.org/10.48550/arXiv.1712.04159

Submission history

From: Niek Tax [view email]
[v1] Tue, 12 Dec 2017 08:03:50 UTC (553 KB)
[v2] Fri, 21 Sep 2018 06:51:54 UTC (618 KB)

Computer Science > Data Structures and Algorithms

Title:Mining Non-Redundant Local Process Models From Sequence Databases

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Data Structures and Algorithms

Title:Mining Non-Redundant Local Process Models From Sequence Databases

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators