-
Validating Temporal Compliance Patterns: A Unified Approach with $MTL_f$ over various Data Models
Authors:
Nesma M. Zaki,
Iman M. A. Helal,
Ehab E. Hassanein,
Ahmed Awad
Abstract:
Process mining extracts valuable insights from event data to help organizations improve their business processes, which is essential for their growth and success. By leveraging process mining techniques, organizations gain a comprehensive understanding of their processes' execution, enabling the discovery of process models, detection of deviations, identification of bottlenecks, and assessment of…
▽ More
Process mining extracts valuable insights from event data to help organizations improve their business processes, which is essential for their growth and success. By leveraging process mining techniques, organizations gain a comprehensive understanding of their processes' execution, enabling the discovery of process models, detection of deviations, identification of bottlenecks, and assessment of performance. Compliance checking, a specific area within conformance checking, ensures that the organizational activities adhere to prescribed process models and regulations. Linear Temporal Logic over finite traces ($LTL_{f}$ ) is commonly used for conformance checking, but it may not capture all temporal aspects accurately. This paper proposes Metric Temporal Logic over finite traces ($MTL_{f}$ ) to define explicit time-related constraints effectively in addition to the implicit time-ordering covered by $LTL_f$. Therefore, it provides a universal formal approach to capture compliance rules. Moreover, we define a minimal set of generic $MTL_f$ formulas and show that they are capable of capturing all the common patterns for compliance rules. As compliance validation is largely driven by the data model used to represent the event logs, we provide a mapping from $MTL_f$ to the common data models we found in the literature to encode event logs, namely, the relational and the graph models. A comprehensive study comparing various data models and an empirical evaluation across real-life event logs demonstrates the effectiveness of the proposed approach.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Efficient Checking of Timed Order Compliance Rules over Graph-encoded Event Logs
Authors:
Nesma M. Zaki,
Iman M. A. Helal,
Ahmed Awad,
Ehab E. Hassanein
Abstract:
Validation of compliance rules against process data is a fundamental functionality for business process management. Over the years, the problem has been addressed for different types of process data, i.e., process models, process event data at runtime, and event logs representing historical execution. Several approaches have been proposed to tackle compliance checking over process logs. These appr…
▽ More
Validation of compliance rules against process data is a fundamental functionality for business process management. Over the years, the problem has been addressed for different types of process data, i.e., process models, process event data at runtime, and event logs representing historical execution. Several approaches have been proposed to tackle compliance checking over process logs. These approaches have been based on different data models and storage technologies including relational databases, graph databases, and proprietary formats. Graph-based encoding of event logs is a promising direction that turns several process analytics tasks into queries on the underlying graph. Compliance checking is one class of such analysis tasks. In this paper, we argue that encoding log data as graphs alone is not enough to guarantee efficient processing of queries on this data. Efficiency is important due to the interactive nature of compliance checking. Thus, compliance checking would benefit from sub-linear scanning of the data. Moreover, as more data are added, e.g., new batches of logs arrive, the data size should grow sub-linearly to optimize both the space of storage and time for querying. We propose two encoding methods using graph representation, realized in Neo4J, and show the benefits of these encoding on a special class of queries, namely timed order compliance rules. Compared to a baseline encoding, our experiments show up to 5x speed up in the querying time as well as a 3x reduction in the graph size.
△ Less
Submitted 19 June, 2022;
originally announced June 2022.
-
Correlating Unlabeled Events at Runtime
Authors:
Iman M. A. Helal,
Ahmed Awad
Abstract:
Process mining is of great importance for both data-centric and process-centric systems. Process mining receives so-called process logs which are collections of partially-ordered events. An event has to possess at least three attributes, case ID, task ID and a timestamp for mining approaches to work. When a case ID is unknown, the event is called unlabeled. Traditionally, process mining is an offl…
▽ More
Process mining is of great importance for both data-centric and process-centric systems. Process mining receives so-called process logs which are collections of partially-ordered events. An event has to possess at least three attributes, case ID, task ID and a timestamp for mining approaches to work. When a case ID is unknown, the event is called unlabeled. Traditionally, process mining is an offline task, where events are collected from different sources are usually manually correlated. That is, events belonging to the same instance are assigned the same case ID. With today's high-volume/high-speed nature of, e.g., IoT applications, process mining shifts to be an online task. For this, event correlation has to be automated and has to occur as the data is generated. In this paper, we introduce an approach that correlates unlabeled events at runtime. Given a process model, a stream of unlabeled events and other information about task duration, our approach can induce a case identifier to a set of unlabeled events with a trust percentage. It can also check the conformance of the identified cases with the process model. A prototype of the proposed approach was implemented and evaluated against real-life and synthetic logs.
△ Less
Submitted 19 April, 2020;
originally announced April 2020.