Prefix Imputation of Orphan Events in Event Stream Processing

Frontiers in Big Data ◽

10.3389/fdata.2021.705243 ◽

2021 ◽

Vol 4 ◽

Author(s):

Rashid Zaman ◽

Marwan Hassani ◽

Boudewijn F. Van Dongen

Keyword(s):

Process Model ◽

Process Mining ◽

State Of The Art ◽

Processing System ◽

Conformance Checking ◽

Event Stream ◽

Window Width ◽

Event Logs ◽

Relevant Case ◽

Log File

In the context of process mining, event logs consist of process instances called cases. Conformance checking is a process mining task that inspects whether a log file is conformant with an existing process model. This inspection is additionally quantifying the conformance in an explainable manner. Online conformance checking processes streaming event logs by having precise insights into the running cases and timely mitigating non-conformance, if any. State-of-the-art online conformance checking approaches bound the memory by either delimiting storage of the events per case or limiting the number of cases to a specific window width. The former technique still requires unbounded memory as the number of cases to store is unlimited, while the latter technique forgets running, not yet concluded, cases to conform to the limited window width. Consequently, the processing system may later encounter events that represent some intermediate activity as per the process model and for which the relevant case has been forgotten, to be referred to as orphan events. The naïve approach to cope with an orphan event is to either neglect its relevant case for conformance checking or treat it as an altogether new case. However, this might result in misleading process insights, for instance, overestimated non-conformance. In order to bound memory yet effectively incorporate the orphan events into processing, we propose an imputation of missing-prefix approach for such orphan events. Our approach utilizes the existing process model for imputing the missing prefix. Furthermore, we leverage the case storage management to increase the accuracy of the prefix prediction. We propose a systematic forgetting mechanism that distinguishes and forgets the cases that can be reliably regenerated as prefix upon receipt of their future orphan event. We evaluate the efficacy of our proposed approach through multiple experiments with synthetic and three real event logs while simulating a streaming setting. Our approach achieves considerably higher realistic conformance statistics than the state of the art while requiring the same storage.

Download Full-text

Business process mining

Encyclopedia with Semantic Computing and Robotic Intelligence ◽

10.1142/s2425038416300044 ◽

2017 ◽

Vol 01 (01) ◽

pp. 1630004 ◽

Cited By ~ 2

Author(s):

Asef Pourmasoumi ◽

Ebrahim Bagheri

Keyword(s):

Process Mining ◽

Status Quo ◽

Process Models ◽

Added Value ◽

Conformance Checking ◽

Event Logs ◽

Organizational Information ◽

Organization Process ◽

The Status ◽

Business Process Mining

One of the most valuable assets of an organization is its organizational data. The analysis and mining of this potential hidden treasure can lead to much added-value for the organization. Process mining is an emerging area that can be useful in helping organizations understand the status quo, check for compliance and plan for improving their processes. The aim of process mining is to extract knowledge from event logs of today’s organizational information systems. Process mining includes three main types: discovering process models from event logs, conformance checking and organizational mining. In this paper, we briefly introduce process mining and review some of its most important techniques. Also, we investigate some of the applications of process mining in industry and present some of the most important challenges that are faced in this area.

Download Full-text

Towards Aspects Identification in Business Process Through Process Mining

10.5753/sbsi.2015.5883 ◽

2015 ◽

Cited By ~ 1

Author(s):

Bruna Brandão ◽

Flávia Santoro ◽

Leonardo Azevedo

Keyword(s):

Programming Languages ◽

Business Process ◽

Process Model ◽

Process Mining ◽

Process Models ◽

Preliminary Evaluation ◽

Crosscutting Concerns ◽

Business Process Models ◽

Event Logs ◽

Initial Results

In business process models, elements can be scattered (repeated) within different processes, making it difficult to handle changes, analyze process for improvements, or check crosscutting impacts. These scattered elements are named as Aspects. Similar to the aspect-oriented paradigm in programming languages, in BPM, aspect handling has the goal to modularize the crosscutting concerns spread across the models. This process modularization facilitates the management of the process (reuse, maintenance and understanding). The current approaches for aspect identification are made manually; thus, resulting in the problem of subjectivity and lack of systematization. This paper proposes a method to automatically identify aspects in business process from its event logs. The method is based on mining techniques and it aims to solve the problem of the subjectivity identification made by specialists. The initial results from a preliminary evaluation showed evidences that the method identified correctly the aspects present in the process model.

Download Full-text

https://ijsea.com/archive/volume10/issue9/IJSEA10091005.pdf

International Journal of Science and Engineering Applications ◽

10.7753/ijsea1009.1006 ◽

2021 ◽

Vol 10 (9) ◽

pp. 144-147

Author(s):

Huiling LI ◽

Xuan SU ◽

Shuaipeng ZHANG

Keyword(s):

Process Model ◽

Large Scale ◽

Process Mining ◽

Data Set ◽

Systems Model ◽

Event Logs ◽

Event Log ◽

Low Efficiency ◽

Sampling Approach

Massive amounts of business process event logs are collected and stored by modern information systems. Model discovery aims to discover a process model from such event logs, however, most of the existing approaches still suffer from low efficiency when facing large-scale event logs. Event log sampling techniques provide an effective scheme to improve the efficiency of process discovery, but the existing techniques still cannot guarantee the quality of model mining. Therefore, a sampling approach based on set coverage algorithm named set coverage sampling approach is proposed. The proposed sampling approach has been implemented in the open-source process mining toolkit ProM. Furthermore, experiments using a real event log data set from conformance checking and time performance analysis show that the proposed event log sampling approach can greatly improve the efficiency of log sampling on the premise of ensuring the quality of model mining.

Download Full-text

Statistical Verification of Process Model Conformance to Execution Log Considering Model Abstraction

International Journal of Cooperative Information Systems ◽

10.1142/s0218843018500028 ◽

2018 ◽

Vol 27 (02) ◽

pp. 1850002

Author(s):

Sung-Hyun Sim ◽

Hyerim Bae ◽

Yulim Choi ◽

Ling Liu

Keyword(s):

Process Model ◽

Process Mining ◽

Statistical Significance ◽

Conformance Checking ◽

Model Abstraction ◽

Verification Method ◽

Statistical Verification ◽

New Concepts ◽

Kolmogorov Smirnov ◽

Process Execution

In Big data and IoT environments, process execution generates huge-sized data some of which is subsequently obtained by sensors. The main issue in such areas has been the necessity of analyzing data in order to suggest enhancements to processes. In this regard, evaluation of process model conformance to the execution log is of great importance. For this purpose, previous reports on process mining approaches have advocated conformance checking by fitness measure, which is a process that uses token replay and node-arc relations based on Petri net. However, fitness measure so far has not considered statistical significance, but just offers a numeric ratio. We herein propose a statistical verification method based on the Kolmogorov–Smirnov (K–S) test to judge whether two different log datasets follow the same process model. Our method can be easily extended to determinations that process execution actually follows a process model, by playing out the model and generating event log data from it. Additionally, in order to solve the problem of the trade-off between model abstraction and process conformance, we also propose the new concepts of Confidence Interval of Abstraction Value (CIAV) and Maximum Confidence Abstraction Value (MCAV). We showed that our method can be applied to any process mining algorithm (e.g. heuristic mining, fuzzy mining) that has parameters related to model abstraction. We expect that our method will come to be widely utilized in many applications dealing with business process enhancement involving process-model and execution-log analyses.

Download Full-text

Database-Less Extraction of Event Logs from Redo Logs

Business Information Systems ◽

10.52825/bis.v1i.66 ◽

2021 ◽

pp. 73-82

Author(s):

Dorina Bano ◽

Tom Lichtenstein ◽

Finn Klessascheck ◽

Mathias Weske

Keyword(s):

Performance Analysis ◽

Real World ◽

Business Processes ◽

Process Mining ◽

Detailed Knowledge ◽

System Failure ◽

Conformance Checking ◽

Event Logs ◽

Event Log ◽

And Performance

Process mining is widely adopted in organizations to gain deep insights about running business processes. This can be achieved by applying different process mining techniques like discovery, conformance checking, and performance analysis. These techniques are applied on event logs, which need to be extracted from the organization’s databases beforehand. This not only implies access to databases, but also detailed knowledge about the database schema, which is often not available. In many real-world scenarios, however, process execution data is available as redo logs. Such logs are used to bring a database into a consistent state in case of a system failure. This paper proposes a semi-automatic approach to extract an event log from redo logs alone. It does not require access to the database or knowledge of the databaseschema. The feasibility of the proposed approach is evaluated on two synthetic redo logs.

Download Full-text

Conformance Checking Techniques of Process Mining: A Survey

10.3233/apc210213 ◽

2021 ◽

Author(s):

Ashok Kumar Saini ◽

Ruchi Kamra ◽

Utpal Shrivastava

Keyword(s):

Information Systems ◽

Process Model ◽

Process Mining ◽

Quality Parameters ◽

Process Models ◽

Conformance Checking ◽

Business Goals ◽

Key Concepts ◽

Event Log ◽

Challenges And Opportunities

Conformance Checking (CC) techniques enable us to gives the deviation between modelled behavior and actual execution behavior. The majority of organizations have Process-Aware Information Systems for recording the insights of the system. They have the process model to show how the process will be executed. The key intention of Process Mining is to extracting facts from the event log and used them for analysis, ratification, improvement, and redesigning of a process. Researchers have proposed various CC techniques for specific applications and process models. This paper has a detailed study of key concepts and contributions of Process Mining. It also helps in achieving business goals. The current challenges and opportunities in Process Mining are also discussed. The survey is based on CC techniques proposed by researchers with key objectives like quality parameters, perspective, algorithm types, tools, and achievements.

Download Full-text

A task-level parallelism approach for process discovery

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i4.14748 ◽

2018 ◽

Vol 7 (4) ◽

pp. 2446

Author(s):

Muktikanta Sahu ◽

Rupjit Chakraborty ◽

Gopal Krishna Nayak

Keyword(s):

Process Model ◽

Process Mining ◽

Programming Model ◽

Parallel Implementation ◽

Primary Objective ◽

Process Models ◽

Task Parallelism ◽

Process Discovery ◽

Event Logs ◽

Computationally Intensive

Building process models from the available data in the event logs is the primary objective of Process discovery. Alpha algorithm is one of the popular algorithms accessible for ascertaining a process model from the event logs in process mining. The steps involved in the Alpha algorithm are computationally rigorous and this problem further manifolds with the exponentially increasing event log data. In this work, we have exploited task parallelism in the Alpha algorithm for process discovery by using MPI programming model. The proposed work is based on distributed memory parallelism available in MPI programming for performance improvement. Independent and computationally intensive steps in the Alpha algorithm are identified and task parallelism is exploited. The execution time of serial as well as parallel implementation of Alpha algorithm are measured and used for calculating the extent of speedup achieved. The maximum and minimum speedups obtained are 3.97x and 3.88x respectively with an average speedup of 3.94x.

Download Full-text

Process Mining Event Logs from FLOSS Data: State of the Art and Perspectives

Software Engineering and Formal Methods - Lecture Notes in Computer Science ◽

10.1007/978-3-319-15201-1_12 ◽

2015 ◽

pp. 182-198 ◽

Cited By ~ 1

Author(s):

Patrick Mukala ◽

Antonio Cerone ◽

Franco Turini

Keyword(s):

Process Mining ◽

State Of The Art ◽

Event Logs

Download Full-text

Simplified Process Model Discovery Based on Role-Oriented Genetic Mining

The Scientific World JOURNAL ◽

10.1155/2014/298592 ◽

2014 ◽

Vol 2014 ◽

pp. 1-8

Author(s):

Weidong Zhao ◽

Xi Liu ◽

Weihui Dai

Keyword(s):

Process Model ◽

Process Mining ◽

Fitness Function ◽

Control Flow ◽

Process Models ◽

Programming Approach ◽

Event Logs ◽

Process Complexity

Process mining is automated acquisition of process models from event logs. Although many process mining techniques have been developed, most of them are based on control flow. Meanwhile, the existing role-oriented process mining methods focus on correctness and integrity of roles while ignoring role complexity of the process model, which directly impacts understandability and quality of the model. To address these problems, we propose a genetic programming approach to mine the simplified process model. Using a new metric of process complexity in terms of roles as the fitness function, we can find simpler process models. The new role complexity metric of process models is designed from role cohesion and coupling, and applied to discover roles in process models. Moreover, the higher fitness derived from role complexity metric also provides a guideline for redesigning process models. Finally, we conduct case study and experiments to show that the proposed method is more effective for streamlining the process by comparing with related studies.

Download Full-text

Conformance Checking of a Longwall Shearer Operation Based on Low-Level Events

Energies ◽

10.3390/en13246630 ◽

2020 ◽

Vol 13 (24) ◽

pp. 6630

Author(s):

Marcin Szpyrka ◽

Edyta Brzychczy ◽

Aneta Napieraj ◽

Jacek Korski ◽

Grzegorz J. Nalepa

Keyword(s):

Petri Nets ◽

Coal Mining ◽

Process Model ◽

Formal Model ◽

Process Mining ◽

Coal Mines ◽

Conformance Checking ◽

Mining Technique ◽

Log Files ◽

Event Log

Conformance checking is a process mining technique that compares a process model with an event log of the same process to check whether the current execution stored in the log conforms to the model and vice versa. This paper deals with the conformance checking of a longwall shearer process. The approach uses place-transition Petri nets with inhibitor arcs for modeling purposes. We use event log files collected from a few coal mines located in Poland by Famur S.A., one of the global suppliers of coal mining machines. One of the main advantages of the approach is the possibility for both offline and online analysis of the log data. The paper presents a detailed description of the longwall process, an original formal model we developed, selected elements of the approach’s implementation and the results of experiments.

Download Full-text