Discovering Process Horizontal Boundaries to Facilitate Process Comprehension

Automated discovery of a process model is a major task of Process Mining that means to produce a process model from an event log, without any a-priori information. However, when an event log contains a large number of distinct activities, process discovery can be real challenging. The goal of this article is to facilitate process discovery in such cases when a process is expected to contain a large set of unique activities. To this end, this article proposes a clustering approach that recommends horizontal boundaries for the process. The proposed approach ultimately partitions the event log in a way that human interpretation efforts are decomposed. In addition, it makes automated discovery more efficient as well as effective by simultaneously considering two quality criteria: informativeness and robustness of the derived groups of activities. The authors conducted several experiments to test the behavior of the algorithm under different settings, and to compare it against other techniques. Finally, they provide a set of recommendations that may help process analysts during the process discovery endeavor.

Download Full-text

Role of Stochastic Petri Net (SPN) in Process Discovery for Modelling and Analysis

Mathematical Problems in Engineering ◽

10.1155/2021/8699164 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Shabnam Shahzadi ◽

Xianwen Fang ◽

David Anekeya Alilah

Keyword(s):

Petri Net ◽

Business Processes ◽

Time Perspective ◽

Process Mining ◽

Vital Role ◽

Process Discovery ◽

Stochastic Petri Net ◽

Event Log ◽

The Stability ◽

Generalized Stochastic Petri Net

For exploitation and extraction of an event’s data that has vital information which is related to the process from the event log, process mining is used. There are three main basic types of process mining as explained in relation to input and output. These are process discovery, conformance checking, and enhancement. Process discovery is one of the most challenging process mining activities based on the event log. Business processes or system performance plays a vital role in modelling, analysis, and prediction. Recently, a memoryless model such as exponential distribution of the stochastic Petri net SPN has gained much attention in research and industry. This paper uses time perspective for modelling and analysis and uses stochastic Petri net to check the performance, evolution, stability, and reliability of the model. To assess the effect of time delay in firing the transition, stochastic reward net SRN model is used. The model can also be used in checking the reliability of the model, whereas the generalized stochastic Petri net GSPN is used for evaluation and checking the performance of the model. SPN is used to analyze the probability of state transition and the stability from one state to another. However, in process mining, logs are used by linking log sequence with the state and, by this, modelling can be done, and its relation with stability of the model can be established.

Download Full-text

https://ijsea.com/archive/volume10/issue9/IJSEA10091005.pdf

International Journal of Science and Engineering Applications ◽

10.7753/ijsea1009.1006 ◽

2021 ◽

Vol 10 (9) ◽

pp. 144-147

Author(s):

Huiling LI ◽

Xuan SU ◽

Shuaipeng ZHANG

Keyword(s):

Process Model ◽

Large Scale ◽

Process Mining ◽

Data Set ◽

Systems Model ◽

Event Logs ◽

Event Log ◽

Low Efficiency ◽

Sampling Approach

Massive amounts of business process event logs are collected and stored by modern information systems. Model discovery aims to discover a process model from such event logs, however, most of the existing approaches still suffer from low efficiency when facing large-scale event logs. Event log sampling techniques provide an effective scheme to improve the efficiency of process discovery, but the existing techniques still cannot guarantee the quality of model mining. Therefore, a sampling approach based on set coverage algorithm named set coverage sampling approach is proposed. The proposed sampling approach has been implemented in the open-source process mining toolkit ProM. Furthermore, experiments using a real event log data set from conformance checking and time performance analysis show that the proposed event log sampling approach can greatly improve the efficiency of log sampling on the premise of ensuring the quality of model mining.

Download Full-text

The RALph miner for automated discovery and verification of resource-aware process models

Software & Systems Modeling ◽

10.1007/s10270-020-00820-7 ◽

2020 ◽

Vol 19 (6) ◽

pp. 1415-1441

Author(s):

Cristina Cabanillas ◽

Lars Ackermann ◽

Stefan Schönig ◽

Christian Sturm ◽

Jan Mendling

Keyword(s):

Process Model ◽

Model Verification ◽

Process Models ◽

Resource Assignment ◽

Process Discovery ◽

Event Logs ◽

Different Types ◽

Automated Discovery ◽

And Performance ◽

Resource Aware

Abstract Automated process discovery is a technique that extracts models of executed processes from event logs. Logs typically include information about the activities performed, their timestamps and the resources that were involved in their execution. Recent approaches to process discovery put a special emphasis on (human) resources, aiming at constructing resource-aware process models that contain the inferred resource assignment constraints. Such constraints can be complex and process discovery approaches so far have missed the opportunity to represent expressive resource assignments graphically together with process models. A subsequent verification of the extracted resource-aware process models is required in order to check the proper utilisation of resources according to the resource assignments. So far, research on discovering resource-aware process models has assumed that models can be put into operation without modification and checking. Integrating resource mining and resource-aware process model verification faces the challenge that different types of resource assignment languages are used for each task. In this paper, we present an integrated solution that comprises (i) a resource mining technique that builds upon a highly expressive graphical notation for defining resource assignments; and (ii) automated model-checking support to validate the discovered resource-aware process models. All the concepts reported in this paper have been implemented and evaluated in terms of feasibility and performance.

Download Full-text

Functional Integration with Process Mining and Process Analyzing for Structural and Behavioral Properness Validation of Processes Discovered from Event Log Datasets

Applied Sciences ◽

10.3390/app10041493 ◽

2020 ◽

Vol 10 (4) ◽

pp. 1493 ◽

Cited By ~ 1

Author(s):

Kwanghoon Pio Kim

Keyword(s):

Process Model ◽

Large Scale ◽

Process Mining ◽

Functional Integration ◽

Structural Complexity ◽

Integrated Approach ◽

Parallel Process ◽

Process Models ◽

Massively Parallel ◽

Event Log

In this paper, we propose an integrated approach for seamlessly and effectively providing the mining and the analyzing functionalities to redesigning work for very large-scale and massively parallel process models that are discovered from their enactment event logs. The integrated approach especially aims at analyzing not only their structural complexity and correctness but also their animation-based behavioral properness, and becomes concretized to a sophisticated analyzer. The core function of the analyzer is to discover a very large-scale and massively parallel process model from a process log dataset and to validate the structural complexity and the syntactical and behavioral properness of the discovered process model. Finally, this paper writes up the detailed description of the system architecture with its functional integration of process mining and process analyzing. More precisely, we excogitate a series of functional algorithms for extracting the structural constructs and for visualizing the behavioral properness of those discovered very large-scale and massively parallel process models. As experimental validation, we apply the proposed approach and analyzer to a couple of process enactment event log datasets available on the website of the 4TU.Centre for Research Data.

Download Full-text

Functional Integration with Process Mining and Process Analyzing for Structural and Behavioral Properness Validation of Discovered Processes from Event Log Datasets

10.20944/preprints202002.0122.v1 ◽

2020 ◽

Author(s):

Kwanghoon Kim

Keyword(s):

Process Model ◽

Large Scale ◽

Process Mining ◽

Functional Integration ◽

Structural Complexity ◽

Integrated Approach ◽

Parallel Process ◽

Process Models ◽

Massively Parallel ◽

Event Log

Process (or business process) management systems fulfill defining, executing, monitoring and managing process models deployed on process-aware enterprises. Accordingly, the functional formation of the systems is made up of three subsystems such as modeling subsystem, enacting subsystem and mining subsystem. In recent times, the mining subsystem has been becoming an essential subsystem. Many enterprises have successfully completed the introduction and application of the process automation technology through the modeling subsystem and the enacting subsystem. According as the time has come to the phase of redesigning and reengineering the deployed process models, from now on it is important for the mining subsystem to cooperate with the analyzing subsystem; the essential cooperation capability is to provide seamless integrations between the designing works with the modeling subsystem and the redesigning work with the mining subsystem. In other words, we need to seamlessly integrate the discovery functionality of the mining subsystem and the analyzing functionality of the modeling subsystem. This integrated approach might be suitable very well when those deployed process models discovered by the mining subsystem are complex and very large-scaled, in particular. In this paper, we propose an integrated approach for seamlessly as well as effectively providing the mining and the analyzing functionalities to the redesigning work on very large-scale and massively parallel process models that are discovered from their enactment event logs. The integrated approach especially aims at analyzing not only their structural complexity and correctness but also their animation-based behavioral properness, and becomes concretized to a sophisticated analyzer. The core function of the analyzer is to discover a very large-scale and massively parallel process model from a process log dataset and to validate the structural complexity and the syntactical and behavioral properness of the discovered process model. Finally, this paper writes up the detailed description of the system architecture with its functional integration of process mining and process analyzing. And more precisely, we excogitate a series of functional algorithms for extracting the structural constructs as well as for visualizing the behavioral properness on those discovered very large-scale and massively parallel process models. As experimental validation, we apply the proposed approach and analyzer to a couple of process enactment event log datasets available on the website of the 4TU.Centre for Research Data.

Download Full-text

Conformance Checking Techniques of Process Mining: A Survey

10.3233/apc210213 ◽

2021 ◽

Author(s):

Ashok Kumar Saini ◽

Ruchi Kamra ◽

Utpal Shrivastava

Keyword(s):

Information Systems ◽

Process Model ◽

Process Mining ◽

Quality Parameters ◽

Process Models ◽

Conformance Checking ◽

Business Goals ◽

Key Concepts ◽

Event Log ◽

Challenges And Opportunities

Conformance Checking (CC) techniques enable us to gives the deviation between modelled behavior and actual execution behavior. The majority of organizations have Process-Aware Information Systems for recording the insights of the system. They have the process model to show how the process will be executed. The key intention of Process Mining is to extracting facts from the event log and used them for analysis, ratification, improvement, and redesigning of a process. Researchers have proposed various CC techniques for specific applications and process models. This paper has a detailed study of key concepts and contributions of Process Mining. It also helps in achieving business goals. The current challenges and opportunities in Process Mining are also discussed. The survey is based on CC techniques proposed by researchers with key objectives like quality parameters, perspective, algorithm types, tools, and achievements.

Download Full-text

An Optimization Approach for Mining of Process Models with Infrequent Behaviors Integrating Data Flow and Control Flow

Scientific Programming ◽

10.1155/2021/8874316 ◽

2021 ◽

Vol 2021 ◽

pp. 1-17

Author(s):

Li-li Wang ◽

Xian-wen Fang ◽

Esther Asare ◽

Fang Huan

Keyword(s):

Process Model ◽

Data Flow ◽

Process Mining ◽

Control Flow ◽

Process Models ◽

Frequent Pattern ◽

Optimization Approach ◽

Event Log ◽

Flow Information ◽

And Control

Infrequent behaviors of business process refer to behaviors that occur in very exceptional cases, and their occurrence frequency is low as their required conditions are rarely fulfilled. Hence, a strong coupling relationship between infrequent behavior and data flow exists. Furthermore, some infrequent behaviors may reveal very important information about the process. Thus, not all infrequent behaviors should be disregarded as noise, and identifying infrequent but correct behaviors in the event log is vital to process mining from the perspective of data flow. Existing process mining approaches construct a process model from frequent behaviors in the event log, mostly concentrating on control flow only, without considering infrequent behavior and data flow information. In this paper, we focus on data flow to extract infrequent but correct behaviors from logs. For an infrequent trace, frequent patterns and interactive behavior profiles are combined to find out which part of the behavior in the trace occurs in low frequency. And, conditional dependency probability is used to analyze the influence strength of the data flow information on infrequent behavior. An approach for identifying effective infrequent behaviors based on the frequent pattern under data awareness is proposed correspondingly. Subsequently, an optimization approach for mining of process models with infrequent behaviors integrating data flow and control flow is also presented. The experiments on synthetic and real-life event logs show that the proposed approach can distinguish effective infrequent behaviors from noise compared with others. The proposed approaches greatly improve the fitness of the mined process model without significantly decreasing its precision.

Download Full-text

Process Mining: Measuring Key Performance Indicator Container Dwell Time

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v16.i1.pp401-411 ◽

2019 ◽

Vol 16 (1) ◽

pp. 401

Author(s):

Bambang Jokonowo ◽

Riyanarto Sarno ◽

Siti Rochimah ◽

Bagus Priambodo

Keyword(s):

Dwell Time ◽

Process Model ◽

Performance Indicator ◽

Process Mining ◽

Key Performance Indicator ◽

Life Cycle Model ◽

Cycle Model ◽

Knowledge Process ◽

Event Log ◽

Process Behavior

<span>The issues measures duration of stay the container logistic processes at ports in developing countries is often a major problem. Therefore, a knowledge process discovery, i.e., Heuristics Miner and Fuzzy Miner, can be used to discover the insight of process by creating a process model. The container import dwell time (DT) processes can be modeled based on the event log data sources are extracted from the terminal operating system (TOS). The <em>L</em>* life-cycle model is used to perform the process behavior analysis steps. The results of analysis and verification show that the container import DT processes have a median duration of 5.5 days and a mean duration of 6.07 days.</span>

Download Full-text

A task-level parallelism approach for process discovery

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i4.14748 ◽

2018 ◽

Vol 7 (4) ◽

pp. 2446

Author(s):

Muktikanta Sahu ◽

Rupjit Chakraborty ◽

Gopal Krishna Nayak

Keyword(s):

Process Model ◽

Process Mining ◽

Programming Model ◽

Parallel Implementation ◽

Primary Objective ◽

Process Models ◽

Task Parallelism ◽

Process Discovery ◽

Event Logs ◽

Computationally Intensive

Building process models from the available data in the event logs is the primary objective of Process discovery. Alpha algorithm is one of the popular algorithms accessible for ascertaining a process model from the event logs in process mining. The steps involved in the Alpha algorithm are computationally rigorous and this problem further manifolds with the exponentially increasing event log data. In this work, we have exploited task parallelism in the Alpha algorithm for process discovery by using MPI programming model. The proposed work is based on distributed memory parallelism available in MPI programming for performance improvement. Independent and computationally intensive steps in the Alpha algorithm are identified and task parallelism is exploited. The execution time of serial as well as parallel implementation of Alpha algorithm are measured and used for calculating the extent of speedup achieved. The maximum and minimum speedups obtained are 3.97x and 3.88x respectively with an average speedup of 3.94x.

Download Full-text

Recovering Truncated Streaming Event Log Using Coupled Hidden Markov Model

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001420590120 ◽

2019 ◽

Vol 34 (04) ◽

pp. 2059012

Author(s):

Riyanarto Sarno ◽

Kelly Rossa Sungkono

Keyword(s):

Information Systems ◽

Markov Model ◽

Hidden Markov Model ◽

Process Model ◽

Hidden Markov ◽

Transition Probability ◽

Process Discovery ◽

Model Based ◽

Event Logs ◽

Event Log

Process discovery is a technique for obtaining process model based on traces recorded in the event log. Nowadays, information systems produce streaming event logs to record their huge processes. The truncated streaming event log is a big issue in process discovery because it inflicts incomplete traces that make process discovery depict wrong processes in a process model. Earlier research suggested several methods for recovering the truncated streaming event log and none of them utilized Coupled Hidden Markov Model. This research proposes a method that combines Coupled Hidden Markov Model with Double States and the Modification of Viterbi–Backward method for recovering the truncated streaming event log. The first layer of states contains the transition probability of activities. The second layer of states uses patterns for detecting traces which have a low appearance in the event log. The experiment results showed that the proposed method recovered appropriately the truncated streaming event log. These results also have proven that the accuracies of recovered traces obtained by the proposed method are higher than those obtained by the Hidden Markov Model and the Coupled Hidden Markov Model.

Download Full-text