https://ijsea.com/archive/volume10/issue9/IJSEA10091005.pdf

Massive amounts of business process event logs are collected and stored by modern information systems. Model discovery aims to discover a process model from such event logs, however, most of the existing approaches still suffer from low efficiency when facing large-scale event logs. Event log sampling techniques provide an effective scheme to improve the efficiency of process discovery, but the existing techniques still cannot guarantee the quality of model mining. Therefore, a sampling approach based on set coverage algorithm named set coverage sampling approach is proposed. The proposed sampling approach has been implemented in the open-source process mining toolkit ProM. Furthermore, experiments using a real event log data set from conformance checking and time performance analysis show that the proposed event log sampling approach can greatly improve the efficiency of log sampling on the premise of ensuring the quality of model mining.

Download Full-text

Functional Integration with Process Mining and Process Analyzing for Structural and Behavioral Properness Validation of Processes Discovered from Event Log Datasets

Applied Sciences ◽

10.3390/app10041493 ◽

2020 ◽

Vol 10 (4) ◽

pp. 1493 ◽

Cited By ~ 1

Author(s):

Kwanghoon Pio Kim

Keyword(s):

Process Model ◽

Large Scale ◽

Process Mining ◽

Functional Integration ◽

Structural Complexity ◽

Integrated Approach ◽

Parallel Process ◽

Process Models ◽

Massively Parallel ◽

Event Log

In this paper, we propose an integrated approach for seamlessly and effectively providing the mining and the analyzing functionalities to redesigning work for very large-scale and massively parallel process models that are discovered from their enactment event logs. The integrated approach especially aims at analyzing not only their structural complexity and correctness but also their animation-based behavioral properness, and becomes concretized to a sophisticated analyzer. The core function of the analyzer is to discover a very large-scale and massively parallel process model from a process log dataset and to validate the structural complexity and the syntactical and behavioral properness of the discovered process model. Finally, this paper writes up the detailed description of the system architecture with its functional integration of process mining and process analyzing. More precisely, we excogitate a series of functional algorithms for extracting the structural constructs and for visualizing the behavioral properness of those discovered very large-scale and massively parallel process models. As experimental validation, we apply the proposed approach and analyzer to a couple of process enactment event log datasets available on the website of the 4TU.Centre for Research Data.

Download Full-text

Functional Integration with Process Mining and Process Analyzing for Structural and Behavioral Properness Validation of Discovered Processes from Event Log Datasets

10.20944/preprints202002.0122.v1 ◽

2020 ◽

Author(s):

Kwanghoon Kim

Keyword(s):

Process Model ◽

Large Scale ◽

Process Mining ◽

Functional Integration ◽

Structural Complexity ◽

Integrated Approach ◽

Parallel Process ◽

Process Models ◽

Massively Parallel ◽

Event Log

Process (or business process) management systems fulfill defining, executing, monitoring and managing process models deployed on process-aware enterprises. Accordingly, the functional formation of the systems is made up of three subsystems such as modeling subsystem, enacting subsystem and mining subsystem. In recent times, the mining subsystem has been becoming an essential subsystem. Many enterprises have successfully completed the introduction and application of the process automation technology through the modeling subsystem and the enacting subsystem. According as the time has come to the phase of redesigning and reengineering the deployed process models, from now on it is important for the mining subsystem to cooperate with the analyzing subsystem; the essential cooperation capability is to provide seamless integrations between the designing works with the modeling subsystem and the redesigning work with the mining subsystem. In other words, we need to seamlessly integrate the discovery functionality of the mining subsystem and the analyzing functionality of the modeling subsystem. This integrated approach might be suitable very well when those deployed process models discovered by the mining subsystem are complex and very large-scaled, in particular. In this paper, we propose an integrated approach for seamlessly as well as effectively providing the mining and the analyzing functionalities to the redesigning work on very large-scale and massively parallel process models that are discovered from their enactment event logs. The integrated approach especially aims at analyzing not only their structural complexity and correctness but also their animation-based behavioral properness, and becomes concretized to a sophisticated analyzer. The core function of the analyzer is to discover a very large-scale and massively parallel process model from a process log dataset and to validate the structural complexity and the syntactical and behavioral properness of the discovered process model. Finally, this paper writes up the detailed description of the system architecture with its functional integration of process mining and process analyzing. And more precisely, we excogitate a series of functional algorithms for extracting the structural constructs as well as for visualizing the behavioral properness on those discovered very large-scale and massively parallel process models. As experimental validation, we apply the proposed approach and analyzer to a couple of process enactment event log datasets available on the website of the 4TU.Centre for Research Data.

Download Full-text

Simplified Process Model Discovery Based on Role-Oriented Genetic Mining

The Scientific World JOURNAL ◽

10.1155/2014/298592 ◽

2014 ◽

Vol 2014 ◽

pp. 1-8

Author(s):

Weidong Zhao ◽

Xi Liu ◽

Weihui Dai

Keyword(s):

Process Model ◽

Process Mining ◽

Fitness Function ◽

Control Flow ◽

Process Models ◽

Programming Approach ◽

Event Logs ◽

Process Complexity

Process mining is automated acquisition of process models from event logs. Although many process mining techniques have been developed, most of them are based on control flow. Meanwhile, the existing role-oriented process mining methods focus on correctness and integrity of roles while ignoring role complexity of the process model, which directly impacts understandability and quality of the model. To address these problems, we propose a genetic programming approach to mine the simplified process model. Using a new metric of process complexity in terms of roles as the fitness function, we can find simpler process models. The new role complexity metric of process models is designed from role cohesion and coupling, and applied to discover roles in process models. Moreover, the higher fitness derived from role complexity metric also provides a guideline for redesigning process models. Finally, we conduct case study and experiments to show that the proposed method is more effective for streamlining the process by comparing with related studies.

Download Full-text

Case notion discovery and recommendation: automated event log building on databases

Knowledge and Information Systems ◽

10.1007/s10115-019-01430-6 ◽

2019 ◽

Vol 62 (7) ◽

pp. 2539-2575

Author(s):

E. González López de Murillas ◽

H. A. Reijers ◽

W. M. P. van der Aalst

Keyword(s):

Process Mining ◽

Learning To Rank ◽

Computational Cost ◽

Manual Task ◽

Event Logs ◽

Event Log ◽

Regression Techniques ◽

Process Views ◽

Interesting Event

AbstractProcess mining techniques use event logs as input. When analyzing complex databases, these event logs can be built in many ways. Events need to be grouped into traces corresponding to a case. Different groupings provide different views on the data. Building event logs is usually a time-consuming, manual task. This paper provides a precise view on the case notion on databases, which enables the automatic computation of event logs. Also, it provides a way to assess event log quality, used to rank event logs with respect to their interestingness. The computational cost of building an event log can be avoided by predicting the interestingness of a case notion, before the corresponding event log is computed. This makes it possible to give recommendations to users, so they can focus on the analysis of the most promising process views. Finally, the accuracy of the predictions and the quality of the rankings generated by our unsupervised technique are evaluated in comparison to the existing regression techniques as well as to state-of-the-art learning to rank algorithms from the information retrieval field. The results show that our prediction technique succeeds at discovering interesting event logs and provides valuable recommendations to users about the perspectives on which to focus the efforts during the analysis.

Download Full-text

Event Log Preprocessing for Process Mining: A Review

Applied Sciences ◽

10.3390/app112210556 ◽

2021 ◽

Vol 11 (22) ◽

pp. 10556

Author(s):

Heidy M. Marin-Castro ◽

Edgar Tello-Leal

Keyword(s):

Business Process ◽

Process Model ◽

Process Mining ◽

Data Cleaning ◽

Data Preprocessing ◽

Process Models ◽

Event Data ◽

Event Logs ◽

Event Log ◽

Preprocessing Technique

Process Mining allows organizations to obtain actual business process models from event logs (discovery), to compare the event log or the resulting process model in the discovery task with the existing reference model of the same process (conformance), and to detect issues in the executed process to improve (enhancement). An essential element in the three tasks of process mining (discovery, conformance, and enhancement) is data cleaning, used to reduce the complexity inherent to real-world event data, to be easily interpreted, manipulated, and processed in process mining tasks. Thus, new techniques and algorithms for event data preprocessing have been of interest in the research community in business process. In this paper, we conduct a systematic literature review and provide, for the first time, a survey of relevant approaches of event data preprocessing for business process mining tasks. The aim of this work is to construct a categorization of techniques or methods related to event data preprocessing and to identify relevant challenges around these techniques. We present a quantitative and qualitative analysis of the most popular techniques for event log preprocessing. We also study and present findings about how a preprocessing technique can improve a process mining task. We also discuss the emerging future challenges in the domain of data preprocessing, in the context of process mining. The results of this study reveal that the preprocessing techniques in process mining have demonstrated a high impact on the performance of the process mining tasks. The data cleaning requirements are dependent on the characteristics of the event logs (voluminous, a high variability in the set of traces size, changes in the duration of the activities. In this scenario, most of the surveyed works use more than a single preprocessing technique to improve the quality of the event log. Trace-clustering and trace/event level filtering resulted in being the most commonly used preprocessing techniques due to easy of implementation, and they adequately manage noise and incompleteness in the event logs.

Download Full-text

Practical Aspects of Event Logs Creation for Industrial Process Modelling

Multidisciplinary Aspects of Production Engineering ◽

10.2478/mape-2018-0011 ◽

2018 ◽

Vol 1 (1) ◽

pp. 77-83 ◽

Cited By ~ 1

Author(s):

Agnieszka Trzcionkowska ◽

Edyta Brzychczy

Keyword(s):

Process Model ◽

Process Mining ◽

Underground Mining ◽

Industrial Process ◽

Sensor Data ◽

Mining Operations ◽

Event Logs ◽

Event Log ◽

Clustering And Classification ◽

Exploratory Data

Abstract In the paper we address the challenge of applying process mining techniques for discovering models of underground mining operations based on a sensor data. The paper presents practical approach of creation an event log based on industrial sensors data gathered in an underground mine monitoring systems. The proposed approach enables to generate event logs at different generalization levels based on several numbers of discovered stages of devices performance. For discovering process stages data mining techniques such as exploratory data analysis, clustering and classification have been applied. Created event log has been used in one of the process mining tasks - process model discovery.

Download Full-text

Prefix Imputation of Orphan Events in Event Stream Processing

Frontiers in Big Data ◽

10.3389/fdata.2021.705243 ◽

2021 ◽

Vol 4 ◽

Author(s):

Rashid Zaman ◽

Marwan Hassani ◽

Boudewijn F. Van Dongen

Keyword(s):

Process Model ◽

Process Mining ◽

State Of The Art ◽

Processing System ◽

Conformance Checking ◽

Event Stream ◽

Window Width ◽

Event Logs ◽

Relevant Case ◽

Log File

In the context of process mining, event logs consist of process instances called cases. Conformance checking is a process mining task that inspects whether a log file is conformant with an existing process model. This inspection is additionally quantifying the conformance in an explainable manner. Online conformance checking processes streaming event logs by having precise insights into the running cases and timely mitigating non-conformance, if any. State-of-the-art online conformance checking approaches bound the memory by either delimiting storage of the events per case or limiting the number of cases to a specific window width. The former technique still requires unbounded memory as the number of cases to store is unlimited, while the latter technique forgets running, not yet concluded, cases to conform to the limited window width. Consequently, the processing system may later encounter events that represent some intermediate activity as per the process model and for which the relevant case has been forgotten, to be referred to as orphan events. The naïve approach to cope with an orphan event is to either neglect its relevant case for conformance checking or treat it as an altogether new case. However, this might result in misleading process insights, for instance, overestimated non-conformance. In order to bound memory yet effectively incorporate the orphan events into processing, we propose an imputation of missing-prefix approach for such orphan events. Our approach utilizes the existing process model for imputing the missing prefix. Furthermore, we leverage the case storage management to increase the accuracy of the prefix prediction. We propose a systematic forgetting mechanism that distinguishes and forgets the cases that can be reliably regenerated as prefix upon receipt of their future orphan event. We evaluate the efficacy of our proposed approach through multiple experiments with synthetic and three real event logs while simulating a streaming setting. Our approach achieves considerably higher realistic conformance statistics than the state of the art while requiring the same storage.

Download Full-text

Towards Aspects Identification in Business Process Through Process Mining

10.5753/sbsi.2015.5883 ◽

2015 ◽

Cited By ~ 1

Author(s):

Bruna Brandão ◽

Flávia Santoro ◽

Leonardo Azevedo

Keyword(s):

Programming Languages ◽

Business Process ◽

Process Model ◽

Process Mining ◽

Process Models ◽

Preliminary Evaluation ◽

Crosscutting Concerns ◽

Business Process Models ◽

Event Logs ◽

Initial Results

In business process models, elements can be scattered (repeated) within different processes, making it difficult to handle changes, analyze process for improvements, or check crosscutting impacts. These scattered elements are named as Aspects. Similar to the aspect-oriented paradigm in programming languages, in BPM, aspect handling has the goal to modularize the crosscutting concerns spread across the models. This process modularization facilitates the management of the process (reuse, maintenance and understanding). The current approaches for aspect identification are made manually; thus, resulting in the problem of subjectivity and lack of systematization. This paper proposes a method to automatically identify aspects in business process from its event logs. The method is based on mining techniques and it aims to solve the problem of the subjectivity identification made by specialists. The initial results from a preliminary evaluation showed evidences that the method identified correctly the aspects present in the process model.

Download Full-text

Multi-Robot SLAM in Dynamic Environments with Parallel Maps

International Journal of Humanoid Robotics ◽

10.1142/s0219843621500110 ◽

2021 ◽

pp. 2150011

Author(s):

Sajad Badalkhani ◽

Ramazan Havangi ◽

Mohsen Farshad

Keyword(s):

Large Scale ◽

Dynamic Environment ◽

Dynamic Environments ◽

Extensive Literature ◽

Real World Data ◽

Data Set ◽

Cooperative Approach ◽

Localization And Mapping ◽

Multi Robot

There is an extensive literature regarding multi-robot simultaneous localization and mapping (MRSLAM). In most part of the research, the environment is assumed to be static, while the dynamic parts of the environment degrade the estimation quality of SLAM algorithms and lead to inherently fragile systems. To enhance the performance and robustness of the SLAM in dynamic environments (SLAMIDE), a novel cooperative approach named parallel-map (p-map) SLAM is introduced in this paper. The objective of the proposed method is to deal with the dynamics of the environment, by detecting dynamic parts and preventing the inclusion of them in SLAM estimations. In this approach, each robot builds a limited map in its own vicinity, while the global map is built through a hybrid centralized MRSLAM. The restricted size of the local maps, bounds computational complexity and resources needed to handle a large scale dynamic environment. Using a probabilistic index, the proposed method differentiates between stationary and moving landmarks, based on their relative positions with other parts of the environment. Stationary landmarks are then used to refine a consistent map. The proposed method is evaluated with different levels of dynamism and for each level, the performance is measured in terms of accuracy, robustness, and hardware resources needed to be implemented. The method is also evaluated with a publicly available real-world data-set. Experimental validation along with simulations indicate that the proposed method is able to perform consistent SLAM in a dynamic environment, suggesting its feasibility for MRSLAM applications.

Download Full-text

Mining E-Mail Messages

Methodological Advancements in Intelligent Information Technologies - Advances in Intelligent Information Technologies ◽

10.4018/978-1-60566-970-0.ch011 ◽

2010 ◽

pp. 212-234

Author(s):

Wil M.P. van der Aalst ◽

Andriy Nikolov

Keyword(s):

Process Mining ◽

Workflow Management ◽

Social Structures ◽

Interaction Patterns ◽

Work Processes ◽

Event Logs ◽

Event Log ◽

Patterns And Processes ◽

E Mail ◽

Mining Tools

Increasingly information systems log historic information in a systematic way. Workflow management systems, but also ERP, CRM, SCM, and B2B systems often provide a so-called “event log’’, i.e., a log recording the execution of activities. Thus far, process mining has been mainly focusing on structured event logs resulting in powerful analysis techniques and tools for discovering process, control, data, organizational, and social structures from event logs. Unfortunately, many work processes are not supported by systems providing structured logs. Instead very basic tools such as text editors, spreadsheets, and e-mail are used. This paper explores the application of process mining to e-mail, i.e., unstructured or semi-structured e-mail messages are converted into event logs suitable for application of process mining tools. This paper presents the tool EMailAnalyzer, embedded in the ProM process mining framework, which analyzes and transforms e-mail messages to a format that allows for analysis using our process mining techniques. The main innovative aspect of this work is that, unlike most other work in this area, our analysis is not restricted to social network analysis. Based on e-mail logs we can also discover interaction patterns and processes.

Download Full-text