Multi-Relational Sequential Pattern Mining Based on Iceberg Concept Lattice

AbstractDiscovering sequential patterns is a rather well-studied area in data mining and has been found many diverse applications, such as basket analysis, telecommunications, etc. In this article, we propose an efficient algorithm that incorporates constraints and promotion-based marketing scenarios for the mining of valuable sequential patterns. Incorporating specific constraints into the sequential mining process has enabled the discovery of more user-centered patterns. We move one step ahead and integrate three significant marketing scenarios for mining promotion-oriented sequential patterns. The promotion-based market scenarios considered in the proposed research are 1) product Downturn, 2) product Revision and 3) product Launch (DRL). Each of these scenarios is characterized by distinct item and adjacency constraints. We have developed a novel DRL-PrefixSpan algorithm (tailored form of the PrefixSpan) for mining all length DRL patterns. The proposed algorithm has been validated on synthetic sequential databases. The experimental results demonstrate the effectiveness of incorporating the promotion-based marketing scenarios in the sequential pattern mining process.

Download Full-text

Applications of Pattern Discovery Using Sequential Data Mining

Pattern Discovery Using Sequence Data Mining ◽

10.4018/978-1-61350-056-9.ch001 ◽

2012 ◽

pp. 1-23 ◽

Cited By ~ 8

Author(s):

Manish Gupta ◽

Jiawei Han

Keyword(s):

Data Mining ◽

Text Mining ◽

Intrusion Detection ◽

Pattern Mining ◽

Pattern Discovery ◽

Sequential Pattern Mining ◽

Web Usage Mining ◽

Sequential Pattern ◽

Sequential Data ◽

Mining Methods

Sequential pattern mining methods have been found to be applicable in a large number of domains. Sequential data is omnipresent. Sequential pattern mining methods have been used to analyze this data and identify patterns. Such patterns have been used to implement efficient systems that can recommend based on previously observed patterns, help in making predictions, improve usability of systems, detect events, and in general help in making strategic product decisions. In this chapter, we discuss the applications of sequential data mining in a variety of domains like healthcare, education, Web usage mining, text mining, bioinformatics, telecommunications, intrusion detection, et cetera. We conclude with a summary of the work.

Download Full-text

Applications of Pattern Discovery Using Sequential Data Mining

Data Mining ◽

10.4018/978-1-4666-2455-9.ch048 ◽

2013 ◽

pp. 947-969

Author(s):

Manish Gupta ◽

Jiawei Han

Keyword(s):

Data Mining ◽

Text Mining ◽

Intrusion Detection ◽

Pattern Mining ◽

Pattern Discovery ◽

Sequential Pattern Mining ◽

Web Usage Mining ◽

Sequential Pattern ◽

Sequential Data ◽

Mining Methods

Sequential pattern mining methods have been found to be applicable in a large number of domains. Sequential data is omnipresent. Sequential pattern mining methods have been used to analyze this data and identify patterns. Such patterns have been used to implement efficient systems that can recommend based on previously observed patterns, help in making predictions, improve usability of systems, detect events, and in general help in making strategic product decisions. In this chapter, we discuss the applications of sequential data mining in a variety of domains like healthcare, education, Web usage mining, text mining, bioinformatics, telecommunications, intrusion detection, et cetera. We conclude with a summary of the work.

Download Full-text

A study on sequential pattern mining on chemical information

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.33.14828 ◽

2018 ◽

Vol 7 (3.3) ◽

pp. 532

Author(s):

S Sathya ◽

N Rajendran

Keyword(s):

Data Mining ◽

Chemical Bonding ◽

Sequential Analysis ◽

Pattern Mining ◽

Fundamental Problem ◽

Research Work ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Graph Representation ◽

Chemical Information

Data mining (DM) is used for extracting the useful and non-trivial information from the large amount of data to collect in many and diverse fields. Data mining determines explanation through clustering visualization, association and sequential analysis. Chemical compounds are well-defined structures compressed by a graph representation. Chemical bonding is the association of atoms into molecules, ions, crystals and other stable species which frame the common substances in chemical information. However, large-scale sequential data is a fundamental problem like higher classification time and bonding time in data mining with many applications. In this work, chemical structured index bonding is used for sequential pattern mining. Our research work helps to evaluate the structural patterns of chemical bonding in chemical information data sets.

Download Full-text

Knowledge Discovery from Healthcare Electronic Records for Sustainable Environment

Sustainability ◽

10.3390/su13168900 ◽

2021 ◽

Vol 13 (16) ◽

pp. 8900

Author(s):

Naeem Ahmed Mahoto ◽

Asadullah Shaikh ◽

Mana Saleh Al Reshan ◽

Muhammad Ali Memon ◽

Adel Sulaiman

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Association Analysis ◽

Pattern Mining ◽

Large Data ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Electronic Records ◽

Data Mining Techniques ◽

Healthcare Data

The medical history of a patient is an essential piece of information in healthcare agencies, which keep records of patients. Due to the fact that each person may have different medical complications, healthcare data remain sparse, high-dimensional and possibly inconsistent. The knowledge discovery from such data is not easily manageable for patient behaviors. It becomes a challenge for both physicians and healthcare agencies to discover knowledge from many healthcare electronic records. Data mining, as evidenced from the existing published literature, has proven its effectiveness in transforming large data collections into meaningful information and knowledge. This paper proposes an overview of the data mining techniques used for knowledge discovery in medical records. Furthermore, based on real healthcare data, this paper also demonstrates a case study of discovering knowledge with the help of three data mining techniques: (1) association analysis; (2) sequential pattern mining; (3) clustering. Particularly, association analysis is used to extract frequent correlations among examinations done by patients with a specific disease, sequential pattern mining allows extracting frequent patterns of medical events and clustering is used to find groups of similar patients. The discovered knowledge may enrich healthcare guidelines, improve their processes and detect anomalous patients’ behavior with respect to the medical guidelines.

Download Full-text

Approaches for Pattern Discovery Using Sequential Data Mining

Data Mining ◽

10.4018/978-1-4666-2455-9.ch095 ◽

2013 ◽

pp. 1835-1851

Author(s):

Manish Gupta ◽

Jiawei Han

Keyword(s):

Data Mining ◽

Pattern Mining ◽

Sequence Data ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Sequential Data ◽

Stream Data ◽

Dual Representation ◽

Advantages And Disadvantages ◽

Growth Methods

In this chapter we first introduce sequence data. We then discuss different approaches for mining of patterns from sequence data, studied in literature. Apriori based methods and the pattern growth methods are the earliest and the most influential methods for sequential pattern mining. There is also a vertical format based method which works on a dual representation of the sequence database. Work has also been done for mining patterns with constraints, mining closed patterns, mining patterns from multi-dimensional databases, mining closed repetitive gapped subsequences, and other forms of sequential pattern mining. Some works also focus on mining incremental patterns and mining from stream data. We present at least one method of each of these types and discuss their advantages and disadvantages. We conclude with a summary of the work.

Download Full-text

The Sequential Pattern Mining Algorithm MHSP Based on MH

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.63-64.425 ◽

2011 ◽

Vol 63-64 ◽

pp. 425-430

Author(s):

Jun Wang ◽

Ya Qiong Jiang

Keyword(s):

Pattern Mining ◽

Sequential Pattern Mining ◽

Experimental Results ◽

Sequential Pattern ◽

Sequential Patterns ◽

Important Method ◽

The Real ◽

Mining Algorithm ◽

Large Projection ◽

Growth Approach

Pattern growth approach is an important method in sequential pattern mining. Projection database based on the method is introduced in PrefixSpan, and the PrefixSpan algorithm can solve the problem of mining sequential patterns. But relative to large projection database, the performance of PrefixSpan is affected. Inspired by the prefix-divide method and MH structure, this paper proposed a new algorithm MHSP for sequential pattern mining. Based on the real datasets, experimental results show that the performance of MHSP algorithm is more than twice as fast as PrefixSpan.

Download Full-text

Experimental results on a constraint based sequential pattern mining for telecommunication alarm data

Proceedings of the Second International Conference on Web Information Systems Engineering WISE-01 ◽

10.1109/wise.2001.996754 ◽

2002 ◽

Cited By ~ 2

Author(s):

Jain-Zhi Ouh ◽

Pei-Hsin Wu ◽

Ming-Syan Chen

Keyword(s):

Pattern Mining ◽

Sequential Pattern Mining ◽

Experimental Results ◽

Sequential Pattern

Download Full-text

Benchmarking the effectiveness of sequential pattern mining methods

Data & Knowledge Engineering ◽

10.1016/j.datak.2006.01.004 ◽

2007 ◽

Vol 60 (1) ◽

pp. 30-50 ◽

Cited By ~ 17

Author(s):

Hye-Chung Kum ◽

Joong Hyuk Chang ◽

Wei Wang

Keyword(s):

Pattern Mining ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Mining Methods

Download Full-text

HIGH UTILITY ITEM INTERVAL SEQUENTIAL PATTERN MINING ALGORITHM

Journal of Computer Science and Cybernetics ◽

10.15625/1813-9663/1/1/14398 ◽

2020 ◽

Vol 36 (1) ◽

pp. 1-15

Author(s):

Tran Huy Duong ◽

Nguyen Truong Thang ◽

Vu Duc Thi ◽

Tran The Anh

Keyword(s):

Data Mining ◽

Pattern Mining ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Sequential Patterns ◽

Sequence Database ◽

Mining Algorithm ◽

Pattern Growth ◽

High Utility ◽

Growth Approach

High utility sequential pattern mining is a popular topic in data mining with the main purpose is to extract sequential patterns with high utility in the sequence database. Many recent works have proposed methods to solve this problem. However, most of them does not consider item intervals of sequential patterns which can lead to the extraction of sequential patterns with too long item interval, thus making little sense. In this paper, we propose a High Utility Item Interval Sequential Pattern (HUISP) algorithm to solve this problem. Our algorithm uses pattern growth approach and some techniques to increase algorithm's performance.

Download Full-text