SEQUENTIAL PATTERN MINING AND NONMONOTONIC REASONING FOR INTELLIGENT INFORMATION AGENTS

Sequential pattern mining methods have been found to be applicable in a large number of domains. Sequential data is omnipresent. Sequential pattern mining methods have been used to analyze this data and identify patterns. Such patterns have been used to implement efficient systems that can recommend based on previously observed patterns, help in making predictions, improve usability of systems, detect events, and in general help in making strategic product decisions. In this chapter, we discuss the applications of sequential data mining in a variety of domains like healthcare, education, Web usage mining, text mining, bioinformatics, telecommunications, intrusion detection, et cetera. We conclude with a summary of the work.

Download Full-text

Approaches for Pattern Discovery Using Sequential Data Mining

Data Mining ◽

10.4018/978-1-4666-2455-9.ch095 ◽

2013 ◽

pp. 1835-1851

Author(s):

Manish Gupta ◽

Jiawei Han

Keyword(s):

Data Mining ◽

Pattern Mining ◽

Sequence Data ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Sequential Data ◽

Stream Data ◽

Dual Representation ◽

Advantages And Disadvantages ◽

Growth Methods

In this chapter we first introduce sequence data. We then discuss different approaches for mining of patterns from sequence data, studied in literature. Apriori based methods and the pattern growth methods are the earliest and the most influential methods for sequential pattern mining. There is also a vertical format based method which works on a dual representation of the sequence database. Work has also been done for mining patterns with constraints, mining closed patterns, mining patterns from multi-dimensional databases, mining closed repetitive gapped subsequences, and other forms of sequential pattern mining. Some works also focus on mining incremental patterns and mining from stream data. We present at least one method of each of these types and discuss their advantages and disadvantages. We conclude with a summary of the work.

Download Full-text

Approaches for Pattern Discovery Using Sequential Data Mining

Pattern Discovery Using Sequence Data Mining ◽

10.4018/978-1-61350-056-9.ch008 ◽

2012 ◽

pp. 137-154 ◽

Cited By ~ 4

Author(s):

Manish Gupta ◽

Jiawei Han

Keyword(s):

Data Mining ◽

Pattern Mining ◽

Sequence Data ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Sequential Data ◽

Stream Data ◽

Dual Representation ◽

Advantages And Disadvantages ◽

Growth Methods

In this chapter we first introduce sequence data. We then discuss different approaches for mining of patterns from sequence data, studied in literature. Apriori based methods and the pattern growth methods are the earliest and the most influential methods for sequential pattern mining. There is also a vertical format based method which works on a dual representation of the sequence database. Work has also been done for mining patterns with constraints, mining closed patterns, mining patterns from multi-dimensional databases, mining closed repetitive gapped subsequences, and other forms of sequential pattern mining. Some works also focus on mining incremental patterns and mining from stream data. We present at least one method of each of these types and discuss their advantages and disadvantages. We conclude with a summary of the work.

Download Full-text

Applications of Pattern Discovery Using Sequential Data Mining

Data Mining ◽

10.4018/978-1-4666-2455-9.ch048 ◽

2013 ◽

pp. 947-969

Author(s):

Manish Gupta ◽

Jiawei Han

Keyword(s):

Data Mining ◽

Text Mining ◽

Intrusion Detection ◽

Pattern Mining ◽

Pattern Discovery ◽

Sequential Pattern Mining ◽

Web Usage Mining ◽

Sequential Pattern ◽

Sequential Data ◽

Mining Methods

Sequential pattern mining methods have been found to be applicable in a large number of domains. Sequential data is omnipresent. Sequential pattern mining methods have been used to analyze this data and identify patterns. Such patterns have been used to implement efficient systems that can recommend based on previously observed patterns, help in making predictions, improve usability of systems, detect events, and in general help in making strategic product decisions. In this chapter, we discuss the applications of sequential data mining in a variety of domains like healthcare, education, Web usage mining, text mining, bioinformatics, telecommunications, intrusion detection, et cetera. We conclude with a summary of the work.

Download Full-text

Sequential pattern mining and belief revision for adaptive information retrieval

Proceedings of the 2005 International Conference on Active Media Technology, 2005. (AMT 2005). ◽

10.1109/amt.2005.1505427 ◽

2005 ◽

Author(s):

R.Y.K. Lau ◽

Yuefeng Li

Keyword(s):

Information Retrieval ◽

Belief Revision ◽

Pattern Mining ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Adaptive Information Retrieval

Download Full-text

A study on sequential pattern mining on chemical information

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.33.14828 ◽

2018 ◽

Vol 7 (3.3) ◽

pp. 532

Author(s):

S Sathya ◽

N Rajendran

Keyword(s):

Data Mining ◽

Chemical Bonding ◽

Sequential Analysis ◽

Pattern Mining ◽

Fundamental Problem ◽

Research Work ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Graph Representation ◽

Chemical Information

Data mining (DM) is used for extracting the useful and non-trivial information from the large amount of data to collect in many and diverse fields. Data mining determines explanation through clustering visualization, association and sequential analysis. Chemical compounds are well-defined structures compressed by a graph representation. Chemical bonding is the association of atoms into molecules, ions, crystals and other stable species which frame the common substances in chemical information. However, large-scale sequential data is a fundamental problem like higher classification time and bonding time in data mining with many applications. In this work, chemical structured index bonding is used for sequential pattern mining. Our research work helps to evaluate the structural patterns of chemical bonding in chemical information data sets.

Download Full-text

Multi-Relational Sequential Pattern Mining Based on Iceberg Concept Lattice

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.109.729 ◽

2011 ◽

Vol 109 ◽

pp. 729-733

Author(s):

Jiang Yin ◽

Yun Li ◽

Cen Cheng Shen ◽

Bo Liu

Keyword(s):

Data Mining ◽

Time Complexity ◽

Pattern Mining ◽

Concept Lattice ◽

Optimization Methods ◽

Sequential Pattern Mining ◽

Experimental Results ◽

Sequential Pattern ◽

Sequential Mining ◽

Mining Methods

Multi-Relational Sequential mining is one of the areas of data mining that rapidly developed in recent years. However, the performance issues of traditional mining methods are not ideal. To effectively mining the pattern, we proposed an algorithm based on Iceberg concept lattice, adopting optimization methods of partition and merger to just mining the frequent sequences. Experimental results show this algorithm effectively reduced the time complexity of multi-relational sequential pattern mining.

Download Full-text

Mining of Sequential Patterns using Directed Graphs

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.k2242.0981119 ◽

2019 ◽

Vol 8 (11) ◽

pp. 4002-4007

Keyword(s):

Pattern Mining ◽

Directed Graphs ◽

Real Life ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Sequential Patterns ◽

Sequential Data ◽

Sequence Database ◽

Directed Paths ◽

Digraph Model

Sequential pattern mining is one of the important functionalities of data mining. It is used for analyzing sequential database and discovers sequential patterns. It is focused for extracting interesting subsequences from a set of sequences. Various factors such as rate of occurrence, length, and profit are used to define the interestingness of subsequence derived from the sequence database. Sequential pattern mining has abundant real-life applications since sequential data is logically programmed as sequences of cipher in many fields such as bioinformatics, e-learning, market basket analysis, texts, and webpage click-stream analysis. A large diversity of competent algorithms such as Prefixspan, GSP and Freespan have been proposed during the past few years. In this paper we propose a data model for organizing the sequential database, which consists of a directed graph DGS (cycles and several edges are allowed) and an organization of directed paths in DGS to represent a sequential data for discovering sequential pattern3 from a sequence database. Competent algorithms for constructing the digraph model (DGS) for extracting all sequential patterns and mining association rules are proposed. A number of theoretical parameters of digraph model are also introduced, which lead to more understanding of the problem.

Download Full-text

Knowledge Discovery from Healthcare Electronic Records for Sustainable Environment

Sustainability ◽

10.3390/su13168900 ◽

2021 ◽

Vol 13 (16) ◽

pp. 8900

Author(s):

Naeem Ahmed Mahoto ◽

Asadullah Shaikh ◽

Mana Saleh Al Reshan ◽

Muhammad Ali Memon ◽

Adel Sulaiman

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Association Analysis ◽

Pattern Mining ◽

Large Data ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Electronic Records ◽

Data Mining Techniques ◽

Healthcare Data

The medical history of a patient is an essential piece of information in healthcare agencies, which keep records of patients. Due to the fact that each person may have different medical complications, healthcare data remain sparse, high-dimensional and possibly inconsistent. The knowledge discovery from such data is not easily manageable for patient behaviors. It becomes a challenge for both physicians and healthcare agencies to discover knowledge from many healthcare electronic records. Data mining, as evidenced from the existing published literature, has proven its effectiveness in transforming large data collections into meaningful information and knowledge. This paper proposes an overview of the data mining techniques used for knowledge discovery in medical records. Furthermore, based on real healthcare data, this paper also demonstrates a case study of discovering knowledge with the help of three data mining techniques: (1) association analysis; (2) sequential pattern mining; (3) clustering. Particularly, association analysis is used to extract frequent correlations among examinations done by patients with a specific disease, sequential pattern mining allows extracting frequent patterns of medical events and clustering is used to find groups of similar patients. The discovered knowledge may enrich healthcare guidelines, improve their processes and detect anomalous patients’ behavior with respect to the medical guidelines.

Download Full-text

HIGH UTILITY ITEM INTERVAL SEQUENTIAL PATTERN MINING ALGORITHM

Journal of Computer Science and Cybernetics ◽

10.15625/1813-9663/1/1/14398 ◽

2020 ◽

Vol 36 (1) ◽

pp. 1-15

Author(s):

Tran Huy Duong ◽

Nguyen Truong Thang ◽

Vu Duc Thi ◽

Tran The Anh

Keyword(s):

Data Mining ◽

Pattern Mining ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Sequential Patterns ◽

Sequence Database ◽

Mining Algorithm ◽

Pattern Growth ◽

High Utility ◽

Growth Approach

High utility sequential pattern mining is a popular topic in data mining with the main purpose is to extract sequential patterns with high utility in the sequence database. Many recent works have proposed methods to solve this problem. However, most of them does not consider item intervals of sequential patterns which can lead to the extraction of sequential patterns with too long item interval, thus making little sense. In this paper, we propose a High Utility Item Interval Sequential Pattern (HUISP) algorithm to solve this problem. Our algorithm uses pattern growth approach and some techniques to increase algorithm's performance.

Download Full-text