Maintaining the discovered sequential patterns for sequence insertion in dynamic databases

Mining useful information or knowledge from a very large database to aid managers or decision makers to make appropriate decisions is a critical issue in recent years. Sequential patterns can be used to discover the purchased behaviors of customers or the usage behaviors of users from Web log data. Most approaches process a static database to discover sequential patterns in a batch way. In real-world applications, transactions or sequences in databases are frequently changed. In the past, a fast updated sequential pattern (FUSP)-tree was proposed to handle dynamic databases whether for sequence insertion, deletion or modification based on FUP concepts. Original database is required to be re-scanned if it is necessary to maintain the small sequences which was not kept in the FUSP tree. In this paper, the prelarge concept was adopted to maintain and update the built prelarge FUSP tree for sequence modification. A prelarge FUSP tree is modified from FUSP tree for preserving not only the frequent 1-sequences but also the prelarge 1-sequences in the tree structure. The PRELARGE-FUSP-TREE-MOD maintenance algorithm is proposed to reduce the rescans of the original database due to the pruning properties of prelarge concept. When the number of modified sequences is smaller than the safety bound of the prelarge concept, better results can be obtained by the proposed PRELARGE-FUSP-TREE-MOD maintenance algorithm for sequence modification in dynamic databases.

Download Full-text

An Efficient Approach for Mining Weighted Sequential Patterns in Dynamic Databases

Advances in Data Mining. Applications and Theoretical Aspects - Lecture Notes in Computer Science ◽

10.1007/978-3-319-95786-9_16 ◽

2018 ◽

pp. 215-229

Author(s):

Sabrina Zaman Ishita ◽

Faria Noor ◽

Chowdhury Farhan Ahmed

Keyword(s):

Sequential Patterns ◽

Efficient Approach ◽

Dynamic Databases

Download Full-text

Efficiently Updating the Discovered Sequential Patterns for Sequence Modification

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194016500455 ◽

2016 ◽

Vol 26 (08) ◽

pp. 1285-1313 ◽

Cited By ~ 1

Author(s):

Jerry Chun-Wei Lin ◽

Wensheng Gan ◽

Philippe Fournier-Viger ◽

Tzung-Pei Hong

Keyword(s):

Pattern Mining ◽

State Of The Art ◽

Poor Performance ◽

Sequential Patterns ◽

Batch Mode ◽

Dynamic Databases ◽

The Cost ◽

Mining Algorithms ◽

Sequence Modification ◽

Over Time

Mining sequential patterns (SPs) is a popular data mining task, which consists in finding interesting, unexpected, and useful patterns in sequence databases. It has several applications in many domains. However, most sequential pattern mining algorithms assume that databases are static, i.e. that they do not change over time. But in real-word applications, sequences are often modified. Thus, it is an important issue to design algorithms for updating SPs in a dynamic database environment. Although some algorithms have been proposed to maintain SPs in dynamic databases, these algorithms may have poor performance, especially when databases contain long sequences or a large number of sequences. This paper addresses this issue by proposing a novel dynamic mining approach named PreFUSP-TREE-MOD to address the problem of maintaining and updating discovered SPs when sequences in a database are modified. The proposed approach adopts the previously proposed pre-large concept using two support thresholds, to avoid scanning the database when possible, for updating the set of discovered patterns. Due to the pruning properties of the pre-large concept, the PreFUSP-TREE-MOD maintenance algorithm can effectively reduce the cost of database scans to maintain and update the built FUSP-tree for sequence modification. When the number of modified sequences is less than the safety bound of the pre-large concept, the proposed maintenance algorithm outperforms traditional SPM algorithms in batch mode, and the state-of-the-art maintenance algorithm in terms of execution time and number of tree nodes.

Download Full-text

Mining Closed Sequential Patterns in Progressive Databases

Journal of Information & Knowledge Management ◽

10.1142/s021964921350024x ◽

2013 ◽

Vol 12 (03) ◽

pp. 1350024

Author(s):

R. B. V. Subramanyam ◽

A. Suresh Rao ◽

Ramesh Karnati ◽

Somaraju Suvvari ◽

D. V. L. N. Somayajulu

Keyword(s):

Pattern Mining ◽

Synthetic Data ◽

Window Size ◽

Search Space ◽

Sequential Patterns ◽

Data Sets ◽

Time Stamp ◽

Algorithmic Approach ◽

Dynamic Databases ◽

Search Space Pruning

Previous studies of Mining Closed Sequential Patterns suggested several heuristics and proposed some computationally effective techniques. Like, Bidirectional Extension with closure checking schemas, Back scan search space pruning, and scan skip optimization used in BIDE (BI-Directional Extension) algorithm. Many researchers were inspired with the efficiency of BIDE, have tried to apply the technique implied by BIDE to various kinds of databases; we toofelt that it can be applied over progressive databases. Without tailoring BIDE, it cannot be applied to dynamic databases. The concept of progressive databases explores the nature of incremental databases by defining the parameters like, Period of Interest (POI), user defined minimum support. An algorithm PISA (Progressive mIning Sequential pAttern mining) was proposed by Huang et al. for finding all sequential patterns over progressive databases. The structure of PISA helps in space utilization by limiting the height of the tree, to the length of POI and this issue is also a motivation for further improvement in this work. In this paper, a tree structure LCT (Label, Customer-id, and Time stamp) is proposed, and an approach formining closed sequential patterns using closure checking schemas across the progressive databases concept. The significance of LCT structure is, confining its height to a maximum of two levels. The algorithmic approach describes that the window size can be increased by one unit of time. The complexity of the proposed algorithmic approach is also analysed. The approach is validated using synthetic data sets available in Internet and shows a better performance in comparison to the existing methods.

Download Full-text

Mining Regular High Utility Sequential Patterns in Static and Dynamic Databases

Advances in Intelligent Systems and Computing - Proceedings of the 13th International Conference on Ubiquitous Information Management and Communication (IMCOM) 2019 ◽

10.1007/978-3-030-19063-7_71 ◽

2019 ◽

pp. 897-916 ◽

Cited By ~ 1

Author(s):

Sabrina Zaman Ishita ◽

Chowdhury Farhan Ahmed ◽

Carson K. Leung ◽

Calvin H. S. Hoi

Keyword(s):

Sequential Patterns ◽

Dynamic Databases ◽

High Utility

Download Full-text

Impact of long-term trials on crop production research and education

Acta Agronomica Hungarica ◽

10.1556/aagr.58.2010.suppl.1.1 ◽

2010 ◽

Vol 58 (Supplement 1) ◽

pp. 1-5 ◽

Cited By ~ 1

Author(s):

M. Jolánkai ◽

F. Nyárai ◽

K. Kassai

Keyword(s):

Crop Production ◽

Life Sciences ◽

Ecological Models ◽

Dynamic Databases ◽

Production Research ◽

Research And Education

Long-term trials have a twofold role in life sciences, acting as both live laboratories and public collections. Long-term trials are not simply scientific curios or the honoured relics of a museum, but highly valuable live ecological models that can never be replaced or restarted if once terminated or suspended. These trials provide valuable and dynamic databases for solving scientific problems. The present paper is intended to give a brief summary of the crop production aspects of long-term trials.

Download Full-text

The method of configuring dynamic databases

Information and Communication Technology for Education ◽

10.2495/icte130201 ◽

2014 ◽

Cited By ~ 6

Author(s):

Yuri Rogozov ◽

Alexander Sviridov ◽

Sergey Kucherov

Keyword(s):

Dynamic Databases

Download Full-text

Toward to Better Structure and Constraint to Mine Negative Sequential Patterns

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2020.3041732 ◽

2020 ◽

pp. 1-15

Author(s):

Xinming Gao ◽

Yongshun Gong ◽

Tiantian Xu ◽

Jinhu Lu ◽

Yuhai Zhao ◽

...

Keyword(s):

Sequential Patterns

Download Full-text

BiLSTM regression model for face sketch synthesis using sequential patterns

Neural Computing and Applications ◽

10.1007/s00521-021-05916-9 ◽

2021 ◽

Author(s):

Abduljalil Radman ◽

Shahrel Azmin Suandi

Keyword(s):

Regression Model ◽

Sequential Patterns ◽

Sketch Synthesis ◽

Face Sketch Synthesis

Download Full-text

Dynamic maintenance model for high average-utility pattern mining with deletion operation

Applied Intelligence ◽

10.1007/s10489-021-02539-4 ◽

2021 ◽

Author(s):

Jimmy Ming-Tai Wu ◽

Qian Teng ◽

Shahab Tayeb ◽

Jerry Chun-Wei Lin

Keyword(s):

Pattern Mining ◽

Computational Cost ◽

Practical Applications ◽

Itemset Mining ◽

Dynamic Databases ◽

Speed Up ◽

Dynamic Maintenance ◽

Average Utility ◽

High Utility ◽

Maintenance Model

AbstractThe high average-utility itemset mining (HAUIM) was established to provide a fair measure instead of genetic high-utility itemset mining (HUIM) for revealing the satisfied and interesting patterns. In practical applications, the database is dynamically changed when insertion/deletion operations are performed on databases. Several works were designed to handle the insertion process but fewer studies focused on processing the deletion process for knowledge maintenance. In this paper, we then develop a PRE-HAUI-DEL algorithm that utilizes the pre-large concept on HAUIM for handling transaction deletion in the dynamic databases. The pre-large concept is served as the buffer on HAUIM that reduces the number of database scans while the database is updated particularly in transaction deletion. Two upper-bound values are also established here to reduce the unpromising candidates early which can speed up the computational cost. From the experimental results, the designed PRE-HAUI-DEL algorithm is well performed compared to the Apriori-like model in terms of runtime, memory, and scalability in dynamic databases.

Download Full-text

Maintaining the discovered sequential patterns for sequence insertion in dynamic databases

Updating the Built Prelarge Fast Updated Sequential Pattern Trees with Sequence Modification

An Efficient Approach for Mining Weighted Sequential Patterns in Dynamic Databases

Efficiently Updating the Discovered Sequential Patterns for Sequence Modification

Mining Closed Sequential Patterns in Progressive Databases

Mining Regular High Utility Sequential Patterns in Static and Dynamic Databases

Impact of long-term trials on crop production research and education

The method of configuring dynamic databases

Toward to Better Structure and Constraint to Mine Negative Sequential Patterns

BiLSTM regression model for face sketch synthesis using sequential patterns

Dynamic maintenance model for high average-utility pattern mining with deletion operation

Export Citation Format