scholarly journals An Efficient Algorithm for Mining Maximal Frequent Sequential Patterns in Large Databases

Author(s):  
Qiu-bin SU ◽  
Lu LU ◽  
Bin CHENG
2017 ◽  
Vol 26 (1) ◽  
pp. 69-85
Author(s):  
Mohammed M. Fouad ◽  
Mostafa G.M. Mostafa ◽  
Abdulfattah S. Mashat ◽  
Tarek F. Gharib

AbstractAssociation rules provide important knowledge that can be extracted from transactional databases. Owing to the massive exchange of information nowadays, databases become dynamic and change rapidly and periodically: new transactions are added to the database and/or old transactions are updated or removed from the database. Incremental mining was introduced to overcome the problem of maintaining previously generated association rules in dynamic databases. In this paper, we propose an efficient algorithm (IMIDB) for incremental itemset mining in large databases. The algorithm utilizes the trie data structure for indexing dynamic database transactions. Performance comparison of the proposed algorithm to recently cited algorithms shows that a significant improvement of about two orders of magnitude is achieved by our algorithm. Also, the proposed algorithm exhibits linear scalability with respect to database size.


2018 ◽  
Vol 95 ◽  
pp. 77-92 ◽  
Author(s):  
Bac Le ◽  
Duy-Tai Dinh ◽  
Van-Nam Huynh ◽  
Quang-Minh Nguyen ◽  
Philippe Fournier-Viger

Author(s):  
José Kadir Febrer-Hernández ◽  
José Hernández-Palancar ◽  
Raudel Hernández-León ◽  
Claudia Feregrino-Uribe

2001 ◽  
Vol 16 (4) ◽  
pp. 359-370
Author(s):  
Ning Chen ◽  
An Chen ◽  
Longxiang Zhou ◽  
Lu Liu

2009 ◽  
Vol 68 (1) ◽  
pp. 68-106 ◽  
Author(s):  
Lei Chang ◽  
Tengjiao Wang ◽  
Dongqing Yang ◽  
Hua Luan ◽  
Shiwei Tang

2010 ◽  
Vol 09 (06) ◽  
pp. 873-888 ◽  
Author(s):  
TZUNG-PEI HONG ◽  
CHING-YAO WANG ◽  
CHUN-WEI LIN

Mining knowledge from large databases has become a critical task for organizations. Managers commonly use the obtained sequential patterns to make decisions. In the past, databases were usually assumed to be static. In real-world applications, however, transactions may be updated. In this paper, a maintenance algorithm for rapidly updating sequential patterns for real-time decision making is proposed. The proposed algorithm utilizes previously discovered large sequences in the maintenance process, thus greatly reducing the number of database rescans and improving performance. Experimental results verify the performance of the proposed approach. The proposed algorithm provides real-time knowledge that can be used for decision making.


Author(s):  
MEHDI Haj Ali ◽  
Qun-Xiong Zhu ◽  
Yan-Lin He

<p><em>Sequential pattern mining, it  is not just important in data mining field , but  it is the basis of many applications .However, running applications cost time and memory, especially when dealing with dense of the dataset. Setting the proper minimum support threshold is one of the factors that consume more memory and time. However ,  it is difficult for users to get the appropriate patterns, it may present too many sequential patterns  and makes it difficult for users to comprehend the results. The problem becomes worse and worse when dealing with long click stream sequences or huge dataset. As a solution, we developed an efficient algorithm, called TopK (Top-K click stream sequence pattern mining), which employs the output as top-k patterns , K is the most important and relevant frequencies (with a high support) . However ,our algorithm based on pseudo-projection to avoid consuming more time and memory, and uses several efficient search space pruning methods together with BI-Directional Extension. Our extensive study and experiments on real click stream datasets show TopK significantly outperforms the previous algorithms.</em></p>


Sign in / Sign up

Export Citation Format

Share Document