An Efficient Algorithm to Mine High Average-Utility Sequential Patterns

Author(s):  
Tiantian Xu
2018 ◽  
Vol 95 ◽  
pp. 77-92 ◽  
Author(s):  
Bac Le ◽  
Duy-Tai Dinh ◽  
Van-Nam Huynh ◽  
Quang-Minh Nguyen ◽  
Philippe Fournier-Viger

Author(s):  
José Kadir Febrer-Hernández ◽  
José Hernández-Palancar ◽  
Raudel Hernández-León ◽  
Claudia Feregrino-Uribe

2016 ◽  
Vol 30 (2) ◽  
pp. 233-243 ◽  
Author(s):  
Jerry Chun-Wei Lin ◽  
Ting Li ◽  
Philippe Fournier-Viger ◽  
Tzung-Pei Hong ◽  
Justin Zhan ◽  
...  

2020 ◽  
Vol 515 ◽  
pp. 302-323
Author(s):  
Tin Truong ◽  
Hai Duong ◽  
Bac Le ◽  
Philippe Fournier-Viger

Author(s):  
MEHDI Haj Ali ◽  
Qun-Xiong Zhu ◽  
Yan-Lin He

<p><em>Sequential pattern mining, it  is not just important in data mining field , but  it is the basis of many applications .However, running applications cost time and memory, especially when dealing with dense of the dataset. Setting the proper minimum support threshold is one of the factors that consume more memory and time. However ,  it is difficult for users to get the appropriate patterns, it may present too many sequential patterns  and makes it difficult for users to comprehend the results. The problem becomes worse and worse when dealing with long click stream sequences or huge dataset. As a solution, we developed an efficient algorithm, called TopK (Top-K click stream sequence pattern mining), which employs the output as top-k patterns , K is the most important and relevant frequencies (with a high support) . However ,our algorithm based on pseudo-projection to avoid consuming more time and memory, and uses several efficient search space pruning methods together with BI-Directional Extension. Our extensive study and experiments on real click stream datasets show TopK significantly outperforms the previous algorithms.</em></p>


Author(s):  
Yongshun Gong ◽  
Tiantian Xu ◽  
Xiangjun Dong ◽  
Guohua Lv

Negative sequential patterns (NSPs), which focus on nonoccurring but interesting behaviors (e.g. missing consumption records), provide a special perspective of analyzing sequential patterns. So far, very few methods have been proposed to solve for NSP mining problem, and these methods only mine NSP from positive sequential patterns (PSPs). However, as many useful negative association rules are mined from infrequent itemsets, many meaningful NSPs can also be found from infrequent positive sequences (IPSs). The challenge of mining NSP from IPS is how to constrain which IPS could be available used during NSP process because, if without constraints, the number of IPS would be too large to be handled. So in this study, we first propose a strategy to constrain which IPS could be available and utilized for mining NSP. Then we give a storage optimization method to hold this IPS information. Finally, an efficient algorithm called Efficient mining Negative Sequential Pattern from both Frequent and Infrequent positive sequential patterns (e-NSPFI) is proposed for mining NSP. The experimental results show that e-NSPFI can efficiently find much more interesting negative patterns than e-NSP.


Sign in / Sign up

Export Citation Format

Share Document