Parallel Mining of Top-k High Utility Itemsets in Spark In-Memory Computing Architecture

Author(s):  
Chun-Han Lin ◽  
Cheng-Wei Wu ◽  
JianTao Huang ◽  
Vincent S. Tseng
Author(s):  
Nguyen Manh Hung ◽  
Dau Hai Phong

Mining high utility itemsets in transaction database is an important task in data mining and widely applied in many areas. Recently, many algorithms have been proposed, but most algorithms for identifying high utility itemsets need to generate candidate sets by overestimating their utility and then calculating their exact utility value. Therefore, the number of candidate itemsets is much larger than the actual number of high utility itemsets. In this paper, we introduce the Retail Transaction-Weighted Utility (RTWU) structure and propose two algorithms: EAHUIMiner algorithm and PEAHUI-Miner parallel algorithm. They have been experimented and compared to the two most efficient algorithms: EFIM and FHM. Results show that our algorithm is better with sparse datasets. DOI: 10.32913/rd-ict.vol3.no14.519


2016 ◽  
Vol 45 (1) ◽  
pp. 44-74 ◽  
Author(s):  
Jayakrushna Sahoo ◽  
Ashok Kumar Das ◽  
A. Goswami

2021 ◽  
Vol 186 ◽  
pp. 115741
Author(s):  
Trinh D.D. Nguyen ◽  
Loan T.T. Nguyen ◽  
Lung Vu ◽  
Bay Vo ◽  
Witold Pedrycz

2021 ◽  
Author(s):  
Nguyen Van Le ◽  
Nguyen Thi Thanh Thuy ◽  
Manh Thien Ly

2017 ◽  
Vol 90 ◽  
pp. 168-183 ◽  
Author(s):  
Srikumar Krishnamoorthy

Author(s):  
Amel Hidouri ◽  
Said Jabbour ◽  
Badran Raddaoui ◽  
Boutheina Ben Yaghlane

Sign in / Sign up

Export Citation Format

Share Document