high utility Latest Research Papers

High-utility sequential pattern mining (HUSPM) is a hot research topic in recent decades since it combines both sequential and utility properties to reveal more information and knowledge rather than the traditional frequent itemset mining or sequential pattern mining. Several works of HUSPM have been presented but most of them are based on main memory to speed up mining performance. However, this assumption is not realistic and not suitable in large-scale environments since in real industry, the size of the collected data is very huge and it is impossible to fit the data into the main memory of a single machine. In this article, we first develop a parallel and distributed three-stage MapReduce model for mining high-utility sequential patterns based on large-scale databases. Two properties are then developed to hold the correctness and completeness of the discovered patterns in the developed framework. In addition, two data structures called sidset and utility-linked list are utilized in the developed framework to accelerate the computation for mining the required patterns. From the results, we can observe that the designed model has good performance in large-scale datasets in terms of runtime, memory, efficiency of the number of distributed nodes, and scalability compared to the serial HUSP-Span approach.

Download Full-text

Mining High Utility Itemsets with Hill Climbing and Simulated Annealing

ACM Transactions on Management Information Systems ◽

10.1145/3462636 ◽

2022 ◽

Vol 13 (1) ◽

pp. 1-22

Author(s):

M. Saqib Nawaz ◽

Philippe Fournier-Viger ◽

Unil Yun ◽

Youxi Wu ◽

Wei Song

Keyword(s):

Simulated Annealing ◽

Heuristic Algorithms ◽

Real Life ◽

Search Space ◽

Population Diversity ◽

Hill Climbing ◽

Target Values ◽

High Utility ◽

High Utility Itemsets ◽

Search Space Pruning

High utility itemset mining (HUIM) is the task of finding all items set, purchased together, that generate a high profit in a transaction database. In the past, several algorithms have been developed to mine high utility itemsets (HUIs). However, most of them cannot properly handle the exponential search space while finding HUIs when the size of the database and total number of items increases. Recently, evolutionary and heuristic algorithms were designed to mine HUIs, which provided considerable performance improvement. However, they can still have a long runtime and some may miss many HUIs. To address this problem, this article proposes two algorithms for HUIM based on Hill Climbing (HUIM-HC) and Simulated Annealing (HUIM-SA). Both algorithms transform the input database into a bitmap for efficient utility computation and for search space pruning. To improve population diversity, HUIs discovered by evolution are used as target values for the next population instead of keeping the current optimal values in the next population. Through experiments on real-life datasets, it was found that the proposed algorithms are faster than state-of-the-art heuristic and evolutionary HUIM algorithms, that HUIM-SA discovers similar HUIs, and that HUIM-SA evolves linearly with the number of iterations.

Download Full-text

HUFTI-SPM: high-utility and frequent time-interval sequential pattern mining from transactional databases

International Journal of Data Science and Analytics ◽

10.1007/s41060-021-00297-7 ◽

2022 ◽

Author(s):

Ritika ◽

Sunil Kumar Gupta

Keyword(s):

Pattern Mining ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Time Interval ◽

Transactional Databases ◽

High Utility

Download Full-text

OHUQI: Mining on-shelf high-utility quantitative itemsets

The Journal of Supercomputing ◽

10.1007/s11227-021-04218-0 ◽

2022 ◽

Author(s):

Lili Chen ◽

Wensheng Gan ◽

Qi Lin ◽

Shuqiang Huang ◽

Chien-Ming Chen

Keyword(s):

High Utility

Download Full-text

An efficient spatial high-utility occupancy frequent item mining algorithm for mission system integration architecture design using the MBSE method

Aerospace Systems ◽

10.1007/s42401-021-00126-6 ◽

2022 ◽

Author(s):

Xiaoxu Dong ◽

Miao Wang ◽

Yongqi Liu ◽

Gang Xiao ◽

Dan Huang ◽

...

Keyword(s):

System Integration ◽

Architecture Design ◽

Frequent Item ◽

Mining Algorithm ◽

Integration Architecture ◽

Mission System ◽

High Utility

Download Full-text

Parallel Algorithm to Efficiently Mine High Utility Itemset

ICT Analysis and Applications - Lecture Notes in Networks and Systems ◽

10.1007/978-981-16-5655-2_16 ◽

2022 ◽

pp. 167-178

Author(s):

Eduardus Hardika Sandy Atmaja ◽

Kavita Sonawane

Keyword(s):

Parallel Algorithm ◽

High Utility

Download Full-text

High Utility Co-location Patterns

Big Data Management - Preference-based Spatial Co-location Pattern Mining ◽

10.1007/978-981-16-7566-9_8 ◽

2022 ◽

pp. 201-222

Author(s):

Lizhen Wang ◽

Yuan Fang ◽

Lihua Zhou

Keyword(s):

Location Patterns ◽

High Utility

Download Full-text

Ordering policy estimation for high utility item-sets considering negative item values in large databases

International Journal of Decision Support System Technology ◽

10.4018/ijdsst.286682 ◽

2022 ◽

Vol 14 (1) ◽

pp. 0-0

Keyword(s):

Data Mining ◽

Real World ◽

Defective Items ◽

Utility Mining ◽

Mining Algorithm ◽

Negative Item ◽

Ordering Policy ◽

Large Databases ◽

Real World Applications ◽

High Utility

Utility mining with negative item values has recently received interest in the data mining field due to its practical considerations. Previously, the values of utility item-sets have been taken into consideration as positive. However, in real-world applications an item-set may be related to negative item values. This paper presents a method for redesigning the ordering policy by including high utility item-sets with negative items. Initially, utility mining algorithm is used to find high utility item-sets. Then, ordering policy is estimated for high utility items considering defective and non-defective items. A numerical example is illustrated to validate the results

Download Full-text

Mining high average-utility sequential rules to identify high-utility gene expression sequences in longitudinal human studies

Expert Systems with Applications ◽

10.1016/j.eswa.2021.116411 ◽

2022 ◽

pp. 116411

Author(s):

Alberto Segura-Delgado ◽

Augusto Anguita-Ruiz ◽

Rafael Alcalá ◽

Jesús Alcalá-Fdez

Keyword(s):

Gene Expression ◽

Human Studies ◽

Average Utility ◽

High Utility

Download Full-text

high utility
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

EAHUIM: Enhanced Absolute High Utility Itemset Miner for Big Data

Scalable Mining of High-Utility Sequential Patterns With Three-Tier MapReduce Model

Mining High Utility Itemsets with Hill Climbing and Simulated Annealing

HUFTI-SPM: high-utility and frequent time-interval sequential pattern mining from transactional databases

OHUQI: Mining on-shelf high-utility quantitative itemsets

An efficient spatial high-utility occupancy frequent item mining algorithm for mission system integration architecture design using the MBSE method

Parallel Algorithm to Efficiently Mine High Utility Itemset

High Utility Co-location Patterns

Ordering policy estimation for high utility item-sets considering negative item values in large databases

Mining high average-utility sequential rules to identify high-utility gene expression sequences in longitudinal human studies

Export Citation Format

high utilityRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

EAHUIM: Enhanced Absolute High Utility Itemset Miner for Big Data

Scalable Mining of High-Utility Sequential Patterns With Three-Tier MapReduce Model

Mining High Utility Itemsets with Hill Climbing and Simulated Annealing

HUFTI-SPM: high-utility and frequent time-interval sequential pattern mining from transactional databases

OHUQI: Mining on-shelf high-utility quantitative itemsets

An efficient spatial high-utility occupancy frequent item mining algorithm for mission system integration architecture design using the MBSE method

Parallel Algorithm to Efficiently Mine High Utility Itemset

High Utility Co-location Patterns

Ordering policy estimation for high utility item-sets considering negative item values in large databases

Mining high average-utility sequential rules to identify high-utility gene expression sequences in longitudinal human studies

high utility
Recently Published Documents