A two-phase approach to mine short-period high-utility itemsets in transactional databases

The goal of the high-utility itemset mining task is to discover combinations of items that yield high profits from transactional databases. HUIM is a useful tool for retail stores to analyze customer behaviors. However, in the real world, items are found with both positive and negative utility values. To address this issue, we propose an algorithm named Modified Efficient High‐utility Itemsets mining with Negative utility (MEHIN) to find all HUIs with negative utility. This algorithm is an improved version of the EHIN algorithm. MEHIN utilizes 2 new upper bounds for pruning, named revised subtree and revised local utility. To reduce dataset scans, the proposed algorithm uses transaction merging and dataset projection techniques. An array‐based utility‐counting technique is also utilized to calculate upper‐bound efficiently. The MEHIN employs a novel structure called P-set to reduce the number of transaction scans and to speed up the mining process. Experimental results show that the proposed algorithms considerably outperform the state-of-the-art HUI-mining algorithms on negative utility in retail databases in terms of runtime.

Download Full-text

A TIFF-Tree Based High Utility Itemset Mining Algorithm

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.760-762.1713 ◽

2013 ◽

Vol 760-762 ◽

pp. 1713-1717

Author(s):

Yi Pan ◽

Bo Zhang

Keyword(s):

Two Phase ◽

Column Operation ◽

Itemset Mining ◽

Mining Algorithm ◽

Utility Information ◽

High Utility ◽

High Utility Itemsets ◽

Result Analysis ◽

Better Than

Owing to their major contribution to the total transaction's sales profits, increasingly importance has been attached to high utility itemsets mining. This paper has proposed a TIFF-tree based algorithm, which takes two-pass database scan to obtain the transaction utility information, the conditional matrix of potential high utility is adopted, through the row-column operation, the calculation of transaction utility can be simplified. The experiment result analysis shows that as the decreasing of user-defined threshold, the performance of TIFP-Growth algorithm is much better than the two-phase algorithm.

Download Full-text

Efficient Algorithms for Mining High Utility Itemsets from Transactional Databases

IEEE Transactions on Knowledge and Data Engineering ◽

10.1109/tkde.2012.59 ◽

2013 ◽

Vol 25 (8) ◽

pp. 1772-1786 ◽

Cited By ~ 262

Author(s):

Vincent S. Tseng ◽

Bai-En Shie ◽

Cheng-Wei Wu ◽

Philip S. Yu

Keyword(s):

Efficient Algorithms ◽

Transactional Databases ◽

High Utility ◽

High Utility Itemsets

Download Full-text

Discovering Relative High Utility Itemsets in Very Large Transactional Databases Using Null-Invariant Measure

10.1109/bigdata52589.2021.9672064 ◽

2021 ◽

Author(s):

R. Uday Kiran ◽

Pradeep Pallikila ◽

J. M. Luna ◽

Philippe Fournier-Viger ◽

Masashi Toyoda ◽

...

Keyword(s):

Invariant Measure ◽

Transactional Databases ◽

High Utility ◽

High Utility Itemsets

Download Full-text

A Dynamic Itemset Counting Based Two-Phase Algorithm for Mining High Utility Itemsets

2018 15th IEEE India Council International Conference (INDICON) ◽

10.1109/indicon45594.2018.8987024 ◽

2018 ◽

Author(s):

B Anup Bhat ◽

S V Harish ◽

M Geetha

Keyword(s):

Two Phase ◽

High Utility ◽

High Utility Itemsets

Download Full-text

Mining Top-k Regular High-Utility Itemsets in Transactional Databases

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2019010104 ◽

2019 ◽

Vol 15 (1) ◽

pp. 58-79 ◽

Cited By ~ 1

Author(s):

P. Lalitha Kumari ◽

S. G. Sanjeevi ◽

T.V. Madhusudhana Rao

Keyword(s):

High Efficiency ◽

Threshold Value ◽

Search Space ◽

List Structure ◽

High Profit ◽

Transactional Databases ◽

High Utility ◽

High Utility Itemsets ◽

Pruning Techniques ◽

Novel Algorithm

Mining high-utility itemsets is an important task in the area of data mining. It involves exponential mining space and returns a very large number of high-utility itemsets. In a real-time scenario, it is often sufficient to mine a small number of high-utility itemsets based on user-specified interestingness. Recently, the temporal regularity of an itemset is considered as an important interesting criterion for many applications. Methods for finding the regular high utility itemsets suffers from setting the threshold value. To address this problem, a novel algorithm called as TKRHU (Top k Regular High Utility Itemset) Miner is proposed to mine top-k high utility itemsets that appears regularly where k represents the desired number of regular high itemsets. A novel list structure RUL and efficient pruning techniques are developed to discover the top-k regular itemsets with high profit. Efficient pruning techniques are designed for reducing search space. Experimental results show that proposed algorithm using novel list structure achieves high efficiency in terms of runtime and space.

Download Full-text

A Survey on Mining High Utility Itemsets from Transactional Databases

International Journal of Science and Research (IJSR) ◽

10.21275/art20164639 ◽

2017 ◽

Vol 6 (1) ◽

pp. 1975-1978

Keyword(s):

Transactional Databases ◽

High Utility ◽

High Utility Itemsets

Download Full-text

HIGH UTILITY ITEMSETS MINING

International Journal of Information Technology & Decision Making ◽

10.1142/s0219622010004159 ◽

2010 ◽

Vol 09 (06) ◽

pp. 905-934 ◽

Cited By ~ 9

Author(s):

YING LIU ◽

JIANWEI LI ◽

WEI-KENG LIAO ◽

ALOK CHOUDHARY ◽

YONG SHI

Keyword(s):

Decision Making ◽

Real World ◽

Two Phase ◽

Business Decision ◽

Memory Space ◽

Real World Applications ◽

The Difference ◽

High Utility ◽

High Utility Itemsets ◽

The Impact

High utility itemsets mining identifies itemsets whose utility satisfies a given threshold. It allows users to quantify the usefulness or preferences of items using different values. Thus, it reflects the impact of different items. High utility itemsets mining is useful in decision-making process of many applications, such as retail marketing and Web service, since items are actually different in many aspects in real applications. However, due to the lack of "downward closure property", the cost of candidate generation of high utility itemsets mining is intolerable in terms of time and memory space. This paper presents a Two-Phase algorithm which can efficiently prune down the number of candidates and precisely obtain the complete set of high utility itemsets. The performance of our algorithm is evaluated by applying it to synthetic databases and two real-world applications. It performs very efficiently in terms of speed and memory cost on large databases composed of short transactions, which are difficult for existing high utility itemsets mining algorithms to handle. Experiments on real-world applications demonstrate the significance of high utility itemsets in business decision-making, as well as the difference between frequent itemsets and high utility itemsets.

Download Full-text

Mining High Utility Itemsets with Negative Utility Values in Transactional Databases

International Journal of Computer Applications ◽

10.5120/ijca2016908009 ◽

2016 ◽

Vol 134 (5) ◽

pp. 39-42

Author(s):

Priyanka D. ◽

Abhijit Patil

Keyword(s):

Utility Values ◽

Transactional Databases ◽

High Utility ◽

High Utility Itemsets

Download Full-text

A two-phase approach to mine short-period high-utility itemsets in transactional databases

A Two-Phase Algorithm for Fast Discovery of High Utility Itemsets

MINING OF HIGH-UTILITY ITEMSETS WITH NEGATIVE UTILITY

A TIFF-Tree Based High Utility Itemset Mining Algorithm

Efficient Algorithms for Mining High Utility Itemsets from Transactional Databases

Discovering Relative High Utility Itemsets in Very Large Transactional Databases Using Null-Invariant Measure

A Dynamic Itemset Counting Based Two-Phase Algorithm for Mining High Utility Itemsets

Mining Top-k Regular High-Utility Itemsets in Transactional Databases

A Survey on Mining High Utility Itemsets from Transactional Databases

HIGH UTILITY ITEMSETS MINING

Mining High Utility Itemsets with Negative Utility Values in Transactional Databases

Export Citation Format