average utility Latest Research Papers

AbstractAssociation rule mining is a technique that is widely used in data mining. This technique is used to identify interesting relationships between sets of items in a dataset and predict associative behavior for new data. Before the rule is formed, it must be determined in advance which items will be involved or called the frequent itemset. In this step, a threshold is used to eliminate items excluded in the frequent itemset which is also known as the minimum support. Furthermore, the threshold provides an important role in determining the number of rules generated. However, setting the wrong threshold leads to the failure of the association rule mining to obtain rules. Currently, user determines the minimum support value randomly. This leads to a challenge that becomes worse for a user that is ignorant of the dataset characteristics. It causes a lot of memory and time consumption. This is because the rule formation process is repeated until it finds the desired number of rules. The value of minimum support in the adaptive support model is determined based on the average and total number of items in each transaction, as well as their support values. Furthermore, the proposed method also uses certain criteria as thresholds, therefore, the resulting rules are in accordance with user needs. The minimum support value in the proposed method is obtained from the average utility value divided by the total existing transactions. Experiments were carried out on 8 specific datasets to determine the association rules using different dataset characteristics. The trial of the proposed adaptive support method uses 2 basic algorithms in the association rule, namely Apriori and Fpgrowth. The test is carried out repeatedly to determine the highest and lowest minimum support values. The result showed that 6 out of 8 datasets produced minimum and maximum support values for the apriori and fpgrowth algorithms. This means that the value of the proposed adaptive support has the ability to generate a rule when viewed from the quality as adaptive support produces at a lift ratio value of > 1. The dataset characteristics obtained from the experimental results can be used as a factor to determine the minimum threshold value.

Download Full-text

Mining interesting sequences with low average cost and high average utility

Applied Intelligence ◽

10.1007/s10489-021-02505-0 ◽

2021 ◽

Author(s):

Tin Truong ◽

Hai Duong ◽

Bac Le ◽

Philippe Fournier-Viger ◽

Unil Yun

Keyword(s):

Average Cost ◽

Average Utility

Download Full-text

Analytics of high average-utility patterns in the industrial internet of things

Applied Intelligence ◽

10.1007/s10489-021-02751-2 ◽

2021 ◽

Author(s):

Jimmy Ming-Tai Wu ◽

Zhongcui Li ◽

Gautam Srivastava ◽

Unil Yun ◽

Jerry Chun-Wei Lin

Keyword(s):

High Performance ◽

Pattern Mining ◽

Search Space ◽

Research Field ◽

Industrial Internet Of Things ◽

Utility Measure ◽

Itemset Mining ◽

Industrial Internet ◽

Average Utility ◽

High Utility

AbstractRecently, revealing more valuable information except for quantity value for a database is an essential research field. High utility itemset mining (HAUIM) was suggested to reveal useful patterns by average-utility measure for pattern analytics and evaluations. HAUIM provides a more fair assessment than generic high utility itemset mining and ignores the influence of the length of itemsets. There are several high-performance HAUIM algorithms proposed to gain knowledge from a disorganized database. However, most existing works do not concern the uncertainty factor, which is one of the characteristics of data gathered from IoT equipment. In this work, an efficient algorithm for HAUIM to handle the uncertainty databases in IoTs is presented. Two upper-bound values are estimated to early diminish the search space for discovering meaningful patterns that greatly solve the limitations of pattern mining in IoTs. Experimental results showed several evaluations of the proposed approach compared to the existing algorithms, and the results are acceptable to state that the designed approach efficiently reveals high average utility itemsets from an uncertain situation.

Download Full-text

Entrepreneurial Migration

10.31235/osf.io/yd3v2 ◽

2021 ◽

Author(s):

Kevin Bryan ◽

Jorge Guzman

Keyword(s):

Silicon Valley ◽

Revealed Preference ◽

High Growth ◽

Growth Potential ◽

Industrial Specialization ◽

Average Utility ◽

Relative Utility ◽

Revealed Preference Approach ◽

High Growth Potential

We use cross-state business registrations to track the geographic movement of startups with high growth potential. In their first five years, 6.6% percent of these startups move across state borders. Though startup births are concentrated geographically, hubs like Silicon Valley and Boston on net lose startups to entrepreneurial migration. A revealed preference approach nonparametrically identifies the average utility of cities to migrant founders. University towns and startup hubs have low relative utility. This pattern is due neither to vertical sorting nor industrial specialization. The higher-quality startups move to lower-tax, business-friendly cities, while less growth-oriented startups move to low-tax, high-amenity cities.

Download Full-text

Frequent high minimum average utility sequence mining with constraints in dynamic databases using efficient pruning strategies

Applied Intelligence ◽

10.1007/s10489-021-02520-1 ◽

2021 ◽

Author(s):

Tin Truong ◽

Hai Duong ◽

Bac Le ◽

Philippe Fournier-Viger ◽

Unil Yun

Keyword(s):

Sequence Mining ◽

Dynamic Databases ◽

Average Utility

Download Full-text

HANP-Miner: High average utility nonoverlapping sequential pattern mining

Knowledge-Based Systems ◽

10.1016/j.knosys.2021.107361 ◽

2021 ◽

pp. 107361

Author(s):

Youxi Wu ◽

Meng Geng ◽

Yan Li ◽

Lei Guo ◽

Zhao Li ◽

...

Keyword(s):

Pattern Mining ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Average Utility

Download Full-text

High average-utility itemsets mining: a survey

Applied Intelligence ◽

10.1007/s10489-021-02611-z ◽

2021 ◽

Author(s):

Kuldeep Singh ◽

Rajiv Kumar ◽

Bhaskar Biswas

Keyword(s):

Average Utility

Download Full-text

Dynamic maintenance model for high average-utility pattern mining with deletion operation

Applied Intelligence ◽

10.1007/s10489-021-02539-4 ◽

2021 ◽

Author(s):

Jimmy Ming-Tai Wu ◽

Qian Teng ◽

Shahab Tayeb ◽

Jerry Chun-Wei Lin

Keyword(s):

Pattern Mining ◽

Computational Cost ◽

Practical Applications ◽

Itemset Mining ◽

Dynamic Databases ◽

Speed Up ◽

Dynamic Maintenance ◽

Average Utility ◽

High Utility ◽

Maintenance Model

AbstractThe high average-utility itemset mining (HAUIM) was established to provide a fair measure instead of genetic high-utility itemset mining (HUIM) for revealing the satisfied and interesting patterns. In practical applications, the database is dynamically changed when insertion/deletion operations are performed on databases. Several works were designed to handle the insertion process but fewer studies focused on processing the deletion process for knowledge maintenance. In this paper, we then develop a PRE-HAUI-DEL algorithm that utilizes the pre-large concept on HAUIM for handling transaction deletion in the dynamic databases. The pre-large concept is served as the buffer on HAUIM that reduces the number of database scans while the database is updated particularly in transaction deletion. Two upper-bound values are also established here to reduce the unpromising candidates early which can speed up the computational cost. From the experimental results, the designed PRE-HAUI-DEL algorithm is well performed compared to the Apriori-like model in terms of runtime, memory, and scalability in dynamic databases.

Download Full-text

average utility
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Mining high average-utility sequential rules to identify high-utility gene expression sequences in longitudinal human studies

HIDING SENSITIVE HIGH AVERAGE-UTILITY ITEMSET

Minimum threshold determination method based on dataset characteristics in association rule mining

Mining interesting sequences with low average cost and high average utility

Analytics of high average-utility patterns in the industrial internet of things

Entrepreneurial Migration

Frequent high minimum average utility sequence mining with constraints in dynamic databases using efficient pruning strategies

HANP-Miner: High average utility nonoverlapping sequential pattern mining

High average-utility itemsets mining: a survey

Dynamic maintenance model for high average-utility pattern mining with deletion operation

Export Citation Format

average utilityRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Mining high average-utility sequential rules to identify high-utility gene expression sequences in longitudinal human studies

HIDING SENSITIVE HIGH AVERAGE-UTILITY ITEMSET

Minimum threshold determination method based on dataset characteristics in association rule mining

Mining interesting sequences with low average cost and high average utility

Analytics of high average-utility patterns in the industrial internet of things

Entrepreneurial Migration

Frequent high minimum average utility sequence mining with constraints in dynamic databases using efficient pruning strategies

HANP-Miner: High average utility nonoverlapping sequential pattern mining

High average-utility itemsets mining: a survey

Dynamic maintenance model for high average-utility pattern mining with deletion operation

average utility
Recently Published Documents