average utility
Recently Published Documents


TOTAL DOCUMENTS

101
(FIVE YEARS 49)

H-INDEX

15
(FIVE YEARS 2)

2022 ◽  
pp. 116411
Author(s):  
Alberto Segura-Delgado ◽  
Augusto Anguita-Ruiz ◽  
Rafael Alcalá ◽  
Jesús Alcalá-Fdez

2021 ◽  
Author(s):  
Huynh Trieu Vy ◽  
Le Quoc Hai ◽  
Truong Ngoc Chau ◽  
Le Quoc Hieu
Keyword(s):  

2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Erna Hikmawati ◽  
Nur Ulfa Maulidevi ◽  
Kridanto Surendro

AbstractAssociation rule mining is a technique that is widely used in data mining. This technique is used to identify interesting relationships between sets of items in a dataset and predict associative behavior for new data. Before the rule is formed, it must be determined in advance which items will be involved or called the frequent itemset. In this step, a threshold is used to eliminate items excluded in the frequent itemset which is also known as the minimum support. Furthermore, the threshold provides an important role in determining the number of rules generated. However, setting the wrong threshold leads to the failure of the association rule mining to obtain rules. Currently, user determines the minimum support value randomly. This leads to a challenge that becomes worse for a user that is ignorant of the dataset characteristics. It causes a lot of memory and time consumption. This is because the rule formation process is repeated until it finds the desired number of rules. The value of minimum support in the adaptive support model is determined based on the average and total number of items in each transaction, as well as their support values. Furthermore, the proposed method also uses certain criteria as thresholds, therefore, the resulting rules are in accordance with user needs. The minimum support value in the proposed method is obtained from the average utility value divided by the total existing transactions. Experiments were carried out on 8 specific datasets to determine the association rules using different dataset characteristics. The trial of the proposed adaptive support method uses 2 basic algorithms in the association rule, namely Apriori and Fpgrowth. The test is carried out repeatedly to determine the highest and lowest minimum support values. The result showed that 6 out of 8 datasets produced minimum and maximum support values for the apriori and fpgrowth algorithms. This means that the value of the proposed adaptive support has the ability to generate a rule when viewed from the quality as adaptive support produces at a lift ratio value of > 1. The dataset characteristics obtained from the experimental results can be used as a factor to determine the minimum threshold value.


Author(s):  
Tin Truong ◽  
Hai Duong ◽  
Bac Le ◽  
Philippe Fournier-Viger ◽  
Unil Yun
Keyword(s):  

Author(s):  
Jimmy Ming-Tai Wu ◽  
Zhongcui Li ◽  
Gautam Srivastava ◽  
Unil Yun ◽  
Jerry Chun-Wei Lin

AbstractRecently, revealing more valuable information except for quantity value for a database is an essential research field. High utility itemset mining (HAUIM) was suggested to reveal useful patterns by average-utility measure for pattern analytics and evaluations. HAUIM provides a more fair assessment than generic high utility itemset mining and ignores the influence of the length of itemsets. There are several high-performance HAUIM algorithms proposed to gain knowledge from a disorganized database. However, most existing works do not concern the uncertainty factor, which is one of the characteristics of data gathered from IoT equipment. In this work, an efficient algorithm for HAUIM to handle the uncertainty databases in IoTs is presented. Two upper-bound values are estimated to early diminish the search space for discovering meaningful patterns that greatly solve the limitations of pattern mining in IoTs. Experimental results showed several evaluations of the proposed approach compared to the existing algorithms, and the results are acceptable to state that the designed approach efficiently reveals high average utility itemsets from an uncertain situation.


2021 ◽  
Author(s):  
Kevin Bryan ◽  
Jorge Guzman

We use cross-state business registrations to track the geographic movement of startups with high growth potential. In their first five years, 6.6% percent of these startups move across state borders. Though startup births are concentrated geographically, hubs like Silicon Valley and Boston on net lose startups to entrepreneurial migration. A revealed preference approach nonparametrically identifies the average utility of cities to migrant founders. University towns and startup hubs have low relative utility. This pattern is due neither to vertical sorting nor industrial specialization. The higher-quality startups move to lower-tax, business-friendly cities, while less growth-oriented startups move to low-tax, high-amenity cities.


2021 ◽  
pp. 107361
Author(s):  
Youxi Wu ◽  
Meng Geng ◽  
Yan Li ◽  
Lei Guo ◽  
Zhao Li ◽  
...  

Author(s):  
Kuldeep Singh ◽  
Rajiv Kumar ◽  
Bhaskar Biswas
Keyword(s):  

Author(s):  
Jimmy Ming-Tai Wu ◽  
Qian Teng ◽  
Shahab Tayeb ◽  
Jerry Chun-Wei Lin

AbstractThe high average-utility itemset mining (HAUIM) was established to provide a fair measure instead of genetic high-utility itemset mining (HUIM) for revealing the satisfied and interesting patterns. In practical applications, the database is dynamically changed when insertion/deletion operations are performed on databases. Several works were designed to handle the insertion process but fewer studies focused on processing the deletion process for knowledge maintenance. In this paper, we then develop a PRE-HAUI-DEL algorithm that utilizes the pre-large concept on HAUIM for handling transaction deletion in the dynamic databases. The pre-large concept is served as the buffer on HAUIM that reduces the number of database scans while the database is updated particularly in transaction deletion. Two upper-bound values are also established here to reduce the unpromising candidates early which can speed up the computational cost. From the experimental results, the designed PRE-HAUI-DEL algorithm is well performed compared to the Apriori-like model in terms of runtime, memory, and scalability in dynamic databases.


Sign in / Sign up

Export Citation Format

Share Document