scholarly journals The water supply association analysis method in Shenzhen based on kmeans clustering discretization and apriori algorithm

PLoS ONE ◽  
2021 ◽  
Vol 16 (8) ◽  
pp. e0255684
Author(s):  
Xin Liu ◽  
Xuefeng Sang ◽  
Jiaxuan Chang ◽  
Yang Zheng ◽  
Yuping Han

Since water supply association analysis plays an important role in attribution analysis of water supply fluctuation, how to carry out effective association analysis has become a critical problem. However, the current techniques and methods used for association analysis are not very effective because they are based on continuous data. In general, there is different degrees of monotone relationship between continuous data, which makes the analysis results easily affected by monotone relationship. The multicollinearity between continuous data distorts these analytical methods and may generate incorrect results. Meanwhile, we cannot know the association rules and value interval between features and water supply. Therefore, the lack of an effective analysis method hinders the water supply association analysis. Association rules and value interval of features obtained from association analysis are helpful to grasp cause of water supply fluctuation and know the fluctuation interval of water supply, so as to provide better support for water supply dispatching. But the association rules and value interval between features and water supply are not fully understood. In this study, a data mining method coupling kmeans clustering discretization and apriori algorithm was proposed. The kmeans was used for data discretization to obtain the one-hot encoding that can be recognized by apriori, and the discretization can also avoid the influence of monotone relationship and multicollinearity on analysis results. All the rules eventually need to be validated in order to filter out spurious rules. The results show that the method in this study is an effective association analysis method. The method can not only obtain the valid strong association rules between features and water supply, but also understand whether the association relationship between features and water supply is direct or indirect. Meanwhile, the method can also obtain value interval of features, the association degree between features and confidence probability of rules.

2022 ◽  
Vol 355 ◽  
pp. 02033
Author(s):  
Tongqiang Jiang ◽  
Xin Chen ◽  
Huan Jiang

At present, China exists a problem that the cost of food sampling inspection is too high. This paper attempts to reduce the number of sampling inspection items in the same food category, reduce the cost of food sampling inspection, and improve the work efficiency through the association analysis of national sampling inspection data. And this paper applies Apriori algorithm to analyse the association rules, which is based on the unqualified pastry sampling inspection data in the 2019 national food sampling inspection database. Finally, we obtain 10 strong association rules through experiments. The results show that this association analysis can reduce the workload of food sampling inspection effectively.


2014 ◽  
Vol 1079-1080 ◽  
pp. 737-742
Author(s):  
Yi Yong Ye

For large amounts of data generated by the e-commerceplatform, combining with the actual needs of e-commerce recommendation system,make research on a common technique of association rules which orientede-commerce Web mining association analysis, introduces the association rules ofApriori mining algorithm, and the specific application of Apriori algorithm isanalyzed through a practical example, Finally, point out the shortcomings ofclassical Apriori algorithm, and gives directions for improvement.


Energies ◽  
2021 ◽  
Vol 14 (14) ◽  
pp. 4228
Author(s):  
Yan Xu ◽  
Mingyu Wang ◽  
Wen Fan

The fault data of the secondary system of smart substations hide some information that the association analysis algorithm can mine. The convergence speed of the Apriori algorithm and FP-growth algorithm is slow, and there is a lack of indicators to evaluate the correlation of association rules and the method to determine the parameter threshold. In this paper, the H-mine algorithm is used to realize the fast mining of fault data. The algorithm can traverse data faster by using the data structure of the H-struct. This paper also sets the lift and CF value to screen the association rules with good correlation. When setting the three key parameters of association analysis, namely, support threshold, confidence threshold, and lift threshold, an objective function composed of weighted average lift, CF value, and data coverage rate was selected, and the adaptive fireworks algorithm was used to optimize the parameters in the association analysis. In particular, the rule screening strategy is introduced in fault cause analysis in this paper. By eliminating rules with high similarity, derived signals in association rules are eliminated to the greatest extent to improve the readability of rules and ensure easy understanding of results.


2014 ◽  
Vol 989-994 ◽  
pp. 1586-1589
Author(s):  
Chi Zhang ◽  
Yi Liu ◽  
Fang Shuai Sun ◽  
Li Yao

First, we describe crossing-selling and association rules. And then with the study of a correlation clothing store sales data, it shows the Apriori algorithm applies in specific association analysis. We can propose a model which is suitable for crossing-selling. Through commercial test, the algorithm can significantly increase sales of the relevant product.


2012 ◽  
Vol 490-495 ◽  
pp. 2017-2021
Author(s):  
Wei Peng Zhang

To apply Apriori algorithm to analysis on association of stroke and hemorheology, and obtain the meaningful medical information. A large number of hemorheology data of patients with stroke were collected, including whole blood viscosity low cut, whole blood viscosity medium cut, whole blood viscosity high cut, blood sedimentation, hematocrit, plasma viscosity, thrombus, age, sex. Minimum support was 0.2 and minimum confidence was 0.8 as experience for analysis of association rules with apriori algorithm. Four strong association rules were screened by the objective and subjective interestingness, which contained the relation between the stroke and age, sex, whole blood viscosity, plasma viscosity. The results show that Apriori algorithm can be used to study the the diagnosis and prevention of stroke.


2014 ◽  
Vol 556-562 ◽  
pp. 1510-1514
Author(s):  
Li Qiang Lin ◽  
Hong Wen Yan

For the low efficiency in generating candidate item sets of apriori algorithm, this paper presents a method based on property division to improve generating candidate item sets. Comparing the improved apriori algorithm with the other algorithm and the improved algorithm is applied to the power system accident cases in extreme climate. The experiment results show that the improved algorithm significantly improves the time efficiency of generating candidate item sets. And it can find the association rules among time, space, disasters and fault facilities in the power system accident cases in extreme climate. That is very useful in power system fault analysis.


2019 ◽  
Vol 2 (1) ◽  
pp. 31-36
Author(s):  
Arfianto Darmawan ◽  
Titin Kristiana

The Anakku Foundation Cooperative is a multi-business cooperative consisting of shop businesses, savings and loans, and student shuttle services. Every sale of stuff services will be inputted data directly to each business unit. The Anakku Foundation Cooperative still has problems, including store transactions that cannot yet answer what items are often sold, when stock items are still difficult to determine the items that are still available or almost running out. Data mining techniques have been mostly used to overcome existing problems, one of which is the application of the Apriori algorithm to obtain information about the associations between products from a transaction database. Transaction data on school equipment sales at Cooperative Employees of Anakku Foundation can be reprocessed using Data mining applications so as to produce strong association rules between itemset sales of school supplies so that they can provide recommendations for item alignment and simplify the arrangement or strong item placement related to interdependence. The results are found that the highest value of support and confidence is if buying MUSLIM L1.5P1, so it would buy AL-IZHAR II LOGO with a value of 14.5% support and 79.5% confidence


2018 ◽  
Vol 1 (2) ◽  
pp. 292
Author(s):  
RYANTY DERWENTYANA NAZHAR

Al-Irsyad mosque at Kota Baru Parahyangan is a religious building that has the power of narrative and full of meaning. The elements of interior and exterior are designed by the architect to deliver messages to users. One element of the interior which is the orientation for the user is the Mihrab. Mihrab of this mosque has a unique concept of shape and philosophy. The concept can contain messages that can even up on the ideology that is intended for users who use the facility, and when elements of the space is interpreted (signified), then it becomes a sign (sign). Meaning analysis method that is used is combined with analysis of the typology of signs and sign structure, that will produce a more powerful meaning of the sign. Mihrab Al-Irsyad mosque is one part of a whole section of the mosque, but its role is very important for conveying message, especially message that regarding religious ideology, which is expressed in symbols. Mihrab Al-Irsyad Kota Baru Parahyangan try to use elements of the space as a mark to describe the expression "Quranic", so that each element is applied within the mihrab has a meaning derived from the meaning that comes from the verses of the Qur'an as a source of knowledge for the Islam follower. Expression of space that is designed solely to strengthen the religious beliefs of the congregation of the mosque on infinity, the one, and the greatness of God. Keywords: mihrab; mosque; semiotic; sign


Sign in / Sign up

Export Citation Format

Share Document