K-means clustering algorithm for data distribution in cloud computing environment

2021 ◽  
Vol 12 (3) ◽  
pp. 322
Author(s):  
Hailan Pan ◽  
Yongmei Lei ◽  
Shi Yin
2020 ◽  
Vol 2020 ◽  
pp. 1-11
Author(s):  
Jing Yu ◽  
Hang Li ◽  
Desheng Liu

Medical data have the characteristics of particularity and complexity. Big data clustering plays a significant role in the area of medicine. The traditional clustering algorithms are easily falling into local extreme value. It will generate clustering deviation, and the clustering effect is poor. Therefore, we propose a new medical big data clustering algorithm based on the modified immune evolutionary method under cloud computing environment to overcome the above disadvantages in this paper. Firstly, we analyze the big data structure model under cloud computing environment. Secondly, we give the detailed modified immune evolutionary method to cluster medical data including encoding, constructing fitness function, and selecting genetic operators. Finally, the experiments show that this new approach can improve the accuracy of data classification, reduce the error rate, and improve the performance of data mining and feature extraction for medical data clustering.


2016 ◽  
Vol 29 (1) ◽  
pp. 279-293 ◽  
Author(s):  
Shafi’i Muhammad Abdulhamid ◽  
Muhammad Shafie Abd Latiff ◽  
Syed Hamid Hussain Madni ◽  
Mohammed Abdullahi

2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Kai Zhang ◽  
Wei Guo ◽  
Jian Feng ◽  
Mei Liu

For the problems of low accuracy and low efficiency of most load forecasting methods, a load forecasting method based on improved deep learning in cloud computing environment is proposed. Firstly, the preprocessed data set is divided into several data partitions with relatively balanced data volume through spatial grid, so as to better detect abnormal data. Then, the density peak clustering algorithm based on spark is used to detect abnormal data in each partition, and the local clusters and abnormal points are merged. The parallel processing of data is realized by using spark cluster computing platform. Finally, the deep belief network is used for load classification, and the classification results are input into the empirical mode decomposition-gating recurrent unit network model, and the load prediction results are obtained through learning. Based on the load data of a power grid, the experimental results demonstrate that the mean prediction error of the proposed method is basically controlled within 3% in the short term and 0.023 MW, 19.75%, and 2.76% in the long term, which are better than other comparison methods, and the parallel performance is good, which has a certain feasibility.


Sign in / Sign up

Export Citation Format

Share Document