Research and application on algorithms of data mining for EMU malfunction’s data under a cloud computing environment

Efficient data mining model design for a large database in the cloud computing environment is studied. For large databases efficiently mining problem, an efficient data mining model in the cloud computing environment based on improved manifold learning algorithms is proposed. The use of nonlinear manifold learning algorithms is able to reduce dimensionality of data vector feature in cloud computing environments, through characteristic extraction module to preprocess data, improved classical manifold learning algorithm is adopted to increase the distance between the data of sample spread intensive area and shorten the distance between the data of sample spread sparse area, prompting even overall distribution of sample database under cloud computing environment, so as to achieve accurate mining for efficient data in cloud computing environment. The experimental results show that the proposed method can accurately mine target data under cloud computing environments, with high efficiency and precision.

Download Full-text

Data mining using hierarchical virtual k-means approach integrating data fragments in cloud computing environment

2011 IEEE International Conference on Cloud Computing and Intelligence Systems ◽

10.1109/ccis.2011.6045065 ◽

2011 ◽

Cited By ~ 6

Author(s):

T. R. Gopalakrishnan Nair ◽

K. Lakshmi Madhuri

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Computing Environment ◽

Cloud Computing Environment

Download Full-text

Construction of multi tier distributed computing data mining system in cloud computing environment

Proceedings of the 2017 2nd International Conference on Materials Science, Machinery and Energy Engineering (MSMEE 2017) ◽

10.2991/msmee-17.2017.301 ◽

2017 ◽

Author(s):

Wendong Xia ◽

Yuanfeng Liu ◽

Deli Chen

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Distributed Computing ◽

Computing Environment ◽

Mining System ◽

Cloud Computing Environment ◽

Data Mining System

Download Full-text

Web data mining algorithm based on cloud computing environment

International Journal of Grid and Utility Computing ◽

10.1504/ijguc.2021.10043188 ◽

2021 ◽

Vol 12 (4) ◽

pp. 359

Author(s):

Yunpeng Liu ◽

Xiaolong Gu ◽

Jie Zhang

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Data Mining Algorithm ◽

Computing Environment ◽

Web Data ◽

Web Data Mining ◽

Cloud Computing Environment ◽

Mining Algorithm

Download Full-text

Research on the Parallel Frequent Data Mining Strategy under the Cloud Computing Environment

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.719-720.924 ◽

2015 ◽

Vol 719-720 ◽

pp. 924-928 ◽

Cited By ~ 1

Author(s):

Xiao Chun Sheng ◽

Xiao Feng Xue ◽

Yan Ping Cheng

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Large Data ◽

Efficient Solutions ◽

Data Repository ◽

Computing Environment ◽

Cloud Computing Environment ◽

Item Data ◽

Important Basis ◽

Data Mining Strategy

Cloud computing is computing tasks distribution resources of a large number of computers in the subnet, to provide users with cheap and efficient computing power, storage capacity and service capabilities. Data mining is to find useful information in large data repository. Frequent flow of large amounts of data quickly and accurately find important basis for forecasting and decision, therefore, under the cloud computing environment parallelization frequent item data mining strategy to provide efficient solutions to store and analyze vast amounts of data has important theoretical significanceand application value.

Download Full-text

Design of data mining model based on improved manifold learning algorithm in cloud computing environment

Proceedings of the 2017 5th International Conference on Frontiers of Manufacturing Science and Measuring Technology (FMSMT 2017) ◽

10.2991/fmsmt-17.2017.277 ◽

2017 ◽

Author(s):

Zhan-kun Zhao

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Manifold Learning ◽

Learning Algorithm ◽

Computing Environment ◽

Cloud Computing Environment ◽

Model Based ◽

Mining Model

Download Full-text

A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment

Mathematical Problems in Engineering ◽

10.1155/2020/6343705 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Jin Gao ◽

Jiaquan Liu ◽

Sihua Guo ◽

Qi Zhang ◽

Xinyang Wang

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Anomaly Detection ◽

Dimensionality Reduction ◽

Mining Method ◽

Computing Environment ◽

Data Set ◽

Cloud Computing Environment ◽

Reduction Model ◽

Data Mining Method

Aiming at problems such as slow training speed, poor prediction effect, and unstable detection results of traditional anomaly detection algorithms, a data mining method for anomaly detection based on the deep variational dimensionality reduction model and MapReduce (DMAD-DVDMR) in cloud computing environment is proposed. First of all, the data are preprocessed by a dimensionality reduction model based on deep variational learning and based on ensuring complete data information as much as possible, the dimensionality of the data is reduced, and the computational pressure is reduced. Secondly, the data set stored on the Hadoop Distributed File System (HDFS) is logically divided into several data blocks, and the data blocks are processed in parallel through the principle of MapReduce, so the k-distance and LOF value of each data point can only be calculated in each block. Thirdly, based on stochastic gradient descent, the concept of k-neighboring distance is redefined, thus avoiding the situation where there are greater than or equal to k-repeated points and infinite local density in the data set. Finally, compared with CNN, DeepAnt, and SVM-IDS algorithms, the accuracy of the scheme is increased by 10.3%, 18.0%, and 17.2%, respectively. The experimental data set verifies the effectiveness and scalability of the proposed DMAD-DVDMR algorithm.

Download Full-text