scholarly journals Data Preprocessing: The Techniques for Preparing Clean and Quality Data for Data Analytics Process

2021 ◽  
Vol 13 (0203) ◽  
pp. 78-81
Author(s):  
Ashish P. Joshi ◽  
Biraj V. Patel

The model and pattern for real time data mining have an important role for decision making. The meaningful real time data mining is basically depends on the quality of data while row or rough data available at warehouse. The data available at warehouse can be in any format, it may huge or it may unstructured. These kinds of data require some process to enhance the efficiency of data analysis. The process to make it ready to use is called data preprocessing. There can be many activities for data preprocessing such as data transformation, data cleaning, data integration, data optimization and data conversion which are use to converting the rough data to quality data. The data preprocessing techniques are the vital step for the data mining. The analyzed result will be good as far as data quality is good. This paper is about the different data preprocessing techniques which can be use for preparing the quality data for the data analysis for the available rough data.

2018 ◽  
Vol 19 (S18) ◽  
Author(s):  
Ahmed Sanaullah ◽  
Chen Yang ◽  
Yuri Alexeev ◽  
Kazutomo Yoshii ◽  
Martin C. Herbordt

2014 ◽  
Vol 599-601 ◽  
pp. 1487-1490 ◽  
Author(s):  
Li Kun Zheng ◽  
Kun Feng ◽  
Xiao Qing Xiao ◽  
Wei Qiao Song

This paper mainly discusses the application of the mass real-time data mining technology in equipment safety state evaluation in the power plant and the realization of the equipment comprehensive quantitative assessment and early warning of potential failure by mining analysis and modeling massive amounts of real-time data the power equipment. In addition to the foundational technology introduced in this paper, the technology is also verified by the application case in the power supply side remote diagnosis center of Guangdong electric institute.


2015 ◽  
Vol 2015 ◽  
pp. 1-14 ◽  
Author(s):  
Woochul Kang ◽  
Jaeyong Chung

With ubiquitous deployment of sensors and network connectivity, amounts of real-time data for embedded systems are increasing rapidly and database capability is required for many embedded systems for systematic management of real-time data. In such embedded systems, supporting the timeliness of tasks accessing databases is an important problem. However, recent multicore-based embedded architectures pose a significant challenge for such data-intensive real-time tasks since the response time of accessing data can be significantly affected by potential intercore interferences. In this paper, we propose a novel feedback control scheme that supports the timeliness of data-intensive tasks against unpredictable intercore interferences. In particular, we use multiple inputs/multiple outputs (MIMO) control method that exploits multiple control knobs, for example, CPU frequency and the Quality-of-Data (QoD) to handle highly unpredictable workloads in multicore systems. Experimental results, using actual implementation, show that the proposed approach achieves the target Quality-of-Service (QoS) goals, such as task timeliness and Quality-of-Data (QoD) while consuming less energy compared to baseline approaches.


2020 ◽  
Vol 17 (11) ◽  
pp. 5162-5166
Author(s):  
Puninder Kaur ◽  
Amandeep Kaur ◽  
Rajwinder Kaur

In the IT world, predicting the academic performance of the huge student population poses a big challenge. Educational data mining techniques significantly contribute in providing solution to this problem. There are several prediction methods available for data classification and clustering, to extract information and provide accurate results. In this paper, different prediction methodologies are highlighted for the prediction of real-time data analysis of dynamic academic behavior of the students. The main focus is to provide brief knowledge about all data mining techniques and highlight dissimilarities among various methods in order to provide the best results for the students.


Sign in / Sign up

Export Citation Format

Share Document