Data preprocessing

Author(s):  
Khalid K. Al-jabery ◽  
Tayo Obafemi-Ajayi ◽  
Gayla R. Olbricht ◽  
Donald C. Wunsch II
Keyword(s):  
2009 ◽  
Vol 147-149 ◽  
pp. 588-593 ◽  
Author(s):  
Marcin Derlatka ◽  
Jolanta Pauk

In the paper the procedure of processing biomechanical data has been proposed. It consists of selecting proper noiseless data, preprocessing data by means of model’s identification and Kernel Principal Component Analysis and next classification using decision tree. The obtained results of classification into groups (normal and two selected pathology of gait: Spina Bifida and Cerebral Palsy) were very good.


2005 ◽  
Vol 38 (14) ◽  
pp. 2475-2492 ◽  
Author(s):  
Fan Gong ◽  
Bo‐Tang Wang ◽  
Foo‐Tim Chau ◽  
Yi‐Zeng Liang

2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Rosa Alba Sola Martínez ◽  
José María Pastor Hernández ◽  
Gema Lozano Terol ◽  
Julia Gallego-Jara ◽  
Luis García-Marcos ◽  
...  

AbstractThe noninvasive diagnosis and monitoring of high prevalence diseases such as cardiovascular diseases, cancers and chronic respiratory diseases are currently priority objectives in the area of health. In this regard, the analysis of volatile organic compounds (VOCs) has been identified as a potential noninvasive tool for the diagnosis and surveillance of several diseases. Despite the advantages of this strategy, it is not yet a routine clinical tool. The lack of reproducible protocols for each step of the biomarker discovery phase is an obstacle of the current state. Specifically, this issue is present at the data preprocessing step. Thus, an open source workflow for preprocessing the data obtained by the analysis of exhaled breath samples using gas chromatography coupled with single quadrupole mass spectrometry (GC/MS) is presented in this paper. This workflow is based on the connection of two approaches to transform raw data into a useful matrix for statistical analysis. Moreover, this workflow includes matching compounds from breath samples with a spectral library. Three free packages (xcms, cliqueMS and eRah) written in the language R are used for this purpose. Furthermore, this paper presents a suitable protocol for exhaled breath sample collection from infants under 2 years of age for GC/MS.


2014 ◽  
Vol 687-691 ◽  
pp. 1592-1595
Author(s):  
Yun Peng Duan ◽  
Chun Xi Zhao ◽  
Ying Shi

With the widely application of the WWW and the emergence of Web technology, make the research of data mining has entered a new stage. Web log mining is based on the idea of data mining to analyze the server log processing. Paper aimed at the early stage of the data mining is put forward based on log data preprocessing methods, the purpose is to divide server logs into multiple unique user access sequence at a time, and to give a good algorithm.


Sign in / Sign up

Export Citation Format

Share Document