SEGMENT CLASSIFICATION OF ECG DATA AND CONSTRUCTION OF SCATTER PLOTS USING PRINCIPAL COMPONENT ANALYSIS

2008 ◽  
Vol 08 (03) ◽  
pp. 421-458 ◽  
Author(s):  
M. P. S. CHAWLA

In many medical applications, feature selection is obvious; but in medical domains, selecting features and creating a feature vector may require more effort. The wavelet transform (WT) technique is used to identify the characteristic points of an electrocardiogram (ECG) signal with fairly good accuracy, even in the presence of severe high-frequency and low-frequency noise. Principal component analysis (PCA) is a suitable technique for ECG data analysis, feature extraction, and image processing — an important technique that is not based upon a probability model. The aim of the paper is to derive better diagnostic parameters for reducing the size of ECG data while preserving morphology, which can be done by PCA. In this analysis, PCA is used for decorrelation of ECG signals, noise, and artifacts from various raw ECG data sets. The aim of this paper is twofold: first, to describe an elegant algorithm that uses WT alone to identify the characteristic points of an ECG signal; and second, to use a composite WT-based PCA method for redundant data reduction and better feature extraction. PCA scatter plots can be observed as a good basis for feature selection to account for cardiac abnormalities. The study is analyzed with higher-order statistics, in contrast to the conventional methods that use only geometric characteristics of feature waves and lower-order statistics. A new algorithm — viz. PCA variance estimator — is developed for this analysis, and the results are also obtained for different combinations of leads to find correlations for feature classification and useful diagnostic information. PCA scatter plots of various chest and augmented ECG leads are obtained to examine the varying orientations of the ECG data in different quadrants, indicating the cardiac events and abnormalities. The efficacy of the PCA algorithm is tested on different leads of 12-channel ECG data; file no. 01 of the Common Standards for Electrocardiography (CSE) database is used for this study. Better feature extraction is obtained for some specific combinations of leads, and significant improvement in signal quality is achieved by identifying the noise and artifact components. The quadrant analysis discussed in this paper highlights the filtering requirements for further ECG processing after performing PCA, as a primary step for decorrelation and dimensionality reduction. The values of the parameters obtained from the results of PCA are also compared with those of wavelet methods.

2020 ◽  
Vol 9 (2) ◽  
pp. 72-79
Author(s):  
Sari Ayu Wulandari ◽  
Sutikno Madnasri ◽  
Ratih Pramitasari ◽  
Susilo Susilo

The need for aroma recognition devices or often known as enose (electronic nose), is increasing. In the health field, enose can detect early diabetes mellitus (DM) type 2 from the aroma of urine. Enose is an aroma recognition tool that uses a pattern recognition algorithm to recognize the urine aroma of diabetics based on input signals from an array of gas sensors. The need for portable enose devices is increasing due to the increasing need for real-time needs. Enose devices have an enormous impact on the choice of the gas sensor Array in the enose. This article discusses the effect of the number of sensor arrays used on the recognition results. Enose uses a maximum of 4 sensors, with a maximum feature matrix. After that, the feature matrix enters the PCA (Principal Component Analysis) feature extraction and clustering using the FCM (Fuzzy C Means) method. The number of sensors indicates the number of features. Enose using method for feature selection, it’s a variation from 4 sensors, where experiment 1 uses 4 sensors, experiment 2 uses a variation of 3 sensors and experiment 3 uses a variation of 2 sensors. Especially for sensors 3 and 4 using feature extraction method, PCA (Principal Component Analysis), to reduce features to only 2 best features. As for the variation of 2 sensors use primer feature matrix. After feature selection, the number of features is 2 out of 11 variations. Next, do the grouping using the FCM (Fuzzy C Means) method. The results show that using two sensors has a high accuracy rate of 92.5%.


Polymers ◽  
2021 ◽  
Vol 13 (23) ◽  
pp. 4117
Author(s):  
Y-h. Taguchi ◽  
Turki Turki

The development of the medical applications for substances or materials that contact cells is important. Hence, it is necessary to elucidate how substances that surround cells affect gene expression during incubation. In the current study, we compared the gene expression profiles of cell lines that were in contact with collagen–glycosaminoglycan mesh and control cells. Principal component analysis-based unsupervised feature extraction was applied to identify genes with altered expression during incubation in the treated cell lines but not in the controls. The identified genes were enriched in various biological terms. Our method also outperformed a conventional methodology, namely, gene selection based on linear regression with time course.


Sign in / Sign up

Export Citation Format

Share Document