Multiclass Decision Forest—A Novel Pattern Recognition Method for Multiclass Classification in Microarray Data Analysis

2004 ◽  
Vol 23 (10) ◽  
pp. 685-694 ◽  
Author(s):  
Huixiao Hong ◽  
Weida Tong ◽  
Roger Perkins ◽  
Hong Fang ◽  
Qian Xie ◽  
...  
2017 ◽  
Vol 2017 ◽  
pp. 1-16 ◽  
Author(s):  
Jingzong Yang ◽  
Xiaodong Wang ◽  
Zao Feng ◽  
Guoyong Huang

Aiming at the nonstationary and nonlinear characteristics of acoustic impulse response signal in pipeline blockage and the difficulty in identifying the different degrees of blockage, this paper proposed a pattern recognition method based on local mean decomposition (LMD), information entropy theory, and extreme learning machine (ELM). Firstly, the impulse response signals of pipeline extracted in different operating conditions were decomposed with LMD method into a series of product functions (PFs). Secondly, based on the information entropy theory, the appropriate energy entropy, singular spectrum entropy, power spectrum entropy, and Hilbert spectrum entropy were extracted as the input feature vectors. Finally, ELM was introduced for classification of pipeline blockage. Through the analysis of acoustic impulse response signal collected under the condition of health and different degrees of blockages in pipeline, the results show that the proposed method can well characterize the state information. Also, it has a great advantage in terms of accuracy and it is time consuming when compared with the support vector machine (SVM) and BP (backpropagation) model.


2008 ◽  
Vol 06 (02) ◽  
pp. 261-282 ◽  
Author(s):  
AO YUAN ◽  
WENQING HE

Clustering is a major tool for microarray gene expression data analysis. The existing clustering methods fall mainly into two categories: parametric and nonparametric. The parametric methods generally assume a mixture of parametric subdistributions. When the mixture distribution approximately fits the true data generating mechanism, the parametric methods perform well, but not so when there is nonnegligible deviation between them. On the other hand, the nonparametric methods, which usually do not make distributional assumptions, are robust but pay the price for efficiency loss. In an attempt to utilize the known mixture form to increase efficiency, and to free assumptions about the unknown subdistributions to enhance robustness, we propose a semiparametric method for clustering. The proposed approach possesses the form of parametric mixture, with no assumptions to the subdistributions. The subdistributions are estimated nonparametrically, with constraints just being imposed on the modes. An expectation-maximization (EM) algorithm along with a classification step is invoked to cluster the data, and a modified Bayesian information criterion (BIC) is employed to guide the determination of the optimal number of clusters. Simulation studies are conducted to assess the performance and the robustness of the proposed method. The results show that the proposed method yields reasonable partition of the data. As an illustration, the proposed method is applied to a real microarray data set to cluster genes.


2003 ◽  
Vol 01 (03) ◽  
pp. 541-586 ◽  
Author(s):  
Tero Aittokallio ◽  
Markus Kurki ◽  
Olli Nevalainen ◽  
Tuomas Nikula ◽  
Anne West ◽  
...  

Microarray analysis has become a widely used method for generating gene expression data on a genomic scale. Microarrays have been enthusiastically applied in many fields of biological research, even though several open questions remain about the analysis of such data. A wide range of approaches are available for computational analysis, but no general consensus exists as to standard for microarray data analysis protocol. Consequently, the choice of data analysis technique is a crucial element depending both on the data and on the goals of the experiment. Therefore, basic understanding of bioinformatics is required for optimal experimental design and meaningful interpretation of the results. This review summarizes some of the common themes in DNA microarray data analysis, including data normalization and detection of differential expression. Algorithms are demonstrated by analyzing cDNA microarray data from an experiment monitoring gene expression in T helper cells. Several computational biology strategies, along with their relative merits, are overviewed and potential areas for additional research discussed. The goal of the review is to provide a computational framework for applying and evaluating such bioinformatics strategies. Solid knowledge of microarray informatics contributes to the implementation of more efficient computational protocols for the given data obtained through microarray experiments.


Sign in / Sign up

Export Citation Format

Share Document