Sample Size for Identifying Differentially Expressed Genes in Microarray Experiments

2004 ◽  
Vol 11 (4) ◽  
pp. 714-726 ◽  
Author(s):  
Sue-Jane Wang ◽  
James J. Chen
2008 ◽  
Vol 2 ◽  
pp. BBI.S473 ◽  
Author(s):  
Akihiro Hirakawa ◽  
Yasunori Sato ◽  
Chikuma Hamada ◽  
Isao Yoshimura

Choosing an appropriate statistic and precisely evaluating the false discovery rate (FDR) are both essential for devising an effective method for identifying differentially expressed genes in microarray data. The t-type score proposed by Pan et al. (2003) succeeded in suppressing false positives by controlling the underestimation of variance but left the overestimation uncontrolled. For controlling the overestimation, we devised a new test statistic (variance stabilized t-type score) by placing shrunken sample variances of the James-Stein type in the denominator of the t-type score. Since the relative superiority of the mean and median FDRs was unclear in the widely adopted Significance Analysis of Microarrays (SAM), we conducted simulation studies to examine the performance of the variance stabilized t-type score and the characteristics of the two FDRs. The variance stabilized t-type score was generally better than or at least as good as the t-type score, irrespective of the sample size and proportion of differentially expressed genes. In terms of accuracy, the median FDR was superior to the mean FDR when the proportion of differentially expressed genes was large. The variance stabilized t-type score with the median FDR was applied to actual colorectal cancer data and yielded a reasonable result.


2010 ◽  
Vol 44-47 ◽  
pp. 905-909
Author(s):  
Yuan Tian ◽  
Gui Xia Liu ◽  
Chun Guang Zhou

One of the main purposes in analysis of microarray experiments is to identify differentially expressed genes under two experimental conditions. The Meta-analysis method, rank product meta-analysis approach, considered a powerful tool for identification of differentially expressed genes. However, rank product meta-analysis approach used the each dataset in the computation of the fold changes, which leaded to less computational efficiency. Here we modified the rank product meta-analysis approach to obtain an improved model for identifying different gene expression. The new model, grouping rank product approach, adds competitive classification of samples to group datasets before the computation of the fold changes. We used the grouping rank product approach on two simulated datasets and two breast datasets and showed that the grouping rank product approach is not only as accurate as the rank product meta-analysis approach, but also more computational efficient in identifying differentially expressed genes.


2003 ◽  
Vol 19 (6) ◽  
pp. 694-703 ◽  
Author(s):  
T. Park ◽  
S.-G. Yi ◽  
S. Lee ◽  
S. Y. Lee ◽  
D.-H. Yoo ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document