Integrated analysis of multiple microarray studies to identify novel gene signatures in preeclampsia

Placenta ◽  
2021 ◽  
Vol 105 ◽  
pp. 104-118
Author(s):  
Qingling Kang ◽  
Wei Li ◽  
Juan Xiao ◽  
Nan Yu ◽  
Lei Fan ◽  
...  
2013 ◽  
Vol 27 (S1) ◽  
Author(s):  
Rana Smalling ◽  
Don Delker ◽  
Yuxia Zhang ◽  
Michael McGuiness ◽  
Shuanghu Liu ◽  
...  

2014 ◽  
Vol 134 (4) ◽  
pp. 848-855 ◽  
Author(s):  
Lianghua Bin ◽  
Michael G. Edwards ◽  
Ryan Heiser ◽  
Joanne E. Streib ◽  
Brittany Richers ◽  
...  

2020 ◽  
Vol 24 (17) ◽  
pp. 9972-9984
Author(s):  
Mingyang Bao ◽  
Lihua Zhang ◽  
Yueqing Hu

2010 ◽  
Vol 9 ◽  
pp. CIN.S2892 ◽  
Author(s):  
Yarong Yang ◽  
Eric J. Kort ◽  
Nader Ebrahimi ◽  
Zhongfa Zhang ◽  
Bin T. Teh

Background Gene set enrichment analysis (GSEA) is an analytic approach which simultaneously reduces the dimensionality of microarray data and enables ready inference of the biological meaning of observed gene expression patterns. Here we invert the GSEA process to identify class-specific gene signatures. Because our approach uses the Kolmogorov-Smirnov approach both to define class specific signatures and to classify samples using those signatures, we have termed this methodology “Dual-KS” (DKS). Results The optimum gene signature identified by the DKS algorithm was smaller than other methods to which it was compared in 5 out of 10 datasets. The estimated error rate of DKS using the optimum gene signature was smaller than the estimated error rate of the random forest method in 4 out of the 10 datasets, and was equivalent in two additional datasets. DKS performance relative to other benchmarked algorithms was similar to its performance relative to random forests. Conclusions DKS is an efficient analytic methodology that can identify highly parsimonious gene signatures useful for classification in the context of microarray studies. The algorithm is available as the dualKS package for R as part of the bioconductor project.


Author(s):  
Nuojin Cheng ◽  
Ashley J Schulte ◽  
Fadil Santosa ◽  
Jong Hyuk Kim

Abstract Angiosarcomas are soft-tissue sarcomas that form malignant vascular tissues. Angiosarcomas are very rare, and due to their aggressive behavior and high metastatic propensity, they have poor clinical outcomes. Hemangiosarcomas commonly occur in domestic dogs, and share pathological and clinical features with human angiosarcomas. Typical pathognomonic features of this tumor are irregular vascular channels that are filled with blood and are lined by a mixture of malignant and nonmalignant endothelial cells. The current gold standard is the histological diagnosis of angiosarcoma; however, microscopic evaluation may be complicated, particularly when tumor cells are undetectable due to the presence of excessive amounts of nontumor cells or when tissue specimens have insufficient tumor content. In this study, we implemented machine learning applications from next-generation transcriptomic data of canine hemangiosarcoma tumor samples (n = 76) and nonmalignant tissues (n = 10) to evaluate their training performance for diagnostic utility. The 10-fold cross-validation test and multiple feature selection methods were applied. We found that extra trees and random forest learning models were the best classifiers for hemangiosarcoma in our testing datasets. We also identified novel gene signatures using the mutual information and Monte Carlo feature selection method. The extra trees model revealed high classification accuracy for hemangiosarcoma in validation sets. We demonstrate that high-throughput sequencing data of canine hemangiosarcoma are trainable for machine learning applications. Furthermore, our approach enables us to identify novel gene signatures as reliable determinants of hemangiosarcoma, providing significant insights into the development of potential applications for this vascular malignancy.


Sign in / Sign up

Export Citation Format

Share Document