scholarly journals Intertwining Threshold Settings, Biological Data and Database Knowledge to Optimize the Selection of Differentially Expressed Genes from Microarray

PLoS ONE ◽  
2010 ◽  
Vol 5 (10) ◽  
pp. e13518 ◽  
Author(s):  
Paul Chuchana ◽  
Philippe Holzmuller ◽  
Frederic Vezilier ◽  
David Berthier ◽  
Isabelle Chantal ◽  
...  
2020 ◽  
Vol 14 ◽  
pp. 117793222090616
Author(s):  
Badreddine Nouadi ◽  
Yousra Sbaoui ◽  
Mariame El Messal ◽  
Faiza Bennis ◽  
Fatima Chegdani

Nowadays, the integration of biological data is a major challenge for bioinformatics. Many studies have examined gene expression in the epithelial tissue in the intestines of infants born to term and breastfed, generating a large amount of data. The integration of these data is important to understand the biological processes involved during bacterial colonization of the newborns intestine, particularly through breast milk. This work aims to exploit the bioinformatics approaches, to provide a new representation and interpretation of the interactions between differentially expressed genes in the host intestine induced by the microbiota.


2002 ◽  
Vol 8 (9) ◽  
pp. 559-567 ◽  
Author(s):  
Takaharu Nagasaka ◽  
Gwénola Boulday ◽  
Christopher C. Fraser ◽  
Stéphanie Coupel ◽  
Flora Coulon ◽  
...  

2003 ◽  
Vol 2 (4) ◽  
pp. 383-391 ◽  
Author(s):  
Sunil Singhal ◽  
Chris G. Kyvernitis ◽  
Steven W. Johnson ◽  
Larry R. Kaiser ◽  
Michael N. Liebman ◽  
...  

2005 ◽  
Vol 03 (03) ◽  
pp. 627-643 ◽  
Author(s):  
SACH MUKHERJEE ◽  
STEPHEN J. ROBERTS

A great deal of recent research has focused on the challenging task of selecting differentially expressed genes from microarray data ("gene selection"). Numerous gene selection algorithms have been proposed in the literature, but it is often unclear exactly how these algorithms respond to conditions like small sample sizes or differing variances. Choosing an appropriate algorithm can therefore be difficult in many cases. In this paper we propose a theoretical analysis of gene selection, in which the probability of successfully selecting differentially expressed genes, using a given ranking function, is explicitly calculated in terms of population parameters. The theory developed is applicable to any ranking function which has a known sampling distribution, or one which can be approximated analytically. In contrast to methods based on simulation, the approach presented here is computationally efficient and can be used to examine the behavior of gene selection algorithms under a wide variety of conditions, even when the number of genes involved runs into the tens of thousands. The utility of our approach is illustrated by comparing three widely-used gene selection methods.


Author(s):  
Silver A Wolf ◽  
Lennard Epping ◽  
Sandro Andreotti ◽  
Knut Reinert ◽  
Torsten Semmler

Abstract Summary RNA-sequencing (RNA-Seq) is the current method of choice for studying bacterial transcriptomes. To date, many computational pipelines have been developed to predict differentially expressed genes from RNA-Seq data, but no gold-standard has been widely accepted. We present the Snakemake-based tool Smart Consensus Of RNA Expression (SCORE) which uses a consensus approach founded on a selection of well-established tools for differential gene expression analysis. This allows SCORE to increase the overall prediction accuracy and to merge varying results into a single, human-readable output. SCORE performs all steps for the analysis of bacterial RNA-Seq data, from read preprocessing to the overrepresentation analysis of significantly associated ontologies. Development of consensus approaches like SCORE will help to streamline future RNA-Seq workflows and will fundamentally contribute to the creation of new gold-standards for the analysis of these types of data. Availability and implementation https://github.com/SiWolf/SCORE. Supplementary information Supplementary data are available at Bioinformatics online.


Sign in / Sign up

Export Citation Format

Share Document