Weighted pooling high-throughput gene expression data sets to maximize the functional coherence of the top rank genes

Author(s):  
Xiaodong Zhou ◽  
E. O. George
2015 ◽  
Vol 11 (11) ◽  
pp. 3137-3148
Author(s):  
Nazanin Hosseinkhan ◽  
Peyman Zarrineh ◽  
Hassan Rokni-Zadeh ◽  
Mohammad Reza Ashouri ◽  
Ali Masoudi-Nejad

Gene co-expression analysis is one of the main aspects of systems biology that uses high-throughput gene expression data.


Author(s):  
Soumya Raychaudhuri

The most interesting and challenging gene expression data sets to analyze are large multidimensional data sets that contain expression values for many genes across multiple conditions. In these data sets the use of scientific text can be particularly useful, since there are a myriad of genes examined under vastly different conditions, each of which may induce or repress expression of the same gene for different reasons. There is an enormous complexity to the data that we are examining—each gene is associated with dozens if not hundreds of expression values as well as multiple documents built up from vocabularies consisting of thousands of words. In Section 2.4 we reviewed common gene expression strategies, most of which revolve around defining groups of genes based on common profiles. A limitation of many gene expression analytic approaches is that they do not incorporate comprehensive background knowledge about the genes into the analysis. We present computational methods that leverage the peer-reviewed literature in the automatic analysis of gene expression data sets. Including the literature in gene expression data analysis offers an opportunity to incorporate background functional information about the genes when defining expression clusters. In Chapter 5 we saw how literature- based approaches could help in the analysis of single condition experiments. Here we will apply the strategies introduced in Chapter 6 to assess the coherence of groups of genes to enhance gene expression analysis approaches. The methods proposed here could, in fact, be applied to any multivariate genomics data type. The key concepts discussed in this chapter are listed in the frame box. We begin with a discussion of gene groups and their role in expression analysis; we briefly discuss strategies to assign keywords to groups and strategies to assess their functional coherence. We apply functional coherence measures to gene expression analysis; for examples we focus on a yeast expression data set. We first demonstrate how functional coherence can be used to focus in on the key biologically relevant gene groups derived by clustering methods such as self-organizing maps and k-means clustering.


2012 ◽  
Vol 39 (12) ◽  
pp. 3046-3061 ◽  
Author(s):  
Harun Pirim ◽  
Burak Ekşioğlu ◽  
Andy D. Perkins ◽  
Çetin Yüceer

Sign in / Sign up

Export Citation Format

Share Document