Faculty Opinions recommendation of Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.

Author(s):  
Charles Coutelle
2005 ◽  
Vol 102 (13) ◽  
pp. 4795-4800 ◽  
Author(s):  
E. H. Margulies ◽  
J. P. Vinson ◽  
W. Miller ◽  
D. B. Jaffe ◽  
K. Lindblad-Toh ◽  
...  

2019 ◽  
Author(s):  
Christian D. Huber ◽  
Bernard Y. Kim ◽  
Kirk E. Lohmueller

AbstractComparative genomic approaches have been used to identify sites where mutations are under purifying selection and of functional consequence by searching for sequences that are conserved across distantly related species. However, the performance of these approaches has not been rigorously evaluated under population genetic models. Further, short-lived functional elements may not leave a footprint of sequence conservation across many species. Here, we use simulations to study how one measure of conservation, the GERP score, relates to the strength of selection (Nes). We show that the GERP score is related to the strength of purifying selection. However, changes in selection coefficients or functional elements over time (i.e. functional turnover) can strongly affect the GERP distribution, leading to unexpected relationships between GERP and Nes. Further, we show that for functional elements that have a high turnover rate, the optimal tree size is not necessarily the largest possible tree, and more turnover reduces the optimal tree size. Finally, we use the distribution of GERP scores across the human genome to compare models with and without turnover of sites where mutations under purifying selection. We show that mutations in 4.51% of the noncoding human genome are under purifying selection and that most of this sequence has likely experienced changes in selection coefficients throughout mammalian evolution.


2003 ◽  
Vol 68 (0) ◽  
pp. 317-322
Author(s):  
Z. LIAN ◽  
G. EUSKIRCHEN ◽  
J. RINN ◽  
R. MARTONE ◽  
P. BERTONE ◽  
...  

2017 ◽  
Author(s):  
Yi Li ◽  
Daniel Quang ◽  
Xiaohui Xie

AbstractMotivationComparing the human genome to the genomes of closely related mammalian species has been a powerful tool for discovering functional elements in the human genome. Millions of conserved elements have been discovered. However, understanding the functional roles of these elements still remain a challenge, especially in noncoding regions. In particular, it is still unclear why these elements are evolutionarily conserved and what kind of functional elements are encoded within these sequences.ResultsWe present a deep learning framework, called DeepCons, to uncover potential functional elements within conserved sequences. DeepCons is a convolutional neural net (CNN) that receives a short segment of DNA sequence as input and outputs the probability of the sequence of being evolutionary conserved. DeepCons utilizes hundreds of convolution kernels to detect features within DNA sequences, and automatically learns these kernels after training the CNN model using 887,577 conserved elements and a similar number of nonconserved elements in the human genome. On a balanced test dataset, DeepCons can achieve an accuracy of 75% in determining whether a sequence element is conserved or not, and the area under the ROC curve of 0.83, based on information from the human genome alone. We further investigate the properties of the learned kernels. Some kernels are directly related to well-known regulatory motifs corresponding to transcription factors. Many kernels show positional biases relative to transcriptional start sites or transcription end sites. But most of discovered kernels do not correspond to any known functional element, suggesting that they might represent unknown categories of functional elements. We also utilize DeepCons to annotate how changes at each individual nucleotide might impact the conservation properties of the surrounding sequences.AvailabilityThe source code of DeepCons and all the learned convolution kernels in motif format is publicly available online athttps://github.com/uci-cbcl/[email protected]


Sign in / Sign up

Export Citation Format

Share Document