conserved noncoding sequences
Recently Published Documents


TOTAL DOCUMENTS

59
(FIVE YEARS 5)

H-INDEX

20
(FIVE YEARS 0)

2021 ◽  
Author(s):  
Nilmini Hettiarachchi

Conserved non coding Sequences (CNSs) are extensively studied for their regulatory properties and functional importance to organisms. Many features such as location, proximity to the likely target gene, lineage specificity, functionality of likely target genes, and nucleotide composition of these sequences have been investigated, thus have provided very meaningful insight to signify underlying evolutionary importance of these elements. Also thorough investigation around how to assign function to non-coding regions of eukaryote genomes is another area that is studied. On one hand evolutionary analyses, including signatures of selection or conservation which can indicate the presence of constraint, suggesting that sequences that are evolving non-neutrally are candidates for functionality. On the other hand evidence that is based on experimental profiling of transcription, methylation, histone modifications and chromatin state. While these types of data are very important and are associated with function in most cases, this is not always the case. Evolutionary conservation though highly conservative which mostly considers elements identifiable in more than one species, is still being used as the initial guideline in investigating function via experiments. If we had an understanding of the experimental profiles of conserved non-coding regions as there may be patterns that are often associated these potentially functional elements it may help to construed functionality of conserved non coding regions easily. In an effort to try integrate experimental profile data, we investigated evidence of expression of conserved noncoding sequences (CNSs). For CNSs from ten primates, we assessed transcription, histone modifications, level of evolutionary constraint or accelerated evolution, and assessed possible target genes, tissue expression profiles of likely target genes (as some CNSs may be enhancers, and may be ncRNAs that interact directly with mRNA) and clustering patterns of CNSs. In total we found 153475 CNSs conserved across all ten primates. Of these 59,870 were overlapping non coding regions of ncRNA genes. H3K4Me1 marks (often associated with active enhancers) were highly correlated with CNSs whereas H4K20Me1 (linked to, e.g. DNA damage repair) had high correlation with conserved ncRNA regions (ncRNA-gene-CEs). Both CNSs and conserved ncRNA showed evidence of being under purifying selection. The CNSs in our dataset overall exhibited lower allele frequencies, consistent with higher levels of evolutionary constraint. We also found that CNSs and ncRNA-gene-CEs produce mutually exclusive groups. The analyses also suggest that both types of conserved elements have undergone waves of accelerated evolution, which we speculate may indicate changes in regulatory requirements following divergence events. Finally, we find that likely target genes for hominoidae, primate and mammalian-specific CNSs and ncRNA-gene-CEs are predominantly associated with brain-related function in humans. The deep conserved primate CNSs and ncRNA gene-CEs signify functional importance suggesting ongoing recruitment of these elements into brain-related functions, consistent with King and Wilsons hypothesis that regulatory changes may account for rapid changes in phenotype among primates.



2021 ◽  
Author(s):  
Wendell J Pereira ◽  
Sara A Knaack ◽  
Daniel Conde ◽  
Sanhita Chakraborty ◽  
Ryan A Folk ◽  
...  

Nitrogen is one of the most inaccessible plant nutrients, but certain species have overcome this limitation by establishing symbiotic interactions with nitrogen-fixing bacteria in the root nodule. This root nodule symbiosis (RNS) is restricted to species within a single clade of angiosperms, suggesting a critical evolutionary event at the base of this clade, which has not yet been determined. While genes implicated in the RNS are present in most plant species (nodulating or not), gene sequence conservation alone does not imply functional conservation - developmental or phenotypic differences can arise from variation in the regulation of transcription. To identify putative regulatory sequences implicated in the evolution of RNS, we aligned the genomes of 25 species capable of nodulation. We detected 3,091 conserved noncoding sequences (CNS) in the nitrogen-fixing clade that are absent from outgroup species. Functional analysis revealed that chromatin accessibility of 452 CNS significantly correlates with the differential regulation of genes responding to lipo-chitooligosaccharides in Medicago truncatula. These included 38 CNS in proximity to 19 known genes involved in RNS. Five such regions are upstream of MtCRE1, Cytokinin Response Element 1, required to activate a suite of downstream transcription factors necessary for nodulation in M. truncatula. Genetic complementation of a Mtcre1 mutant showed a significant association between nodulation and the presence of these CNS, when they are driving the expression of a functional copy of MtCRE1. Conserved noncoding sequences, therefore, may be required for the regulation of genes controlling the root nodule symbiosis in M. truncatula.



2021 ◽  
pp. gr.266528.120
Author(s):  
Baoxing Song ◽  
Edward S. Buckler ◽  
Hai Wang ◽  
Yaoyao Wu ◽  
Evan Rees ◽  
...  


Author(s):  
Alan E Yocca ◽  
Zefu Lu ◽  
Robert J Schmitz ◽  
Michael Freeling ◽  
Patrick P Edger

Abstract Recent pangenome studies have revealed a large fraction of the gene content within a species exhibits presence-absence variation (PAV). However, coding regions alone provide an incomplete assessment of functional genomic sequence variation at the species level. Little to no attention has been paid to noncoding regulatory regions in pangenome studies, though these sequences directly modulate gene expression and phenotype. To uncover regulatory genetic variation, we generated chromosome-scale genome assemblies for thirty Arabidopsis thaliana accessions from multiple distinct habitats and characterized species level variation in Conserved Noncoding Sequences (CNS). Our analyses uncovered not only PAV and positional variation (PosV) but that diversity in CNS is non-random, with variants shared across different accessions. Using evolutionary analyses and chromatin accessibility data, we provide further evidence supporting roles for conserved and variable CNS in gene regulation. Additionally, our data suggests transposable elements contribute to CNS variation. Characterizing species-level diversity in all functional genomic sequences may later uncover previously unknown mechanistic links between genotype and phenotype.



Author(s):  
Jun Inoue ◽  
Naruya Saitou

Abstract We developed dbCNS (http://yamasati.nig.ac.jp/dbcns), a new database for conserved noncoding sequences (CNSs). CNSs exist in many eukaryotes and are assumed to be involved in protein expression control. Version 1 of dbCNS, introduced here, includes a powerful and precise CNS identification pipeline for multiple vertebrate genomes. Mutations in CNSs may induce morphological changes and cause genetic diseases. For this reason, many vertebrate CNSs have been identified, with special reference to primate genomes. We integrated ∼6.9 million CNSs from many vertebrate genomes into dbCNS, which allows users to extract CNSs near genes of interest using keyword searches. In addition to CNSs, dbCNS contains published genome sequences of 161 species. With purposeful taxonomic sampling of genomes, users can employ CNSs as queries to reconstruct CNS alignments and phylogenetic trees, to evaluate CNS modifications, acquisitions, and losses, and to roughly identify species with CNSs having accelerated substitution rates. dbCNS also produces links to dbSNP for searching pathogenic single-nucleotide polymorphisms in human CNSs. Thus, dbCNS connects morphological changes with genetic diseases. A test analysis using 38 gnathostome genomes was accomplished within 30 s. dbCNS results can evaluate CNSs identified by other stand-alone programs using genome-scale data.



2020 ◽  
Vol 225 ◽  
pp. 33-43 ◽  
Author(s):  
Misuzu Harada ◽  
Vo Trong Nghia ◽  
Ayaka Nakao ◽  
Riho Tanigaki ◽  
Natsuki Fukuoka ◽  
...  


2019 ◽  
Author(s):  
Alan E. Yocca ◽  
Zefu Lu ◽  
Robert J. Schmitz ◽  
Michael Freeling ◽  
Patrick P. Edger

AbstractRecent pangenome studies have revealed a large fraction of the gene content within a species exhibits presence-absence variation (PAV). However, coding regions alone provide an incomplete assessment of functional genomic sequence variation at the species level. Little to no attention has been paid to noncoding regulatory regions in pangenome studies, though these sequences directly modulate gene expression and phenotype. To uncover regulatory genetic variation, we generated chromosome-scale genome assemblies for thirty Arabidopsis thaliana accessions from multiple distinct habitats and characterized species level variation in Conserved Noncoding Sequences (CNS). Our analyses uncovered not only evidence for PAV and positional variation (PosV) but that diversity in CNS is non-random, with variants shared across different accessions. Using evolutionary analyses and chromatin accessibility data, we provide further evidence supporting roles for conserved and variable CNS in gene regulation. Characterizing species-level diversity in all functional genomic sequences may later uncover previously unknown mechanistic links between genotype and phenotype.



2018 ◽  
Vol 8 (1) ◽  
Author(s):  
Ramesh P. Thylur ◽  
Sung Yong Ahn ◽  
Eunhea Jung ◽  
Chang-Duk Jun ◽  
Young-Min Hyun


Heredity ◽  
2018 ◽  
Vol 120 (5) ◽  
pp. 437-451 ◽  
Author(s):  
Jianbo Xie ◽  
Kecheng Qian ◽  
Jingna Si ◽  
Liang Xiao ◽  
Dong Ci ◽  
...  


Sign in / Sign up

Export Citation Format

Share Document