GC-AG introns features in long non-coding and protein-coding genes suggest their role in gene expression regulation

Author(s):  
Monah Abou Alezz
2020 ◽  
Vol 11 ◽  
Author(s):  
Monah Abou Alezz ◽  
Ludovica Celli ◽  
Giulia Belotti ◽  
Antonella Lisa ◽  
Silvia Bione

2021 ◽  
Author(s):  
Alex RJ Lima ◽  
Saloe B Poubel ◽  
Juliana N Roson ◽  
Loyze PO de Lima ◽  
Hellida M Costa-Silva ◽  
...  

Background: Genomic organization and gene expression regulation in trypanosomes are remarkable because protein-coding genes are organized into codirectional gene clusters with unrelated functions. Moreover, there is no dedicated promoter for each gene, resulting in polycistronic gene transcription, with posttranscriptional control playing a major role. Nonetheless, these parasites harbor epigenetic modifications at critical regulatory genome features that dynamically change among parasite stages, which are not fully understood. Results: Here, we investigated the impact of chromatin changes in a scenario commanded by posttranscriptional control exploring the parasite Trypanosoma cruzi and its differentiation program using genome-wide approaches supported by transmission electron microscopy. The integration of FAIRE and MNase-seq data, two complementary epigenomic approaches, enabled us to identify differences in T. cruzi genome compartments, putative transcriptional start regions and virulence factors. In addition, we also detected developmental chromatin regulation at tRNA loci (tDNA), which seems to be linked to the translation regulatory mechanism required for parasite differentiation. Strikingly, a positive correlation was observed between active chromatin and steady-state transcription levels. Conclusion: Taken together, our results indicate that chromatin changes reflect the unusual gene expression regulation of trypanosomes and the differences among parasite developmental stages, even in the context of a lack of canonical transcriptional control of protein-coding genes.


2021 ◽  
Author(s):  
Alex RJ Lima ◽  
Saloe B Poubel ◽  
Juliana N Rosón ◽  
Loyze PO de Lima ◽  
Hellida M Costa-Silva ◽  
...  

Abstract Background: Genomic organization and gene expression regulation in trypanosomes are remarkable because protein-coding genes are organized into codirectional gene clusters with unrelated functions. Moreover, there is no dedicated promoter for each gene, resulting in polycistronic gene transcription, with posttranscriptional control playing a major role. Nonetheless, these parasites harbor epigenetic modifications at critical regulatory genome features that dynamically change among parasite stages, which are not fully understood. Results: Here, we investigated the impact of chromatin changes in a scenario commanded by posttranscriptional control exploring the parasite Trypanosoma cruzi and its differentiation program using genome-wide approaches supported by transmission electron microscopy. The integration of FAIRE and MNase-seq data, two complementary epigenomic approaches, enabled us to identify differences in T. cruzi genome compartments, putative transcriptional start regions and virulence factors. In addition, we also detected developmental chromatin regulation at tRNA loci (tDNA), which seems to be linked to the translation regulatory mechanism required for parasite differentiation. Strikingly, a positive correlation was observed between active chromatin and steady-state transcription levels. Conclusion: Taken together, our results indicate that chromatin changes reflect the unusual gene expression regulation of trypanosomes and the differences among parasite developmental stages, even in the context of a lack of canonical transcriptional control of protein-coding genes.


2019 ◽  
Author(s):  
Monah Abou Alezz ◽  
Ludovica Celli ◽  
Giulia Belotti ◽  
Antonella Lisa ◽  
Silvia Bione

ABSTRACTLong non-coding RNAs (lncRNAs) are recognized as an important class of regulatory molecules involved in a variety of biological functions. However, the regulatory mechanisms of long non-coding genes expression are still poorly understood. The characterization of the genomic features of lncRNAs is crucial to get insight into their function. In this study, we exploited recent annotations by GENCODE to characterize the genomic and splicing features of long non-coding genes in comparison with protein-coding ones, both in human and mouse. Our analysis highlighted differences between the two classes of genes in terms of their gene architecture. Significant differences in the splice sites usage were observed between long non-coding and protein-coding genes (PCG). While the frequency of non-canonical GC-AG splice junctions represents about 0.8% of total splice sites in PCGs, we identified a significant enrichment of the GC-AG splice sites in long non-coding genes, both in human (3.0%) and mouse (1.9%). In addition, we found a positional bias of GC-AG splice sites being enriched in the first intron in both classes of genes. Moreover, a significant shorter length and weaker donor and acceptor sites were found comparing GC-AG introns to GT-AG introns. Genes containing at least one GC-AG intron were found conserved in many species, more prone to alternative splicing and a functional analysis pointed toward their enrichment in specific biological processes such as DNA repair. Our study shows for the first time that GC-AG introns are mainly associated with lncRNAs and are preferentially located in the first intron. Additionally, we discovered their regulatory potential indicating the existence of a new mechanism of non-coding and PCGs expression regulation.


2021 ◽  
Vol 16 ◽  
Author(s):  
Min Yao ◽  
Caiyun Jiang ◽  
Chenglong Li ◽  
Yongxia Li ◽  
Shan Jiang ◽  
...  

Background: Mammalian genes are regulated at the transcriptional and post-transcriptional levels. These mechanisms may involve the direct promotion or inhibition of transcription via a regulator or post-transcriptional regulation through factors such as micro (mi)RNAs. Objective: This study aimed to construct gene regulation relationships modulated by causality inference-based miRNA-(transition factor)-(target gene) networks and analyze gene expression data to identify gene expression regulators. Methods: Mouse gene expression regulation relationships were manually curated from literature using a text mining method which was then employed to generate miRNA-(transition factor)-(target gene) networks. An algorithm was then introduced to identify gene expression regulators from transcriptome profiling data by applying enrichment analysis to these networks. Results: A total of 22,271 mouse gene expression regulation relationships were curated for 4,018 genes and 242 miRNAs. GEREA software was developed to perform the integrated analyses. We applied the algorithm to transcriptome data for synthetic miR-155 oligo-treated mouse CD4+ T-cells and confirmed that miR-155 is an important network regulator. The software was also tested on publicly available transcriptional profiling data for Salmonella infection, resulting in the identification of miR-125b as an important regulator. Conclusion: The causality inference-based miRNA-(transition factor)-(target gene) networks serve as a novel resource for gene expression regulation research, and GEREA is an effective and useful adjunct to the currently available methods. The regulatory networks and the algorithm implemented in the GEREA software package are available under a free academic license at website : http://www.thua45.cn/gerea.


Sign in / Sign up

Export Citation Format

Share Document