Transcriptomic profiling identifies novel transcripts, isomorphs, and noncoding RNAs in Paracoccidioides brasiliensis

2020 ◽  
Author(s):  
Fabiano B Menegidio ◽  
David Aciole Barbosa ◽  
Valquíria C Alencar ◽  
Renata O Vilas Boas ◽  
Regina Costa de Oliveira ◽  
...  

Abstract This paper describes a transcriptomic profiling of Paracoccidioides brasiliensis (Pb) performed with the aid of an RNA-seq-based approach, aimed at characterizing the general transcriptome in this human pathogenic fungus, responsible for paracoccidioidomycosis (PCM). Results confirm that ∼75% of the genes currently annotated in the P. brasiliensis genome are, in fact, transcribed in vivo and that ∼19% of them may display alternative isomorphs. Moreover, we identified 627 transcripts that do not match any gene currently mapped in the genome, represented by 114 coding transcripts (probably derived from previously unmapped protein-coding genes) and 513 noncoding RNAs (ncRNAs), including 203 long-noncoding RNAs (lncRNAs).

Blood ◽  
2016 ◽  
Vol 128 (22) ◽  
pp. 2880-2880
Author(s):  
Susann Ihme ◽  
Hartmut Döhner ◽  
Konstanze Döhner ◽  
Michaela Feuring-Buske ◽  
Christian Buske ◽  
...  

Abstract Acute Myeloid Leukemia (AML) is a hematological malignancy with heterogeneous genetics and clinical course. Studies have shown that there is a strong correlation between xenograft in vivo engraftment potential and clinical outcome. Furthermore, such studies have conducted gene expression profiling to identify gene signatures associated with engraftment potential, leukemic stem cell (LSC) property, and prognosis. Most of these analyses were limited to protein coding genes. More recently, advances in next generation sequencing (NGS) have created a paradigm shift on our perception of transcription that, contrary to previous consensus, the human genome is largely transcribed and most of these transcripts do not code for proteins (ncRNA). In the current work, we conducted a comprehensive functional genomics study to identify long ncRNA (lncRNA) that significantly predict in vivo engraftment potential and LSC properties of AML and evaluate their functional and prognostic relevance. We first sorted normal granulocyte-macrophage progenitor (GMP), lymphoid-primed multi-potential progenitor (LMPP) and CD34-subpopulations from leukemic bulk of 15 AML patient samples. Sorted samples were then transplanted into the non-obese diabetic/severe combined immunodeficient Gamma (NSG) mouse model. Only the subpopulations showing robust leukemic engraftment (> 1%) were classified as LSC populations. In parallel, we performed total RNA based RNA-seq using Ovation® Single Cell RNA-Seq System (NuGEN Technologies Inc.) and evaluated both protein coding genes and lncRNAs. In line with previous studies, we observed that the engraftment potential primarily resides within the CD34+ subpopulations (GMPs and LMPPs). LMPPs/GMPs from 8 AML patients engrafted (defined as LSC populations), while subpopulations from the other 7 patients failed to engraft (non-LSC fraction). Using our RNA-seq data, we first conducted a global principal component analysis (PCA) based between group analysis (BGA) based on lncRNA expression levels. Interestingly, we noted that the LMPPs and GMPs primarily cluster based on their engraftment potential rather than cell type. In two subsequent independent analyses, we employed weighted correlation network analysis (WGCNA) and top scoring pairs (tsp) to identify lncRNAs that show significant correlation to engraftment percentages and could also categorically classify samples based on their engraftment status. Results of the WGCNA analysis showed that 830 lncRNAs are statistically significantly correlated to engraftment percentages of subpopulations (p < 0.05). In the tsp analysis, by taking all lncRNAs in the dataset in a pairwise fashion, we identified two lncRNAs that could statistically significantly classify samples into 'LSC' (engrafters) and 'non-LSCs' (non-engrafters) (mean classification error ~ 0.19 using bootstrap analysis; statistically significant with p-value < 0.0001 using Monte Carlo Simulation). Of interest here, the two lncRNAs were also in the list of significantly correlated lncRNAs based on the separately conducted WGCNA. Combining the two analyses and using a 3d plot, we looked at the expression levels of two lncRNAs and engraftment percentages of the subpopulations and were able to see that the GMP/LMPP LSCs formed a distinct cluster. The non-LSCs with GMP or LMPP phenotype were distantly clustered from their engrafting counterparts while all the CD34-negative non-LSCs clustered with no overlap to the other groups. Furthermore, a guilt-by-association analysis was carried out to identify potential functional link of the candidate lncRNAs by assessing their correlation to functionally annotated genes. 'JAK-STAT', 'Hematopoietic lineage', and 'Toll-like receptor signaling' were some of the significant pathways (p < 0.05; FDR < 0.25). We also tested some published AML LSC and engraftment associated signatures. The lncRNAs showed negative correlation to the Eppert et al. (2011; Nat. Med.) LSC signature and set of lineage affiliated genes (Goardon et al. 2011; Cancer Cell) including FLT3, NOTCH1, and RUNX1. Taken together, our approach shows that the two lncRNAs we identified sufficiently recapitulate the underlying engraftment potential of AMLs and predict LSC property with significant accuracy. We conclude that these findings highlight the necessity to focus on lncRNAs as key players in clinical and functional studies of AML. Disclosures Buske: Celltrion, Inc.: Consultancy, Honoraria. Mulaw:NuGEN: Honoraria.


2015 ◽  
Vol 2015 ◽  
pp. 1-14 ◽  
Author(s):  
Yun Xiao ◽  
Yanling Lv ◽  
Hongying Zhao ◽  
Yonghui Gong ◽  
Jing Hu ◽  
...  

Long noncoding RNAs (lncRNAs) have been shown to play key roles in various biological processes. However, functions of most lncRNAs are poorly characterized. Here, we represent a framework to predict functions of lncRNAs through construction of a regulatory network between lncRNAs and protein-coding genes. Using RNA-seq data, the transcript profiles of lncRNAs and protein-coding genes are constructed. Using the Bayesian network method, a regulatory network, which implies dependency relations between lncRNAs and protein-coding genes, was built. In combining protein interaction network, highly connected coding genes linked by a given lncRNA were subsequently used to predict functions of the lncRNA through functional enrichment. Application of our method to prostate RNA-seq data showed that 762 lncRNAs in the constructed regulatory network were assigned functions. We found that lncRNAs are involved in diverse biological processes, such as tissue development or embryo development (e.g., nervous system development and mesoderm development). By comparison with functions inferred using the neighboring gene-based method and functions determined using lncRNA knockdown experiments, our method can provide comparable predicted functions of lncRNAs. Overall, our method can be applied to emerging RNA-seq data, which will help researchers identify complex relations between lncRNAs and coding genes and reveal important functions of lncRNAs.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
John W Davey ◽  
Carolina M C Catta-Preta ◽  
Sally James ◽  
Sarah Forrester ◽  
Maria Cristina M Motta ◽  
...  

Abstract Angomonas deanei is an endosymbiont-bearing trypanosomatid with several highly fragmented genome assemblies and unknown chromosome number. We present an assembly of the A. deanei nuclear genome based on Oxford Nanopore sequence that resolves into 29 complete or close-to-complete chromosomes. The assembly has several previously unknown special features; it has a supernumerary chromosome, a chromosome with a 340-kb inversion, and there is a translocation between two chromosomes. We also present an updated annotation of the chromosomal genome with 10,365 protein-coding genes, 59 transfer RNAs, 26 ribosomal RNAs, and 62 noncoding RNAs.


2012 ◽  
Vol 111 (suppl_1) ◽  
Author(s):  
Emma L Robinson ◽  
Syed Haider ◽  
Hillary Hei ◽  
Richard T Lee ◽  
Roger S Foo

Heart failure comprises of clinically distinct inciting causes but a consistent pattern of change in myocardial gene expression supports the hypothesis that unifying biochemical mechanisms underlie disease progression. The recent RNA-seq revolution has enabled whole transcriptome profiling, using deep-sequencing technologies. Up to 70% of the genome is now known to be transcribed into RNA, a significant proportion of which is long non-coding RNAs (lncRNAs), defined as polyribonucleotides of ≥200 nucleotides. This project aims to discover whether the myocardium expression of lncRNAs changes in the failing heart. Paired end RNA-seq from a 300-400bp library of ‘stretched’ mouse myocyte total RNA was carried out to generate 76-mer sequence reads. Mechanically stretching myocytes with equibiaxial stretch apparatus mimics pathological hypertrophy in the heart. Transcripts were assembled and aligned to reference genome mm9 (UCSC), abundance determined and differential expression of novel transcripts and alternative splice variants were compared with that of control (non-stretched) mouse myocytes. Five novel transcripts have been identified in our RNA-seq that are differentially expressed in stretched myocytes compared with non-stretched. These are regions of the genome that are currently unannotated and potentially are transcribed into non-coding RNAs. Roles of known lncRNAs include control of gene expression, either by direct interaction with complementary regions of the genome or association with chromatin remodelling complexes which act on the epigenome.Changes in expression of genes which contribute to the deterioration of the failing heart could be due to the actions of these novel lncRNAs, immediately suggesting a target for new pharmaceuticals. Changes in the expression of these novel transcripts will be validated in a larger sample size of stretched myocytes vs non-stretched myocytes as well as in the hearts of transverse aortic constriction (TAC) mice vs Sham (surgical procedure without the aortic banding). In vivo investigations will then be carried out, using siLNA antisense technology to silence novel lncRNAs in mice.


2019 ◽  
Vol 30 (17) ◽  
pp. 2240-2253 ◽  
Author(s):  
Catherine M. Mageeney ◽  
Vassie C. Ware

The functional significance of ribosome heterogeneity in development and differentiation is relatively unexplored. We present the first in vivo evidence of ribosome heterogeneity playing a role in specific mRNA translation in a multicellular eukaryote. Eukaryotic-specific ribosomal protein paralogues eRpL22 and eRpL22-like are essential in development and required for sperm maturation and fertility in Drosophila. eRpL22 and eRpL22-like roles in spermatogenesis are not completely interchangeable. Flies depleted of eRpL22 and rescued by eRpL22-like overexpression have reduced fertility, confirming that eRpL22-like cannot substitute fully for eRpL22 function, and that paralogues have functionally distinct roles, not yet defined. We investigated the hypothesis that specific RNAs differentially associate with eRpL22 or eRpL22-like ribosomes, thereby establishing distinct ribosomal roles. RNA-seq identified 12,051 transcripts (mRNAs/noncoding RNAs) with 50% being enriched on specific polysome types. Analysis of ∼10% of the most abundant mRNAs suggests ribosome specialization for translating groups of mRNAs expressed at specific stages of spermatogenesis. Further, we show enrichment of “model” eRpL22-like polysome-associated testis mRNAs can occur outside the germline within S2 cells transfected with eRpL22-like, indicating that germline-specific factors are not required for selective translation. This study reveals specialized roles in translation for eRpL22 and eRpL22-like ribosomes in germline differentiation.


2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Mikhail Pomaznoy ◽  
Ashu Sethi ◽  
Jason Greenbaum ◽  
Bjoern Peters

Abstract RNA-seq methods are widely utilized for transcriptomic profiling of biological samples. However, there are known caveats of this technology which can skew the gene expression estimates. Specifically, if the library preparation protocol does not retain RNA strand information then some genes can be erroneously quantitated. Although strand-specific protocols have been established, a significant portion of RNA-seq data is generated in non-strand-specific manner. We used a comprehensive stranded RNA-seq dataset of 15 blood cell types to identify genes for which expression would be erroneously estimated if strand information was not available. We found that about 10% of all genes and 2.5% of protein coding genes have a two-fold or higher difference in estimated expression when strand information of the reads was ignored. We used parameters of read alignments of these genes to construct a machine learning model that can identify which genes in an unstranded dataset might have incorrect expression estimates and which ones do not. We also show that differential expression analysis of genes with biased expression estimates in unstranded read data can be recovered by limiting the reads considered to those which span exonic boundaries. The resulting approach is implemented as a package available at https://github.com/mikpom/uslcount.


2019 ◽  
Vol 7 (10) ◽  
pp. 425 ◽  
Author(s):  
Jiao Jian ◽  
Xu Liang

The pathogenic fungus Fusarium graminearum (F. graminearum), causing Fusarium head blight (FHB) or scab, is one of the most important cereal killers worldwide, exerting great economic and agronomic losses on global grain production. To repress pathogen invasion, plants have evolved a sophisticated innate immunity system for pathogen recognition and defense activation. Simultaneously, pathogens continue to evolve more effective means of invasion to conquer plant resistance systems. In the process of co-evolution of plants and pathogens, several small RNAs (sRNAs) have been proved in regulating plant immune response and plant-microbial interaction. In this study, we report that a F. graminearum sRNA (Fg-sRNA1) can suppress wheat defense response by targeting and silencing a resistance-related gene, which codes a Chitin Elicitor Binding Protein (TaCEBiP). Transcriptional level evidence indicates that Fg-sRNA1 can target TaCEBiP mRNA and trigger silencing of TaCEBiP in vivo, and in Nicotiana benthamiana (N. benthamiana) plants, Western blotting experiments and YFP Fluorescence observation proofs show that Fg-sRNA1 can suppress the accumulation of protein coding by TaCEBiP gene in vitro. F. graminearum PH-1 strain displays a weakening ability to invasion when Barley stripe mosaic virus (BSMV) vector induces effective silencing Fg-sRNA1 in PH-1 infected wheat plants. Taken together, our results suggest that a small RNA from F. graminearum can target and silence the wheat TaCEBiP gene to enhance invasion of F. graminearum.


2002 ◽  
Vol 40 (1) ◽  
pp. 45-51 ◽  
Author(s):  
E. J. Venancio ◽  
C. M. Kyaw ◽  
C. V. Mello ◽  
S. P. Silva ◽  
C. M. A. Soares ◽  
...  

2019 ◽  
Vol 120 (8) ◽  
pp. 13441-13452 ◽  
Author(s):  
Nan Cheng ◽  
Yuanbin Wu ◽  
Huajun Zhang ◽  
Yi Guo ◽  
Huimin Cui ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document