New Putative Long Non-Coding RNAs (lncRNA) Revealed by Pan-Transcriptome of the Emerging Human Pathogenic Fungus Talaromyces Marneffei
Abstract Previous genomic/transcriptomic analyses of Talaromyces marneffei (TM) unravelled relevant pathogenicity-related elements, as well as chromosomal regions potentially involved with the production of non-coding RNAs (ncRNAs), which have been parsimoniously reported in fungi. This manuscript describes a comprehensive pan-transcriptome assembly for TM that identifies a series of previously undetected genetic elements in this emerging pathogenic fungus. Our results confirm that ~58.28% of the 9,480 genes currently annotated in the TM genome are, in fact, transcribed in vivo and that ~23.6% of them may display alternative isomorphs. Moreover, we identified 585 transcripts that do not match any gene currently mapped in the genome, represented by 90 coding transcripts and 140 ncRNAs, including 48 long non-coding RNAs (lncRNAs). Overall, we expect that the novel elements described herein may contribute to improve the currently available Talaromyces databases and foster studies aiming at characterizing lncRNA-mediated gene expression control in fungi.