scholarly journals Building a reference transcriptome for Juniperus squamata (Cupressaceae) based on single-molecule real-time sequencing

2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Yufei Wang ◽  
Siyu Xie ◽  
Jialiang Li ◽  
Jieshi Tang ◽  
Tsam Ju ◽  
...  

Abstract Objectives Cupressaceae is the second largest family of coniferous trees (Coniferopsida) with important economic and ecological values. However, like other conifers, the members of Cupressaceae have extremely large genome (> 8 gigabytes), which limited the researches of these taxa. A high-quality transcriptome is an important resource for gene discovery and annotation for non-model organisms. Data description Juniperus squamata, a tetraploid species which is widely distributed in Asian mountains, represents the largest genus, Juniperus, in Cupressaceae. Single-molecule real-time sequencing was used to obtain full-length transcriptome of Juniperus squamata. The full-length transcriptome was corrected with Illumina RNA-seq data from the same individual. A total of 47,860 non-redundant full-length transcripts, N50 of which was 2839, were obtained. A total of 57,393 simple sequence repeats were identified and 268,854 open reading frames were predicted for Juniperus squamata. A BLAST alignment against non-redundant protein database was conducted and 10,818 sequences were annotated in Gene Ontology database. InterPro analysis shows that 30,403 sequences have been functionally characterized against its member database. This data presents the first comprehensive transcriptome characterization of Juniperus species, and provides an important reference for researches on the genomics and evolutionary history of Cupressaceae plants and conifers in the future.

2021 ◽  
Author(s):  
Yufei Wang ◽  
Siyu Xie ◽  
Jialiang Li ◽  
Jieshi Tang ◽  
Tsam Ju ◽  
...  

Abstract Objectives Cupressaceae is the second largest family of coniferous trees (Coniferopsida) with important economic and ecological values. However, like other conifers, the members of Cupressaceae have extremely large genome (>8 gigabytes), which limited the researches of these taxa. A high-quality transcriptome is an important resource for gene discovery and annotation for non-model organisms. Data descriptionJuniperus squamata, a tetraploid species which is widely distributed in Asian mountains, represents the largest genus, Juniperus, in Cupressaceae. Single-molecule real-time sequencing was used to obtain full-length transcriptome of Juniperus squamata. The full-length transcriptome was corrected with Illumina RNA-seq data from the same individual. A total of 47, 860 non-redundant full-length transcripts, N50 of which was 2, 839, were obtained. Simple sequence repeats for Juniperus squamata were also identified. This data presents the first comprehensive transcriptome characterization of Cupressaceae species, and provides an important reference for researches on the genomic evolutionary history of Cupressaceae plants and even conifers in the future.


2021 ◽  
Vol 12 ◽  
Author(s):  
Fiza Liaquat ◽  
Muhammad Farooq Hussain Munis ◽  
Samiah Arif ◽  
Urooj Haroon ◽  
Jianxin Shi ◽  
...  

Schima superba (Theaceae) is a subtropical evergreen tree and is used widely for forest firebreaks and gardening. It is a plant that tolerates salt and typically accumulates elevated amounts of manganese in the leaves. With large ecological amplitude, this tree species grows quickly. Due to its substantial biomass, it has a great potential for soil remediation. To evaluate the thorough framework of the mRNA, we employed PacBio sequencing technology for the first time to generate S. Superba transcriptome. In this analysis, overall, 511,759 full length non-chimeric reads were acquired, and 163,834 high-quality full-length reads were obtained. Overall, 93,362 open reading frames were obtained, of which 78,255 were complete. In gene annotation analyses, the Kyoto Encyclopedia of Genes and Genomes (KEGG), Clusters of Orthologous Genes (COG), Gene Ontology (GO), and Non-Redundant (Nr) databases were allocated 91,082, 71,839, 38,914, and 38,376 transcripts, respectively. To identify long non-coding RNAs (lncRNAs), we utilized four computational methods associated with protein families (Pfam), Cooperative Data Classification (CPC), Coding Assessing Potential Tool (CPAT), and Coding Non-Coding Index (CNCI) databases and observed 8,551, 9,174, 20,720, and 18,669 lncRNAs, respectively. Moreover, nine genes were randomly selected for the expression analysis, which showed the highest expression of Gene 6 (Na_Ca_ex gene), and CAX (CAX-interacting protein 4) was higher in manganese (Mn)-treated group. This work provided significant number of full-length transcripts and refined the annotation of the reference genome, which will ease advanced genetic analyses of S. superba.


2019 ◽  
Author(s):  
Anne Deslattes Mays ◽  
Marcel O. Schmidt ◽  
Garrett T. Graham ◽  
Elizabeth Tseng ◽  
Primo Baybayan ◽  
...  

AbstractHematopoietic cells are continuously replenished from progenitor cells that reside in the bone marrow. To evaluate molecular changes during this process, we analyzed the transcriptomes of freshly harvested human bone marrow progenitor (lineage-negative) and differentiated (lineage-positive) cells by single molecule, real time (SMRT) full length RNA sequencing. This analysis revealed a ∼5-fold higher number of transcript isoforms than previously detected and showed a distinct composition of individual transcript isoforms characteristic for bone marrow subpopulations. A detailed analysis of mRNA isoforms transcribed from the ANXA1 and EEF1A1 loci confirmed their distinct composition. The expression of proteins predicted from the transcriptome analysis was validated by mass spectrometry and validated previously unknown protein isoforms predicted e.g. for EEF1A1. These protein isoforms distinguished the lineage negative cell population from the lineage positive cell population. Finally, transcript isoforms expressed from paralogous gene loci (e.g. CFD, GATA2, HLA-A, B & C) also distinguished cell subpopulations but were only detectable by full length RNA sequencing. Thus, qualitatively distinct transcript isoforms from individual genomic loci separate bone marrow cell subpopulations indicating complex transcriptional regulation and protein isoform generation during hematopoiesis.


2018 ◽  
Vol 19 (2) ◽  
pp. 136-146 ◽  
Author(s):  
Takahiro Mimori ◽  
Jun Yasuda ◽  
Yoko Kuroki ◽  
Tomoko F. Shibata ◽  
Fumiki Katsuoka ◽  
...  

DNA Research ◽  
2019 ◽  
Vol 26 (4) ◽  
pp. 301-311 ◽  
Author(s):  
Yue Zhang ◽  
Tonny Maraga Nyong'A ◽  
Tao Shi ◽  
Pingfang Yang

Abstract Alternative splicing (AS) plays a critical role in regulating different physiological and developmental processes in eukaryotes, by dramatically increasing the diversity of the transcriptome and the proteome. However, the saturation and complexity of AS remain unclear in lotus due to its limitation of rare obtainment of full-length multiple-splice isoforms. In this study, we apply a hybrid assembly strategy by combining single-molecule real-time sequencing and Illumina RNA-seq to get a comprehensive insight into the lotus transcriptomic landscape. We identified 211,802 high-quality full-length non-chimeric reads, with 192,690 non-redundant isoforms, and updated the lotus reference gene model. Moreover, our analysis identified a total of 104,288 AS events from 16,543 genes, with alternative 3ʹ splice-site being the predominant model, following by intron retention. By exploring tissue datasets, 370 tissue-specific AS events were identified among 12 tissues. Both the tissue-specific genes and isoforms might play important roles in tissue or organ development, and are suitable for ‘ABCE’ model partly in floral tissues. A large number of AS events and isoform variants identified in our study enhance the understanding of transcriptional diversity in lotus, and provide valuable resource for further functional genomic studies.


PLoS ONE ◽  
2020 ◽  
Vol 15 (9) ◽  
pp. e0238942
Author(s):  
Cuiping Pan ◽  
Yongqing Wang ◽  
Lian Tao ◽  
Hui Zhang ◽  
Qunxian Deng ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document