transcriptome sequences
Recently Published Documents


TOTAL DOCUMENTS

64
(FIVE YEARS 21)

H-INDEX

18
(FIVE YEARS 2)

2021 ◽  
Author(s):  
Yvain Desplat ◽  
Jacob F Warner ◽  
Jose V Lopez

Abstract Marine sponge transcriptomes are underrepresented in current databases. Furthermore, only two sponge genomes are available for comparative studies. Here we present the assembled and annotated holo-transcriptome of the common Florida reef sponge from the species Cinachyrella alloclada. After Illumina high throughput sequencing, the data assembled using Trinity v2.5 confirmed a highly symbiotic organism, with the complexity of high microbial abundance (HMA) sponges. This dataset is enriched in poly-A selected eukaryotic, rather than microbial transcripts. Overall, 39,813 transcripts with verified sponge sequence homology coded for 8,496 unique proteins. The average sequence length was found to be 946 bp with an N50 sequence length of 1290 bp. Overall, the sponge assembly resulted in a GC content of 51.04%, which is within the range of GC bases in a eukaryotic transcriptome. BUSCO scored completeness analysis revealed a completeness of 60.3% and 60.1% based on the Eukaryota and Metazoa databases, respectively. Overall, this study points to an overarching goal of developing the Cinachyrella alloclada sponge as a useful new experimental model organism.


2021 ◽  
Vol 12 ◽  
Author(s):  
Hanjing Liu ◽  
Yuli Zhang ◽  
Zhen Wang ◽  
Yingjuan Su ◽  
Ting Wang

Cephalotaxus oliveri is an endemic conifer of China, which has medicinal and ornamental value. However, the limited molecular markers and genetic information are insufficient for further genetic studies of this species. In this study, we characterized and developed the EST-SSRs from transcriptome sequences for the first time. The results showed that a total of 5089 SSRs were identified from 36446 unigenes with a density of one SSR per 11.1 kb. The most common type was trinucleotide repeats, excluding mononucleotide repeats, followed by dinucleotide repeats. AAG/CTT and AT/AT exhibited the highest frequency in the trinucleotide and dinucleotide repeats, respectively. Of the identified SSRs, 671, 1125, and 1958 SSRs were located in CDS, 3′UTR, and 5′UTR, respectively. Functional annotation showed that the SSR-containing unigenes were involved in growth and development with various biological functions. Among successfully designed primer pairs, 238 primer pairs were randomly selected for amplification and validation of EST-SSR markers and 47 primer pairs were identified as polymorphic. Finally, 28 high-polymorphic primers were used for genetic analysis and revealed a moderate level of genetic diversity. Seven natural C. oliveri sampling sites were divided into two genetic groups. Furthermore, the 28 EST-SSRs had 96.43, 71.43, and 78.57% of transferability rate in Cephalotaxus fortune, Ametotaxus argotaenia, and Pseudotaxus chienii, respectively. These markers developed in this study lay the foundation for further genetic and adaptive evolution studies in C. oliveri and related species.


Pathogens ◽  
2020 ◽  
Vol 9 (4) ◽  
pp. 281 ◽  
Author(s):  
Hina Durrani ◽  
Marshall Hampton ◽  
Jon N. Rumbley ◽  
Sara L. Zimmer

In kinetoplastids, the first seven steps of glycolysis are compartmentalized into a glycosome along with parts of other metabolic pathways. This organelle shares a common ancestor with the better-understood eukaryotic peroxisome. Much of our understanding of the emergence, evolution, and maintenance of glycosomes is limited to explorations of the dixenous parasites, including the enzymatic contents of the organelle. Our objective was to determine the extent that we could leverage existing studies in model kinetoplastids to determine the composition of glycosomes in species lacking evidence of experimental localization. These include diverse monoxenous species and dixenous species with very different hosts. For many of these, genome or transcriptome sequences are available. Our approach initiated with a meta-analysis of existing studies to generate a subset of enzymes with highest evidence of glycosome localization. From this dataset we extracted the best possible glycosome signal peptide identification scheme for in silico identification of glycosomal proteins from any kinetoplastid species. Validation suggested that a high glycosome localization score from our algorithm would be indicative of a glycosomal protein. We found that while metabolic pathways were consistently represented across kinetoplastids, individual proteins within those pathways may not universally exhibit evidence of glycosome localization.


BMC Genomics ◽  
2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Shenghui Zhou ◽  
Jinpeng Zhang ◽  
Haiming Han ◽  
Jing Zhang ◽  
Huihui Ma ◽  
...  

Abstract Background Agropyron cristatum (L.) Gaertn. (2n = 4x = 28; genomes PPPP) is a wild relative of common wheat (Triticum aestivum L.) and provides many desirable genetic resources for wheat improvement. However, there is still a lack of reference genome and transcriptome information for A. cristatum, which severely impedes functional and molecular breeding studies. Results Single-molecule long-read sequencing technology from Pacific Biosciences (PacBio) was used to sequence full-length cDNA from a mixture of leaves, roots, stems and caryopses and constructed the first full-length transcriptome dataset of A. cristatum, which comprised 44,372 transcripts. As expected, the PacBio transcripts were generally longer and more complete than the transcripts assembled via the Illumina sequencing platform in previous studies. By analyzing RNA-Seq data, we identified tissue-enriched transcripts and assessed their GO term enrichment; the results indicated that tissue-enriched transcripts were enriched for particular molecular functions that varied by tissue. We identified 3398 novel and 1352 A. cristatum-specific transcripts compared with the wheat gene model set. To better apply this A. cristatum transcriptome, the A. cristatum transcripts were integrated with the wheat genome as a reference sequence to try to identify candidate A. cristatum transcripts associated with thousand-grain weight in a wheat-A. cristatum translocation line, Pubing 3035. Conclusions Full-length transcriptome sequences were used in our study. The present study not only provides comprehensive transcriptomic insights and information for A. cristatum but also proposes a new method for exploring the functional genes of wheat relatives under a wheat genetic background. The sequence data have been deposited in the NCBI under BioProject accession number PRJNA534411.


2019 ◽  
Vol 12 (1) ◽  
Author(s):  
Arseny Dubin ◽  
Tor Erik Jørgensen ◽  
Lars Martin Jakt ◽  
Steinar Daae Johansen

Abstract Objective Analyze key features of the anglerfish Lophius piscatorius mitochondrial transcriptome based on high-throughput total RNA sequencing. Results We determined the complete mitochondrial DNA and corresponding transcriptome sequences of L. piscatorius. Key features include highly abundant mitochondrial ribosomal RNAs (10–100 times that of mRNAs), and that cytochrome oxidase mRNAs appeared > 5 times more abundant than both NADH dehydrogenase and ATPase mRNAs. Unusual for a vertebrate mitochondrial mRNA, the polyadenylated COI mRNA was found to harbor a 75 nucleotide 3′ untranslated region. The mitochondrial genome expressed several non-canonical genes, including the long noncoding RNAs lncCR-H, lncCR-L and lncCOI. Whereas lncCR-H and lncCR-L mapped to opposite strands in a non-overlapping organization within the control region, lncCOI appeared novel among vertebrates. We found lncCOI to be a highly abundant mitochondrial RNA in antisense to the COI mRNA. Finally, we present the coding potential of a humanin-like peptide within the large subunit ribosomal RNA.


2019 ◽  
Author(s):  
Shenghui Zhou(Former Corresponding Author) ◽  
Jinpeng Zhang ◽  
Haiming Han ◽  
Jing Zhang ◽  
Ma Huihui ◽  
...  

Abstract Agropyron cristatum (L.) Gaertn. (2n = 4x = 28; genomes PPPP) is a wild relative of common wheat (Triticum aestivum L.) and provides many desirable genetic resources for wheat improvement. However, there is still a lack of reference genome and transcriptome information for A. cristatum, which severely impedes functional and molecular breeding studies.Results Single-molecule long-read sequencing technology from Pacific Biosciences (PacBio) was used to sequence full-length cDNA from a mixture of leaves, roots, stems and caryopses and constructed the first full-length transcriptome dataset of A. cristatum, which comprised 44,372 transcripts. As expected, the PacBio transcripts were generally longer and more complete than the transcripts assembled via the Illumina sequencing platform in previous studies. By analyzing RNA-Seq data, we identified tissue-enriched transcripts and assessed their GO term enrichment; the results indicated that tissue-enriched transcripts were enriched for particular molecular functions that varied by tissue. We identified 3,398 novel and 1,352 A. cristatum-specific transcripts compared with the wheat gene model set. To better apply this A. cristatum transcriptome, the A. cristatum transcripts were integrated with the wheat genome as a reference sequence to try to identify candidate A. cristatum transcripts associated with thousand-grain weight in a wheat-A. cristatum translocation line, Pubing 3035.Conclusions Full-length transcriptome sequences were used in our study. The present study not only provides comprehensive transcriptomic insights and information for A. cristatum but also proposes a new method for exploring the functional genes of wheat relatives under a wheat genetic background. The sequence data have been deposited in the NCBI under BioProject accession number PRJNA534411.


Sign in / Sign up

Export Citation Format

Share Document