Full-length transcriptome sequences of Agropyron cristatum facilitate the prediction of putative genes for thousand-grain weight in a wheat-A. cristatum translocation line
Abstract Background Agropyron cristatum (L.) Gaertn. (2n = 4x = 28; genomes PPPP) is a wild relative of common wheat (Triticum aestivum L.) and provides many desirable genetic resources of wheat improvement. However, there is still a lack of reference genome and transcriptome information for A. cristatum, which severely impedes functional and molecular breeding studies. Results Single-molecule long-read sequencing technology from Pacific Biosciences (PacBio) was used to sequence full-length cDNA from a mixture of leaves, roots, stems and caryopses and constructed the first full-length transcriptome dataset of A. cristatum, which comprised 44,372 transcripts. As expected, the PacBio transcripts were generally longer than the transcripts assembled via the Illumina sequencing platform in a previous study. By mapping the full-length transcripts in this study, we identified tissue-enriched expressed transcripts and enriched GO terms, indicating that tissue-enriched transcripts were enriched for particular molecular functions that varies with tissue. We identified 3,398 novel and 1,352 A. cristatum-specific transcripts compared with the wheat gene model set. In order to better apply this transcriptome of A. cristatum, the A. cristatum transcripts was integrated with wheat genome as a reference sequences to try to identify candidate A. cristatum transcripts associated with thousand-grain weight in a wheat-A. cristatum translocation line Pubing 3035. Conclusions Full-length transcriptome sequences were used in our study. The present study not only provides comprehensive transcriptomic insights and information for A. cristatum but also proposes a new method for exploring the functional genes of wheat relatives under a wheat genetic background. The sequence data have been deposited in the NCBI under BioProject accession number PRJNA534411.