scholarly journals De novo assembly and transcriptome characterization of an endemic species of Vietnam, Panax vietnamensis Ha et Grushv., including the development of EST-SSR markers for population genetics

2020 ◽  
Author(s):  
Duy Dinh Vu ◽  
Syed Noor Muhammad Shah ◽  
Mai Phuong Pham ◽  
Van Thang Bui ◽  
Minh Tam Nguyen ◽  
...  

Abstract Background: Understanding the genetic diversity in threatened species that occur in forest remnants is necessary to establish efficient strategies for the species conservation, restoration and management. Panax vietnamensis Ha et Grushv. is medicinally important, endemic and endangered species of Vietnam. However, genetic diversity and structure of population is unknown due to lack of efficient molecular markers. Results: In this study, we employed Illumina HiSeq TM 4000 sequencing to analyze the transcriptomes of P. vietnamensis (roots, leaves and stems). A total of 23,741,783 raw reads were obtained and assembled, from which, 89,271 unigenes with an average length of 598.3191 nt were generated. During functional annotation, 31,686 unigenes were annotated in Gene Ontology categories, Kyoto Encyclopedia of Genes and Genomes pathways, Swiss-Prot database, and Nucleotide Collection (NR/NT) database. In addition, 11,343 expressed sequence tag-simple sequence repeat (EST-SSRs) were detected. From 7,774 primer pairs, 101 were selected for polymorphism validation, in which, 20 primer pairs were successfully amplified to DNA fragments and significant amounts of polymorphism was observed within population. The nine polymorphic microsatellite loci were used to analyze genetic diversity and structure of the natural populations. The obtained results revealed that the shows high levels of genetic diversity in populations, the average observed and expected heterozygosity were H O = 0.422 and H E = 0.479. During the Bottleneck analysis using TPM and SMM models (p < 0.01) shows that targeted population is significantly heterozygote deficient. This suggests sign of bottleneck in all populations. Genetic differentiation among populations was moderate (F ST = 0.133) and indicating limited gene flow (Nm = 1.63). Analysis of molecular variance (AMOVA) showed 63.17% of variation within individuals and 12.45% among populations. These results showed a moderate genetic structure of P. vietnamensis. STRUCTURE analysis and the unweighted pair-group method with arithmetic means (UPGMA) tree revealed strong genetic structure and two genetic clusters related to geographical distances, as well. Conclusion: Our study will assist conservators in future conservation management, breeding, production and habitats restoration of the species.

2020 ◽  
Author(s):  
Duy Dinh Vu ◽  
Syed Noor Muhammad Shah ◽  
Mai Phuong Pham ◽  
Van Thang Bui ◽  
Minh Tam Nguyen ◽  
...  

Abstract Background: Understanding the genetic diversity in threatened species that occur in forest remnants is necessary to establish efficient strategies for the species conservation, restoration and management. Panax vietnamensis Ha et Grushv. is medicinally important, endemic and endangered species of Vietnam. However, genetic diversity and structure of population is unknown due to lack of efficient molecular markers.Results: In this study, we employed Illumina HiSeq TM 4000 sequencing to analyze the transcriptomes of P. vietnamensis (roots, leaves and stems). A total of 23,741,783 raw reads were obtained and assembled, from which, 89,271 unigenes with an average length of 598.3191 nt were generated. During functional annotation, 31,686 unigenes were annotated in Gene Ontology categories, Kyoto Encyclopedia of Genes and Genomes pathways, Swiss-Prot database, and Nucleotide Collection (NR/NT) database. In addition, 11,343 expressed sequence tag-simple sequence repeat (EST-SSRs) were detected. From 7,774 primer pairs, 101 were selected for polymorphism validation, in which, 20 primer pairs were successfully amplified to DNA fragments and significant amounts of polymorphism was observed within population. The nine polymorphic microsatellite loci were used to analyze genetic diversity and structure of the natural populations. The obtained results revealed that the shows high levels of genetic diversity in populations, the average observed and expected heterozygosity were H O = 0.422 and H E = 0.479. During the Bottleneck analysis using TPM and SMM models (p < 0.01) shows that targeted population is significantly heterozygote deficient. This suggests sign of bottleneck in all populations. Genetic differentiation among populations was moderate (F ST = 0.133) and indicating limited gene flow (Nm = 1.63). Analysis of molecular variance (AMOVA) showed 63.17% of variation within individuals and 12.45% among populations. These results showed a moderate genetic structure of P. vietnamensis. STRUCTURE analysis and the unweighted pair-group method with arithmetic means (UPGMA) tree revealed strong genetic structure and two genetic clusters related to geographical distances, as well. Conclusion: Our study will assist conservators in future conservation management, breeding, production and habitats restoration of the species.


2019 ◽  
Author(s):  
Duy Dinh Vu ◽  
Syed Noor Muhammad Shah ◽  
Mai Phuong Pham ◽  
Van Thang Bui ◽  
Minh Tam Nguyen ◽  
...  

Abstract Background: Understanding the genetic diversity in threatened species that occur in forest remnants is necessary to establish efficient strategies for the species conservation, restoration and management. Panax vietnamensis Ha et Grushv. is medicinally important, endemic and endangered species of Vietnam. However, genetic diversity and structure of population is unknown due to lack of efficient molecular markers. Results: In this study, we employed Illumina HiSeq TM 4000 sequencing to analyze the transcriptomes of P. vietnamensis (roots, leaves and stems). A total of 23,741,783 raw reads were obtained and assembled, from which, 89,271 unigenes with an average length of 598.3191 nt were generated. During functional annotation, 31,686 unigenes were annotated in Gene Ontology categories, Kyoto Encyclopedia of Genes and Genomes pathways, Swiss-Prot database, and Nucleotide Collection (NR/NT) database. In addition, 11,343 expressed sequence tag-simple sequence repeat (EST-SSRs) were detected. From 7,774 primer pairs, 101 were selected for polymorphism validation, in which, 20 primer pairs were successfully amplified to DNA fragments and significant amounts of polymorphism was observed within population. The nine polymorphic microsatellite loci were used to analyze genetic diversity and structure of the natural populations. The obtained results revealed that the shows high levels of genetic diversity in populations, the average observed and expected heterozygosity were H O = 0.422 and H E = 0.479. During the Bottleneck analysis using TPM and SMM models (p < 0.01) shows that targeted population is significantly heterozygote deficient. This suggests sign of bottleneck in all populations. Genetic differentiation among populations was moderate (F ST = 0.133) and indicating limited gene flow (Nm = 1.63). Analysis of molecular variance (AMOVA) showed 63.17% of variation within individuals and 12.45% among populations. These results showed a moderate genetic structure of P. vietnamensis. STRUCTURE analysis and the unweighted pair-group method with arithmetic means (UPGMA) tree revealed strong genetic structure and two genetic clusters related to geographical distances, as well. Conclusion: Our study will assist conservators in future conservation management, breeding, production and habitats restoration of the species. Keywords: Conservation, EST-SSRs; Transcriptome; Panax vietnamensis ; Population genetics


2020 ◽  
Author(s):  
Duy Dinh Vu ◽  
Syed Noor Muhammad Shah ◽  
Mai Phuong Pham ◽  
Van Thang Bui ◽  
Minh Tam Nguyen ◽  
...  

Abstract Background: Understanding the genetic diversity in endangered species that occur in forest remnants is necessary to establish efficient strategies for the species conservation, restoration and management. Panax vietnamensis Ha et Grushv. is medicinally important, endemic and endangered species of Vietnam. However, genetic diversity and structure of population are unknown due to lack of efficient molecular markers. Results: In this study, we employed Illumina HiSeqTM 4000 sequencing to analyze the transcriptomes of P. vietnamensis (roots, leaves and stems). Raw reads total of 23,741,783 was obtained and then assembled, from which the generated unigenes were 89,271 (average length = 598.3191 nt). The 31,686 unigenes were annotated in different databases i.e. Gene Ontology, Kyoto Encyclopedia of Genes and Genomes, Nucleotide Collection (NR/NT) and Swiss-Prot for functional annotation. Further, 11,343 EST-SSRs were detected. From 7,774 primer pairs, 101 were selected for polymorphism validation, in which; 20 primer pairs were successfully amplified to DNA fragments and significant amounts of polymorphism was observed within population. The nine polymorphic microsatellite loci were used for population structure and diversity analyses. The obtained results revealed high levels of genetic diversity in populations, the average observed and expected heterozygosity were HO = 0.422 and HE = 0.479, respectively. During the Bottleneck analysis using TPM and SMM models (p < 0.01) shows that targeted population is significantly heterozygote deficient. This suggests sign of the bottleneck in all populations. Genetic differentiation between populations was moderate (FST = 0.133) and indicating slightly high level of gene flow (Nm = 1.63). Analysis of molecular variance (AMOVA) showed 63.17% of variation within individuals and 12.45% among populations. Our results shows two genetic clusters related to geographical distances. Conclusion: Our study will assist conservators in future conservation management, breeding, production and habitats restoration of the species.


Forests ◽  
2019 ◽  
Vol 10 (11) ◽  
pp. 1019 ◽  
Author(s):  
Zhong ◽  
Yang ◽  
Li ◽  
Zhang ◽  
Liu ◽  
...  

Cinnamomum camphora is a valuable broad-leaf tree indigenous to South China and East Asia and has been widely cultivated and utilized by humans since ancient times. However, owing to its overutilization for essential oil extraction, the Transplanting Big Trees into Cities Program, and over deforestation to make furniture, its wild populations have been detrimentally affected and are declining rapidly. In the present study, the genetic diversity and population structure of 180 trees sampled from 41 populations in South China were investigated with 22 expressed sequence tag-simple sequence repeat (EST-SSR) markers. In total, 61 alleles were harbored across 180 individuals, and medium genetic diversity level was inferred from the observed heterozygosity (Ho), expected heterozygosity (He), and Nei’ gene diversity (GD), which were 0.45, 0.44, and 0.44, respectively. Among the 41 wild populations, C. camphora had an average of 44 alleles, 2.02 effective alleles, and He ranging from 0.30 (SC) to 0.61 (HK). Analysis of molecular variance (AMOVA) showed that 17% of the variation among populations and the average pairwise genetic differentiation coefficient (FST) between populations was 0.162, indicating relatively low genetic population differentiations. Structure analysis suggested two groups for the 180 individuals, which was consistent with the principal coordinate analysis (PCoA) and unweighted pair-group method with arithmetic means (UPGMA). Populations grouped to cluster I were nearly all distributed in Jiangxi Province (except population XS in Zhejiang Province), and cluster II mainly comprised populations from other regions, indicating a significant geographical distribution. Moreover, the Mantel test showed that this geographical distance was significantly correlated with genetic distance. The findings of this research will assist in future C. camphora conservation management and breeding programs.


2018 ◽  
Vol 54 (No. 1) ◽  
pp. 17-25 ◽  
Author(s):  
D.-D. Vu ◽  
T.T.-X. Bui ◽  
T.H.-N. Nguyen ◽  
S.N.M. Shah ◽  
N.-H. Vu ◽  
...  

A total 20 074 230 sequencing reads were generated by Illumina HiSeq<sup>™ </sup>2500 from three different Toxicodendron vernicifluum tissue samples. In total, 48 693 unigenes with an average length of 703.34 bp were obtained by de novo assembly. 3392 potential EST-SSRs (expressed sequence tag-simple sequence repeat) were identified as potential molecular markers from unigenes with lengths exceeding 1 kb. A total of 80 pairs of PCR primers were randomly selected to validate the assembly quality and develop EST-SSR markers from genomic DNA. Of these primer pairs, 14 primer pairs successfully amplified DNA fragments and detected significant amounts of polymorphism within the lacquer tree population in Langao, Shaanxi province, China. There were high genetic diversities (number of alleles per locus (A) = 2.93, polymorphic information content (PIC) = 0.53, observed heterozygosity (Ho) = 0.62 and expected heterozygosity (He) = 0.85) in the lacquer tree natural population. The four loci were significantly deviated from Hardy-Weinberg equilibrium. These results suggested high homozygosity in the population and low or deficiency in heterozygosity (inbreeding coefficient (Fis) = 0.27). These polymorphic EST-SSR markers will provide the base for further studies of genetic structure and breeding in T. vernicifluum.


Genome ◽  
2005 ◽  
Vol 48 (4) ◽  
pp. 731-737 ◽  
Author(s):  
N A Barkley ◽  
M L Newman ◽  
M L Wang ◽  
M W Hotchkiss ◽  
G A Pederson

Polymorphic expressed sequence tag - simple sequence repeat (EST-SSR) markers derived from major cereal crops were used to assess the genetic diversity of the USDA temperate bamboo collection consisting of 92 accessions classified in 11 separate genera and 44 species. A total of 211 bands were detected with a mean number of alleles per locus of 8.440. Phylogenetic relationships were determined by calculating genetic distances between all pairwise combinations and assessing differences in character data. The resulting dendrograms (unweighted pair group method with arithmetic means (UPGMA) and parsimony) clustered the accessions into 2 main clades, which corresponded to accessions characterized morphologically as either clumping (sympodial) or running (monopodial) bamboos. The majority of the accessions clustered according to their current taxonomic classification. These markers were also beneficial in identifying contaminated and (or) misidentified plots. Overall, these transferred markers were informative in differentiating the various bamboo accessions and determining the level of genetic variation within and among species and genera.Key words: bamboo germplasm, genetic diversity, phylogeny.


2020 ◽  
Vol 69 (1) ◽  
pp. 116-122
Author(s):  
Tsam Ju ◽  
Perla Farhat ◽  
Wenjing Tao ◽  
Jibin Miao ◽  
Jialiang Li ◽  
...  

AbstractJuniperus squamata, an endemic conifer of Asia, is an important shrub ecologically and economically. Yet little is known about its genetic diversity and population structure due to lacking of highly polymorphic molecular markers. In this study, expressed sequence tag microsatellite markers (EST-SSR) were developed for Juniperus squamata. Illumina HiSeq data were used to reconstruct the transcriptome of this species by de novo assembly. Based on this transcriptome, 18 SSR markers were designed and successfully amplified. Just one locus was eliminated due to its detection of null alleles and the remaining 17 loci were polymorphic, generating five to 14 alleles per locus in J. squamata. Markers cross-amplification tests were successful in two closely related species of J. squamata. These markers will serve as a basis for further studies to assess the genetic diversity and population structure of J. squamata. As well, they could be useful in promoting sustainable forest management strategies for this species in the face of global climate change.


Forests ◽  
2019 ◽  
Vol 10 (5) ◽  
pp. 411 ◽  
Author(s):  
Yu Ge ◽  
Lin Tan ◽  
Bin Wu ◽  
Tao Wang ◽  
Teng Zhang ◽  
...  

Avocado (Persea americana Mill.) could be considered as an important tropical and subtropical woody oil crop with high economic and nutritional value. Despite the importance of this species, genomic information is currently unavailable for avocado and closely related congeners. In this study, we generated more than 216 million clean reads from different avocado ecotypes using Illumina HiSeq high-throughput sequencing technology. The high-quality reads were assembled into 154,310 unigenes with an average length of 922 bp. A total of 55,558 simple sequence repeat (SSR) loci detected among the 43,270 SSR-containing unigene sequences were used to develop 74,580 expressed sequence tag (EST)-SSR markers. From these markers, a subset of 100 EST-SSR markers was randomly chosen to identify polymorphic EST-SSR markers in 28 avocado accessions. Sixteen EST-SSR markers with moderate to high polymorphism levels were detected, with polymorphism information contents ranging from 0.33 to 0.84 and averaging 0.63. These 16 polymorphic EST-SSRs could clearly and effectively distinguish the 28 avocado accessions. In summary, our study is the first presentation of transcriptome data of different avocado ecotypes and comprehensive study on the development and analysis of a set of EST-SSR markers in avocado. The application of next-generation sequencing techniques for SSR development is a potentially powerful tool for genetic studies.


PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e8038
Author(s):  
Yanli Xiong ◽  
Wenhui Liu ◽  
Yi Xiong ◽  
Qingqing Yu ◽  
Xiao Ma ◽  
...  

Hosting unique and important plant germplasms, the Qinghai-Tibet Plateau (QTP), as the third pole of the world, and Xinjiang, located in the centre of the Eurasian continent, are major distribution areas of perennial Triticeae grasses, especially the widespread Elymus species. Elymus excelsus Turcz. ex Griseb, a perennial forage grass with strong tolerance to environmental stresses, such as drought, cold and soil impoverishment, can be appropriately used for grassland establishment due to its high seed production. To provide basic information for collection, breeding strategies and utilization of E. excelsus germplasm, microsatellite markers (SSR) were employed in the present study to determine the genetic variation and population structure of 25 wild accessions of E. excelsus from Xinjiang (XJC) and the QTP, including Sichuan (SCC) and Gansu (GSC) of western China. Based on the 159 polymorphic bands amplified by 35 primer pairs developed from three related species, the average values of the polymorphic information content (PIC), marker index (MI), resolving power (Rp), Nei’s genetic diversity (H) and Shannon’s diversity index (I) of each pair of primers were 0.289, 1.348, 1.897, 0.301 and 0.459, respectively, validating that these SSR markers can also be used for the evaluation of genetic diversity of E. excelsus germplasms, and demonstrating the superior versatility of EST-SSR vs. G-SSR. We found a relatively moderate differentiation (Fst = 0.151) among the XJC, SCC and GSC geo-groups, and it is worth noting that, the intra-group genetic diversity of the SCC group (He = 0.197) was greater than that of the GSC (He = 0.176) and XJC (He = 0.148) groups. Both the Unweighted Pair Group Method with Arithmetic (UPGMA) clustering and principal coordinates analysis (PCoA) divided the 25 accessions into three groups, whereas the Bayesian STRUCTURE analysis suggested that E. excelsus accessions fell into four main clusters. Besides, this study suggested that geographical distance and environmental variables (annual mean precipitation and average precipitation in growing seasons), especially for QTP accessions, should be combined to explain the population genetic differentiation among the divergent geographical regions. These data provided comprehensive information about these valuable E. excelsus germplasm resources for the protection and collection of germplasms and for breeding strategies in areas of Xinjiang and QTP in western China.


PLoS ONE ◽  
2021 ◽  
Vol 16 (3) ◽  
pp. e0247586
Author(s):  
Christine E. Edwards ◽  
Brooke C. Tessier ◽  
Joel F. Swift ◽  
Burgund Bassüner ◽  
Alexander G. Linan ◽  
...  

Understanding genetic diversity and structure in a rare species is critical for prioritizing both in situ and ex situ conservation efforts. One such rare species is Physaria filiformis (Brassicaceae), a threatened, winter annual plant species. The species has a naturally fragmented distribution, occupying three different soil types spread across four disjunct geographical locations in Missouri and Arkansas. The goals of this study were to understand: (1) whether factors associated with fragmentation and small population size (i.e., inbreeding, genetic drift or genetic bottlenecks) have reduced levels of genetic diversity, (2) how genetic variation is structured and which factors have influenced genetic structure, and (3) how much extant genetic variation of P. filiformis is currently publicly protected and the implications for the development of conservation strategies to protect its genetic diversity. Using 16 microsatellite markers, we genotyped individuals from 20 populations of P. filiformis from across its geographical range and one population of Physaria gracilis for comparison and analyzed genetic diversity and structure. Populations of P. filiformis showed comparable levels of genetic diversity to its congener, except a single population in northwest Arkansas showed evidence of a genetic bottleneck and two populations in the Ouachita Mountains of Arkansas showed lower genetic variation, consistent with genetic drift. Populations showed isolation by distance, indicating that migration is geographically limited, and analyses of genetic structure grouped individuals into seven geographically structured genetic clusters, with geographic location/spatial separation showing a strong influence on genetic structure. At least one population is protected for all genetic clusters except one in north-central Arkansas, which should therefore be prioritized for protection. Populations in the Ouachita Mountains were genetically divergent from the rest of P. filiformis; future morphological analyses are needed to identify whether it merits recognition as a new, extremely rare species.


Sign in / Sign up

Export Citation Format

Share Document