scholarly journals An in silico approach for evaluation of rbcL and matK loci for DNA barcoding of Cucurbitaceae family

2020 ◽  
Vol 21 (8) ◽  
Author(s):  
Viet The Ho ◽  
MINH PHUONG NGUYEN

Abstract. Ho VT, Nguyen MP. 2020. An in silico approach for evaluation of rbcL and matK loci for DNA barcoding of Cucurbitaceae family. Biodiversitas 21: 3879-3885. DNA barcodes have been used intensively to discriminate different species in Cucurbitaceae family. The main of this study is to evaluate the effectiveness of rbcL and matK loci for 16 species of Cucurbitaceae family by using in silico approach. For analysis, sequences were firstly retrieved from NCBI and then calculated for sequence parameters. Sequences were then aligned and constructed phylogenetic try and examined for species resolution ability. The obtained data show the variability of resolving capacity among species. rbcL region is suitable for distinguishing five species namely S. edule, M. cochinchinensis, L. aegyptiaca, C. melo, and C. pepo, whereas matK locus is more proper for different five species consisting of M. balsamina, M. cochinchinensis, M. charantia, S. edule, and C. sativus. The resolving power is improved sharply by analyzing the rbcL + matK combination with up to nine species consisting of C. lanatus, B. hispida, C. melo, C. sativus, C. pepo, C. agryrosperma, L. aegyptiaca, S. edule, and M. cochinchinensis. Therefore, the integration of rbcL and matK loci may improve the competence of assessing genetic relatedness at species level of members in Cucurbitaceae family. The obtained information could be important for choosing proper DNA barcode loci for phylogenetic study of this crop family.

Author(s):  
Takeru Nakazato

DNA barcoding technology has become employed widely for biodiversity and molecular biology researchers to identify species and analyze their phylogeny. Recently, DNA metabarcoding and environmental DNA (eDNA) technology have developed by expanding the concept of DNA barcoding. These techniques analyze the diversity and quantity of organisms within an environment by detecting biogenic DNA in water and soil. It is particularly popular for monitoring fish species living in rivers and lakes (Takahara et al. 2012). BOLD Systems (Barcode of Life Database systems, Ratnasingham and Hebert 2007) is a database for DNA barcoding, archiving 8.5 million of barcodes (as of August 2020) along with the voucher specimen, from which the DNA barcode sequence is derived, including taxonomy, collected country, and museum vouchered as metadata (e.g. https://www.boldsystems.org/index.php/Public_RecordView?processid=TRIBS054-16). Also, many barcoding data are submitted to GenBank (Sayers et al. 2020), which is a database for DNA sequences managed by NCBI (National Center for Biotechnology Information, US). The number of the records of DNA barcodes, i.e. COI (cytochrome c oxidase I) gene for animal, has grown significantly (Porter and Hajibabaei 2018). BOLD imports DNA barcoding data from GenBank, and lots of DNA barcoding data in GenBank are also assigned BOLD IDs. However, we have to refer to both BOLD and GenBank data when performing DNA barcoding. I have previously investigated the registration of DNA barcoding data in GenBank, especially the association with BOLD, using insects and flowering plants as examples (Nakazato 2019). Here, I surveyed the number of species covered by BOLD and GenBank. I used fish data as an example because eDNA research is particularly focused on fish. I downloaded all GenBank files for vertebrates from NCBI FTP (File Transfer Protocol) sites (as of November 2019). Of the GenBank fish entries, 86,958 (7.3%) were assigned BOLD identifiers (IDs). The NCBI taxonomy database has registrations for 39,127 species of fish, and 20,987 scientific names at the species level (i.e., excluding names that included sp., cf. or aff.). GenBank entries with BOLD IDs covered 11,784 species (30.1%) and 8,665 species-level names (41.3%). I also obtained whole "specimens and sequences combined data" for fish from BOLD systems (as of November 2019). In the BOLD, there are 273,426 entries that are registered as fish. Of these entries, 211,589 BOLD entries were assigned GenBank IDs, i.e. with values in “genbank_accession” column, and 121,748 entries were imported from GenBank, i.e. with "Mined from GenBank, NCBI" description in "institution_storing" column. The BOLD data covered 18,952 fish species and 15,063 species-level names, but 35,500 entries were assigned no species-level names and 22,123 entries were not even filled with family-level names. At the species level, 8,067 names co-occurred in GenBank and BOLD, with 6,997 BOLD-specific names and 599 GenBank-specific names. GenBank has 425,732 fish entries with voucher IDs, of which 340,386 were not assigned a BOLD ID. Of these 340,386 entries, 43,872 entries are registrations for COI genes, which could be candidates for DNA barcodes. These candidates include 4,201 species that are not included in BOLD, thus adding these data will enable us to identify 19,863 fish to the species level. For researchers, it would be very useful if both BOLD and GenBank DNA barcoding data could be searched in one place. For this purpose, it is necessary to integrate data from the two databases. A lot of biodiversity data are recorded based on the Darwin Core standard while DNA sequencing data are sometimes integrated or cross-linked by RDF (Resource Description Framework). It may not be technically difficult to integrate these data, but the species data referenced differ from the EoL (The Encyclopedia of Life) for BOLD and the NCBI taxonomy for GenBank, and the differences in taxonomic systems make it difficult to match by scientific name description. GenBank has fields for the latitude and longitude of the specimens sampled, and Porter and Hajibabaei 2018 argue that this information should be enhanced. However, this information may be better described in the specimen and occurrence databases. The integration of barcoding data with the specimen and occurrence data will solve these problems. Most importantly, it will save the researcher from having to register the same information in multiple databases. In the field of biodiversity, only DNA barcode sequences may have been focused on and used as gene sequences. The museomics community regards museum-preserved specimens as rich resources for DNA studies because their biodiversity information can accompany the extraction and analysis of their DNA (Nakazato 2018). GenBank is useful for biodiversity studies due to its low rate of mislabelling (Leray et al. 2019). In the future, we will be working with a variety of DNA, including genomes from museum specimens as well as DNA barcoding. This will require more integrated use of biodiversity information and DNA sequence data. This integration is also of interest to molecular biologists and bioinformaticians.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Kannika Thongkhao ◽  
Veerachai Pongkittiphan ◽  
Thatree Phadungcharoen ◽  
Chayapol Tungphatthong ◽  
Santhosh Kumar J. Urumarudappa ◽  
...  

Abstract Cyanthillium cinereum (L.) H.Rob. is one of the most popular herbal smoking cessation aids currently used in Thailand, and its adulteration with Emilia sonchifolia (L.) DC. is often found in the herbal market. Therefore, the quality of the raw material must be considered. This work aimed to integrate macro- and microscopic, chemical and genetic authentication strategies to differentiate C. cinereum raw material from its adulterant. Different morphological features between C. cinereum and E. sonchifolia were simply recognized at the leaf base. For microscopic characteristics, trichome and pappus features were different between the two plants. HPTLC profiles showed a distinct band that could be used to unambiguously differentiate C. cinereum from E. sonchifolia. Four triterpenoid compounds, β-amyrin, taraxasterol, lupeol, and betulin, were identified from the distinct HPTLC band of C. cinereum. The use of core DNA barcode regions; rbcL, matK, ITS and psbA-trnH provided species-level resolution to differentiate the two plants. Taken together, the integration of macroscopic and microscopic characterization, phytochemical analysis by HPTLC and DNA barcoding distinguished C. cinereum from E. sonchifolia. The signatures of C. cinereum obtained here can help manufacturers to increase the quality control of C. cinereum raw material in commercialized smoking cessation products.


2015 ◽  
Vol 15 (4) ◽  
pp. 286-295 ◽  
Author(s):  
Sebastin Raveendar ◽  
Jung-Ro Lee ◽  
Donghwan Shim ◽  
Gi-An Lee ◽  
Young-Ah Jeon ◽  
...  

AbstractThe genus Vicia L., one of the earliest domesticated plant genera, is a member of the legume tribe Fabeae of the subfamily Papilionoideae (Fabaceae). The taxonomic history of this genus is extensive and controversial, which has hindered the development of taxonomic procedures and made it difficult to identify and share these economically important crop resources. Species identification through DNA barcoding is a valuable taxonomic classification tool. In this study, four DNA barcodes (ITS2, matK, rbcL and psbA-trnH) were evaluated on 110 samples that represented 34 taxonomically best-known species in the Vicia genus. Topologies of the phylogenetic trees based on an individual locus were similar. Individual locus-based analyses could not discriminate closely related Vicia species. We proposed a concatenated data approach to increase the resolving power of ITS2. The DNA barcodes matK, psbA-trnH and rbcL were used as an additional tool for phylogenetic analysis. Among the four barcodes, three-barcode combinations that included psbA-trnH with any two of the other barcodes (ITS2, matK or rbcL) provided the best discrimination among Vicia species. Species discrimination was assessed with bootstrap values and considered successful only when all the conspecific individuals formed a single clade. Through sequencing of these barcodes from additional Vicia accessions, 17 of the 34 known Vicia species could be identified with varying levels of confidence. From our analyses, the combined barcoding markers are useful in the early diagnosis of targeted Vicia species and can provide essential baseline data for conservation strategies, as well as guidance in assembling germplasm collections.


2021 ◽  
Vol 9 ◽  
Author(s):  
Parvin Aghayeva ◽  
Salvatore Cozzolino ◽  
Donata Cafasso ◽  
Valida Ali-zade ◽  
Silvia Fineschi ◽  
...  

DNA barcoding has rapidly become a useful complementary tool in floristic investigations particularly for identifying specimens that lack diagnostic characters. Here, we assess the capability of three DNA barcode markers (chloroplast rpoB, accD and nuclear ITS) for correct species assignment in a floristic survey on the Caucasus. We focused on two herbal groups with potential for ornamental applications, namely orchids and asterids. On these two plant groups, we tested whether our selection of barcode markers allows identification of the “barcoding gap” in sequence identity and to distinguish between monophyletic species when employing distance-based methods. All markers successfully amplified most specimens, but we found that the rate of species-level resolution amongst selected markers largely varied in the two plant groups. Overall, for both lineages, plastid markers had a species-level assignment success rate lower than the nuclear ITS marker. The latter confirmed, in orchids, both the existence of a barcoding gap and that all accessions of the same species clustered together in monophyletic groups. Further, it also allowed the detection of a phylogeographic signal.The ITS marker resulted in its being the best performing barcode for asterids; however, none of the three tested markers showed high discriminatory ability. Even if ITS were revealed as the most promising plant barcode marker, we argue that the ability of this barcode for species assignment is strongly dependent on the evolutionary history of the investigated plant lineage.


2022 ◽  
Vol 2022 ◽  
pp. 1-19
Author(s):  
Sarfraz Ahmed ◽  
Muhammad Ibrahim ◽  
Chanin Nantasenamat ◽  
Muhammad Farrukh Nisar ◽  
Aijaz Ahmad Malik ◽  
...  

DNA barcodes are regarded as hereditary succession codes that serve as a recognition marker to address several queries relating to the identification, classification, community ecology, and evolution of certain functional traits in organisms. The mitochondrial cytochrome c oxidase 1 (CO1) gene as a DNA barcode is highly efficient for discriminating vertebrate and invertebrate animal species. Similarly, different specific markers are used for other organisms, including ribulose bisphosphate carboxylase (rbcL), maturase kinase (matK), transfer RNA-H and photosystem II D1-ApbsArabidopsis thaliana (trnH-psbA), and internal transcribed spacer (ITS) for plant species; 16S ribosomal RNA (16S rRNA), elongation factor Tu gene (Tuf gene), and chaperonin for bacterial strains; and nuclear ITS for fungal strains. Nevertheless, the taxon coverage of reference sequences is far from complete for genus or species-level identification. Applying the next-generation sequencing approach to the parallel acquisition of DNA barcode sequences could greatly expand the potential for library preparation or accurate identification in biodiversity research. Overall, this review articulates on the DNA barcoding technology as applied to different organisms, its universality, applicability, and innovative approach to handling DNA-based species identification.


Parasitology ◽  
2020 ◽  
Vol 147 (13) ◽  
pp. 1499-1508
Author(s):  
Susanne Reier ◽  
Helmut Sattmann ◽  
Thomas Schwaha ◽  
Hans-Peter Fuehrer ◽  
Elisabeth Haring

AbstractAcanthocephalans are obligate parasites of vertebrates, mostly of fish. There is limited knowledge about the diversity of fish-parasitizing Acanthocephala in Austria. Seven determined species and an undetermined species are recorded for Austrian waters. Morphological identification of acanthocephalans remains challenging due to their sparse morphological characters and their high intraspecific variations. DNA barcoding is an effective tool for taxonomic assignment at the species level. In this study, we provide new DNA barcoding data for three genera of Acanthocephala (Pomphorhynchus Monticelli, 1905, Echinorhynchus Zoega in Müller, 1776 and Acanthocephalus Koelreuter, 1771) obtained from different fish species in Austria and provide an important contribution to acanthocephalan taxonomy and distribution in Austrian fish. Nevertheless, the taxonomic assignment of one species must remain open. We found indications for cryptic species within Echinorhynchus cinctulus Porta, 1905. Our study underlines the difficulties in processing reliable DNA barcodes and highlights the importance of the establishment of such DNA barcodes to overcome these. To achieve this goal, it is necessary to collect and compare material across Europe allowing a comprehensive revision of the phylum in Europe.


Botany ◽  
2019 ◽  
Vol 97 (9) ◽  
pp. 503-512 ◽  
Author(s):  
Deniz Aygoren Uluer ◽  
Rahma Alshamrani

Aesculus L. is a small genus of horticulturally important trees and shrubs, comprising 13–19 species. Frequent hybridization among species, particularly in cultivation, has contributed to taxonomic confusion and difficulties in the identification of plants. In this study, we evaluated three widely employed plant DNA barcode loci, matK, and the entire ITS region (ITS1+5.8S+ITS2) as well as subunit ITS2 for 50 individuals representing 13 species of Aesculus, excluding only A. wangii (=A. assamica). In contrast to the plastid matK region, both the ITS and ITS2 loci displayed low levels of species discrimination, especially in our “first hit” BLASTn searches. We also presented the phylogeny of Aesculus based on matK and the entire ITS region, with additional matK and ITS sequences from GenBank. Our results show that Aesculus chinensis, A. flava, A. glabra, A. pavia, and A. sylvatica are probably not monophyletic. Furthermore, with the widest taxon coverage until now, the current study highlights the importance of sampling multiple individuals, not only for DNA barcoding, but also for phylogenetic studies.


Genome ◽  
2016 ◽  
Vol 59 (9) ◽  
pp. 641-660 ◽  
Author(s):  
Daniel H. Janzen ◽  
Winnie Hallwachs

The 37-year ongoing inventory of the estimated 15 000 species of Lepidoptera living in the 125 000 terrestrial hectares of Area de Conservacion Guanacaste, northwestern Costa Rica, has DNA barcode documented 11 000+ species, and the simultaneous inventory of at least 6000+ species of wild-caught caterpillars, plus 2700+ species of parasitoids. The inventory began with Victorian methodologies and species-level perceptions, but it was transformed in 2004 by the full application of DNA barcoding for specimen identification and species discovery. This tropical inventory of an extraordinarily species-rich and complex multidimensional trophic web has relied upon the sequencing services provided by the Canadian Centre for DNA Barcoding, and the informatics support from BOLD, the Barcode of Life Data Systems, major tools developed by the Centre for Biodiversity Genomics at the Biodiversity Institute of Ontario, and available to all through couriers and the internet. As biodiversity information flows from these many thousands of undescribed and often look-alike species through their transformations to usable product, we see that DNA barcoding, firmly married to our centuries-old morphology-, ecology-, microgeography-, and behavior-based ways of taxonomizing the wild world, has made possible what was impossible before 2004. We can now work with all the species that we find, as recognizable species-level units of biology. In this essay, we touch on some of the details of the mechanics of actually using DNA barcoding in an inventory.


2016 ◽  
Vol 371 (1702) ◽  
pp. 20160025 ◽  
Author(s):  
Xin Zhou ◽  
Paul B. Frandsen ◽  
Ralph W. Holzenthal ◽  
Clare R. Beet ◽  
Kristi R. Bennett ◽  
...  

DNA barcoding was intended as a means to provide species-level identifications through associating DNA sequences from unknown specimens to those from curated reference specimens. Although barcodes were not designed for phylogenetics, they can be beneficial to the completion of the Tree of Life. The barcode database for Trichoptera is relatively comprehensive, with data from every family, approximately two-thirds of the genera, and one-third of the described species. Most Trichoptera, as with most of life's species, have never been subjected to any formal phylogenetic analysis. Here, we present a phylogeny with over 16 000 unique haplotypes as a working hypothesis that can be updated as our estimates improve. We suggest a strategy of implementing constrained tree searches, which allow larger datasets to dictate the backbone phylogeny, while the barcode data fill out the tips of the tree. We also discuss how this phylogeny could be used to focus taxonomic attention on ambiguous species boundaries and hidden biodiversity. We suggest that systematists continue to differentiate between ‘Barcode Index Numbers’ (BINs) and ‘species’ that have been formally described. Each has utility, but they are not synonyms. We highlight examples of integrative taxonomy, using both barcodes and morphology for species description. This article is part of the themed issue ‘From DNA barcodes to biomes’.


2021 ◽  
Vol 38 ◽  
pp. 00087
Author(s):  
Elena Nikitina ◽  
Abdurashid Rakhmatov

The species level diversity is the reference unit for biodiversity accounting, should be systematized and include full information about the species. Reliable identification of any species is critical for a large-scale biodiversity monitoring and conservation. A DNA barcode is a DNA sequence that identifies a species by comparing the sequence of an unknown species with barcodes of a known species sequence database. Accurate identification of important plants is essential for their conservation, inventory. The species diversity assessing exampled on the subtribe Nepetinae (Lamiaceae) representatives, growing in Uzbekistan is given, using DNA barcoding method. The study was aimed to identify indigenous important plants with the nuclear (ITS) and plastid (matK, rbcL, trnL-F) genomes. This work demonstrates the phylogenetic relationships of some genera within the subtribe Nepetinae Coss. & Germ. (Lamiaceae), based on ITS locus gene. All results indicate that the DNA barcoding tool can be successfully used to reliably identify important plants, to inventory the botanical resources of Uzbekistan and to create a reference library of DNA barcodes. So, the combination of three-four locus gene is a good candidate for this approach.


Sign in / Sign up

Export Citation Format

Share Document