scholarly journals A low-cost platform suitable for sequencing-based recovery of natural variation in understudied plants

BioTechniques ◽  
2020 ◽  
Author(s):  
Rachel Howard-Till ◽  
Claudia E Osorio ◽  
Bradley J Till

Genetic characterization of wild and cultivated plants provides valuable knowledge for conservation and agriculture. DNA sequencing technologies are improving, and costs are dropping. Yet analysis of many species is hindered because they grow in regions that lack infrastructure for advanced molecular biology. The authors developed and adapted low-cost methods that address these issues. Tissue was collected and stored in silica gel, avoiding the need for liquid nitrogen and freezers. The authors optimized low-cost, homemade DNA extraction to increase yields, reduce costs and produce DNA suitable for next-generation sequencing. The authors describe how to build a gel documentation system for DNA quantification. As a proof of principle, the authors used these methods to evaluate wild Berberis darwinii, native to Southern Chile.

2020 ◽  
Author(s):  
Rachel Howard-Till ◽  
Claudia E. Osorio ◽  
Bradley J. Till

AbstractGenetic characterization of wild and cultivated plants provides valuable knowledge for conservation and agriculture. DNA sequencing technologies are improving and costs are dropping. Yet, analysis of many species is hindered because they grow in regions that lack infrastructure for advanced molecular biology. We developed and adapted low-cost methods that address these issues. Tissue is collected and stored in silica-gel, avoiding the need for liquid nitrogen and freezers. We have optimized low-cost home-made DNA extraction to increase yields, reduce costs, and produce DNA suitable for next generation sequencing. We also describe how to build a gel documentation system for DNA quantification. As a proof of principle, we use these methods to evaluate wild Berberis darwinii, native to Southern Chile.Method summaryWe describe a suite of low-cost do-it-yourself methods for field collection of plant tissues, extraction of genomic DNA suitable for next generation sequencing, and home-made agarose gel documentation suitable for DNA quantification. These methods enable the collection and preparation of samples for genomic analysis in regions with limited infrastructure.


2018 ◽  
Vol 6 (13) ◽  
Author(s):  
My V. T. Phan ◽  
Claudia M. E. Schapendonk ◽  
Bas B. Oude Munnink ◽  
Marion P. G. Koopmans ◽  
Rik L. de Swart ◽  
...  

ABSTRACT Genetic characterization of wild-type measles virus (MV) strains is a critical component of measles surveillance and molecular epidemiology. We have obtained complete genome sequences of six MV strains belonging to different genotypes, using random-primed next generation sequencing.


2020 ◽  
Vol 7 ◽  
Author(s):  
Örjan Johansson ◽  
Karin Ullman ◽  
Purevjav Lkhagvajav ◽  
Marc Wiseman ◽  
Jonas Malmsten ◽  
...  

Author(s):  
Oliver Schwengers ◽  
Patrick Barth ◽  
Linda Falgenhauer ◽  
Torsten Hain ◽  
Trinad Chakraborty ◽  
...  

ABSTRACTPlasmids are extrachromosomal genetic elements replicating independently of the chromosome which play a vital role in the environmental adaptation of bacteria. Due to potential mobilization or conjugation capabilities, plasmids are important genetic vehicles for antimicrobial resistance genes and virulence factors with huge and increasing clinical implications. They are therefore subject to large genomic studies within the scientific community worldwide. As a result of rapidly improving next generation sequencing methods, the amount of sequenced bacterial genomes is constantly increasing, in turn raising the need for specialized tools to (i) extract plasmid sequences from draft assemblies, (ii) derive their origin and distribution, and (iii) further investigate their genetic repertoire. Recently, several bioinformatic methods and tools have emerged to tackle this issue; however, a combination of both high sensitivity and specificity in plasmid sequence identification is rarely achieved in a taxon-independent manner. In addition, many software tools are not appropriate for large high-throughput analyses or cannot be included into existing software pipelines due to their technical design or software implementation. In this study, we investigated differences in the replicon distributions of protein-coding genes on a large scale as a new approach to distinguish plasmid-borne from chromosome-borne contigs. We defined and computed statistical discrimination thresholds for a new metric: the replicon distribution score (RDS) which achieved an accuracy of 96.6%. The final performance was further improved by the combination of the RDS metric with heuristics exploiting several plasmid specific higher-level contig characterizations. We implemented this workflow in a new high-throughput taxon-independent bioinformatics software tool called Platon for the recruitment and characterization of plasmid-borne contigs from short-read draft assemblies. Compared to PlasFlow, Platon achieved a higher accuracy (97.5%) and more balanced predictions (F1=82.6%) tested on a broad range of bacterial taxa and better or equal performance against the targeted tools PlasmidFinder and PlaScope on sequenced E. coli isolates. Platon is available at: platon.computational.bioData SummaryPlaton was developed as a Python 3 command line application for Linux.The complete source code and documentation is available on GitHub under a GPL3 license: https://github.com/oschwengers/platon and platon.computational.bio.All database versions are hosted at Zenodo: DOI 10.5281/zenodo.3349651.Platon is available via bioconda package platonPlaton is available via PyPI package cb-platonBacterial representative sequences for UniProt’s UniRef90 protein clusters, complete bacterial genome sequences from the NCBI RefSeq database, complete plasmid sequences from the NCBI genomes plasmid section, created artificial contigs, RDS threshold metrics and raw protein replicon hit counts used to create and evaluate the marker protein sequence database are hosted at Zenodo: DOI 10.5281/zenodo.375916924 Escherichia coli isolates sequenced with short read (Illumina MiSeq) and long read sequencing technologies (Oxford Nanopore Technology GridION platform) used for real data benchmarks are available under the following NCBI BioProjects: PRJNA505407, PRJNA387731Impact StatementPlasmids play a vital role in the spread of antibiotic resistance and pathogenicity genes. The increasing numbers of clinical outbreaks involving resistant pathogens worldwide pushed the scientific community to increase their efforts to comprehensively investigate bacterial genomes. Due to the maturation of next-generation sequencing technologies, nowadays entire bacterial genomes including plasmids are sequenced in huge scale. To analyze draft assemblies, a mandatory first step is to separate plasmid from chromosome contigs. Recently, many bioinformatic tools have emerged to tackle this issue. Unfortunately, several tools are implemented only as interactive or web-based tools disabling them for necessary high-throughput analysis of large data sets. Other tools providing such a high-throughput implementation however often come with certain drawbacks, e.g. providing taxon-specific databases only, not providing actionable, i.e. true binary classification or achieving biased classification performances towards either sensitivity or specificity.Here, we introduce the tool Platon implementing a new replicon distribution-based approach combined with higher-level contig characterizations to address the aforementioned issues. In addition to the plasmid detection within draft assemblies, Platon provides the user with valuable information on certain higher-level contig characterizations. We show that Platon provides a balanced classification performance as well as a scalable implementation for high-throughput analyses. We therefore consider Platon to be a powerful, species-independent and flexible tool to scan large amounts of bacterial whole-genome sequencing data for their plasmid content.


Retrovirology ◽  
2021 ◽  
Vol 18 (1) ◽  
Author(s):  
Imogen A. Wright ◽  
Michael J. Bale ◽  
Wei Shao ◽  
Wei-Shau Hu ◽  
John M. Coffin ◽  
...  

AbstractThe characterisation of the HIV-1 reservoir, which consists of replication-competent integrated proviruses that persist on antiretroviral therapy (ART), is made difficult by the rarity of intact proviruses relative to those that are defective. While the only conclusive test for the replication-competence of HIV-1 proviruses is carried out in cell culture, genetic characterization of genomes by near full-length (NFL) PCR and sequencing can be used to determine whether particular proviruses have insertions, deletions, or substitutions that render them defective. Proviruses that are not excluded by having such defects can be classified as genetically intact and, possibly, replication competent. Identifying and quantifying proviruses that are potentially replication-competent is important for the development of strategies towards a functional cure. However, to date, there are no programs that can be incorporated into deep-sequencing pipelines for the automated characterization and annotation of HIV genomes. Existing programs that perform this work require manual intervention, cannot be widely installed, and do not have easily adjustable settings. Here, we present HIVIntact, a python-based software tool that characterises genomic defects in NFL HIV-1 sequences, allowing putative intact genomes to be identified in-silico. Unlike other applications that assess the genetic intactness of HIV genomes, this tool can be incorporated into existing sequence-analysis pipelines and applied to large next-generation sequencing datasets.


2016 ◽  
Vol 26 (1) ◽  
pp. 105-121
Author(s):  
Tasnim Rahman ◽  
Hasnain Heickal ◽  
Shamira Tabrejee ◽  
Md Miraj Kobad Chowdhury ◽  
Sheikh Muhammad Sarwar ◽  
...  

With the availability of recent next generation sequencing technologies and their low cost, genomes of different organisms are being sequenced frequently. Therefore, quick assembly of genome, transcriptome, and target contigs from the raw data generated through the sequencing technologies has become necessary for better understanding of different biological systems. This article proposes an algorithm, namely SeqDev (Sequence Developer) for constructing contigs from raw reads using reference sequences. For this, we considered a weighted frequency?based consensus mechanism named BlastAssemb for primary construction of a sequence with gaps. Then, we adopted suffix array and proposed a gap filling search (GFS) algorithm for searching the missing sequences in the primary construct. For evaluating our algorithm, we have chosen Pokkali (rice) raw genome and Japonica (rice) as our reference data. Experimental results demonstrated that our proposed algorithm accurately constructs promoter sequences of Pokkali from its raw genome data. These constructed promoter sequences were 93 ? 100% identical with the reference and also aligned with 96 ? 100% of corresponding reference sequences with eValue ranging from 0.0 ? 2e-14. All these results indicated that our proposed method could be a potential algorithm to construct target contigs from raw sequences with the help of reference sequences. Further wet lab validation with specific Pokkali promoter sequence will boost this method as a robust algorithm for target contig assembly.Plant Tissue Cult. & Biotech. 26(1): 105-121, 2016 (June)


2010 ◽  
Vol 28 (1) ◽  
pp. E6 ◽  
Author(s):  
Paul A. Northcott ◽  
James T. Rutka ◽  
Michael D. Taylor

Advances in the field of genomics have recently enabled the unprecedented characterization of the cancer genome, providing novel insight into the molecular mechanisms underlying malignancies in humans. The application of high-resolution microarray platforms to the study of medulloblastoma has revealed new oncogenes and tumor suppressors and has implicated changes in DNA copy number, gene expression, and methylation state in its etiology. Additionally, the integration of medulloblastoma genomics with patient clinical data has confirmed molecular markers of prognostic significance and highlighted the potential utility of molecular disease stratification. The advent of next-generation sequencing technologies promises to greatly transform our understanding of medulloblastoma pathogenesis in the next few years, permitting comprehensive analyses of all aspects of the genome and increasing the likelihood that genomic medicine will become part of the routine diagnosis and treatment of medulloblastoma.


2019 ◽  
Vol 56 (5) ◽  
pp. 515-523 ◽  
Author(s):  
Hamza Dallali ◽  
Serena Pezzilli ◽  
Meriem Hechmi ◽  
Om Kalthoum Sallem ◽  
Sahar Elouej ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document