Accurate profiling of forensic autosomal STRs using the Oxford Nanopore Technologies MinION device

Nanopore Sequencing of a Forensic STR Multiplex Reveals Loci Suitable for Single-Contributor STR Profiling

Genes ◽

10.3390/genes11040381 ◽

2020 ◽

Vol 11 (4) ◽

pp. 381 ◽

Cited By ~ 2

Author(s):

Olivier Tytgat ◽

Yannick Gansemans ◽

Jana Weymaere ◽

Kaat Rubben ◽

Dieter Deforce ◽

...

Keyword(s):

Tandem Repeats ◽

Massively Parallel Sequencing ◽

Flow Cell ◽

Nanopore Sequencing ◽

Flow Cells ◽

Pattern Complexity ◽

Str Loci ◽

Oxford Nanopore ◽

Oxford Nanopore Technologies ◽

Short Tandem

Nanopore sequencing for forensic short tandem repeats (STR) genotyping comes with the advantages associated with massively parallel sequencing (MPS) without the need for a high up-front device cost, but genotyping is inaccurate, partially due to the occurrence of homopolymers in STR loci. The goal of this study was to apply the latest progress in nanopore sequencing by Oxford Nanopore Technologies in the field of STR genotyping. The experiments were performed using the state of the art R9.4 flow cell and the most recent R10 flow cell, which was specifically designed to improve consensus accuracy of homopolymers. Two single-contributor samples and one mixture sample were genotyped using Illumina sequencing, Nanopore R9.4 sequencing, and Nanopore R10 sequencing. The accuracy of genotyping was comparable for both types of flow cells, although the R10 flow cell provided improved data quality for loci characterized by the presence of homopolymers. We identify locus-dependent characteristics hindering accurate STR genotyping, providing insights for the design of a panel of STR loci suited for nanopore sequencing. Repeat number, the number of different reference alleles for the locus, repeat pattern complexity, flanking region complexity, and the presence of homopolymers are identified as unfavorable locus characteristics. For single-contributor samples and for a limited set of the commonly used STR loci, nanopore sequencing could be applied. However, the technology is not mature enough yet for implementation in routine forensic workflows.

Get full-text (via PubEx)

Updated Genome Sequence for the Probiotic Bacterium Bifidobacterium animalis subsp. lactis BB-12

Microbiology Resource Announcements ◽

10.1128/mra.00078-21 ◽

2021 ◽

Vol 10 (27) ◽

Author(s):

Kristian Jensen ◽

Kosai Al-Nakeeb ◽

Anna Koza ◽

Ahmad A. Zeidan

Keyword(s):

Genome Sequence ◽

Probiotic Bacterium ◽

Content Type ◽

Short Read Sequencing ◽

Bifidobacterium Animalis ◽

Oxford Nanopore ◽

Hybrid Genome ◽

Long Read ◽

Sequencing Platforms ◽

Oxford Nanopore Technologies

The genome of Bifidobacterium animalis subsp. lactis BB-12 was sequenced using Oxford Nanopore Technologies long-read and Illumina short-read sequencing platforms. A hybrid genome assembly approach was used to construct an updated complete genome sequence for BB-12 containing 1,944,152 bp, with a G+C content of 60.5% and 1,615 genes.

Get full-text (via PubEx)

Methplotlib: analysis of modified nucleotides from nanopore sequencing

10.1101/826107 ◽

2019 ◽

Author(s):

Wouter De Coster ◽

Mojca Strazisar

Keyword(s):

Gene Expression Regulation ◽

Command Line ◽

Nanopore Sequencing ◽

Modified Nucleotides ◽

Oxford Nanopore ◽

Command Line Tool ◽

Within Subjects ◽

Allele Specific ◽

Sequencing Platforms ◽

Oxford Nanopore Technologies

AbstractSummaryModified nucleotides play a crucial role in gene expression regulation. Here we describe methplotlib, a tool developed for the visualization of modified nucleotides detected from Oxford Nanopore Technologies sequencing platforms, together with additional scripts for statistical analysis of allele specific modification within subjects and differential modification frequency across subjects.Availability and implementationThe methplotlib command-line tool is written in Python3, is compatible with Linux, Mac OS and the MS Windows 10 Subsystem for Linux and released under the MIT license. The source code can be found at https://github.com/wdecoster/methplotlib and can be installed from PyPI and bioconda. Our repository includes test data and the tool is continuously tested at [email protected]

Get full-text (via PubEx)

Robust long-read native DNA sequencing using the ONT CsgG Nanopore system

Wellcome Open Research ◽

10.12688/wellcomeopenres.11246.1 ◽

2017 ◽

Vol 2 ◽

pp. 23 ◽

Cited By ~ 12

Author(s):

Jean-Michel Carter ◽

Shobbir Hussain

Keyword(s):

Dna Sequencing ◽

Cancer Cell Line ◽

Read Length ◽

Nanopore Sequencing ◽

Computational Tools ◽

Practical Applications ◽

Oxford Nanopore ◽

Sequencing Method ◽

Long Read ◽

Oxford Nanopore Technologies

Background: The ability to obtain long read lengths during DNA sequencing has several potentially important practical applications. Especially long read lengths have been reported using the Nanopore sequencing method, currently commercially available from Oxford Nanopore Technologies (ONT). However, early reports have demonstrated only limited levels of combined throughput and sequence accuracy. Recently, ONT released a new CsgG pore sequencing system as well as a 250b/s translocation chemistry with potential for improvements. Methods: We made use of such components on ONTs miniature ‘MinION’ device and sequenced native genomic DNA obtained from the near haploid cancer cell line HAP1. Analysis of our data was performed utilising recently described computational tools tailored for nanopore/long-read sequencing outputs, and here we present our key findings. Results: From a single sequencing run, we obtained ~240,000 high-quality mapped reads, comprising a total of ~2.3 billion bases. A mean read length of 9.6kb and an N50 of ~17kb was achieved, while sequences mapped to reference with a mean identity of 85%. Notably, we obtained ~68X coverage of the mitochondrial genome and were able to achieve a mean consensus identity of 99.8% for sequenced mtDNA reads. Conclusions: With improved sequencing chemistries already released and higher-throughput instruments in the pipeline, this early study suggests that ONT CsgG-based sequencing may be a useful option for potential practical long-read applications.

Get full-text (via PubEx)

Benchmarking of long-read assemblers for prokaryote whole genome sequencing

F1000Research ◽

10.12688/f1000research.21782.1 ◽

2019 ◽

Vol 8 ◽

pp. 2138 ◽

Cited By ~ 15

Author(s):

Ryan R. Wick ◽

Kathryn E. Holt

Keyword(s):

Data Sets ◽

Computationally Efficient ◽

Short Read Sequencing ◽

Oxford Nanopore ◽

Long Read ◽

Sequencing Platforms ◽

Computational Resources ◽

Assembly Algorithms ◽

Oxford Nanopore Technologies ◽

Multiple Assembly

Background: Data sets from long-read sequencing platforms (Oxford Nanopore Technologies and Pacific Biosciences) allow for most prokaryote genomes to be completely assembled – one contig per chromosome or plasmid. However, the high per-read error rate of long-read sequencing necessitates different approaches to assembly than those used for short-read sequencing. Multiple assembly tools (assemblers) exist, which use a variety of algorithms for long-read assembly. Methods: We used 500 simulated read sets and 120 real read sets to assess the performance of six long-read assemblers (Canu, Flye, Miniasm/Minipolish, Raven, Redbean and Shasta) across a wide variety of genomes and read parameters. Assemblies were assessed on their structural accuracy/completeness, sequence identity, contig circularisation and computational resources used. Results: Canu v1.9 produced moderately reliable assemblies but had the longest runtimes of all assemblers tested. Flye v2.6 was more reliable and did particularly well with plasmid assembly. Miniasm/Minipolish v0.3 was the only assembler which consistently produced clean contig circularisation. Raven v0.0.5 was the most reliable for chromosome assembly, though it did not perform well on small plasmids and had circularisation issues. Redbean v2.5 and Shasta v0.3.0 were computationally efficient but more likely to produce incomplete assemblies. Conclusions: Of the assemblers tested, Flye, Miniasm/Minipolish and Raven performed best overall. However, no single tool performed well on all metrics, highlighting the need for continued development on long-read assembly algorithms.

Get full-text (via PubEx)

Benchmarking of long-read assemblers for prokaryote whole genome sequencing

F1000Research ◽

10.12688/f1000research.21782.3 ◽

2020 ◽

Vol 8 ◽

pp. 2138 ◽

Cited By ~ 2

Author(s):

Ryan R. Wick ◽

Kathryn E. Holt

Keyword(s):

Data Sets ◽

Computationally Efficient ◽

Oxford Nanopore ◽

Long Read ◽

Sequencing Platforms ◽

Computational Resources ◽

Assembly Algorithms ◽

Oxford Nanopore Technologies ◽

Sequence Errors ◽

Multiple Assembly

Background: Data sets from long-read sequencing platforms (Oxford Nanopore Technologies and Pacific Biosciences) allow for most prokaryote genomes to be completely assembled – one contig per chromosome or plasmid. However, the high per-read error rate of long-read sequencing necessitates different approaches to assembly than those used for short-read sequencing. Multiple assembly tools (assemblers) exist, which use a variety of algorithms for long-read assembly. Methods: We used 500 simulated read sets and 120 real read sets to assess the performance of eight long-read assemblers (Canu, Flye, Miniasm/Minipolish, NECAT, NextDenovo/NextPolish, Raven, Redbean and Shasta) across a wide variety of genomes and read parameters. Assemblies were assessed on their structural accuracy/completeness, sequence identity, contig circularisation and computational resources used. Results: Canu v2.0 produced reliable assemblies and was good with plasmids, but it performed poorly with circularisation and had the longest runtimes of all assemblers tested. Flye v2.8 was also reliable and made the smallest sequence errors, though it used the most RAM. Miniasm/Minipolish v0.3/v0.1.3 was the most likely to produce clean contig circularisation. NECAT v20200119 was reliable and good at circularisation but tended to make larger sequence errors. NextDenovo/NextPolish v2.3.0/v1.2.4 was reliable with chromosome assembly but bad with plasmid assembly. Raven v1.1.10 was the most reliable for chromosome assembly, though it did not perform well on small plasmids and had circularisation issues. Redbean v2.5 and Shasta v0.5.1 were computationally efficient but more likely to produce incomplete assemblies. Conclusions: Of the assemblers tested, Flye, Miniasm/Minipolish and Raven performed best overall. However, no single tool performed well on all metrics, highlighting the need for continued development on long-read assembly algorithms.

Get full-text (via PubEx)

Benchmarking of long-read assemblers for prokaryote whole genome sequencing

F1000Research ◽

10.12688/f1000research.21782.2 ◽

2020 ◽

Vol 8 ◽

pp. 2138 ◽

Cited By ~ 4

Author(s):

Ryan R. Wick ◽

Kathryn E. Holt

Keyword(s):

Data Sets ◽

Computationally Efficient ◽

Short Read Sequencing ◽

Oxford Nanopore ◽

Long Read ◽

Sequencing Platforms ◽

Computational Resources ◽

Assembly Algorithms ◽

Oxford Nanopore Technologies ◽

Multiple Assembly

Background: Data sets from long-read sequencing platforms (Oxford Nanopore Technologies and Pacific Biosciences) allow for most prokaryote genomes to be completely assembled – one contig per chromosome or plasmid. However, the high per-read error rate of long-read sequencing necessitates different approaches to assembly than those used for short-read sequencing. Multiple assembly tools (assemblers) exist, which use a variety of algorithms for long-read assembly. Methods: We used 500 simulated read sets and 120 real read sets to assess the performance of seven long-read assemblers (Canu, Flye, Miniasm/Minipolish, NECAT, Raven, Redbean and Shasta) across a wide variety of genomes and read parameters. Assemblies were assessed on their structural accuracy/completeness, sequence identity, contig circularisation and computational resources used. Results: Canu v1.9 produced moderately reliable assemblies but had the longest runtimes of all assemblers tested. Flye v2.7 was more reliable and did particularly well with plasmid assembly. Miniasm/Minipolish v0.3 and NECAT v20200119 were the most likely to produce clean contig circularisation. Raven v0.0.8 was the most reliable for chromosome assembly, though it did not perform well on small plasmids and had circularisation issues. Redbean v2.5 and Shasta v0.4.0 were computationally efficient but more likely to produce incomplete assemblies. Conclusions: Of the assemblers tested, Flye, Miniasm/Minipolish and Raven performed best overall. However, no single tool performed well on all metrics, highlighting the need for continued development on long-read assembly algorithms.

Get full-text (via PubEx)

Robust long-read native DNA sequencing using the ONT CsgG Nanopore system

Wellcome Open Research ◽

10.12688/wellcomeopenres.11246.3 ◽

2018 ◽

Vol 2 ◽

pp. 23 ◽

Cited By ~ 5

Author(s):

Jean-Michel Carter ◽

Shobbir Hussain

Keyword(s):

Dna Sequencing ◽

Cancer Cell Line ◽

Read Length ◽

Nanopore Sequencing ◽

Computational Tools ◽

Practical Applications ◽

Oxford Nanopore ◽

Sequencing Method ◽

Long Read ◽

Oxford Nanopore Technologies

Background: The ability to obtain long read lengths during DNA sequencing has several potentially important practical applications. Especially long read lengths have been reported using the Nanopore sequencing method, currently commercially available from Oxford Nanopore Technologies (ONT). However, early reports have demonstrated only limited levels of combined throughput and sequence accuracy. Recently, ONT released a new CsgG pore sequencing system as well as a 250b/s translocation chemistry with potential for improvements. Methods: We made use of such components on ONTs miniature ‘MinION’ device and sequenced native genomic DNA obtained from the near haploid cancer cell line HAP1. Analysis of our data was performed utilising recently described computational tools tailored for nanopore/long-read sequencing outputs, and here we present our key findings. Results: From a single sequencing run, we obtained ~240,000 high-quality mapped reads, comprising a total of ~2.3 billion bases. A mean read length of 9.6kb and an N50 of ~17kb was achieved, while sequences mapped to reference with a mean identity of 85%. Notably, we obtained ~68X coverage of the mitochondrial genome and were able to achieve a mean consensus identity of 99.8% for sequenced mtDNA reads. Conclusions: With improved sequencing chemistries already released and higher-throughput instruments in the pipeline, this early study suggests that ONT CsgG-based sequencing may be a useful option for potential practical long-read applications with relevance to complex genomes.

Get full-text (via PubEx)

Benchmarking of long-read assemblers for prokaryote whole genome sequencing

F1000Research ◽

10.12688/f1000research.21782.4 ◽

2021 ◽

Vol 8 ◽

pp. 2138

Author(s):

Ryan R. Wick ◽

Kathryn E. Holt

Keyword(s):

Data Sets ◽

Computationally Efficient ◽

Oxford Nanopore ◽

Long Read ◽

Sequencing Platforms ◽

Computational Resources ◽

Assembly Algorithms ◽

Oxford Nanopore Technologies ◽

Sequence Errors ◽

Multiple Assembly

Background: Data sets from long-read sequencing platforms (Oxford Nanopore Technologies and Pacific Biosciences) allow for most prokaryote genomes to be completely assembled – one contig per chromosome or plasmid. However, the high per-read error rate of long-read sequencing necessitates different approaches to assembly than those used for short-read sequencing. Multiple assembly tools (assemblers) exist, which use a variety of algorithms for long-read assembly. Methods: We used 500 simulated read sets and 120 real read sets to assess the performance of eight long-read assemblers (Canu, Flye, Miniasm/Minipolish, NECAT, NextDenovo/NextPolish, Raven, Redbean and Shasta) across a wide variety of genomes and read parameters. Assemblies were assessed on their structural accuracy/completeness, sequence identity, contig circularisation and computational resources used. Results: Canu v2.1 produced reliable assemblies and was good with plasmids, but it performed poorly with circularisation and had the longest runtimes of all assemblers tested. Flye v2.8 was also reliable and made the smallest sequence errors, though it used the most RAM. Miniasm/Minipolish v0.3/v0.1.3 was the most likely to produce clean contig circularisation. NECAT v20200803 was reliable and good at circularisation but tended to make larger sequence errors. NextDenovo/NextPolish v2.3.1/v1.3.1 was reliable with chromosome assembly but bad with plasmid assembly. Raven v1.3.0 was reliable for chromosome assembly, though it did not perform well on small plasmids and had circularisation issues. Redbean v2.5 and Shasta v0.7.0 were computationally efficient but more likely to produce incomplete assemblies. Conclusions: Of the assemblers tested, Flye, Miniasm/Minipolish, NextDenovo/NextPolish and Raven performed best overall. However, no single tool performed well on all metrics, highlighting the need for continued development on long-read assembly algorithms.

Get full-text (via PubEx)

Robust long-read native DNA sequencing using the ONT CsgG Nanopore system

Wellcome Open Research ◽

10.12688/wellcomeopenres.11246.2 ◽

2017 ◽

Vol 2 ◽

pp. 23 ◽

Cited By ~ 5

Author(s):

Jean-Michel Carter ◽

Shobbir Hussain

Keyword(s):

Dna Sequencing ◽

Cancer Cell Line ◽

Read Length ◽

Nanopore Sequencing ◽

Computational Tools ◽

Practical Applications ◽

Oxford Nanopore ◽

Sequencing Method ◽

Long Read ◽

Oxford Nanopore Technologies

Background: The ability to obtain long read lengths during DNA sequencing has several potentially important practical applications. Especially long read lengths have been reported using the Nanopore sequencing method, currently commercially available from Oxford Nanopore Technologies (ONT). However, early reports have demonstrated only limited levels of combined throughput and sequence accuracy. Recently, ONT released a new CsgG pore sequencing system as well as a 250b/s translocation chemistry with potential for improvements. Methods: We made use of such components on ONTs miniature ‘MinION’ device and sequenced native genomic DNA obtained from the near haploid cancer cell line HAP1. Analysis of our data was performed utilising recently described computational tools tailored for nanopore/long-read sequencing outputs, and here we present our key findings. Results: From a single sequencing run, we obtained ~240,000 high-quality mapped reads, comprising a total of ~2.3 billion bases. A mean read length of 9.6kb and an N50 of ~17kb was achieved, while sequences mapped to reference with a mean identity of 85%. Notably, we obtained ~68X coverage of the mitochondrial genome and were able to achieve a mean consensus identity of 99.8% for sequenced mtDNA reads. Conclusions: With improved sequencing chemistries already released and higher-throughput instruments in the pipeline, this early study suggests that ONT CsgG-based sequencing may be a useful option for potential practical long-read applications with relevance to complex genomes.

Get full-text (via PubEx)