scholarly journals Next-Generation Sequencing of HIV-1 RNA Genomes: Determination of Error Rates and Minimizing Artificial Recombination

PLoS ONE ◽  
2013 ◽  
Vol 8 (9) ◽  
pp. e74249 ◽  
Author(s):  
Francesca Di Giallonardo ◽  
Osvaldo Zagordi ◽  
Yannick Duport ◽  
Christine Leemann ◽  
Beda Joos ◽  
...  
2012 ◽  
Vol 50 (12) ◽  
pp. 3838-3844 ◽  
Author(s):  
A. Gall ◽  
B. Ferns ◽  
C. Morris ◽  
S. Watson ◽  
M. Cotten ◽  
...  

2016 ◽  
Author(s):  
Peizhou Liao ◽  
Glen A. Satten ◽  
Yi-juan Hu

ABSTRACTA fundamental challenge in analyzing next-generation sequencing data is to determine an individual’s genotype correctly as the accuracy of the inferred genotype is essential to downstream analyses. Some genotype callers, such as GATK and SAMtools, directly calculate the base-calling error rates from phred scores or recalibrated base quality scores. Others, such as SeqEM, estimate error rates from the read data without using any quality scores. It is also a common quality control procedure to filter out reads with low phred scores. However, choosing an appropriate phred score threshold is problematic as a too-high threshold may lose data while a too-low threshold may introduce errors. We propose a new likelihood-based genotype-calling approach that exploits all reads and estimates the per-base error rates by incorporating phred scores through a logistic regression model. The algorithm, which we call PhredEM, uses the Expectation-Maximization (EM) algorithm to obtain consistent estimates of genotype frequencies and logistic regression parameters. We also develop a simple, computationally efficient screening algorithm to identify loci that are estimated to be monomorphic, so that only loci estimated to be non-monomorphic require application of the EM algorithm. We evaluate the performance of PhredEM using both simulated data and real sequencing data from the UK10K project. The results demonstrate that PhredEM is an improved, robust and widely applicable genotype-calling approach for next-generation sequencing studies. The relevant software is freely available.


Author(s):  
Chatzinikolaou Panagiotis ◽  
Makris Christos ◽  
Dimitrios Vlachakis ◽  
Sophia Kossida

In language of genetics and biochemistry, sequencing is the determination of an unbranched biopolymer's primary structure. A sequence is a symbolic linear depiction, result of sequencing. This sequence is a succinct summary of the most of the sequenced molecule's atomic-level structure. (Most known is DNA-sequencing, RNA-sequencing, Protein-sequencing and Next-Generation-sequencing)


2019 ◽  
Vol 121 ◽  
pp. 104207 ◽  
Author(s):  
Enagnon Kazali Alidjinou ◽  
Pauline Coulon ◽  
Christophe Hallaert ◽  
Olivier Robineau ◽  
Agnès Meybeck ◽  
...  

2018 ◽  
Vol 101 ◽  
pp. 63-65 ◽  
Author(s):  
David T. Dunn ◽  
Wolfgang Stöhr ◽  
Alejandro Arenas-Pinto ◽  
Anna Tostevin ◽  
Jean L. Mbisa ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document