scholarly journals Single-strand specific nuclease enhances accuracy of error-corrected sequencing and improves rare mutation-detection sensitivity

Author(s):  
Yuki Otsubo ◽  
Shoji Matsumura ◽  
Naohiro Ikeda ◽  
Masayuki Yamane

AbstractError-corrected sequences (ECSs) that utilize double-stranded DNA sequences are useful in detecting mutagen-induced mutations. However, relatively higher frequencies of G:C > T:A (1 × 10−7 bp) and G:C > C:G (2 × 10−7 bp) errors decrease the accuracy of detection of rare G:C mutations (approximately 10−7 bp). Oxidized guanines in single-strand (SS) overhangs generated after shearing could serve as the source of these errors. To remove these errors, we first computationally discarded up to 20 read bases corresponding to the ends of the DNA fragments. Error frequencies decreased proportionately with trimming length; however, the results indicated that they were not sufficiently removed. To efficiently remove SS overhangs, we evaluated three mechanistically distinct SS-specific nucleases (S1 Nuclease, mung bean nuclease, and RecJf exonuclease) and found that they were more efficient than computational trimming. Consequently, we established Jade-Seq™, an ECS protocol with S1 Nuclease treatment, which reduced G:C > T:A and G:C > C:G errors to 0.50 × 10−7 bp and 0.12 × 10−7 bp, respectively. This was probably because S1 Nuclease removed SS regions, such as gaps and nicks, depending on its wide substrate specificity. Subsequently, we evaluated the mutation-detection sensitivity of Jade-Seq™ using DNA samples from TA100 cells exposed to 3-methylcholanthrene and 7,12-dimethylbenz[a]anthracene, which contained the rare G:C > T:A mutation (i.e., 2 × 10−7 bp). Fold changes of G:C > T:A compared to the vehicle control were 1.2- and 1.3-times higher than those of samples without S1 Nuclease treatment, respectively. These findings indicate the potential of Jade-Seq™ for detecting rare mutations and determining the mutagenicity of environmental mutagens.

2019 ◽  
Vol 20 (1) ◽  
pp. 199 ◽  
Author(s):  
Eun Hyun Ahn ◽  
Seung Hyuk Lee

We present a genome-wide comparative and comprehensive analysis of three different sequencing methods (conventional next generation sequencing (NGS), tag-based single strand sequencing (e.g., SSCS), and Duplex Sequencing for investigating mitochondrial mutations in human breast epithelial cells. Duplex Sequencing produces a single strand consensus sequence (SSCS) and a duplex consensus sequence (DCS) analysis, respectively. Our study validates that although high-frequency mutations are detectable by all the three sequencing methods with the similar accuracy and reproducibility, rare (low-frequency) mutations are not accurately detectable by NGS and SSCS. Even with conservative bioinformatical modification to overcome the high error rate of NGS, the NGS frequency of rare mutations is 7.0 × 10−4. The frequency is reduced to 1.3 × 10−4 with SSCS and is further reduced to 1.0 × 10−5 using DCS. Rare mutation context spectra obtained from NGS significantly vary across independent experiments, and it is not possible to identify a dominant mutation context. In contrast, rare mutation context spectra are consistently similar in all independent DCS experiments. We have systematically identified heat-induced artifactual variants and corrected the artifacts using Duplex Sequencing. Specific sequence contexts were analyzed to examine the effects of neighboring bases on the accumulation of heat-induced artifactual variants. All of these artifacts are stochastically occurring rare mutations. C > A/G > T, a signature of oxidative damage, is the most increased (170-fold) heat-induced artifactual mutation type. Our results strongly support the claim that Duplex Sequencing accurately detects low-frequency mutations and identifies and corrects artifactual mutations introduced by heating during DNA preparation.


2018 ◽  
Author(s):  
Eun Hyun Ahn ◽  
Seung Hyuk Lee

AbstractWe present a genome-wide comparative and comprehensive analysis of three different sequencing methods (conventional next generation sequencing (NGS), tag-based single strand sequencing (eg. SSCS), and Duplex Sequencing for investigating mitochondrial mutations in human breast epithelial cells. Duplex Sequencing produces a single strand consensus sequence (SSCS) and a duplex consensus sequence (DCS) analysis, respectively. Our study validates that although high-frequency mutations are detectable by all the three sequencing methods with the similar accuracy and reproducibility, rare (low-frequency) mutations are not accurately detectable by NGS and SSCS. Even with conservative bioinformatical modification to overcome the high error rate of NGS, the NGS frequency of rare mutations is 7.0×10−4. The frequency is reduced to 1.3×10−4 with SSCS and is further reduced to 1.0×10−5 using DCS. Rare mutation context spectra obtained from NGS significantly vary across independent experiments, and it is not possible to identify a dominant mutation context. In contrast, rare mutation context spectra are consistently similar in all independent DCS experiments. We have systematically identified heat-induced artifactual mutations and corrected the artifacts using Duplex Sequencing. All of these artifacts are stochastically occurring rare mutations. C>A/G>T, a signature of oxidative damage, is the most increased (170-fold) heat-induced artifactual mutation type. Our results strongly support the claim that Duplex Sequencing accurately detects low-frequency mutations and identifies and corrects artifactual mutations introduced by heating during DNA preparation.


2000 ◽  
Vol 74 (11) ◽  
pp. 5161-5167 ◽  
Author(s):  
Hiroaki Okamoto ◽  
Masato Ukita ◽  
Tsutomu Nishizawa ◽  
Junichi Kishimoto ◽  
Yuji Hoshi ◽  
...  

ABSTRACT TT virus (TTV) is an unenveloped, circular, and single-stranded DNA virus commonly infecting human beings worldwide. TTV DNAs in paired serum and liver tissues from three viremic individuals were separated by gel electrophoresis and characterized biophysically. TTV DNAs in sera migrated in sizes ranging from 2.0 to 2.5 kb. TTV DNAs in liver tissues, however, migrated at 2.0 to 2.5 kb as well as at 3.5 to 6.1 kb. Both faster- and slower-migrating forms of TTV DNAs in the liver were found to be circular and of the full genomic length of 3.8 kb. TTV DNAs migrating at 2.0 to 2.5 kb, from either serum or liver tissues, were sensitive to S1 nuclease but resistant to restriction endonucleases, and therefore, they were single-stranded. By contrast, TTV DNAs in liver tissues that migrated at 3.5 to 6.1 kb were resistant to S1 nuclease. They migrated at 3.7 to 4.0 kb after digestion with EcoRI, which suggests that they represent circular, double-stranded replicative intermediates of TTV. When TTV DNAs were subjected to strand-specific primer extension and then amplified by PCR with internal primers, those in serum were found to be minus-stranded DNAs while those in liver tissues were found to be a mixture of plus- and minus-stranded DNAs. These results suggest that TTV replicates in the liver via a circular double-stranded DNA.


1996 ◽  
Vol 16 (6) ◽  
pp. 2656-2669 ◽  
Author(s):  
G A Michelotti ◽  
E F Michelotti ◽  
A Pullner ◽  
R C Duncan ◽  
D Eick ◽  
...  

Transcription activation and repression of eukaryotic genes are associated with conformational and topological changes of the DNA and chromatin, altering the spectrum of proteins associated with an active gene. Segments of the human c-myc gene possessing non-B structure in vivo located with enzymatic and chemical probes. Sites hypertensive to cleavage with single-strand-specific S1 nuclease or the single-strand-selective agent potassium permanganate included the major promoters P1 and P2 as well as the far upstream sequence element (FUSE) and CT elements, which bind, respectively, the single-strand-specific factors FUSE-binding protein and heterogeneous nuclear ribonucleoprotein K in vitro. Active and inactive c-myc genes yielded different patterns of S1 nuclease and permanganate sensitivity, indicating alternative chromatin configurations of active and silent genes. The melting of specific cis elements of active c-myc genes in vivo suggested that transcriptionally associated torsional strain might assist strand separation and facilitate factor binding. Therefore, the interaction of FUSE-binding protein and heterogeneous nuclear ribonucleoprotein K with supercoiled DNA was studied. Remarkably, both proteins recognize their respective elements torsionally strained but not as liner duplexes. Single-strand- or supercoil-dependent gene regulatory proteins may directly link alterations in DNA conformation and topology with changes in gene expression.


1987 ◽  
Vol 7 (8) ◽  
pp. 2933-2940
Author(s):  
H Honkawa ◽  
W Masahashi ◽  
S Hashimoto ◽  
T Hashimoto-Gotoh

A number of deletion mutants were isolated, including 5', 3', and internal deletions in the 5'-flanking region of the human cellular oncogene related to the Harvey sarcoma virus (c-H-ras), and their transforming activities were examined in NIH 3T3 cells. DNA sequences which could not be detected without losing transforming activity were localized to a relatively short stretch upstream of the region which showed homology to the 5'-flanking region of v-H-ras oncogene. S1 nuclease analysis indicated that there were two clusters of mRNA start sites at positions that were about 1,371 and 1,298 base pairs upstream of the first coding ATG. The minimum region required for promoter function was estimated to be a 51-base-pair-long (or less) DNA segment. The promoter was GC rich (78%) and did not contain the consensus sequences that are usually observed in PolII-directed promoters but contained a GC box within which one of the mRNA start sites was included. In addition, two sets of positive and negative elements seemed to be located between the promoter and the protein-coding region, which appeared to influence positively and negatively, respectively, the efficiency of transformation with the c-H-ras oncogene.


1987 ◽  
Vol 7 (4) ◽  
pp. 1545-1548
Author(s):  
M R Kelley ◽  
S Kidd ◽  
R L Berg ◽  
M W Young

P elements move about the Drosophila melanogaster genome in a nonrandom fashion, preferring some chromosomal targets for insertion over others (J. C. J. Eeken, F. H. Sobels, V. Hyland, and A. P. Schalet, Mutat. Res. 150:261-275, 1985; W. R. Engels, Annu. Rev. Genet. 17:315-344, 1983; M. D. Golubovsky, Y. N. Ivanov, and M. M. Green, Proc. Natl. Acad. Sci. USA 74:2973-2975, 1977; M. J. Simmons and J. K. Lim, Proc. Natl. Acad. Sci. USA 77:6042-6046, 1980). Some of this specificity may be due to recognition of a particular DNA sequence in the target DNA; derivatives of an 8-base-pair consensus sequence are occupied by these transposable elements at many different chromosomal locations (K. O'Hare and G. M. Rubin, Cell 34:25-36, 1983). An additional level of specificity of P-element insertions is described in this paper. Of 14 mutations induced in the complex locus Notch by hybrid dysgenesis, 13 involved P-element insertions at or near the transcription start site of the gene. This clustering was not seen in other transposable element-induced mutations of Notch. DNA sequences homologous to the previously described consensus target for P-element insertion are not preferentially located in this region of the locus. The choice of a chromosomal site for integration appears to be based on more subtle variations in chromosome structure that are probably associated with activation or expression of the target gene.


Genetics ◽  
1985 ◽  
Vol 111 (2) ◽  
pp. 233-241
Author(s):  
Joachim F Ernst ◽  
D Michael Hampsey ◽  
Fred Sherman

ABSTRACT ICR-170-induced mutations in the CYC1 gene of the yeast Saccharomyces cerevisiae were investigated by genetic and DNA sequence analyses. Genetic analysis of 33 cyc1 mutations induced by ICR-170 and sequence analysis of eight representatives demonstrated that over one-third were frameshift mutations that occurred at one site corresponding to amino acid positions 29-30, whereas the remaining mutations were distributed more-or-less randomly, and a few of these were not frameshift mutations. The sequence results indicate that ICR-170 primarily induces G·C additions at sites containing monotonous runs of three G·C base pairs. However, some (see PDF) sites within the CYC1 gene were not mutated by ICR-170. Thus, ICR-170 is a relatively specific mutagen that preferentially acts on certain sites with monotonous runs of G·C base pairs.


1981 ◽  
Vol 1 (2) ◽  
pp. 136-143
Author(s):  
M Pellegrini ◽  
W E Timberlake ◽  
R B Goldberg

Electron microscopic analysis of reassociated deoxyribonucleic acid (DNA) from the aquatic fungus Achlya bisexualis revealed details of the sequence arrangement of the inverted repeats and both the highly and moderately repetitive sequence clusters. We used the gene 32 protein-ethidium bromide technique for visualizing the DNA molecules, a procedure which provides excellent contrast between single- and double-stranded DNA regions. Long (greater than 6-kilobase) DNA fragments were isolated after reannealing to two different repetitive C0t values, and the renatured structures were then visualized in an electron microscope. Our results showed that the inverted repeat sequences were short (0.5 kilobase, number-average) and separated by nonhomologous DNA of various lengths. These pairs of sequences were not clustered within the genome. Both highly repetitive and moderately repetitive DNA sequences were organized as tandem arrays of precisely paired, regularly repeating units. No permuted clusters of repeating sequences were observed, nor was there evidence of interspersion of repetitive with single-copy DNA sequences in the Achlya genome.


Sign in / Sign up

Export Citation Format

Share Document