scholarly journals Conversion of Genomic DNA to Proxy Constructs Suitable for Accurate Nanopore Sequencing

2015 ◽  
Author(s):  
Dimitra Tsavachidou

AbstractNanopore sequencing at single-base resolution is challenging. There are developing technologies to convert DNA molecules to expanded constructs. Such constructs can be sequenced by nanopores in place of the original DNA molecules. We present a novel method for converting genomic DNA to expanded constructs (“proxies”) with 99.67% accuracy. Our method “reads” each base in each DNA fragment and appends an oligonucleotide to the DNA fragment after each base “reading”. Each appended oligonucleotide represents a specific base type, so that the proxy construct consisting of all the appended oligonucleotides faithfully represents the original DNA sequence. We generated proxies for genomic DNA and confirmed the identities of both the proxies and their corresponding original DNA sequences by performing sequencing using Ion Torrent sequencer.Conversion to proxies had only 0.33% raw error rate. Errors were: 93.96% deletions, 5.29% insertions, and 0.74% substitutions. The longest sequenced proxy was 170 bases, corresponding to a 17-base original DNA sequence. The short length of the detected proxies reflected restrictions imposed by Ion Torrent’s short reads and was not caused by limitations of our method. The consensus sequence built by using proxies alone (average length: 120 bases; corresponding to original sequences with average length 12 bases) covered 55% of the reference genome with 100% accuracy, and outperformed the Ion Torrent sequencing of the corresponding original DNA fragments in terms of accuracy, coverage and number of aligned sequences. Data and other materials can be found at http://www.vastogen.com/data.html. This proof-of-concept experiment demonstrates highly accurate proxy construction at the whole genome level. To our knowledge, this is the first demonstrated construction of expanded versions of DNA at the whole genome level.

Open Biology ◽  
2019 ◽  
Vol 9 (8) ◽  
pp. 190116 ◽  
Author(s):  
Shoko Sato ◽  
Yasuhiro Arimura ◽  
Tomoya Kujirai ◽  
Akihito Harada ◽  
Kazumitsu Maehara ◽  
...  

Tn5 transposase is a bacterial enzyme that integrates a DNA fragment into genomic DNA, and is used as a tool for detecting nucleosome-free regions of genomic DNA in eukaryotes. However, in chromatin, the DNA targeting by Tn5 transposase has remained unclear. In the present study, we reconstituted well-positioned 601 dinucleosomes, in which two nucleosomes are connected with a linker DNA, and studied the DNA integration sites in the dinucleosomes by Tn5 transposase in vitro . We found that Tn5 transposase preferentially targets near the entry–exit DNA regions within the nucleosome. Tn5 transposase minimally cleaved the dinucleosome without a linker DNA, indicating that the linker DNA between two nucleosomes is important for the Tn5 transposase activity. In the presence of a 30 base-pair linker DNA, Tn5 transposase targets the middle of the linker DNA, in addition to the entry–exit sites of the nucleosome. Intriguingly, this Tn5-targeting characteristic is conserved in a dinucleosome substrate with a different DNA sequence from the 601 sequence. Therefore, the Tn5-targeting preference in the nucleosomal templates reported here provides important information for the interpretation of Tn5 transposase-based genomics methods, such as ATAC-seq.


2000 ◽  
Vol 46 (12) ◽  
pp. 1149-1152 ◽  
Author(s):  
Kuo-Chieh Ho ◽  
Gan-Nan Chang

A DNA fragment carrying flagellar genes was cloned from Salmonella choleraesuis. Compared to the corresponding DNA fragment of Salmonella muenchen, this fragment contained three ORFs instead of four shown in S. muenchen. The DNA sequence data showed that there was an insertion of nucleotide C in the ORF of the S. choleraesuis fliU gene, which resulted in the disappearance of a termination codon downstream. The recombinant plasmid pFU11 containing the coding region of the fliU gene made by PCR on S. choleraesuis genomic DNA was constructed and expressed in Escherichia coli in the presence of IPTG. As expected, a 45 kDa protein band was observed on a SDS-PAGE gel, in contrast to two with each having about a half of the molecular weight. These results demonstrated that the DNA sequence encoding one protein (FliU) in S. choleraesuis corresponded to the DNA sequence encoding two proteins (FliU and FliV) in S. muenchen. The protein encoded by this single ORF might carry out the functions of two separated proteins by folding in such a way that its conformation could function like two interdependent protein subunits.Key words: Salmonella choleraesuis, Salmonella muenchen, fliU gene, gene expression.


Genome ◽  
1993 ◽  
Vol 36 (3) ◽  
pp. 427-432 ◽  
Author(s):  
X. Xia ◽  
L. Erickson

A DNA fragment of ~750 bp was cloned from EcoRI-digested nuclear DNA of alfalfa (Medicago sativa). Southern blot and sequence analysis showed that the cloned DNA fragment represents a tetramer of a highly tandemly repeated DNA sequence of 185–188 bp (E180). The consensus sequence deduced from the four repeating units is 189 bp in length with an AT content of 67%. The copy number of the satellite DNA was estimated to be ~1.8 × 105 per genome and constitutes about 1% of the alfalfa genome. Sequence comparison revealed no identity to any repetitive DNA sequences that have been published to date. Digestion with HpaII and MspI indicated a high degree of methylation at the internal C of the restriction site CCGG within E180.Key words: alfalfa, Medicago, repetitive DNA sequence, satellite DNA.


2005 ◽  
Vol 03 (03) ◽  
pp. 677-696 ◽  
Author(s):  
YINHE CAO ◽  
WEN-WEN TUNG ◽  
J. B. GAO ◽  
YAN QI

With the completion of the human and a few model organisms' genomes, and with the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time-based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Our method requires approximately 6 · N byte memory and a computational time of N log N to extract all the repeat-related and periodic or quasi-periodic features from a sequence of length N without any prior knowledge on the consensus sequence of those features, hence enables us to carry out sequence analysis on the whole genomic scale by a PC.


Author(s):  
Amnon Koren ◽  
Dashiell J Massey ◽  
Alexa N Bracci

Abstract Motivation Genomic DNA replicates according to a reproducible spatiotemporal program, with some loci replicating early in S phase while others replicate late. Despite being a central cellular process, DNA replication timing studies have been limited in scale due to technical challenges. Results We present TIGER (Timing Inferred from Genome Replication), a computational approach for extracting DNA replication timing information from whole genome sequence data obtained from proliferating cell samples. The presence of replicating cells in a biological specimen leads to non-uniform representation of genomic DNA that depends on the timing of replication of different genomic loci. Replication dynamics can hence be observed in genome sequence data by analyzing DNA copy number along chromosomes while accounting for other sources of sequence coverage variation. TIGER is applicable to any species with a contiguous genome assembly and rivals the quality of experimental measurements of DNA replication timing. It provides a straightforward approach for measuring replication timing and can readily be applied at scale. Availability and Implementation TIGER is available at https://github.com/TheKorenLab/TIGER. Supplementary information Supplementary data are available at Bioinformatics online


Author(s):  
Hugo López-Fernández ◽  
Cristina P. Vieira ◽  
Pedro Ferreira ◽  
Paula Gouveia ◽  
Florentino Fdez-Riverola ◽  
...  
Keyword(s):  

Genetics ◽  
2001 ◽  
Vol 157 (1) ◽  
pp. 133-140
Author(s):  
Kazumi Inada ◽  
Yoshinori Morimoto ◽  
Toshihide Arima ◽  
Yukio Murata ◽  
Takashi Kamada

Abstract Sexual development in the mushroom Coprinus cinereus is under the control of the A and B mating-type loci, both of which must be different for a compatible, dikaryotic mycelium to form between two parents. The A genes, encoding proteins with homeodomain motifs, regulate conjugate division of the two nuclei from each mating partner and promote the formation of clamp connections. The latter are hyphal configurations required for the maintenance of the nuclear status in the dikaryotic phase of basidiomycetes. The B genes encode pheromones and pheromone receptors. They regulate the cellular fusions that complete clamp connections during growth, as well as the nuclear migration required for dikaryosis. The AmutBmut strain (326) of C. cinereus, in which both A- and B-regulated pathways are constitutively activated by mutations, produces, without mating, dikaryon-like, fertile hyphae with clamp connections. In this study we isolated and characterized clampless1-1 (clp1-1), a mutation that blocks clamp formation, an essential step in A-regulated sexual development, in the AmutBmut background. A genomic DNA fragment that rescues the clp1-1 mutation was identified by transformations. Sequencing of the genomic DNA, together with RACE experiments, identified an ORF interrupted by one intron, encoding a novel protein of 365 amino acids. The clp1-1 mutant allele carries a deletion of four nucleotides, which is predicted to cause elimination of codon 128 and frameshifts thereafter. The clp1 transcript was normally detected only in the presence of the A protein heterodimer formed when homokaryons with compatible A genes were mated. Forced expression of clp1 by promoter replacements induced clamp development without the need for a compatible A gene combination. These results indicate that expression of clp1 is necessary and sufficient for induction of the A-regulated pathway that leads to clamp development.


Viruses ◽  
2021 ◽  
Vol 13 (5) ◽  
pp. 847
Author(s):  
Kyungmin Park ◽  
Seung-Ho Lee ◽  
Jongwoo Kim ◽  
Jingyeong Lee ◽  
Geum-Young Lee ◽  
...  

Whole-genome sequencing of infectious agents enables the identification and characterization of emerging viruses. The MinION device is a portable sequencer that allows real-time sequencing in fields or hospitals. Hantaan orthohantavirus (Hantaan virus, HTNV), harbored by Apodemus agrarius, causes hemorrhagic fever with renal syndrome (HFRS) and poses a critical public health threat worldwide. In this study, we aimed to evaluate the feasibility of using nanopore sequencing for whole-genome sequencing of HTNV from samples having different viral copy numbers. Amplicon-based next-generation sequencing was performed in A. agrarius lung tissues collected from the Republic of Korea. Genomic sequences of HTNV were analyzed based on the viral RNA copy numbers. Amplicon-based nanopore sequencing provided nearly full-length genomic sequences of HTNV and showed sufficient read depth for phylogenetic analysis after 8 h of sequencing. The average identity of the HTNV genome sequences for the nanopore sequencer compared to those of generated from Illumina MiSeq revealed 99.8% (L and M segments) and 99.7% (S segment) identities, respectively. This study highlights the potential of the portable nanopore sequencer for rapid generation of accurate genomic sequences of HTNV for quicker decision making in point-of-care testing of HFRS patients during a hantavirus outbreak.


1990 ◽  
Vol 18 (7) ◽  
pp. 1789-1795 ◽  
Author(s):  
Rajendra P. Kandpal ◽  
David C. Ward ◽  
Sherman M. Weissman

Genome ◽  
1998 ◽  
Vol 41 (2) ◽  
pp. 148-153 ◽  
Author(s):  
Monique Abadon ◽  
Eric Grenier ◽  
Christian Laumond ◽  
Pierre Abad

An AluI satellite DNA family has been cloned from the entomopathogenic nematode Heterorhabditis indicus. This repeated sequence appears to be an unusually abundant satellite DNA, since it constitutes about 45% of the H. indicus genome. The consensus sequence is 174 nucleotides long and has an A + T content of 56%, with the presence of direct and inverted repeat clusters. DNA sequence data reveal that monomers are quite homogeneous. Such homogeneity suggests that some mechanism is acting to maintain the homogeneity of this satellite DNA, despite its abundance, or that this repeated sequence could have appeared recently in the genome of H. indicus. Hybridization analysis of genomic DNAs from different Heterorhabditis species shows that this satellite DNA sequence is specific to the H. indicus genome. Considering the species specificity and the high copy number of this AluI satellite DNA sequence, it could provide a rapid and powerful tool for identifying H. indicus strains.Key words: AluI repeated DNA, tandem repeats, species-specific sequence, nucleotide sequence analysis.


Sign in / Sign up

Export Citation Format

Share Document