A DNA G-quadruplex/i-motif hybrid

10.1101/737296 ◽

2019 ◽

Author(s):

Betty Chu ◽

Daoning Zhang ◽

Paul J. Paukstelis

Keyword(s):

Tandem Repeats ◽

Double Helix ◽

Structural Diversity ◽

Hybrid Structure ◽

Structural Motif ◽

Base Pairs ◽

Independent Sequence ◽

Solution Studies ◽

G Quadruplex

AbstractDNA can form many structures beyond the canonical Watson-Crick double helix. It is now clear that noncanonical structures are present in genomic DNA and have biological functions. G-rich G-quadruplexes and C-rich i-motifs are the most well-characterized noncanonical DNA motifs that have been detected in vivo with either proscribed or postulated biological roles. Because of their independent sequence requirements, these structures have largely been considered distinct types of quadruplexes. Here, we describe the crystal structure of the DNA oligonucleotide, d(CCAGGCTGCAA), that self-associates to form a quadruplex structure containing two central antiparallel G-tetrads and six i-motif C-C+ base pairs. Solution studies suggest a robust structural motif capable of assembling as a tetramer of individual strands or as a dimer when composed of tandem repeats. This hybrid structure highlights the growing structural diversity of DNA and suggests that biological systems may harbor many functionally important non-duplex structures.

Download Full-text

Human homologs of TU transposon sequences: polypurine/polypyrimidine sequence elements that can alter DNA conformation in vitro and in vivo.

Molecular and Cellular Biology ◽

10.1128/mcb.6.11.3632 ◽

1986 ◽

Vol 6 (11) ◽

pp. 3632-3642 ◽

Cited By ~ 40

Author(s):

B Hoffman-Liebermann ◽

D Liebermann ◽

A Troutt ◽

L H Kedes ◽

S N Cohen

Keyword(s):

Tandem Repeats ◽

Base Pairs ◽

S1 Nuclease ◽

Human Dna ◽

Outer Domain ◽

Sequence Elements ◽

Dna Strands ◽

Species Specific

We previously have shown that homologs of the outer domain segment of the inverted repeat termini (IVR-OD) of the sea urchin TU transposons are conserved among multiple eucaryotic species, including humans. We report here that two cloned human DNA IVR-OD homologs, Hut2 and Hut17, consist of a series of tandem repeats of the trimer AGG/TCC, forming segments (313 and 221 base pairs in length, respectively) of polypurine/polypyrimidine (pPu/pPy or "Puppy") asymmetry in the two DNA strands; these are punctuated at certain sites with variant trimers, which are different for the two clones. Sequences homologous to the Hut2 pPu/pPy tract exist at multiple sites in the DNA of a wide variety of eucaryotes. Hybridization of human DNA with a Hut2 probe or with a previously described chicken DNA pPu/pPy sequence indicates that pPu/pPy sequences can be grouped into families distinguishable by the extent of their homology with each probe at different hybridization stringencies. Moreover, particular pPu/pPy tracts show species-specific differences in their distribution. Both the Hut2 and Hut17 pPu/pPy tracts are cleaved by S1 nuclease when tested on supercoiled plasmids. Most if not all of the 313-base-pair Hut2 pPu/pPy tract is also sensitive to S1 in its native location in HeLa cell chromatin, indicating that the sequence contains conformational information that can be expressed in vivo. This view is supported by evidence that exogenously derived Hut2 pPu/pPy tracts introduced into mouse L cells and integrated in chromatin can assume an S1-sensitive conformation.

Download Full-text

DeepG4: A deep learning approach to predict cell-type specific active G-quadruplex regions

PLoS Computational Biology ◽

10.1371/journal.pcbi.1009308 ◽

2021 ◽

Vol 17 (8) ◽

pp. e1009308

Author(s):

Vincent Rocher ◽

Matthieu Genais ◽

Elissar Nassereddine ◽

Raphael Mourad

Keyword(s):

Deep Learning ◽

Double Helix ◽

Complex Molecule ◽

Cell Types ◽

Sequence Motif ◽

Cell Type ◽

G Quadruplex ◽

Cell Type Specific

DNA is a complex molecule carrying the instructions an organism needs to develop, live and reproduce. In 1953, Watson and Crick discovered that DNA is composed of two chains forming a double-helix. Later on, other structures of DNA were discovered and shown to play important roles in the cell, in particular G-quadruplex (G4). Following genome sequencing, several bioinformatic algorithms were developed to map G4s in vitro based on a canonical sequence motif, G-richness and G-skewness or alternatively sequence features including k-mers, and more recently machine/deep learning. Recently, new sequencing techniques were developed to map G4s in vitro (G4-seq) and G4s in vivo (G4 ChIP-seq) at few hundred base resolution. Here, we propose a novel convolutional neural network (DeepG4) to map cell-type specific active G4 regions (e.g. regions within which G4s form both in vitro and in vivo). DeepG4 is very accurate to predict active G4 regions in different cell types. Moreover, DeepG4 identifies key DNA motifs that are predictive of G4 region activity. We found that such motifs do not follow a very flexible sequence pattern as current algorithms seek for. Instead, active G4 regions are determined by numerous specific motifs. Moreover, among those motifs, we identified known transcription factors (TFs) which could play important roles in G4 activity by contributing either directly to G4 structures themselves or indirectly by participating in G4 formation in the vicinity. In addition, we used DeepG4 to predict active G4 regions in a large number of tissues and cancers, thereby providing a comprehensive resource for researchers. Availability: https://github.com/morphos30/DeepG4.

Download Full-text

DeepG4 : A deep learning approach to predict active G-quadruplexes from DNA

10.1101/2020.07.22.215699 ◽

2020 ◽

Author(s):

Vincent Rocher ◽

Matthieu Genais ◽

Elissar Nassereddine ◽

Raphael Mourad

Keyword(s):

Deep Learning ◽

Double Helix ◽

Complex Molecule ◽

Genome Project ◽

Sequence Motif ◽

Cell Type ◽

G Quadruplex ◽

The Impact

AbstractDNA is a complex molecule carrying the instructions an organism needs to develop, live and reproduce. In 1953, Watson and Crick discovered that DNA is composed of two chains forming a double-helix. Later on, other structures of DNA were discovered and shown to play important roles in the cell, in particular G-quadruplex (G4). Following genome sequencing, several bioinformatic algorithms were developed to map G4s in vitro based on a canonical sequence motif, G-richness and G-skewness or alternatively sequence features including k-mers, and more recently machine/deep learning. Here, we propose a novel convolutional neural network (DeepG4) to map active G4s (forming both in vitro and in vivo). DeepG4 is very accurate to predict active G4s, while most state-of-the-art algorithms fail. Moreover, DeepG4 identifies key DNA motifs that are predictive of G4 activity. We found that active G4 motifs do not follow a very flexible sequence pattern as current algorithms seek for. Instead, active G4s are determined by numerous specific motifs. Moreover, among those motifs, we identified known transcription factors (TFs) which could play important roles in G4 activity by contributing either directly to G4 structures themselves or indirectly by participating in G4 formation in the vicinity. Moreover, we showed that specific TFs might explain G4 activity depending on cell type. Lastly, variant analysis suggests that SNPs altering predicted G4 activity could affect transcription and chromatin, e.g. gene expression, H3K4me3 mark and DNA methylation. Thus, DeepG4 paves the way for future studies assessing the impact of known disease-associated variants on DNA secondary structure by providing a mechanistic interpretation of SNP impact on transcription and chromatin.Availability: https://github.com/morphos30/DeepG4.Author summaryDNA is a molecule carrying genetic information and found in all living cells. In 1953, Watson and Crick found that DNA has a double helix structure. However, other DNA structures were later identified, and most notably, G-quadruplex (G4). In 2000, the Human Genome Project revealed the widespread presence of G4s in the genome using algorithms. To date, all G4 mapping algorithms were developed to map G4s on naked DNA, without knowing if they could be formed in the cell. Here, we designed a novel artificial intelligence algorithm that could map G4s active in the cell from the DNA sequence. We showed its better accuracy compared to existing algorithms. Moreover, we identified key transcriptional factor motifs that could explain G4 activity depending on cell type. Lastly, we demonstrated the existence of mutations that could alter G4 activity and therefore impact molecular processes, such as transcription, in the cell. Such results could provide a novel mechanistic interpretation of known disease-associated mutations.

Download Full-text

Drug—nucleic acid interaction: X-ray crystallographic determination of an ethidium—dinucleoside monophosphate crystalline complex, ethidium: 5-iodouridylyl(3'-5')adenosine

Philosophical Transactions of the Royal Society of London Series B Biological Sciences ◽

10.1098/rstb.1975.0076 ◽

1975 ◽

Vol 272 (915) ◽

pp. 137-146 ◽

Cited By ~ 14

Keyword(s):

Nucleic Acid ◽

Double Helix ◽

Monoclinic Space Group ◽

Sequence Specificity ◽

Acid Interaction ◽

Base Pairs ◽

Crystalline Complex ◽

Structural Explanation ◽

Solution Studies ◽

Model Drug

The intercalative trypanosomal drug, ethidium bromide, forms a crystalline complex with the dinucleoside monophosphate, 5-iodouridylyl(3'-5')adenosine (iodoUpA). These crystals are monoclinic, space group C2, with unit cell dimensions a =2.845nm, b = 1.354 nm, c = 3.413nm, β =98.6°. The structure has been solved to atomic resolution by Patterson and Fourier methods, and refined by full matrix least squares to a residual of 0.20 on 2017 observed reflexions. The asymmetric unit contains two ethidium molecules, two iodoUpA molecules, twenty water molecules and four methanol molecules, a total of 156 atoms excluding hydrogens. The two iodoUpA molecules are held together by adenine-uracil Watson-Crick base-pairing. Adjacent base-pairs within this paired iodoUpA structure and between neighbouring iodoUpA molecules in adjoining unit cells are separated by 0.68 nm. This separation results from intercalative binding by one ethidium molecule and stacking by the other ethidium molecule above and below the base-pairs. Non-crystallographic twofold symmetry is utilized in this model drug-nucleic acid interaction, the intercalative ethidium molecule being oriented such that its phenyl and ethyl groups lie in the narrow groove of the miniature nucleic acid double helix. Solution studies have indicated a marked sequence specificity for ethidium-dinucleotide interactions and a probable structural explanation for this has been provided by this study.

Download Full-text

The biofilm matrix scaffold of Pseudomonas aeruginosa contains G-quadruplex extracellular DNA structures

npj Biofilms and Microbiomes ◽

10.1038/s41522-021-00197-5 ◽

2021 ◽

Vol 7 (1) ◽

Author(s):

Thomas Seviour ◽

Fernaldo Richtia Winnerdy ◽

Lan Li Wong ◽

Xiangyan Shi ◽

Sudarsan Mugunthan ◽

...

Keyword(s):

Pseudomonas Aeruginosa ◽

Extracellular Dna ◽

Chromosomal Dna ◽

Base Pairs ◽

Dna Structures ◽

G Quadruplex ◽

The Matrix ◽

Nucleic Acid Conformation ◽

Conformation Transition

AbstractExtracellular DNA, or eDNA, is recognised as a critical biofilm component; however, it is not understood how it forms networked matrix structures. Here, we isolate eDNA from static-culture Pseudomonas aeruginosa biofilms using ionic liquids to preserve its biophysical signatures of fluid viscoelasticity and the temperature dependency of DNA transitions. We describe a loss of eDNA network structure as resulting from a change in nucleic acid conformation, and propose that its ability to form viscoelastic structures is key to its role in building biofilm matrices. Solid-state analysis of isolated eDNA, as a proxy for eDNA structure in biofilms, reveals non-canonical Hoogsteen base pairs, triads or tetrads involving thymine or uracil, and guanine, suggesting that the eDNA forms G-quadruplex structures. These are less abundant in chromosomal DNA and disappear when eDNA undergoes conformation transition. We verify the occurrence of G-quadruplex structures in the extracellular matrix of intact static and flow-cell biofilms of P. aeruginosa, as displayed by the matrix to G-quadruplex-specific antibody binding, and validate the loss of G-quadruplex structures in vivo to occur coincident with the disappearance of eDNA fibres. Given their stability, understanding how extracellular G-quadruplex structures form will elucidate how P. aeruginosa eDNA builds viscoelastic networks, which are a foundational biofilm property.

Download Full-text

Human homologs of TU transposon sequences: polypurine/polypyrimidine sequence elements that can alter DNA conformation in vitro and in vivo

Molecular and Cellular Biology ◽

10.1128/mcb.6.11.3632-3642.1986 ◽

1986 ◽

Vol 6 (11) ◽

pp. 3632-3642

Author(s):

B Hoffman-Liebermann ◽

D Liebermann ◽

A Troutt ◽

L H Kedes ◽

S N Cohen

Keyword(s):

Tandem Repeats ◽

Base Pairs ◽

S1 Nuclease ◽

Human Dna ◽

Outer Domain ◽

Sequence Elements ◽

Dna Strands ◽

Species Specific

We previously have shown that homologs of the outer domain segment of the inverted repeat termini (IVR-OD) of the sea urchin TU transposons are conserved among multiple eucaryotic species, including humans. We report here that two cloned human DNA IVR-OD homologs, Hut2 and Hut17, consist of a series of tandem repeats of the trimer AGG/TCC, forming segments (313 and 221 base pairs in length, respectively) of polypurine/polypyrimidine (pPu/pPy or "Puppy") asymmetry in the two DNA strands; these are punctuated at certain sites with variant trimers, which are different for the two clones. Sequences homologous to the Hut2 pPu/pPy tract exist at multiple sites in the DNA of a wide variety of eucaryotes. Hybridization of human DNA with a Hut2 probe or with a previously described chicken DNA pPu/pPy sequence indicates that pPu/pPy sequences can be grouped into families distinguishable by the extent of their homology with each probe at different hybridization stringencies. Moreover, particular pPu/pPy tracts show species-specific differences in their distribution. Both the Hut2 and Hut17 pPu/pPy tracts are cleaved by S1 nuclease when tested on supercoiled plasmids. Most if not all of the 313-base-pair Hut2 pPu/pPy tract is also sensitive to S1 in its native location in HeLa cell chromatin, indicating that the sequence contains conformational information that can be expressed in vivo. This view is supported by evidence that exogenously derived Hut2 pPu/pPy tracts introduced into mouse L cells and integrated in chromatin can assume an S1-sensitive conformation.

Download Full-text

ATP-Dependent Minor Groove Recognition of TA Base Pairs Is Required for Template Melting by the E1 Initiator Protein

Journal of Virology ◽

10.1128/jvi.02432-06 ◽

2007 ◽

Vol 81 (7) ◽

pp. 3293-3302 ◽

Cited By ~ 14

Author(s):

Stephen Schuck ◽

Arne Stenlund

Keyword(s):

Dna Replication ◽

Double Helix ◽

Minor Groove ◽

Bovine Papillomavirus ◽

Base Pairs ◽

Initiator Protein ◽

Initiation Of Dna Replication ◽

Multiple Interactions

ABSTRACT Template melting is an essential step in the initiation of DNA replication, but the mechanism of template melting is unknown for any replicon. Here we demonstrate that melting of the bovine papillomavirus type 1 ori is a sequence-dependent process which relies on specific recognition of TA base pairs in the minor groove by the E1 initiator. We show that correct template melting is a prerequisite for the formation of a stable double hexamer with helicase activity and that ori mutants that fail to melt correctly are defective for ori unwinding and DNA replication in vivo. Our results also indicate that melting of the DNA is achieved by destabilization of the double helix along its length through multiple interactions with E1, each of which is responsible for melting of a few base pairs, resulting in the extensive melting that is required for initiation of DNA replication.

Download Full-text

Novel concatameric heparin-binding peptides reverse heparin and low-molecular-weight heparin anticoagulant activities in patient plasma in vitro and in rats in vivo

Blood ◽

10.1182/blood-2003-07-2334 ◽

2004 ◽

Vol 103 (4) ◽

pp. 1356-1363 ◽

Cited By ~ 30

Author(s):

Barbara P. Schick ◽

David Maslow ◽

Adrianna Moshinski ◽

James D. San Antonio

Keyword(s):

Molecular Weight ◽

Low Molecular Weight Heparin ◽

Tandem Repeats ◽

Factor Xa ◽

Low Molecular Weight ◽

Heparin Binding ◽

High Affinity ◽

Binding Peptides

Abstract Patients given unfractionated heparin (UFH) or low-molecular-weight heparin (LMWH) for prophylaxis or treatment of thrombosis sometimes suffer serious bleeding. We showed previously that peptides containing 3 or more tandem repeats of heparin-binding consensus sequences have high affinity for LMWH and neutralize LMWH (enoxaparin) in vivo in rats and in vitro in citrate. We have now modified the (ARKKAAKA)n tandem repeat peptides by cyclization or by inclusion of hydrophobic tails or cysteines to promote multimerization. These peptides exhibit high-affinity binding to LMWH (dissociation constant [Kd], ≈ 50 nM), similar potencies in neutralizing anti–Factor Xa activity of UFH and enoxaparin added to normal plasma in vitro, and efficacy equivalent to or greater than protamine. Peptide (ARKKAAKA)3VLVLVLVL was most effective in all plasmas from enoxaparin-treated patients, and was 4- to 20-fold more effective than protamine. Several other peptide structures were effective in some patients' plasmas. All high-affinity peptides reversed inhibition of thrombin-induced clot formation by UFH. These peptides (1 mg/300 g rat) neutralized 1 U/mL anti–Factor Xa activity of enoxaparin in rats within 1 to 2 minutes. Direct blood pressure and heart rate measurements showed little or no hemodynamic effect. These heparin-binding peptides, singly or in combination, are potential candidates for clinical reversal of UFH and LMWH in humans.

Download Full-text

Time delay during the proton tunneling in the base pairs of the DNA double helix

Progress in Biophysics and Molecular Biology ◽

10.1016/j.pbiomolbio.2021.06.001 ◽

2021 ◽

Author(s):

Gizem Çelebi ◽

Elif Özçelik ◽

Emre Vardar ◽

Durmuş Demir

Keyword(s):

Time Delay ◽

Double Helix ◽

Base Pairs ◽

Proton Tunneling ◽

Dna Double Helix

Download Full-text