Chapter 2: Identification of Genes and their Regulatory Regions Based on Multiple Physical and Structural Properties of a DNA Sequence

Author(s):  
Xi Yang ◽  
Nancy Yu Song ◽  
Hong Yan
2020 ◽  
Vol 21 (1) ◽  
Author(s):  
Nathaniel P. Delos Santos ◽  
Lorane Texari ◽  
Christopher Benner

Abstract Background Motif enrichment analysis (MEA) identifies over-represented transcription factor binding (TF) motifs in the DNA sequence of regulatory regions, enabling researchers to infer which transcription factors can regulate transcriptional response to a stimulus, or identify sequence features found near a target protein in a ChIP-seq experiment. Score-based MEA determines motifs enriched in regions exhibiting extreme differences in regulatory activity, but existing methods do not control for biases in GC content or dinucleotide composition. This lack of control for sequence bias, such as those often found in CpG islands, can obscure the enrichment of biologically relevant motifs. Results We developed Motif Enrichment In Ranked Lists of Peaks (MEIRLOP), a novel MEA method that determines enrichment of TF binding motifs in a list of scored regulatory regions, while controlling for sequence bias. In this study, we compare MEIRLOP against other MEA methods in identifying binding motifs found enriched in differentially active regulatory regions after interferon-beta stimulus, finding that using logistic regression and covariates improves the ability to call enrichment of ISGF3 binding motifs from differential acetylation ChIP-seq data compared to other methods. Our method achieves similar or better performance compared to other methods when quantifying the enrichment of TF binding motifs from ENCODE TF ChIP-seq datasets. We also demonstrate how MEIRLOP is broadly applicable to the analysis of numerous types of NGS assays and experimental designs. Conclusions Our results demonstrate the importance of controlling for sequence bias when accurately identifying enriched DNA sequence motifs using score-based MEA. MEIRLOP is available for download from https://github.com/npdeloss/meirlop under the MIT license.


Gene ◽  
2008 ◽  
Vol 410 (1) ◽  
pp. 165-176 ◽  
Author(s):  
Pelin Akan ◽  
Panos Deloukas

2000 ◽  
Vol 107 (4) ◽  
pp. 362-365 ◽  
Author(s):  
Michael Krawczak ◽  
Nadia Chuzhanova ◽  
Peter Stenson ◽  
Bent Johansen ◽  
Edward Ball ◽  
...  

1984 ◽  
Vol 12 (4) ◽  
pp. 1779-1790 ◽  
Author(s):  
N. Ramakrishna ◽  
E. Dubnau ◽  
I. Smith

Author(s):  
Barbara Trask ◽  
Susan Allen ◽  
Anne Bergmann ◽  
Mari Christensen ◽  
Anne Fertitta ◽  
...  

Using fluorescence in situ hybridization (FISH), the positions of DNA sequences can be discretely marked with a fluorescent spot. The efficiency of marking DNA sequences of the size cloned in cosmids is 90-95%, and the fluorescent spots produced after FISH are ≈0.3 μm in diameter. Sites of two sequences can be distinguished using two-color FISH. Different reporter molecules, such as biotin or digoxigenin, are incorporated into DNA sequence probes by nick translation. These reporter molecules are labeled after hybridization with different fluorochromes, e.g., FITC and Texas Red. The development of dual band pass filters (Chromatechnology) allows these fluorochromes to be photographed simultaneously without registration shift.


2012 ◽  
pp. n/a-n/a
Author(s):  
Qian-Quan Li ◽  
Min-Hui Li ◽  
Qing-Jun Yuan ◽  
Zhan-Hu Cui ◽  
Lu-Qi Huang ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document