Ranked k-spectrum kernel for comparative and evolutionary comparison of exons, introns, and CpG islands

Author(s):  
Sangseon Lee ◽  
Taeheon Lee ◽  
Yung-Kyun Noh ◽  
Sun Kim
2018 ◽  
Author(s):  
Florian Massipa ◽  
Marc Laurent ◽  
Caroline Brossas ◽  
José Miguel Fernández-Justel ◽  
María Gómez ◽  
...  

BackgroundThe replication programme of vertebrate genomes is driven by the chro-mosomal distribution and timing of activation of tens of thousands of replication origins. Genome-wide studies have shown the frequent association of origins with promoters and CpG islands, and their enrichment in G-quadruplex sequence motifs (G4). However, the genetic determinants driving their activity remain poorly understood. To gain insight on the functional constraints operating on replication origins and their spatial distribution, we conducted the first evolutionary comparison of genome-wide origins maps across vertebrates.ResultsWe generated a high resolution genome-wide map of chicken replication origins (the first of a bird genome), and performed an extensive comparison with human and mouse maps. The analysis of intra-species polymorphism revealed a strong depletion of genetic diversity on an ~ 40 bp region centred on the replication initiation loci. Surprisingly, this depletion in genetic diversity was not linked to the presence of G4 motifs, nor to the association with promoters or CpG islands. In contrast, we also showed that origins experienced a rapid turnover during vertebrates evolution, since pairwise comparisons of origin maps revealed that only 4 to 24% of them were conserved between any two species.ConclusionsThis study unravels the existence of a novel genetic determinant of replication origins, the precise functional role of which remains to be determined. Despite the importance of replication initiation activity for the fitness of organisms, the distribution of replication origins along vertebrate chromosomes is highly flexible.


2020 ◽  
Vol 15 ◽  
Author(s):  
Dicle Yalcin ◽  
Hasan H. Otu

Background: Epigenetic repression mechanisms play an important role in gene regulation, specifically in cancer development. In many cases, a CpG island’s (CGI) susceptibility or resistance to methylation are shown to be contributed by local DNA sequence features. Objective: To develop unbiased machine learning models–individually and combined for different biological features–that predict the methylation propensity of a CGI. Methods: We developed our model consisting of CGI sequence features on a dataset of 75 sequences (28 prone, 47 resistant) representing a genome-wide methylation structure. We tested our model on two independent datasets that are chromosome (132 sequences) and disease (70 sequences) specific. Results: We provided improvements in prediction accuracy over previous models. Our results indicate that combined features better predict the methylation propensity of a CGI (area under the curve (AUC) ~0.81). Our global methylation classifier performs well on independent datasets reaching an AUC of ~0.82 for the complete model and an AUC of ~0.88 for the model using select sequences that better represent their classes in the training set. We report certain de novo motifs and transcription factor binding site (TFBS) motifs that are consistently better in separating prone and resistant CGIs. Conclusion: Predictive models for the methylation propensity of CGIs lead to a better understanding of disease mechanisms and can be used to classify genes based on their tendency to contain methylation prone CGIs, which may lead to preventative treatment strategies. MATLAB and Python™ scripts used for model building, prediction, and downstream analyses are available at https://github.com/dicleyalcin/methylProp_predictor.


2017 ◽  
Vol 12 (2) ◽  
pp. 181-184 ◽  
Author(s):  
Zhenxin Fan ◽  
Bisong Yue ◽  
Xiuyue Zhang ◽  
Lianming Du ◽  
Zuoyi Jian
Keyword(s):  

Author(s):  
R. Jamuna

CpG islands (CGIs) play a vital role in genome analysis as genomic markers.  Identification of the CpG pair has contributed not only to the prediction of promoters but also to the understanding of the epigenetic causes of cancer. In the human genome [1] wherever the dinucleotides CG occurs the C nucleotide (cytosine) undergoes chemical modifications. There is a relatively high probability of this modification that mutates C into a T. For biologically important reasons the mutation modification process is suppressed in short stretches of the genome, such as ‘start’ regions. In these regions [2] predominant CpG dinucleotides are found than elsewhere. Such regions are called CpG islands. DNA methylation is an effective means by which gene expression is silenced. In normal cells, DNA methylation functions to prevent the expression of imprinted and inactive X chromosome genes. In cancerous cells, DNA methylation inactivates tumor-suppressor genes, as well as DNA repair genes, can disrupt cell-cycle regulation. The most current methods for identifying CGIs suffered from various limitations and involved a lot of human interventions. This paper gives an easy searching technique with data mining of Markov Chain in genes. Markov chain model has been applied to study the probability of occurrence of C-G pair in the given   gene sequence. Maximum Likelihood estimators for the transition probabilities for each model and analgously for the  model has been developed and log odds ratio that is calculated estimates the presence or absence of CpG is lands in the given gene which brings in many  facts for the cancer detection in human genome.


Cancers ◽  
2021 ◽  
Vol 13 (10) ◽  
pp. 2392
Author(s):  
Giovanna Spatari ◽  
Alessandro Allegra ◽  
Mariella Carrieri ◽  
Giovanni Pioggia ◽  
Sebastiano Gangemi

Benzene carcinogenic ability has been reported, and chronic exposure to benzene can be one of the risk elements for solid cancers and hematological neoplasms. Benzene is acknowledged as a myelotoxin, and it is able to augment the risk for the onset of acute myeloid leukemia, myelodysplastic syndromes, aplastic anemia, and lymphomas. Possible mechanisms of benzene initiation of hematological tumors have been identified, as a genotoxic effect, an action on oxidative stress and inflammation and the provocation of immunosuppression. However, it is becoming evident that genetic alterations and the other causes are insufficient to fully justify several phenomena that influence the onset of hematologic malignancies. Acquired epigenetic alterations may participate with benzene leukemogenesis, as benzene may affect nuclear receptors, and provoke post-translational alterations at the protein level, thereby touching the function of regulatory proteins, comprising oncoproteins and tumor suppressor proteins. DNA hypomethylation correlates with stimulation of oncogenes, while the hypermethylation of CpG islands in promoter regions of specific tumor suppressor genes inhibits their transcription and stimulates the onset of tumors. The discovery of the systems of epigenetic induction of benzene-caused hematological tumors has allowed the possibility to operate with pharmacological interventions able of stopping or overturning the negative effects of benzene.


Genetics ◽  
2001 ◽  
Vol 159 (3) ◽  
pp. 1089-1102
Author(s):  
James C Badciong ◽  
Jeffery M Otto ◽  
Gail L Waring

Abstract The Drosophila dec-1 gene encodes multiple proteins that are required for female fertility and proper eggshell morphogenesis. Genetic and immunolocalization data suggest that the different DEC-1 proteins are functionally distinct. To identify regions within the proteins with potential biological significance, we cloned and sequenced the D. yakuba and D. virilis dec-1 homologs. Interspecies comparisons of the predicted translation products revealed rapidly evolving sequences punctuated by blocks of conserved amino acids. Despite extensive amino acid variability, the proteins produced by the different dec-1 homologs were functionally interchangeable. The introduction of transgenes containing either the D. yakuba or the D. virilis dec-1 open reading frames into a D. melanogaster DEC-1 protein null mutant was sufficient to restore female fertility and wild-type eggshell morphology. Normal expression and extracellular processing of the DEC-1 proteins was correlated with the phenotypic rescue. The nature of the conserved features highlighted by the evolutionary comparison and the molecular resemblance of some of these features to those found in other extracellular proteins suggests functional correlates for some of the multiple DEC-1 derivatives.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Rupalatha Maddala ◽  
Junyuan Gao ◽  
Richard T. Mathias ◽  
Tylor R. Lewis ◽  
Vadim Y. Arshavsky ◽  
...  

AbstractS100A4, a member of the S100 family of multifunctional calcium-binding proteins, participates in several physiological and pathological processes. In this study, we demonstrate that S100A4 expression is robustly induced in differentiating fiber cells of the ocular lens and that S100A4(−/−) knockout mice develop late-onset cortical cataracts. Transcriptome profiling of lenses from S100A4(−/−) mice revealed a robust increase in the expression of multiple photoreceptor- and Müller glia-specific genes, as well as the olfactory sensory neuron-specific gene, S100A5. This aberrant transcriptional profile is characterized by corresponding increases in the levels of proteins encoded by the aberrantly upregulated genes. Ingenuity pathway network and curated pathway analyses of differentially expressed genes in S100A4(−/−) lenses identified Crx and Nrl transcription factors as the most significant upstream regulators, and revealed that many of the upregulated genes possess promoters containing a high-density of CpG islands bearing trimethylation marks at histone H3K27 and/or H3K4, respectively. In support of this finding, we further documented that S100A4(−/−) knockout lenses have altered levels of trimethylated H3K27 and H3K4. Taken together, our findings suggest that S100A4 suppresses the expression of retinal genes during lens differentiation plausibly via a mechanism involving changes in histone methylation.


2021 ◽  
Author(s):  
Daniel N. Weinberg ◽  
Phillip Rosenbaum ◽  
Xiao Chen ◽  
Douglas Barrows ◽  
Cynthia Horth ◽  
...  
Keyword(s):  
De Novo ◽  

Sign in / Sign up

Export Citation Format

Share Document