scholarly journals GenomegaMap: within-species genome-wide dN/dS estimation from over 10,000 genomes

2019 ◽  
Author(s):  
Daniel J. Wilson ◽  

ABSTRACTThe dN/dS ratio provides evidence of adaptation or functional constraint in protein-coding genes by quantifying the relative excess or deficit of amino acid-replacing versus silent nucleotide variation. Inexpensive sequencing promises a better understanding of parameters such as dN/dS, but analysing very large datasets poses a major statistical challenge. Here I introduce genomegaMap for estimating within-species genome-wide variation in dN/dS, and I apply it to 3,979 genes across 10,209 tuberculosis genomes to characterize the selection pressures shaping this global pathogen. GenomegaMap is a phylogeny-free method that addresses two major problems with existing approaches: (i) it is fast no matter how large the sample size and (ii) it is robust to recombination, which causes phylogenetic methods to report artefactual signals of adaptation. GenomegaMap uses population genetics theory to approximate the distribution of allele frequencies under general, parent-dependent mutation models. Coalescent simulations show that substitution parameters are well-estimated even when genomegaMap’s simplifying assumption of independence among sites is violated. I demonstrate the ability of genomegaMap to detect genuine signatures of selection at antimicrobial resistance-conferring substitutions in M. tuberculosis and describe a novel signature of selection in the cold-shock DEAD-box protein A gene deaD/csdA. The genomegaMap approach helps accelerate the exploitation of big data for gaining new insights into evolution within species.

2020 ◽  
Vol 37 (8) ◽  
pp. 2450-2460 ◽  
Author(s):  
Daniel J Wilson ◽  
Derrick W Crook ◽  
Timothy E A Peto ◽  
A Sarah Walker ◽  
Sarah J Hoosdally ◽  
...  

Abstract The dN/dS ratio provides evidence of adaptation or functional constraint in protein-coding genes by quantifying the relative excess or deficit of amino acid-replacing versus silent nucleotide variation. Inexpensive sequencing promises a better understanding of parameters, such as dN/dS, but analyzing very large data sets poses a major statistical challenge. Here, I introduce genomegaMap for estimating within-species genome-wide variation in dN/dS, and I apply it to 3,979 genes across 10,209 tuberculosis genomes to characterize the selection pressures shaping this global pathogen. GenomegaMap is a phylogeny-free method that addresses two major problems with existing approaches: 1) It is fast no matter how large the sample size and 2) it is robust to recombination, which causes phylogenetic methods to report artefactual signals of adaptation. GenomegaMap uses population genetics theory to approximate the distribution of allele frequencies under general, parent-dependent mutation models. Coalescent simulations show that substitution parameters are well estimated even when genomegaMap’s simplifying assumption of independence among sites is violated. I demonstrate the ability of genomegaMap to detect genuine signatures of selection at antimicrobial resistance-conferring substitutions in Mycobacterium tuberculosis and describe a novel signature of selection in the cold-shock DEAD-box protein A gene deaD/csdA. The genomegaMap approach helps accelerate the exploitation of big data for gaining new insights into evolution within species.


2013 ◽  
Vol 94 (3) ◽  
pp. 406-412 ◽  
Author(s):  
Huanan Wang ◽  
Ting Zhu ◽  
Shenye Yu ◽  
Huifang Liu ◽  
Xiumei Wang ◽  
...  

2020 ◽  
Vol 36 (9) ◽  
pp. 2936-2937 ◽  
Author(s):  
Gareth Peat ◽  
William Jones ◽  
Michael Nuhn ◽  
José Carlos Marugán ◽  
William Newell ◽  
...  

Abstract Motivation Genome-wide association studies (GWAS) are a powerful method to detect even weak associations between variants and phenotypes; however, many of the identified associated variants are in non-coding regions, and presumably influence gene expression regulation. Identifying potential drug targets, i.e. causal protein-coding genes, therefore, requires crossing the genetics results with functional data. Results We present a novel data integration pipeline that analyses GWAS results in the light of experimental epigenetic and cis-regulatory datasets, such as ChIP-Seq, Promoter-Capture Hi-C or eQTL, and presents them in a single report, which can be used for inferring likely causal genes. This pipeline was then fed into an interactive data resource. Availability and implementation The analysis code is available at www.github.com/Ensembl/postgap and the interactive data browser at postgwas.opentargets.io.


2021 ◽  
Vol 12 (8) ◽  
Author(s):  
Guo-dong Zhu ◽  
Jing Yu ◽  
Zheng-yu Sun ◽  
Yan Chen ◽  
Hong-mei Zheng ◽  
...  

AbstractGlioblastomas (GBM) is the most common primary malignant brain tumor, and radiotherapy plays a critical role in its therapeutic management. Unfortunately, the development of radioresistance is universal. Here, we identified calcium-regulated heat-stable protein 1 (CARHSP1) as a critical driver for radioresistance utilizing genome-wide CRISPR activation screening. This is a protein with a cold-shock domain (CSD)-containing that is highly similar to cold-shock proteins. CARHSP1 mRNA level was upregulated in irradiation-resistant GBM cells and knockdown of CARHSP1 sensitized GBM cells to radiotherapy. The high expression of CARHSP1 upon radiation might mediate radioresistance by activating the inflammatory signaling pathway. More importantly, patients with high levels of CARHSP1 had poorer survival when treated with radiotherapy. Collectively, our findings suggested that targeting the CARHSP1/TNF-α inflammatory signaling activation induced by radiotherapy might directly affect radioresistance and present an attractive therapeutic target for GBM, particularly for patients with high levels of CARHSP1.


2021 ◽  
Vol 22 (11) ◽  
pp. 6091
Author(s):  
Kristina Daniunaite ◽  
Arnas Bakavicius ◽  
Kristina Zukauskaite ◽  
Ieva Rauluseviciute ◽  
Juozas Rimantas Lazutka ◽  
...  

The molecular diversity of prostate cancer (PCa) has been demonstrated by recent genome-wide studies, proposing a significant number of different molecular markers. However, only a few of them have been transferred into clinical practice so far. The present study aimed to identify and validate novel DNA methylation biomarkers for PCa diagnosis and prognosis. Microarray-based methylome data of well-characterized cancerous and noncancerous prostate tissue (NPT) pairs was used for the initial screening. Ten protein-coding genes were selected for validation in a set of 151 PCa, 51 NPT, as well as 17 benign prostatic hyperplasia samples. The Prostate Cancer Dataset (PRAD) of The Cancer Genome Atlas (TCGA) was utilized for independent validation of our findings. Methylation frequencies of ADAMTS12, CCDC181, FILIP1L, NAALAD2, PRKCB, and ZMIZ1 were up to 91% in our study. PCa specific methylation of ADAMTS12, CCDC181, NAALAD2, and PRKCB was demonstrated by qualitative and quantitative means (all p < 0.05). In agreement with PRAD, promoter methylation of these four genes was associated with the transcript down-regulation in the Lithuanian cohort (all p < 0.05). Methylation of ADAMTS12, NAALAD2, and PRKCB was independently predictive for biochemical disease recurrence, while NAALAD2 and PRKCB increased the prognostic power of multivariate models (all p < 0.01). The present study identified methylation of ADAMTS12, NAALAD2, and PRKCB as novel diagnostic and prognostic PCa biomarkers that might guide treatment decisions in clinical practice.


2019 ◽  
Vol 20 (13) ◽  
pp. 3315 ◽  
Author(s):  
Simona Cantarella ◽  
Davide Carnevali ◽  
Marco Morselli ◽  
Anastasia Conti ◽  
Matteo Pellegrini ◽  
...  

Alu retroelements, whose retrotransposition requires prior transcription by RNA polymerase III to generate Alu RNAs, represent the most numerous non-coding RNA (ncRNA) gene family in the human genome. Alu transcription is generally kept to extremely low levels by tight epigenetic silencing, but it has been reported to increase under different types of cell perturbation, such as viral infection and cancer. Alu RNAs, being able to act as gene expression modulators, may be directly involved in the mechanisms determining cellular behavior in such perturbed states. To directly address the regulatory potential of Alu RNAs, we generated IMR90 fibroblasts and HeLa cell lines stably overexpressing two slightly different Alu RNAs, and analyzed genome-wide the expression changes of protein-coding genes through RNA-sequencing. Among the genes that were upregulated or downregulated in response to Alu overexpression in IMR90, but not in HeLa cells, we found a highly significant enrichment of pathways involved in cell cycle progression and mitotic entry. Accordingly, Alu overexpression was found to promote transition from G1 to S phase, as revealed by flow cytometry. Therefore, increased Alu RNA may contribute to sustained cell proliferation, which is an important factor of cancer development and progression.


Genes ◽  
2021 ◽  
Vol 12 (5) ◽  
pp. 643
Author(s):  
Thibaud Kuca ◽  
Brandy M. Marron ◽  
Joana G. P. Jacinto ◽  
Julia M. Paris ◽  
Christian Gerspach ◽  
...  

Genodermatosis such as hair disorders mostly follow a monogenic mode of inheritance. Congenital hypotrichosis (HY) belong to this group of disorders and is characterized by abnormally reduced hair since birth. The purpose of this study was to characterize the clinical phenotype of a breed-specific non-syndromic form of HY in Belted Galloway cattle and to identify the causative genetic variant for this recessive disorder. An affected calf born in Switzerland presented with multiple small to large areas of alopecia on the limbs and on the dorsal part of the head, neck, and back. A genome-wide association study using Swiss and US Belted Galloway cattle encompassing 12 cases and 61 controls revealed an association signal on chromosome 29. Homozygosity mapping in a subset of cases refined the HY locus to a 1.5 Mb critical interval and subsequent Sanger sequencing of protein-coding exons of positional candidate genes revealed a stop gain variant in the HEPHL1 gene that encodes a multi-copper ferroxidase protein so-called hephaestin like 1 (c.1684A>T; p.Lys562*). A perfect concordance between the homozygous presence of this most likely pathogenic loss-of-function variant and the HY phenotype was found. Genotyping of more than 700 purebred Swiss and US Belted Galloway cattle showed the global spread of the mutation. This study provides a molecular test that will permit the avoidance of risk matings by systematic genotyping of relevant breeding animals. This rare recessive HEPHL1-related form of hypotrichosis provides a novel large animal model for similar human conditions. The results have been incorporated in the Online Mendelian Inheritance in Animals (OMIA) database (OMIA 002230-9913).


Sign in / Sign up

Export Citation Format

Share Document