causal genes
Recently Published Documents


TOTAL DOCUMENTS

353
(FIVE YEARS 233)

H-INDEX

22
(FIVE YEARS 8)

2022 ◽  
Author(s):  
Loic Yengo ◽  
Sailaja Vedantam ◽  
Eirini Marouli ◽  
Julia Sidorenko ◽  
Eric Bartell ◽  
...  

Common SNPs are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes. Here we show, using GWAS data from 5.4 million individuals of diverse ancestries, that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a median size of ~90 kb, covering ~21% of the genome. The density of independent associations varies across the genome and the regions of elevated density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs account for 40% of phenotypic variance in European ancestry populations but only ~10%-20% in other ancestries. Effect sizes, associated regions, and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely explained by linkage disequilibrium and allele frequency differences within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than needed to implicate causal genes and variants. Overall, this study, the largest GWAS to date, provides an unprecedented saturated map of specific genomic regions containing the vast majority of common height-associated variants.


Author(s):  
Ke Hao ◽  
Raili Ermel ◽  
Katyayani Sukhavasi ◽  
Haoxiang Cheng ◽  
Lijiang Ma ◽  
...  

Background: Hundreds of candidate genes have been associated with coronary artery disease (CAD) through genome-wide association studies. However, a systematic way to understand the causal mechanism(s) of these genes, and a means to prioritize them for further study, has been lacking. This represents a major roadblock for developing novel disease- and gene-specific therapies for patients with CAD. Recently, powerful integrative genomics analyses pipelines have emerged to identify and prioritize candidate causal genes by integrating tissue/cell-specific gene expression data with genome-wide association studies data sets. Methods: We aimed to develop a comprehensive integrative genomics analyses pipeline for CAD and to provide a prioritized list of causal CAD genes. To this end, we leveraged several complimentary informatics approaches to integrate summary statistics from CAD genome-wide association studies (from UK Biobank and CARDIoGRAMplusC4D) with transcriptomic and expression quantitative trait loci data from 9 cardiometabolic tissue/cell types in the STARNET study (Stockholm-Tartu Atherosclerosis Reverse Network Engineering Task). Results: We identified 162 unique candidate causal CAD genes, which exerted their effect from between one and up to 7 disease-relevant tissues/cell types, including the arterial wall, blood, liver, skeletal muscle, adipose, foam cells, and macrophages. When their causal effect was ranked, the top candidate causal CAD genes were CDKN2B (associated with the 9p21.3 risk locus) and PHACTR1 ; both exerting their causal effect in the arterial wall. A majority of candidate causal genes were represented in cross-tissue gene regulatory co-expression networks that are involved with CAD, with 22/162 being key drivers in those networks. Conclusions: We identified and prioritized candidate causal CAD genes, also localizing their tissue(s) of causal effect. These results should serve as a resource and facilitate targeted studies to identify the functional impact of top causal CAD genes.


2021 ◽  
Author(s):  
Stavroula Kanoni ◽  
Sarah E Graham ◽  
Yuxuan Wang ◽  
Ida Surakka ◽  
Shweta Ramdas ◽  
...  

Genetic variants within nearly 1,000 loci are known to contribute to modulation of blood lipid levels. However, the biological pathways underlying these associations are frequently unknown, limiting understanding of these findings and hindering downstream translational efforts such as drug target discovery. To expand our understanding of the underlying biological pathways and mechanisms controlling blood lipid levels, we leverage a large multi-ancestry meta-analysis (N=1,654,960) of blood lipids to prioritize putative causal genes for 2,286 lipid associations by combining six gene prediction methods and assigning a confidence score. We assign, most confidently, 118 candidate causal genes and identify potential drug targets including bona-fide (PCSK9) and putative (PNLIP and ARF6) genes. Using phenome-wide association (PheWAS) scans, we identify relationships of genetically-predicted lipid levels to other diseases and conditions. We confirm known pleiotropic associations with cardiovascular phenotypes and determine novel associations, notably with cholelithiasis risk. We perform sex-stratified GWAS meta-analysis of lipid levels and show that 3-5% of autosomal lipid-associated loci demonstrate sex-biased effects. Finally, we report 21 novel lipid loci identified on the X chromosome. Many of the sex-biased autosomal and X chromosome lipid loci show pleiotropic associations with sex hormones, emphasizing the role of hormone regulation in lipid metabolism. Taken together, our findings provide insights into the mechanisms through which associated variants lead to altered lipid levels and potentially cardiovascular disease risk.


2021 ◽  
Author(s):  
Zichen Zhang ◽  
Ye Eun Bae ◽  
Jonathan R. Bradley ◽  
Lang Wu ◽  
Chong Wu

AbstractGenes with moderate to low expression heritability may explain a large proportion of complex trait heritability, but these genes are insufficiently captured in transcriptome-wide association studies (TWAS) partly due to the relatively small available reference datasets for developing expression genetic prediction models to capture the moderate to low genetically regulated components of gene expression. Here, we introduce a new method, Summary-level Unified Method for Modeling Integrated Transcriptome (SUMMIT), to improve the expression prediction model accuracy and the power of TWAS by using a large expression quantitative trait loci (eQTL) summary-level dataset. We applied SUMMIT to the eQTL summary-level data provided by the eQTLGen consortium, which involve 31,684 blood samples from 37 cohorts. Through simulation studies and analyses of GWAS summary statistics for 24 complex traits, we show that SUMMIT substantially improves the accuracy of expression prediction in blood, successfully builds expression prediction models for genes with low expression heritability, and achieves higher statistical power than several benchmark methods. In the end, we conducted a case study of COVID-19 severity with SUMMIT and identified 11 likely causal genes associated with COVID-19 severity.


2021 ◽  
Vol 22 (24) ◽  
pp. 13294
Author(s):  
Luke Mansard ◽  
David Baux ◽  
Christel Vaché ◽  
Catherine Blanchet ◽  
Isabelle Meunier ◽  
...  

Usher syndrome is an autosomal recessive disorder characterized by congenital hearing loss combined with retinitis pigmentosa, and in some cases, vestibular areflexia. Three clinical subtypes are distinguished, and MYO7A and USH2A represent the two major causal genes involved in Usher type I, the most severe form, and type II, the most frequent form, respectively. Massively parallel sequencing was performed on a cohort of patients in the context of a molecular diagnosis to confirm clinical suspicion of Usher syndrome. We report here 231 pathogenic MYO7A and USH2A genotypes identified in 73 Usher type I and 158 Usher type II patients. Furthermore, we present the ACMG classification of the variants, which comprise all types. Among them, 68 have not been previously reported in the literature, including 12 missense and 16 splice variants. We also report a new deep intronic variant in USH2A. Despite the important number of molecular studies published on these two genes, we show that during the course of routine genetic diagnosis, undescribed variants continue to be identified at a high rate. This is particularly pertinent in the current era, where therapeutic strategies based on DNA or RNA technologies are being developed.


2021 ◽  
Vol 12 ◽  
Author(s):  
Xiongjie Zheng ◽  
Yu Yang ◽  
Salim Al-Babili

In plants, carotenoids are subjected to enzyme-catalyzed oxidative cleavage reactions as well as to non-enzymatic degradation processes, which produce various carbonyl products called apocarotenoids. These conversions control carotenoid content in different tissues and give rise to apocarotenoid hormones and signaling molecules, which play important roles in plant growth and development, response to environmental stimuli, and in interactions with surrounding organisms. In addition, carotenoid cleavage gives rise to apocarotenoid pigments and volatiles that contribute to the color and flavor of many flowers and several fruits. Some apocarotenoid pigments, such as crocins and bixin, are widely utilized as colorants and additives in food and cosmetic industry and also have health-promoting properties. Considering the importance of this class of metabolites, investigation of apocarotenoid diversity and regulation has increasingly attracted the attention of plant biologists. Here, we provide an update on the plant apocarotenoid biosynthetic pathway, especially highlighting the diversity of the enzyme carotenoid cleavage dioxygenase 4 (CCD4) from different plant species with respect to substrate specificity and regioselectivity, which contribute to the formation of diverse apocarotenoid volatiles and pigments. In addition, we summarize the regulation of apocarotenoid metabolic pathway at transcriptional, post-translational, and epigenetic levels. Finally, we describe inter- and intraspecies variation in apocarotenoid production observed in many important horticulture crops and depict recent progress in elucidating the genetic basis of the natural variation in the composition and amount of apocarotenoids. We propose that the illustration of biochemical, genetic, and evolutionary background of apocarotenoid diversity would not only accelerate the discovery of unknown biosynthetic and regulatory genes of bioactive apocarotenoids but also enable the identification of genetic variation of causal genes for marker-assisted improvement of aroma and color of fruits and vegetables and CRISPR-based next-generation metabolic engineering of high-value apocarotenoids.


2021 ◽  
Vol 53 (1) ◽  
Author(s):  
Leticia P. Sanglard ◽  
Yijian Huang ◽  
Kent A. Gray ◽  
Daniel C. L. Linhares ◽  
Jack C. M. Dekkers ◽  
...  

Abstract Background The possibility of using antibody response (S/P ratio) to PRRSV vaccination measured in crossbred commercial gilts as a genetic indicator for reproductive performance in vaccinated crossbred sows has motivated further studies of the genomic basis of this trait. In this study, we investigated the association of haplotypes and runs of homozygosity (ROH) and heterozygosity (ROHet) with S/P ratio and their impact on reproductive performance. Results There was no association (P-value ≥ 0.18) of S/P ratio with the percentage of ROH or ROHet, or with the percentage of heterozygosity across the whole genome or in the major histocompatibility complex (MHC) region. However, specific ROH and ROHet regions were significantly associated (P-value ≤ 0.01) with S/P ratio on chromosomes 1, 4, 5, 7, 10, 11, 13, and 17 but not (P-value ≥ 0.10) with reproductive performance. With the haplotype-based genome-wide association study (GWAS), additional genomic regions associated with S/P ratio were identified on chromosomes 4, 7, and 9. These regions harbor immune-related genes, such as SLA-DOB, TAP2, TAPBP, TMIGD3, and ADORA. Four haplotypes at the identified region on chromosome 7 were also associated with multiple reproductive traits. A haplotype significantly associated with S/P ratio that is located in the MHC region may be in stronger linkage disequilibrium (LD) with the quantitative trait loci (QTL) than the previously identified single nucleotide polymorphism (SNP) (H3GA0020505) given the larger estimate of genetic variance explained by the haplotype than by the SNP. Conclusions Specific ROH and ROHet regions were significantly associated with S/P ratio. The haplotype-based GWAS identified novel QTL for S/P ratio on chromosomes 4, 7, and 9 and confirmed the presence of at least one QTL in the MHC region. The chromosome 7 region was also associated with reproductive performance. These results narrow the search for causal genes in this region and suggest SLA-DOB and TAP2 as potential candidate genes associated with S/P ratio on chromosome 7. These results provide additional opportunities for marker-assisted selection and genomic selection for S/P ratio as genetic indicator for litter size in commercial pig populations.


2021 ◽  
Vol 12 ◽  
Author(s):  
Martina Rauner ◽  
Ines Foessl ◽  
Melissa M. Formosa ◽  
Erika Kague ◽  
Vid Prijatelj ◽  
...  

The availability of large human datasets for genome-wide association studies (GWAS) and the advancement of sequencing technologies have boosted the identification of genetic variants in complex and rare diseases in the skeletal field. Yet, interpreting results from human association studies remains a challenge. To bridge the gap between genetic association and causality, a systematic functional investigation is necessary. Multiple unknowns exist for putative causal genes, including cellular localization of the molecular function. Intermediate traits (“endophenotypes”), e.g. molecular quantitative trait loci (molQTLs), are needed to identify mechanisms of underlying associations. Furthermore, index variants often reside in non-coding regions of the genome, therefore challenging for interpretation. Knowledge of non-coding variance (e.g. ncRNAs), repetitive sequences, and regulatory interactions between enhancers and their target genes is central for understanding causal genes in skeletal conditions. Animal models with deep skeletal phenotyping and cell culture models have already facilitated fine mapping of some association signals, elucidated gene mechanisms, and revealed disease-relevant biology. However, to accelerate research towards bridging the current gap between association and causality in skeletal diseases, alternative in vivo platforms need to be used and developed in parallel with the current -omics and traditional in vivo resources. Therefore, we argue that as a field we need to establish resource-sharing standards to collectively address complex research questions. These standards will promote data integration from various -omics technologies and functional dissection of human complex traits. In this mission statement, we review the current available resources and as a group propose a consensus to facilitate resource sharing using existing and future resources. Such coordination efforts will maximize the acquisition of knowledge from different approaches and thus reduce redundancy and duplication of resources. These measures will help to understand the pathogenesis of osteoporosis and other skeletal diseases towards defining new and more efficient therapeutic targets.


2021 ◽  
Author(s):  
Young Keun Lee ◽  
Jisoo Kim ◽  
Sung Wook Seo

Abstract BackgroundThe recent explosion of cancer genomics provides extensive information about mutations and gene expression changes in cancer. However, most of the identified gene mutations are not clinically utilized. It remains uncertain whether the presence of a certain genetic alteration will affect treatment response. Conventional statistics have limitations for causal inferences and are hard to gain sufficient power in genomic datasets. Here, we developed and evaluated an algorithm for searching the causal genes that maximize the effect of the treatment.MethodsThe algorithm was developed based on the potential outcome framework and Bayesian posterior update. The precision of the algorithm was validated using a simulation dataset. The algorithm was implemented to a cBioPortal dataset. The genes discovered by the algorithm were externally validated within CancerSCAN screening data from Samsung Medical Center.ResultsSimulation data analysis showed that the C-search algorithm was able to identify nine causal genes out of ten. The C-search algorithm shows the discovery rate rapidly increasing until the 1500 number of data. Meanwhile, the log-rank test shows a slower increase in performance. The C-search algorithm was able to suggest nine causal genes from the cBioPortal Metabric dataset. Treating the patients with the causal genes are associated with better survival outcome in both the cBioPortal dataset and the CancerSCAN dataset which is used for external validation.ConclusionsOur C-search algorithm demonstrated better performance to identify causal effects of the genes than multiple rog-rank test analysis especially within a limited number of data. The result suggests that the C-search can discover the causal genes from various genetic datasets, where the number of samples is limited compared to the number of variables.


2021 ◽  
Author(s):  
Cheng He ◽  
Jacob D Washburn ◽  
Yangfan Hao ◽  
Zhiwu Zhang ◽  
Jinliang Yang ◽  
...  

Genome-wide association study (GWAS) with single nucleotide polymorphisms (SNPs) has been widely used to explore genetic controls of phenotypic traits. Here we employed an GWAS approach using k-mers, short substrings from sequencing reads. Using maize cob and kernel color traits, we demonstrated that k-mer GWAS can effectively identify associated k-mers. Co-expression analysis of kernel color k-mers and pathway genes directly found k-mers from causal genes. Analyzing complex traits of kernel oil and leaf angle resulted in k-mers from both known and candidate genes. Evolution analysis revealed most k-mers positively correlated with kernel oil were strongly selected against in maize populations, while most k-mers for upright leaf angle were positively selected. In addition, phenotypic prediction of kernel oil, leaf angle, and flowering time using k-mer data showed at least a similarly high prediction accuracy to the standard SNP-based method. Collectively, our results demonstrated the bridging role of k-mers for data integration and functional gene discovery.


Sign in / Sign up

Export Citation Format

Share Document