scholarly journals iGWAS: Integrative Genome-Wide Association Studies of Genetic and Genomic Data for Disease Susceptibility Using Mediation Analysis

2015 ◽  
Vol 39 (5) ◽  
pp. 347-356 ◽  
Author(s):  
Yen-Tsung Huang ◽  
Liming Liang ◽  
Miriam F. Moffatt ◽  
William O. C. M. Cookson ◽  
Xihong Lin
Author(s):  
Ting-Hao Chen ◽  
Chen-Cheng Yang ◽  
Kuei-Hau Luo ◽  
Chia-Yen Dai ◽  
Yao-Chung Chuang ◽  
...  

Aluminum (Al) toxicity is related to renal failure and the failure of other systems. Although there were some genome-wide association studies (GWAS) in Australia and England, there were no GWAS about Han Chinese to our knowledge. Thus, this research focused on using whole genomic genotypes from the Taiwan Biobank for exploring the association between Al concentrations in plasma and renal function. Participants, who underwent questionnaire interviews, biomarkers, and genotyping, were from the Taiwan Biobank database. Then, we measured their plasma Al concentrations with ICP-MS in the laboratory at Kaohsiung Medical University. We used this data to link genome-wide association (GWA) tests while looking for candidate genes and associated plasma Al concentration to renal function. Furthermore, we examined the path relationship between Single Nucleotide Polymorphisms (SNPs), Al concentrations, and estimated glomerular filtration rates (eGFR) through the mediation analysis with 3000 replication bootstraps. Following the principles of GWAS, we focused on three SNPs within the dipeptidyl peptidase-like protein 6 (DPP6) gene in chromosome 7, rs10224371, rs2316242, and rs10268004, respectively. The results of the mediation analysis showed that all of the selected SNPs have indirectly affected eGFR through a mediation of Al concentrations. Our analysis revealed the association between DPP6 SNPs, plasma Al concentrations, and eGFR. However, further longitudinal studies and research on mechanism are in need. Our analysis was still be the first study that explored the association between the DPP6, SNPs, and Al in plasma affecting eGFR.


2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Meng-tse Gabriel Lee ◽  
Tzu-Chun Hsu ◽  
Shyr-Chyr Chen ◽  
Ya-Chin Lee ◽  
Po-Hsiu Kuo ◽  
...  

2020 ◽  
Vol 46 (1) ◽  
pp. 86-97 ◽  
Author(s):  
Timothy Reynolds ◽  
Emma C. Johnson ◽  
Spencer B. Huggett ◽  
Jason A. Bubier ◽  
Rohan H. C. Palmer ◽  
...  

AbstractGenome-wide association studies and other discovery genetics methods provide a means to identify previously unknown biological mechanisms underlying behavioral disorders that may point to new therapeutic avenues, augment diagnostic tools, and yield a deeper understanding of the biology of psychiatric conditions. Recent advances in psychiatric genetics have been made possible through large-scale collaborative efforts. These studies have begun to unearth many novel genetic variants associated with psychiatric disorders and behavioral traits in human populations. Significant challenges remain in characterizing the resulting disease-associated genetic variants and prioritizing functional follow-up to make them useful for mechanistic understanding and development of therapeutics. Model organism research has generated extensive genomic data that can provide insight into the neurobiological mechanisms of variant action, but a cohesive effort must be made to establish which aspects of the biological modulation of behavioral traits are evolutionarily conserved across species. Scalable computing, new data integration strategies, and advanced analysis methods outlined in this review provide a framework to efficiently harness model organism data in support of clinically relevant psychiatric phenotypes.


GigaScience ◽  
2020 ◽  
Vol 9 (8) ◽  
Author(s):  
Arash Bayat ◽  
Piotr Szul ◽  
Aidan R O’Brien ◽  
Robert Dunne ◽  
Brendan Hosking ◽  
...  

Abstract Background Many traits and diseases are thought to be driven by >1 gene (polygenic). Polygenic risk scores (PRS) hence expand on genome-wide association studies by taking multiple genes into account when risk models are built. However, PRS only considers the additive effect of individual genes but not epistatic interactions or the combination of individual and interacting drivers. While evidence of epistatic interactions ais found in small datasets, large datasets have not been processed yet owing to the high computational complexity of the search for epistatic interactions. Findings We have developed VariantSpark, a distributed machine learning framework able to perform association analysis for complex phenotypes that are polygenic and potentially involve a large number of epistatic interactions. Efficient multi-layer parallelization allows VariantSpark to scale to the whole genome of population-scale datasets with 100,000,000 genomic variants and 100,000 samples. Conclusions Compared with traditional monogenic genome-wide association studies, VariantSpark better identifies genomic variants associated with complex phenotypes. VariantSpark is 3.6 times faster than ReForeSt and the only method able to scale to ultra-high-dimensional genomic data in a manageable time.


2021 ◽  
Vol 80 (Suppl 1) ◽  
pp. 405-405
Author(s):  
L. Papageorgiou ◽  
H. Alkenaris ◽  
M. Zervou ◽  
D. Vlachakis ◽  
G. Goulielmos ◽  
...  

Background:Genome wide association studies (GWAS) have successfully identified novel autoimmune disease-associated loci, with many of them shared by multiple disease-associated pathways but much of the genetics and pathophysiological mechanisms remain still obscure [1-3]. SLE is a chronic, highly heterogeneous autoimmune disease, characterized by differences in autoantibody profile, serum cytokines, and a multi-system involvement [4]. Epione Application is an integrated bioinformatics web-tool designed to assist medical experts and researchers in the process of diagnosing SLE [5].Objectives:To identify the most credible gene variants and single nucleotide polymorphisms (SNPs), causing SLE using the genomic data provided for the patient and aid the medical expert in SLE diagnosis [5].Methods:In the present study, we have analyzed more than 70.000 SLE-related publications using data mining and semantic techniques towards extracting the SLE -related genes and SNPs [6]. The extracted knowledge has been filtered, evaluated, annotated, classified, and stored in the Epione Application Database (EAD) (Figure 1). Moreover, an updated gene regulatory network with the genes implements in SLE has been estimated [7]. This was followed by the design and development of the Epione application, in which the generated datasets and results were included. The application has been tested and presented here with WES data from several related patients with SLE [8].Results:SLE-related SNPs and variants identified in genome-wide association studies (GWAS), whole-genome (WGS), whole-exome (WES), or targeted sequencing information are classified, annotated, and analyzed in an integrated patient profile with clinical significance information. Probable genes associated with the patient’s genomic profile are visualized with several graphs, including chromosome ideograms, statistic bars, and regulatory networks through data mining studies with relative publications, to obtain a representative number of the most credible candidate genes and biological pathways associated with the SLE. An evaluation study was performed on 7 patients from a three-generation family with SLE [9]. All the recognized gene variants that were previously considered SLE-associated were properly identified in the output profile per patient, and by comparing the results, new findings have emerged.Conclusion:The Epione application was designed to assist medical doctor diagnosis from the early stages by using the patients’ genomic data [5, 8, 10]. Its diagnosis-oriented output presents the patient profile through which the user is provided with a structured set of results in various categories, which are generated based on the list of the most predictable candidate gene variants related to SLE. This novel and accessible webserver tool of SLE to assist medical experts in the clinical genomics and precision medicine procedure is available at http://143.233.188.162/epione/.References:[1]Molineros JE et al. (2017). Hum Mol Genet 26:1205-1216.[2]Sciascia S et al. (2018). F1000 Res, 2018:1-17.[3]Gonzalez-Serna D et al. (2020). Sci Rep 10:1862.[4]Harley JB et al. (2006). Springer Semin Immun 28:119–130.[5]Goulielmos GN et al. (2018). Gene 668:59-72.[6]Zhao Y et al. (2020). Front Genet 11:400.[7]Song YL and Chen S (2009). Interdiscip Sci 1:179-186.[8]Koile D et al. (2018). BMC Bioinformatics 19:25.[9]Albertsen HM et al. (2019). Mol Med Rep 19:1716-1720.[10]Ebrahimiyan H et al. (2018). Meta Gene 16:241-247.Figure 1.The Epione application database (EAD) for SLE.Disclosure of Interests:None declared


2020 ◽  
Vol 36 (10) ◽  
pp. 3004-3010
Author(s):  
Huang Xu ◽  
Xiang Li ◽  
Yaning Yang ◽  
Yi Li ◽  
Jose Pinheiro ◽  
...  

Abstract Motivation With the emerging of high-dimensional genomic data, genetic analysis such as genome-wide association studies (GWAS) have played an important role in identifying disease-related genetic variants and novel treatments. Complex longitudinal phenotypes are commonly collected in medical studies. However, since limited analytical approaches are available for longitudinal traits, these data are often underutilized. In this article, we develop a high-throughput machine learning approach for multilocus GWAS using longitudinal traits by coupling Empirical Bayesian Estimates from mixed-effects modeling with a novel ℓ0-norm algorithm. Results Extensive simulations demonstrated that the proposed approach not only provided accurate selection of single nucleotide polymorphisms (SNPs) with comparable or higher power but also robust control of false positives. More importantly, this novel approach is highly scalable and could be approximately >1000 times faster than recently published approaches, making genome-wide multilocus analysis of longitudinal traits possible. In addition, our proposed approach can simultaneously analyze millions of SNPs if the computer memory allows, thereby potentially allowing a true multilocus analysis for high-dimensional genomic data. With application to the data from Alzheimer's Disease Neuroimaging Initiative, we confirmed that our approach can identify well-known SNPs associated with AD and were much faster than recently published approaches (≥6000 times). Availability and implementation The source code and the testing datasets are available at https://github.com/Myuan2019/EBE_APML0. Supplementary information Supplementary data are available at Bioinformatics online.


Sign in / Sign up

Export Citation Format

Share Document