Functional Genetic Biomarkers of Alzheimer’s Disease and Gene Expression from Peripheral Blood

Mapping Intimacies ◽

10.1101/2021.01.15.426891 ◽

2021 ◽

Author(s):

Andrew Ni ◽

Amish Sethi ◽

Keyword(s):

Gene Expression ◽

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Peripheral Blood ◽

Genetic Variants ◽

Cell Activation ◽

Association Studies ◽

Gene Set Enrichment Analysis ◽

Machine Learning Techniques ◽

Genome Wide Association Studies

AbstractDetecting Alzheimer’s Disease (AD) at the earliest possible stage is key in advancing AD prevention and treatment but is challenged by normal aging processes in addition to other confounding neurodegenerative diseases. Recent genome-wide association studies (GWAS) have identified associated alleles, but it has been difficult to transition from non-coding genetic variants to underlying mechanisms of AD. Here, we sought to reveal functional genetic variants and diagnostic biomarkers underlying AD using machine learning techniques. We first developed a Random Forest (RF) classifier using microarray gene expression data sampled from the peripheral blood of 744 participants in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort. After initial feature selection, 5-fold cross-validation of the 100-gene RF classifier achieved an accuracy of 99.04%. The high accuracy of the RF classifier supports the possibility of a powerful and minimally invasive tool for screening of AD. Next, unsupervised clustering was used to validate and identify relationships among differentially expressed genes (DEGs) the RF selected revealing 3 distinct AD clusters. Results suggest downregulation of global sulfatase and oxidoreductase activities in AD through mutations in SUMF1 and SMOX respectively. Then, we used Greedy Fast Causal Inference (GFCI) to find potential causes of AD within DEGs. In the causal graph, HLA-DPB1 and CYP4A11 emerge as hub genes, furthering the discussion of the immune system’s role in AD. Finally, we used Gene Set Enrichment Analysis (GSEA) to determine the biological pathways and processes underlying the DEGs that were highly correlated with AD. Cell activation in the immune system, glycosaminoglycan (GAG) binding, vascular dysfunction, oxidative stress, and the neuronal apoptotic process were revealed to be significantly enriched in AD. This study further advances the possibility of low-cost and noninvasive genetic screening for AD while also providing potential gene targets for further experimentation.

Download Full-text

Integrating Transcriptomics, Genomics, and Imaging in Alzheimer's Disease: A Federated Model

10.1101/2021.09.14.460367 ◽

2021 ◽

Author(s):

Jianfeng Wu ◽

Yanxi Chen ◽

Panwen Wang ◽

Richard J Caselli ◽

Paul M Thompson ◽

...

Keyword(s):

Gene Expression ◽

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Imaging Modality ◽

Association Studies ◽

Imaging Genetics ◽

Health Concern ◽

Genome Wide Association Studies ◽

Imaging Data ◽

Stable Performance

Alzheimer's disease (AD) affects more than 1 in 9 people age 65 and older and becomes an urgent public health concern as the global population ages. In clinical practice, structural magnetic resonance imaging (sMRI) is the most accessible and widely used diagnostic imaging modality. Additionally, genome-wide association studies (GWAS) and transcriptomic, the study of gene expression, also play an important role in understanding AD etiology and progression. Sophisticated imaging genetics systems have been developed to discover genetic factors that consistently affect brain function and structure. However, most studies to date focused on the relationships between brain sMRI and GWAS or brain sMRI and transcriptomics. To our knowledge, few methods have been developed to discover and infer multimodal relationships among sMRI, GWAS, and transcriptomics. To address this, we propose a novel federated model, Genotype-Expression-Imaging Data Integration (GEIDI), to identify genetic and transcriptomic influences on brain sMRI measures. The relationships between brain imaging measures and gene expression are allowed to depend on a person's genotype at the single-nucleotide polymorphism (SNP) level, making the inferences adaptive and personalized. We performed extensive experiments on publicly available Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset. Experimental results demonstrated our proposed method outperformed state-of-the-art expression quantitative trait loci (eQTL) methods for detecting genetic and transcriptomic factors related to AD and has stable performance when data are integrated from multiple sites. Our GEIDI approach may offer novel insights into the relationship among image biomarkers, genotypes, and gene expression and help discover novel genetic targets for potential AD drug treatments.

Download Full-text

Genetic variants influencing human aging from late-onset Alzheimer's disease (LOAD) genome-wide association studies (GWAS)

Neurobiology of Aging ◽

10.1016/j.neurobiolaging.2012.02.014 ◽

2012 ◽

Vol 33 (8) ◽

pp. 1849.e5-1849.e18 ◽

Cited By ~ 23

Author(s):

Hui Shi ◽

Olivia Belbin ◽

Christopher Medway ◽

Kristelle Brown ◽

Noor Kalsheker ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Genetic Variants ◽

Late Onset ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Human Aging ◽

Genome Wide

Download Full-text

Predicting late-onset Alzheimer’s disease from genomic data using deep neural networks

10.1101/629402 ◽

2019 ◽

Author(s):

Javier de Velasco Oriol ◽

Edgar E. Vallejo ◽

Karol Estrada ◽

Keyword(s):

Alzheimer’S Disease ◽

Neural Networks ◽

Alzheimer's Disease ◽

Genetic Variants ◽

Deep Neural Networks ◽

Late Onset ◽

Association Studies ◽

Genome Wide Association Studies ◽

Clinical Markers ◽

Genome Wide

AbstractAlzheimer’s disease (AD) is the leading form of dementia. Over 25 million cases have been estimated worldwide and this number is predicted to increase two-fold every 20 years. Even though there is a variety of clinical markers available for the diagnosis of AD, the accurate and timely diagnosis of this disease remains elusive. Recently, over a dozen of genetic variants predisposing to the disease have been identified by genome-wide association studies. However, these genetic variants only explain a small fraction of the estimated genetic component of the disease. Therefore, useful predictions of AD from genetic data could not rely on these markers exclusively as they are not sufficiently informative predictors. In this study, we propose the use of deep neural networks for the prediction of late-onset Alzheimer’s disease from a large number of genetic variants. Experimental results indicate that the proposed model holds promise to produce useful predictions for clinical diagnosis of AD.

Download Full-text

Lipid associated polygenic enrichment in Alzheimer’s disease

10.1101/383844 ◽

2018 ◽

Author(s):

Iris J. Broce ◽

Chin Hong Tan ◽

Chun Chieh Fan ◽

Aree Witoelar ◽

Natalie Wen ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Genetic Variants ◽

Plasma Lipids ◽

Association Studies ◽

Density Lipoprotein ◽

Genome Wide Association Studies ◽

Nucleotide Polymorphisms ◽

Genetic Pleiotropy ◽

Common Genetic Variants

ABSTRACTCardiovascular (CV) and lifestyle associated risk factors (RFs) are increasingly recognized as important for Alzheimer’s disease (AD) pathogenesis. Beyond the ∊4 allele of apolipoprotein E (APOE), comparatively little is known about whether CV associated genes also increase risk for AD (genetic pleiotropy). Using large genome-wide association studies (GWASs) (total n > 500,000 cases and controls) and validated tools to quantify genetic pleiotropy, we systematically identified single nucleotide polymorphisms (SNPs) jointly associated with AD and one or more CV RFs, namely body mass index (BMI), type 2 diabetes (T2D), coronary artery disease (CAD), waist hip ratio (WHR), total cholesterol (TC), low-density (LDL) and high-density lipoprotein (HDL). In fold enrichment plots, we observed robust genetic enrichment in AD as a function of plasma lipids (TC, LDL, and HDL); we found minimal AD genetic enrichment conditional on BMI, T2D, CAD, and WHR. Beyond APOE, at conjunction FDR < 0.05 we identified 57 SNPs on 19 different chromosomes that were jointly associated with AD and CV outcomes including APOA4, ABCA1, ABCG5, LIPG, and MTCH2/SPI1. We found that common genetic variants influencing AD are associated with multiple CV RFs, at times with a different directionality of effect. Expression of these AD/CV pleiotropic genes was enriched for lipid metabolism processes, over-represented within astrocytes and vascular structures, highly co-expressed, and differentially altered within AD brains. Beyond APOE, we show that the polygenic component of AD is enriched for lipid associated RFs. Rather than a single causal link between genetic loci, RF and the outcome, we found that common genetic variants influencing AD are associated with multiple CV RFs. Our collective findings suggest that a network of genes involved in lipid biology also influence Alzheimer’s risk.

Download Full-text

P1-262: Genetic Variants Influencing Human Longevity from Late-Onset Alzheimer's Disease (LOAD) Genome-Wide Association Studies (GWAS)

Alzheimer s & Dementia ◽

10.1016/j.jalz.2011.05.542 ◽

2011 ◽

Vol 7 ◽

pp. S195-S195

Author(s):

Hui Shi ◽

Christopher Medway ◽

Kristelle Brown ◽

Noor Kalsheker ◽

Alison Goate ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Genetic Variants ◽

Late Onset ◽

Association Studies ◽

Genome Wide Association ◽

Human Longevity ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

Monocyte-specific changes in gene expression implicate LACTB2 and PLIN2 in Alzheimer’s disease

10.1101/2020.06.05.136275 ◽

2020 ◽

Author(s):

Janet C. Harwood ◽

Ganna Leonenko ◽

Rebecca Sims ◽

Valentina Escott-Price ◽

Julie Williams ◽

...

Keyword(s):

Gene Expression ◽

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Association Studies ◽

Genetic Data ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Risk Genes ◽

Genome Wide ◽

Immune Pathways

AbstractMore than 50 genetic loci have been identified as being associated with Alzheimer’s disease (AD) from genome-wide association studies (GWAS) and many of these are involved in immune pathways and lipid metabolism. Therefore, we performed a transcriptome-wide association study (TWAS) of immune-relevant cells, to study the mis-regulation of genes implicated in AD. We used expression and genetic data from naive and induced CD14+ monocytes and two GWAS of AD to study genetically controlled gene expression in monocytes at different stages of differentiation and compared the results with those from TWAS of brain and blood. We identified nine genes with statistically independent TWAS signals, seven are known AD risk genes from GWAS: BIN1, PTK2B, SPI1, MS4A4A, MS4A6E, APOE and PVR and two, LACTB2 and PLIN2/ADRP, are novel candidate genes for AD. Three genes, SPI1, PLIN2 and LACTB2, are TWAS significant specifically in monocytes. LACTB2 is a mitochondrial endoribonuclease and PLIN2/ADRP associates with intracellular neutral lipid storage droplets (LSDs) which have been shown to play a role in the regulation of the immune response. Notably, LACTB2 and PLIN2 were not detected from GWAS alone.

Download Full-text

Centenarian Controls Increase Variant Effect-sizes by an average two-fold in an Extreme Case-Extreme Control Analysis of Alzheimer’s Disease

10.1101/298018 ◽

2018 ◽

Author(s):

Niccolò Tesi ◽

Sven J. van der Lee ◽

Marc Hulsman ◽

Iris E. Jansen ◽

Najada Stringa ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Effect Size ◽

Genetic Variants ◽

Association Studies ◽

Effect Sizes ◽

Small Samples ◽

Control Analysis ◽

Genome Wide Association Studies ◽

Variant Effect

AbstractThe detection of genetic loci associated with Alzheimer’s disease (AD) requires large numbers of cases and controls because variant effect-sizes are mostly small. We hypothesized that variant effect-sizes should increase when individuals who represent the extreme ends of a disease spectrum are considered, as their genomes are assumed to be maximally enriched or depleted with disease-associated genetic variants.We used 1,073 extensively phenotyped AD cases with relatively young age at onset as extreme cases (66.3±7.9 years), 1,664 age-matched controls (66.0±6.5 years) and 255 cognitively healthy centenarians as extreme controls (101.4±1.3 years). We estimated the effect-size of 29 variants that were previously associated with AD in genome-wide association studies.Comparing extreme AD-cases with centenarian-controls increased the variant effect-size relative to published effect-sizes by on average 1.90-fold (SE=0.29,p=9.0×10−4). The effect-size increase was largest for the rare high-impactTREM2 (R74H)variant (6.5-fold), and significant for variants in/nearECHDC3(4.6-fold),SLC24A4-RIN3(4.5-fold),NME8(3.8-fold),PLCG2(3.3-fold),APOE-ε2(2.2-fold) andAPOE-ε4(2.0-fold). Comparing extreme phenotypes enabled us to replicate the AD association for 10 variants (p<0.05) in relatively small samples. The increase in effect-sizes depended mainly on using centenarians as extreme controls: the average variant effect-size was not increased in a comparison of extreme AD cases and age-matched controls (0.94-fold,p=6.8×10−1), suggesting that on average the tested genetic variants did not explain the extremity of the AD-cases. Concluding, using centenarians as extreme controls in AD case-controls studies boosts the variant effect-size by on average two-fold, allowing the replication of disease-association in relatively small samples.

Download Full-text

Deep learning-based identification of genetic variants: Application to Alzheimer's disease classification

10.1101/2021.07.19.21260789 ◽

2021 ◽

Author(s):

Taeho Jo ◽

Kwangsik Nho ◽

Paula Bice ◽

Andrew J Saykin

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Deep Learning ◽

Genetic Variants ◽

Association Studies ◽

Classification Model ◽

High Dimensional ◽

Optimal Size ◽

Genome Wide Association Studies ◽

Genome Wide

Deep learning is a promising tool that uses nonlinear transformations to extract features from high-dimensional data. Although deep learning has been used in several genetic studies, it is challenging in genome-wide association studies (GWAS) with high-dimensional genomic data. Here we propose a novel three-step approach for identification of genetic variants using deep learning to identify phenotype-related single nucleotide polymorphisms (SNPs) and develop accurate classification models. In the first step, we divided the whole genome into non-overlapping fragments of an optimal size and then ran Convolutional Neural Network (CNN) on each fragment to select phenotype-associated fragments. In the second step, using an overlapping window approach, we ran CNN on the selected fragments to calculate phenotype influence scores (PIS) and identify phenotype-associated SNPs based on PIS. In the third step, we ran CNN on all identified SNPs to develop a classification model. We tested our approach using genome-wide genotyping data for Alzheimer's disease (AD) (N=981; cognitively normal older adults (CN) =650 and AD=331). Our approach identified the well-known APOE region as the most significant genetic locus for AD. Our classification model achieved an area under the curve (AUC) of 0.82, which outperformed traditional machine learning approaches, Random Forest and XGBoost. By using a novel deep learning-based GWAS approach, we were able to identify AD-associated SNPs and develop a better classification model for AD.

Download Full-text

Alzheimer's disease variant portal (ADVP): a catalog of genetic findings for Alzheimer's disease

10.1101/2020.09.29.20203950 ◽

2020 ◽

Author(s):

Pavel P Kuksa ◽

Chia-Lun Lui ◽

Wei Fu ◽

Liming Qu ◽

Yi Zhao ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Functional Genomics ◽

Genetic Variants ◽

Disease Risk ◽

Association Studies ◽

Genome Wide Association Studies ◽

Genetic Associations ◽

Genome Wide ◽

Disease Variant

Background: Alzheimer's disease (AD) genetic findings span progressively larger genome-wide association studies (GWASs) for various outcomes and populations. These genetic findings are obtained from a single GWAS, joint- or meta- analyses of multiple GWAS datasets. However, no single resource provides harmonized and searchable information on all AD genetic associations obtained from these analyses, nor linking the identified genetic variants and reported genes with other supporting functional genomic evidence. Methods: We created the Alzheimer's Disease Variant Portal (ADVP), which provides unified access to a uniquely extensive collection of high-quality GWAS association results for AD. Records in ADVP are curated from the genome-wide significant and suggestive loci reported in AD genetics literature. ADVP contains curated results from all AD GWAS publications by Alzheimer's Disease Genetics Consortium (ADGC) since 2009 and AD GWAS publications identified from other public catalogs (GWAS catalog). Genetic association information was systematically extracted from these publications, harmonized, and organized into three types of tables. These tables included structured publication, variant, and association categories to ensure consistent representation of all AD genetic findings. All extracted AD genetic associations were further annotated and integrated with NIAGADS Genomics DB in order to provide extensive biological and functional genomics annotations. Results: Currently, ADVP contains 6,990 AD-association records curated from >200 AD GWAS publications corresponding to >900 unique genomic loci and >1,800 unique genetic variants. The ADVP collection contains genetic findings from >80 cohorts and across various populations, including Caucasians, Hispanics, African-Americans, and Asians. Of all the association records, 46% are disease-risk, 13% are related to expression quantitative trait analyses, and 27% are related to AD endophenotypes and neuropathology. ADVP web interface allows accessing AD association records by individual variants, genes, publications, genomic regions of interest, and genome-wide interactive variant views. ADVP is integrated with the NIAGADS Alzheimer's Genomics Database. Researchers can explore additional biological annotations at the genetic variant or gene level and view cross-reference functional genomics evidence provided by other public resources. Conclusions: ADVP is the largest, most up-to-date, and comprehensive literature-derived collection of AD genetic associations. All records have been systematically curated, harmonized, and comprehensively annotated. ADVP is freely accessible at https://advp.niagads.org/.

Download Full-text

A Systems Biology Approach for Hypothesizing the Effect of Genetic Variants on Neuroimaging Features in Alzheimer’s Disease

Journal of Alzheimer s Disease ◽

10.3233/jad-201397 ◽

2021 ◽

Vol 80 (2) ◽

pp. 831-840

Author(s):

Sepehr Golriz Khatami ◽

Daniel Domingo-Fernández ◽

Sarah Mubeen ◽

Charles Tapley Hoyt ◽

Christine Robinson ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Genetic Variants ◽

Large Scale ◽

Multiple Scales ◽

Association Studies ◽

Hippocampal Atrophy ◽

Genome Wide Association Studies ◽

Biological Processes ◽

Functional Interpretation

Background: Neuroimaging markers provide quantitative insight into brain structure and function in neurodegenerative diseases, such as Alzheimer’s disease, where we lack mechanistic insights to explain pathophysiology. These mechanisms are often mediated by genes and genetic variations and are often studied through the lens of genome-wide association studies. Linking these two disparate layers (i.e., imaging and genetic variation) through causal relationships between biological entities involved in the disease’s etiology would pave the way to large-scale mechanistic reasoning and interpretation. Objective: We explore how genetic variants may lead to functional alterations of intermediate molecular traits, which can further impact neuroimaging hallmarks over a series of biological processes across multiple scales. Methods: We present an approach in which knowledge pertaining to single nucleotide polymorphisms and imaging readouts is extracted from the literature, encoded in Biological Expression Language, and used in a novel workflow to assist in the functional interpretation of SNPs in a clinical context. Results: We demonstrate our approach in a case scenario which proposes KANSL1 as a candidate gene that accounts for the clinically reported correlation between the incidence of the genetic variants and hippocampal atrophy. We find that the workflow prioritizes multiple mechanisms reported in the literature through which KANSL1 may have an impact on hippocampal atrophy such as through the dysregulation of cell proliferation, synaptic plasticity, and metabolic processes. Conclusion: We have presented an approach that enables pinpointing relevant genetic variants as well as investigating their functional role in biological processes spanning across several, diverse biological scales.

Download Full-text