Gene4PD: a comprehensive genetic database of Parkinson's disease
Abstract BackgroundParkinson's disease (PD) is a complex neurodegenerative disorder with a strong genetic component. A growing number of variants and genes have been reported to be associated with PD; however, there is no database that integrate different type of genetic data, and support analyzing of PD-associated genes (PAGs).MethodsBy systematic review and curation of multiple lines of public studies, we integrate multiple layers of genetic data (rare variants and copy-number variants identified from patients with PD, associated variants identified from genome-wide association studies, differential expression genes, and differential DNA methylation genes) and clinical data in PD. A weighted scoring system was employed to prioritize PAGs. Permutation test and protein-protein interaction network was used to evaluate the interconnectivity and functional correlation among the PAGs. The relationship between AAO and PAGs was further analyzed. A PHP-based web framework was used to construct a database. ResultWe integrated five layers of genetic data with different levels of evidences from more than 3,000 studies and prioritized 124 PAGs with strong or suggestive evidences. These PAGs were identified to be significantly interacted with each other and formed an interconnected functional network enriched in several functional pathways involved in PD, suggesting these genes may contribute to the pathogenesis of PD, which also highlighting the reliability of these genetic data for PD. Furthermore, we identified 10 genes were associated with a juvenile-onset (age ≤ 30 years), 11 genes were associated with an early-onset (age of 30–50 years), whereas another 10 genes were associated with a late-onset (age > 50 years). Notably, the AAOs of patients with loss of function variants in five genes (GCH1, PINK1, PRKN, FBXO7, ATP13A2) were significantly lower than that of patients with deleterious missense variants, while patients with VPS13C (P = 0.01) was opposite. Finally, we developed an online database named Gene4PD (http://genemed.tech/gene4pd) which integrated published genetic data in PD, the PAGs, and 63 popular genomic data sources, as well as an online pipeline for prioritize risk variants in PD.ConclusionGene4PD provides researchers and clinicians comprehensive genetic knowledge and analytic platform for PD, and would also improve the understanding of pathogenesis in PD.Availability and Implementation: Gene4PD can be freely accessed at http://genemed.tech/gene4pd.