scholarly journals Joint variable selection and network modeling for detecting eQTLs

Author(s):  
Xuan Cao ◽  
Lili Ding ◽  
Tesfaye B. Mersha

AbstractIn this study, we conduct a comparison of three most recent statistical methods for joint variable selection and covariance estimation with application of detecting expression quantitative trait loci (eQTL) and gene network estimation, and introduce a new hierarchical Bayesian method to be included in the comparison. Unlike the traditional univariate regression approach in eQTL, all four methods correlate phenotypes and genotypes by multivariate regression models that incorporate the dependence information among phenotypes, and use Bayesian multiplicity adjustment to avoid multiple testing burdens raised by traditional multiple testing correction methods. We presented the performance of three methods (MSSL – Multivariate Spike and Slab Lasso, SSUR – Sparse Seemingly Unrelated Bayesian Regression, and OBFBF – Objective Bayes Fractional Bayes Factor), along with the proposed, JDAG (Joint estimation via a Gaussian Directed Acyclic Graph model) method through simulation experiments, and publicly available HapMap real data, taking asthma as an example. Compared with existing methods, JDAG identified networks with higher sensitivity and specificity under row-wise sparse settings. JDAG requires less execution in small-to-moderate dimensions, but is not currently applicable to high dimensional data. The eQTL analysis in asthma data showed a number of known gene regulations such as STARD3, IKZF3 and PGAP3, all reported in asthma studies. The code of the proposed method is freely available at GitHub (https://github.com/xuan-cao/Joint-estimation-for-eQTL).

2019 ◽  
Vol 4 (1) ◽  
pp. e000273
Author(s):  
Irina Balikova ◽  
Laurence Postelmans ◽  
Brigitte Pasteels ◽  
Pascale Coquelet ◽  
Janet Catherine ◽  
...  

ObjectiveAge-related macular degeneration (ARMD) is a leading cause of visual impairment. Intravitreal injections of anti-vascular endothelial growth factor (VEGF) are the standard treatment for wet ARMD. There is however, variability in patient responses, suggesting patient-specific factors influencing drug efficacy. We tested whether single nucleotide polymorphisms (SNPs) in genes encoding VEGF pathway members contribute to therapy response.Methods and analysisA retrospective cohort of 281 European wet ARMD patients treated with anti-VEGF was genotyped for 138 tagging SNPs in the VEGF pathway. Per patient, we collected best corrected visual acuity at baseline, after three loading injections and at 12 months. We also registered the injection number and changes in retinal morphology after three loading injections (central foveal thickness (CFT), intraretinal cysts and serous neuroepithelium detachment). Changes in CFT after 3 months were our primary outcome measure. Association of SNPs to response was assessed by binomial logistic regression. Replication was attempted by associating visual acuity changes to genotypes in an independent Japanese cohort.ResultsAssociation with treatment response was detected for seven SNPs, including in FLT4 (rs55667289: OR=0.746, 95% CI 0.63 to 0.88, p=0.0005) and KDR (rs7691507: OR=1.056, 95% CI 1.02 to 1.10, p=0.005; and rs2305945: OR=0.963, 95% CI 0.93 to 1.00, p=0.0472). Only association with rs55667289 in FLT4 survived multiple testing correction. This SNP was unavailable for testing in the replication cohort. Of six SNPs tested for replication, one was significant although not after multiple testing correction.ConclusionIdentifying genetic variants that define treatment response can help to develop individualised therapeutic approaches for wet ARMD patients and may point towards new targets in non-responders.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Sangyoon Yi ◽  
Xianyang Zhang ◽  
Lu Yang ◽  
Jinyan Huang ◽  
Yuanhang Liu ◽  
...  

AbstractOne challenge facing omics association studies is the loss of statistical power when adjusting for confounders and multiple testing. The traditional statistical procedure involves fitting a confounder-adjusted regression model for each omics feature, followed by multiple testing correction. Here we show that the traditional procedure is not optimal and present a new approach, 2dFDR, a two-dimensional false discovery rate control procedure, for powerful confounder adjustment in multiple testing. Through extensive evaluation, we demonstrate that 2dFDR is more powerful than the traditional procedure, and in the presence of strong confounding and weak signals, the power improvement could be more than 100%.


2021 ◽  
pp. 014662162110146
Author(s):  
Justin L. Kern ◽  
Edison Choe

This study investigates using response times (RTs) with item responses in a computerized adaptive test (CAT) setting to enhance item selection and ability estimation and control for differential speededness. Using van der Linden’s hierarchical framework, an extended procedure for joint estimation of ability and speed parameters for use in CAT is developed following van der Linden; this is called the joint expected a posteriori estimator (J-EAP). It is shown that the J-EAP estimate of ability and speededness outperforms the standard maximum likelihood estimator (MLE) of ability and speededness in terms of correlation, root mean square error, and bias. It is further shown that under the maximum information per time unit item selection method (MICT)—a method which uses estimates for ability and speededness directly—using the J-EAP further reduces average examinee time spent and variability in test times between examinees above the resulting gains of this selection algorithm with the MLE while maintaining estimation efficiency. Simulated test results are further corroborated with test parameters derived from a real data example.


Biomedicines ◽  
2021 ◽  
Vol 9 (5) ◽  
pp. 528
Author(s):  
Shu-Pin Huang ◽  
Yei-Tsung Chen ◽  
Lih-Chyang Chen ◽  
Cheng-Hsueh Lee ◽  
Chao-Yuan Huang ◽  
...  

Neuregulins (NRGs) activate receptor tyrosine kinases of the ErbB family, and play essential roles in the proliferation, survival, and differentiation of normal and malignant tissue cells. We hypothesized that genetic variants of NRG signalling pathway genes may influence treatment outcomes in prostate cancer. To test this hypothesis, we performed a comprehensive analysis to evaluate the associations of 459 single-nucleotide polymorphisms in 19 NRG pathway genes with cancer-specific survival (CSS), overall survival (OS), and progression-free survival (PFS) in 630 patients with prostate cancer receiving androgen-deprivation therapy (ADT). After multivariate Cox regression and multiple testing correction, we found that NRG1 rs144160282 C > T is significantly associated with worsening CSS, OS, and PFS during ADT. Further analysis showed that low expression of NRG1 is closely related to prostate cancer, as indicated by a high Gleason score, an advanced stage, and a shorter PFS rate. Meta-analysis of 16 gene expression datasets of 1,081 prostate cancer samples and 294 adjacent normal samples indicate lower NRG1 expression in the former compared with the latter (p < 0.001). These results suggest that NRG1 rs144160282 might be a prognostic predictor of the efficacy of ADT. Further studies are required to confirm the significance of NRG1 as a biomarker and therapeutic target for prostate cancer.


2018 ◽  
Vol 2018 ◽  
pp. 1-13
Author(s):  
JungJun Lee ◽  
SungHwan Kim ◽  
Jae-Hwan Jhong ◽  
Ja-Yong Koo

In genomic data analysis, it is commonplace that underlying regulatory relationship over multiple genes is hardly ascertained due to unknown genetic complexity and epigenetic regulations. In this paper, we consider a joint mean and constant covariance model (JMCCM) that elucidates conditional dependent structures of genes with controlling for potential genotype perturbations. To this end, the modified Cholesky decomposition is utilized to parametrize entries of a precision matrix. The JMCCM maximizes the likelihood function to estimate parameters involved in the model. We also develop a variable selection algorithm that selects explanatory variables and Cholesky factors by exploiting the combination of the GCV and BIC as benchmarks, together with Rao and Wald statistics. Importantly, we notice that sparse estimation of a precision matrix (or equivalently gene network) is effectively achieved via the proposed variable selection scheme and contributes to exploring significant hub genes shown to be concordant to a priori biological evidence. In simulation studies, we confirm that our model selection efficiently identifies the true underlying networks. With an application to miRNA and SNPs data from yeast (a.k.a. eQTL data), we demonstrate that constructed gene networks reproduce validated biological and clinical knowledge with regard to various pathways including the cell cycle pathway.


PLoS ONE ◽  
2022 ◽  
Vol 17 (1) ◽  
pp. e0262093
Author(s):  
Mary K. Horton ◽  
Shannon McCurdy ◽  
Xiaorong Shao ◽  
Kalliope Bellesis ◽  
Terrence Chinn ◽  
...  

Background Adverse childhood experiences (ACEs) are linked to numerous health conditions but understudied in multiple sclerosis (MS). This study’s objective was to test for the association between ACEs and MS risk and several clinical outcomes. Methods We used a sample of adult, non-Hispanic MS cases (n = 1422) and controls (n = 1185) from Northern California. Eighteen ACEs were assessed including parent divorce, parent death, and abuse. Outcomes included MS risk, age of MS onset, Multiple Sclerosis Severity Scale score, and use of a walking aid. Logistic and linear regression estimated odds ratios (ORs) (and beta coefficients) and 95% confidence intervals (CIs) for ACEs operationalized as any/none, counts, individual events, and latent factors/patterns. Results Overall, more MS cases experienced ≥1 ACE compared to controls (54.5% and 53.8%, respectively). After adjusting for sex, birthyear, and race, this small difference was attenuated (OR = 1.01, 95% CI: 0.87, 1.18). There were no trends of increasing or decreasing odds of MS across ACE count categories. Consistent associations between individual ACEs between ages 0–10 and 11–20 years and MS risk were not detected. Factor analysis identified five latent ACE factors, but their associations with MS risk were approximately null. Age of MS onset and other clinical outcomes were not associated with ACEs after multiple testing correction. Conclusion Despite rich data and multiple approaches to operationalizing ACEs, no consistent and statistically significant effects were observed between ACEs with MS. This highlights the challenges of studying sensitive, retrospective events among adults that occurred decades before data collection.


2012 ◽  
Vol 53 ◽  
Author(s):  
Gintautas Jakimauskas ◽  
Leonidas Sakalauskas

The efficiency of adding an auxiliary regression variable to the logit model in estimation of small probabilities in large populations is considered. Let us consider two models of distribution of unknown probabilities: the probabilities have gamma distribution (model (A)), or logits of the probabilities have Gaussian distribution (model (B)). In modification of model (B) we will use additional regression variable for Gaussian mean (model (BR)). We have selected real data from Database of Indicators of Statistics Lithuania – Working-age persons recognized as disabled for the first time by administrative territory, year 2010 (number of populations K = 60). Additionally, we have used average annual population data by administrative territory. The auxiliary regression variable was based on data – Number of hospital discharges by administrative territory, year 2010. We obtained initial parameters using simple iterative procedures for models (A), (B) and (BR). At the second stage we performed various tests using Monte-Carlo simulation (using models (A), (B) and (BR)). The main goal was to select an appropriate model and to propose some recommendations for using gamma and logit (with or without auxiliary regression variable) models for Bayesian estimation. The results show that a Monte Carlo simulation method enables us to determine which estimation model is preferable.


2018 ◽  
Author(s):  
David M. Howard ◽  
Mark J. Adams ◽  
Toni-Kim Clarke ◽  
Jonathan D. Hafferty ◽  
Jude Gibson ◽  
...  

AbstractMajor depression is a debilitating psychiatric illness that is typically associated with low mood, anhedonia and a range of comorbidities. Depression has a heritable component that has remained difficult to elucidate with current sample sizes due to the polygenic nature of the disorder. To maximise sample size, we meta-analysed data on 807,553 individuals (246,363 cases and 561,190 controls) from the three largest genome-wide association studies of depression. We identified 102 independent variants, 269 genes, and 15 gene-sets associated with depression, including both genes and gene-pathways associated with synaptic structure and neurotransmission. Further evidence of the importance of prefrontal brain regions in depression was provided by an enrichment analysis. In an independent replication sample of 1,306,354 individuals (414,055 cases and 892,299 controls), 87 of the 102 associated variants were significant following multiple testing correction. Based on the putative genes associated with depression this work also highlights several potential drug repositioning opportunities. These findings advance our understanding of the complex genetic architecture of depression and provide several future avenues for understanding aetiology and developing new treatment approaches.


2009 ◽  
Vol 3 (0) ◽  
pp. 633-650 ◽  
Author(s):  
Chuanwen Chen ◽  
Arthur Cohen ◽  
Harold B. Sackrowitz

Sign in / Sign up

Export Citation Format

Share Document