scholarly journals Disentangling genetic feature selection and aggregation in transcriptome-wide association studies

2020 ◽  
Author(s):  
Chen Cao ◽  
Devin Kwok ◽  
Qing Li ◽  
Jingni He ◽  
Xingyi Guo ◽  
...  

ABSTRACTThe success of transcriptome-wide association studies (TWAS) has led to substantial research towards improving its core component of genetically regulated expression (GReX). GReX links expression information with phenotype by serving as both the outcome of genotype-based expression models and the predictor for downstream association testing. In this work, we demonstrate that current linear models of GReX inadvertently combine two separable steps of machine learning - feature selection and aggregation - which can be independently replaced to improve overall power. We show that the monolithic approach of GReX limits the adaptability of TWAS methodology and practice, especially given low expression heritability.

Genetics ◽  
2021 ◽  
Author(s):  
Chen Cao ◽  
Pathum Kossinna ◽  
Devin Kwok ◽  
Qing Li ◽  
Jingni He ◽  
...  

Abstract The success of transcriptome-wide association studies (TWAS) has led to substantial research towards improving the predictive accuracy of its core component of Genetically Regulated eXpression (GReX). GReX links expression information with genotype and phenotype by playing two roles simultaneously: it acts as both the outcome of the genotype-based predictive models (for predicting expressions) and the linear combination of genotypes (as the predicted expressions) for association tests. From the perspective of machine learning (considering SNPs as features), these are actually two separable steps—feature selection and feature aggregation—which can be independently conducted. In this work, we show that the single approach of GReX limits the adaptability of TWAS methodology and practice. By conducting simulations and real data analysis, we demonstrate that disentangled protocols adapting straightforward approaches for feature selection (e.g., simple marker test) and aggregation (e.g., kernel machines) outperform the standard TWAS protocols that rely on GReX. Our development provides more powerful novel tools for conducting TWAS. More importantly, our characterization of the exact nature of TWAS suggests that, instead of questionably binding two distinct steps into the same statistical form (GReX), methodological research focusing on optimal combinations of feature selection and aggregation approaches will bring higher power to TWAS protocols.


2011 ◽  
Vol 51 (4) ◽  
pp. 810-820 ◽  
Author(s):  
Sérgio Francisco da Silva ◽  
Marcela Xavier Ribeiro ◽  
João do E.S. Batista Neto ◽  
Caetano Traina-Jr. ◽  
Agma J.M. Traina

Sign in / Sign up

Export Citation Format

Share Document