eqtl mapping
Recently Published Documents


TOTAL DOCUMENTS

103
(FIVE YEARS 40)

H-INDEX

20
(FIVE YEARS 2)

2021 ◽  
Vol 12 ◽  
Author(s):  
Cheng Gao ◽  
Hairong Wei ◽  
Kui Zhang

Characterization of genetic variations that are associated with gene expression levels is essential to understand cellular mechanisms that underline human complex traits. Expression quantitative trait loci (eQTL) mapping attempts to identify genetic variants, such as single nucleotide polymorphisms (SNPs), that affect the expression of one or more genes. With the availability of a large volume of gene expression data, it is necessary and important to develop fast and efficient statistical and computational methods to perform eQTL mapping for such large scale data. In this paper, we proposed a new method, the low rank penalized regression method (LORSEN), for eQTL mapping. We evaluated and compared the performance of LORSEN with two existing methods for eQTL mapping using extensive simulations as well as real data from the HapMap3 project. Simulation studies showed that our method outperformed two commonly used methods for eQTL mapping, LORS and FastLORS, in many scenarios in terms of area under the curve (AUC). We illustrated the usefulness of our method by applying it to SNP variants data and gene expression levels on four chromosomes from the HapMap3 Project.


2021 ◽  
Vol 12 ◽  
Author(s):  
Xianxian Liu ◽  
Junjie Zhang ◽  
Xinwei Xiong ◽  
Congying Chen ◽  
Yuyun Xing ◽  
...  

Understanding the genetic factors behind meat quality traits is of great significance to animal breeding and production. We previously conducted a genome-wide association study (GWAS) for meat quality traits in a White Duroc × Erhualian F2 pig population using Illumina porcine 60K SNP data. Here, we further investigate the functional candidate genes and their network modules associated with meat quality traits by integrating transcriptomics and GWAS information. Quantitative trait transcript (QTT) analysis, gene expression QTL (eQTL) mapping, and weighted gene co-expression network analysis (WGCNA) were performed using the digital gene expression (DGE) data from 493 F2 pig’s muscle and liver samples. Among the quantified 20,108 liver and 23,728 muscle transcripts, 535 liver and 1,014 muscle QTTs corresponding to 416 and 721 genes, respectively, were found to be significantly (p < 5 × 10−4) correlated with 22 meat quality traits measured on longissiums dorsi muscle (LM) or semimembranosus muscle (SM). Transcripts associated with muscle glycolytic potential (GP) and pH values were enriched for genes involved in metabolic process. There were 42 QTTs (for 32 genes) shared by liver and muscle tissues, of which 10 QTTs represent GP- and/or pH-related genes, such as JUNB, ATF3, and PPP1R3B. Furthermore, a genome-wide eQTL mapping revealed a total of 3,054 eQTLs for all annotated transcripts in muscle (p < 2.08 × 10−5), including 1,283 cis-eQTLs and 1771 trans-eQTLs. In addition, WGCNA identified five modules relevant to glycogen metabolism pathway and highlighted the connections between variations in meat quality traits and genes involved in energy process. Integrative analysis of GWAS loci, eQTL, and QTT demonstrated GALNT15/GALNTL2 and HTATIP2 as strong candidate genes for drip loss and pH drop from postmortem 45 min to 24 h, respectively. Our findings provide valuable insights into the genetic basis of meat quality traits and greatly expand the number of candidate genes that may be valuable for future functional analysis and genetic improvement of meat quality.


Author(s):  
Xin Zhou ◽  
Xiaodong Cai

Abstract Motivation Genetic variations of expression quantitative trait loci (eQTLs) play a critical role in influencing complex traits and diseases development. Two main factors that affect the statistical power of detecting eQTLs are: 1) relatively small size of samples available, and 2) heavy burden of multiple testing due to a very large number of variants to be tested. The later issue is particularly severe when one tries to identify trans-eQTLs that are far away from the genes they influence. If one can exploit co-expressed genes jointly in eQTL-mapping, effective sample size can be increased. Furthermore, using the structure of the gene regulatory network (GRN) may help to identify trans-eQTLs without increasing multiple testing burden. Results In this paper, we employ the structure equation model (SEM) to model both GRN and effect of eQTLs on gene expression, and then develop a novel algorithm, named sparse SEM for eQTL mapping (SSEMQ), to conduct joint eQTL mapping and GRN inference. The SEM can exploit co-expressed genes jointly in eQTL mapping and also use GRN to determine trans-eQTLs. Computer simulations demonstrate that our SSEMQ significantly outperforms nine existing eQTL mapping methods. SSEMQ is further employed to analyze two real datasets of human breast and whole blood tissues, yielding a number of cis- and trans-eQTLs. Availability R package ssemQr is available at https://github.com/Ivis4ml/ssemQr.git. Supplementary information Supplementary data are available at Bioinformatics online.


2021 ◽  
Vol 129 (Suppl_1) ◽  
Author(s):  
Buyan-Ochir Orgil ◽  
Fuyi Xu ◽  
Undral Munkhsaikhan ◽  
Neely R Alberson ◽  
Jason Johnson ◽  
...  

Background: Causal and modifier genes, genetic background and environment underlie clinical heterogeneity in cardiomyopathy (CM). The BXD recombinant inbred (RI) family represents a murine genetic reference population (GRP) that are descendants from crosses between C57BL/6J (B6) and DBA/2J (D2) mice. The parental D2 mouse is a natural model of hypertrophic CM (HCM). The study aimed to dissect genetic architecture of cardiac traits in BXD GRP using systems genetics analysis. Methods: Echocardiography was performed in 88 strains of male (M) and female (F) BXDs (N>5/sex) at 4-5 months of age. Cardiac traits were then associated with heart transcriptome, and expression quantitative trait loci ( eQTL) mapping was performed. Results: More than 2-fold variance in ejection fraction (EF%), fractional shortening (FS%), left ventricular (LV) volumes at end-systole and end-diastole (Vol;s and Vol;d), internal dimensions (ID;s and ID;d), posterior wall (PW), and interventricular septum (IVS) thickness was found among BXDs. Traits seen in dilated CM (DCM) patients such as reduced EF%, FS%, and LVPW and increased Vol;s and ID;s are found in BXD78 (M, F), BXD32, 111, and 68 (F) strains. Strains D2, BXD90 and 155 (M, F), BXD44 and 65 (M), and BXD113, 16, 77 (F) had significantly greater LV mass, LVPW and IVS thickness compared to sex-matched controls, suggestive for traits seen in HCM patients. A 6.4 Mb QTL (peak LRS=18.50) was identified on chromosome (Chr) 8 to be significantly associated with ID;s, ID;d, Vol;s and Vol;d among male BXDs. eQTL mapping for each of 131 genes on Chr8 QTL identified 6 genes ( Coq9 , Ndrg4 , Crnde, Irx3, Rpgrip1l, and Rbl2 ) being cis -regulated and Ndrg, Slc6a2 and Ces1d being significantly (p < 0.05) correlated with LV volumes. In female BXDs, a significant QTL on Chr7 (40.2 Mb) with 9 genes that significantly correlated with LVPW;d was identified. A suggestive 92.6 Mb QTL on Chr3 with Snapin , Tpm3 , and Wars2 correlated with EF% and FS% (p < 0.05). Conclusions: Our study found cardiomyopathy-associated traits are segregated among BXD family and these traits vary among BXD lines. Multiple associated QTLs demonstrate that the BXD family is suitable to map gene variants and identify genetic factors and modifiers that influence cardiomyopathy phenotypes.


2021 ◽  
Vol 22 (S9) ◽  
Author(s):  
Tao Wang ◽  
Yongzhuang Liu ◽  
Junpeng Ruan ◽  
Xianjun Dong ◽  
Yadong Wang ◽  
...  

Abstract Background Advances in the expression quantitative trait loci (eQTL) studies have provided valuable insights into the mechanism of diseases and traits-associated genetic variants. However, it remains challenging to evaluate and control the quality of multi-source heterogeneous eQTL raw data for researchers with limited computational background. There is an urgent need to develop a powerful and user-friendly tool to automatically process the raw datasets in various formats and perform the eQTL mapping afterward. Results In this work, we present a pipeline for eQTL analysis, termed eQTLQC, featured with automated data preprocessing for both genotype data and gene expression data. Our pipeline provides a set of quality control and normalization approaches, and utilizes automated techniques to reduce manual intervention. We demonstrate the utility and robustness of this pipeline by performing eQTL case studies using multiple independent real-world datasets with RNA-seq data and whole genome sequencing (WGS) based genotype data. Conclusions eQTLQC provides a reliable computational workflow for eQTL analysis. It provides standard quality control and normalization as well as eQTL mapping procedures for eQTL raw data in multiple formats. The source code, demo data, and instructions are freely available at https://github.com/stormlovetao/eQTLQC.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Anna S. E. Cuomo ◽  
Giordano Alvari ◽  
Christina B. Azodi ◽  
Davis J. McCarthy ◽  
Marc Jan Bonder ◽  
...  

Abstract Background Single-cell RNA sequencing (scRNA-seq) has enabled the unbiased, high-throughput quantification of gene expression specific to cell types and states. With the cost of scRNA-seq decreasing and techniques for sample multiplexing improving, population-scale scRNA-seq, and thus single-cell expression quantitative trait locus (sc-eQTL) mapping, is increasingly feasible. Mapping of sc-eQTL provides additional resolution to study the regulatory role of common genetic variants on gene expression across a plethora of cell types and states and promises to improve our understanding of genetic regulation across tissues in both health and disease. Results While previously established methods for bulk eQTL mapping can, in principle, be applied to sc-eQTL mapping, there are a number of open questions about how best to process scRNA-seq data and adapt bulk methods to optimize sc-eQTL mapping. Here, we evaluate the role of different normalization and aggregation strategies, covariate adjustment techniques, and multiple testing correction methods to establish best practice guidelines. We use both real and simulated datasets across single-cell technologies to systematically assess the impact of these different statistical approaches. Conclusion We provide recommendations for future single-cell eQTL studies that can yield up to twice as many eQTL discoveries as default approaches ported from bulk studies.


2021 ◽  
Author(s):  
Dustin G. Wilkerson ◽  
Chase R. Crowell ◽  
Craig H. Carlson ◽  
Patrick W. McMullen ◽  
Christine D. Smart ◽  
...  

Abstract Background: Melampsora spp. rusts are the greatest pathogen threat to shrub willow (Salix spp.) bioenergy crops. Genetic resistance is key to limit the effects of these foliar diseases on host response and biomass yield, however, the genetic basis of host resistance has not been characterized. The addition of new genomic resources for Salix provides greater power to investigate the interaction between S. purpurea and M. americana, species commonly found in the Northeast US. Here, we utilize 3' RNA-seq to investigate host-pathogen interactions following controlled inoculations of M. americana on resistant and susceptible F2 S. purpurea genotypes identified in a recent QTL mapping study. Differential gene expression, network analysis, and eQTL mapping was used to contrast the response to inoculation and to identify associated candidate genes.Results: Controlled inoculation in a replicated greenhouse study identified 19 and 105 differentially expressed genes between resistant and susceptible genotypes at 42 and 66 HPI, respectively. Defense response gene networks were activated in both resistant and susceptible genotypes and enriched for many of the same defense response genes, yet the hub genes of these common response modules showed greater mean expression among the resistant plants. Further, eight and six eQTL hotspots were identified at 42 and 66 HPI, respectively. The combined results of the three analyses highlight 124 candidate genes in the host for further analysis while analysis of pathogen RNA showed differential expression of 22 genes, two of which are candidate pathogen effectors. Conclusions: We identified two differentially expressed M. americana transcripts and 124 S. purpurea genes that are good candidates for future studies to confirm their role in conferring resistance.


2021 ◽  
Author(s):  
Diptavo Dutta ◽  
Yuan He ◽  
Ashis Saha ◽  
Marios Arvanitis ◽  
Alexis Battle ◽  
...  

Abstract Large scale genetic association studies have identified many trait-associated variants and understanding the role of these variants in downstream regulation of gene-expressions can uncover important mediating biological mechanisms. In this study, we propose Aggregative tRans assoCiation to detect pHenotype specIfic gEne-sets (ARCHIE), as a method to establish links between sets of known genetic variants associated with a trait and sets of co-regulated gene-expressions through trans associations. ARCHIE employs sparse canonical correlation analysis based on summary statistics from trans-eQTL mapping and genotype and expression correlation matrices constructed from external data sources. A resampling based procedure is then used to test for significant trait-specific trans-association patterns in the background of highly polygenic regulation of gene-expression. Simulation studies show that compared to standard trans-eQTL analysis, ARCHIE is better suited to identify “core”-like genes through which effects of many other genes may be mediated and which can explain disease specific patterns of genetic associations. By applying ARCHIE to available trans-eQTL summary statistics reported by the eQTLGen consortium, we identify 71 gene networks which have significant evidence of trans-association with groups of known genetic variants across 29 complex traits. Around half (50.7%) of the selected genes do not have any strong trans-associations and could not have been detected by standard trans-eQTL mapping. We provide further evidence for causal basis of the target genes through a series of follow-up analyses. These results show ARCHIE is a powerful tool for identifying sets of genes whose trans regulation may be related to specific complex traits. The method has potential for broader applications for identification of networks of various types of molecular traits which mediates complex traits genetic associations.


2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Silva Kasela ◽  
◽  
Victor E. Ortega ◽  
Molly Martorella ◽  
Suresh Garudadri ◽  
...  

Abstract Background The large airway epithelial barrier provides one of the first lines of defense against respiratory viruses, including SARS-CoV-2 that causes COVID-19. Substantial inter-individual variability in individual disease courses is hypothesized to be partially mediated by the differential regulation of the genes that interact with the SARS-CoV-2 virus or are involved in the subsequent host response. Here, we comprehensively investigated non-genetic and genetic factors influencing COVID-19-relevant bronchial epithelial gene expression. Methods We analyzed RNA-sequencing data from bronchial epithelial brushings obtained from uninfected individuals. We related ACE2 gene expression to host and environmental factors in the SPIROMICS cohort of smokers with and without chronic obstructive pulmonary disease (COPD) and replicated these associations in two asthma cohorts, SARP and MAST. To identify airway biology beyond ACE2 binding that may contribute to increased susceptibility, we used gene set enrichment analyses to determine if gene expression changes indicative of a suppressed airway immune response observed early in SARS-CoV-2 infection are also observed in association with host factors. To identify host genetic variants affecting COVID-19 susceptibility in SPIROMICS, we performed expression quantitative trait (eQTL) mapping and investigated the phenotypic associations of the eQTL variants. Results We found that ACE2 expression was higher in relation to active smoking, obesity, and hypertension that are known risk factors of COVID-19 severity, while an association with interferon-related inflammation was driven by the truncated, non-binding ACE2 isoform. We discovered that expression patterns of a suppressed airway immune response to early SARS-CoV-2 infection, compared to other viruses, are similar to patterns associated with obesity, hypertension, and cardiovascular disease, which may thus contribute to a COVID-19-susceptible airway environment. eQTL mapping identified regulatory variants for genes implicated in COVID-19, some of which had pheWAS evidence for their potential role in respiratory infections. Conclusions These data provide evidence that clinically relevant variation in the expression of COVID-19-related genes is associated with host factors, environmental exposures, and likely host genetic variation.


Sign in / Sign up

Export Citation Format

Share Document