scholarly journals High-Throughput Screening and CRISPR-Cas9 Modeling of Causal Lipid-Associated Expression Quantitative Trait Locus Variants

2016 ◽  
Author(s):  
Avanthi Raghavan ◽  
Xiao Wang ◽  
Peter Rogov ◽  
Li Wang ◽  
Xiaolan Zhang ◽  
...  

AbstractGenome-wide association studies have identified a number of novel genetic loci linked to serum cholesterol and triglyceride levels. The causal DNA variants at these loci and the mechanisms by which they influence phenotype and disease risk remain largely unexplored. Expression quantitative trait locus analyses of patient liver and fat biopsies indicate that many lipid-associated variants influence gene expression in a cis-regulatory manner. However, linkage disequilibrium among neighboring SNPs at a genome-wide association study-implicated locus makes it challenging to pinpoint the actual variant underlying an association signal. We used a methodological framework for causal variant discovery that involves high-throughput identification of putative disease-causal loci through a functional reporter-based screen, the massively parallel reporter assay, followed by validation of prioritized variants in genome-edited human pluripotent stem cell models generated with CRISPR-Cas9. We complemented the stem cell models with CRISPR interference experiments in vitro and in knock-in mice in vivo. We provide validation for two high-priority SNPs, rs2277862 and rs10889356, being causal for lipid-associated expression quantitative trait loci. We also highlight the challenges inherent in modeling common genetic variation with these experimental approaches.Author SummaryGenome-wide association studies have identified numerous loci linked to a variety of clinical phenotypes. It remains a challenge to identify and validate the causal DNA variants in these loci. We describe the use of a high-throughput technique called the massively parallel reporter assay to analyze thousands of candidate causal DNA variants for their potential effects on gene expression. We use a combination of genome editing in human pluripotent stem cells, “CRISPR interference” experiments in other cultured human cell lines, and genetically modified mice to analyze the two highest-priority candidate DNA variants to emerge from the massively parallel reporter assay, and we confirm the relevance of the variants to nearby gene expression. These findings highlight a methodological framework with which to identify and functionally validate causal DNA variants.

2020 ◽  
Vol 127 (6) ◽  
pp. 761-777 ◽  
Author(s):  
Wilson Lek Wen Tan ◽  
Chukwuemeka George Anene-Nzelu ◽  
Eleanor Wong ◽  
Chang Jie Mick Lee ◽  
Hui San Tan ◽  
...  

Rationale: Identifying genetic markers for heterogeneous complex diseases such as heart failure is challenging and requires prohibitively large cohort sizes in genome-wide association studies to meet the stringent threshold of genome-wide statistical significance. On the other hand, chromatin quantitative trait loci, elucidated by direct epigenetic profiling of specific human tissues, may contribute toward prioritizing subthreshold variants for disease association. Objective: Here, we captured noncoding genetic variants by performing epigenetic profiling for enhancer H3K27ac chromatin immunoprecipitation followed by sequencing in 70 human control and end-stage failing hearts. Methods and Results: We have mapped a comprehensive catalog of 47 321 putative human heart enhancers and promoters. Three thousand eight hundred ninety-seven differential acetylation peaks (FDR [false discovery rate], 5%) pointed to pathways altered in heart failure. To identify cardiac histone acetylation quantitative trait loci (haQTLs), we regressed out confounding factors including heart failure disease status and used the G-SCI (Genotype-independent Signal Correlation and Imbalance) test 1 to call out 1680 haQTLs (FDR, 10%). RNA sequencing performed on the same heart samples proved a subset of haQTLs to have significant association also to gene expression (expression quantitative trait loci), either in cis (180) or through long-range interactions (81), identified by Hi-C (high-throughput chromatin conformation assay) and HiChIP (high-throughput protein centric chromatin) performed on a subset of hearts. Furthermore, a concordant relationship between the gain or disruption of TF (transcription factor)-binding motifs, inferred from alternative alleles at the haQTLs, implied a surprising direct association between these specific TF and local histone acetylation in human hearts. Finally, 62 unique loci were identified by colocalization of haQTLs with the subthreshold loci of heart-related genome-wide association studies datasets. Conclusions: Disease and phenotype association for 62 unique loci are now implicated. These loci may indeed mediate their effect through modification of enhancer H3K27 acetylation enrichment and their corresponding gene expression differences (bioRxiv: https://doi.org/10.1101/536763 ). Graphical Abstract: A graphical abstract is available for this article.


2018 ◽  
Author(s):  
Zhou Shaoqun ◽  
Karl A. Kremling ◽  
Bandillo Nonoy ◽  
Richter Annett ◽  
Ying K. Zhang ◽  
...  

One Sentence SummaryHPLC-MS metabolite profiling of maize seedlings, in combination with genome-wide association studies, identifies numerous quantitative trait loci that influence the accumulation of foliar metabolites.AbstractCultivated maize (Zea mays) retains much of the genetic and metabolic diversity of its wild ancestors. Non-targeted HPLC-MS metabolomics using a diverse panel of 264 maize inbred lines identified a bimodal distribution in the prevalence of foliar metabolites. Although 15% of the detected mass features were present in >90% of the inbred lines, the majority were found in <50% of the samples. Whereas leaf bases and tips were differentiated primarily by flavonoid abundance, maize varieties (stiff-stalk, non-stiff-stalk, tropical, sweet corn, and popcorn) were differentiated predominantly by benzoxazinoid metabolites. Genome-wide association studies (GWAS), performed for 3,991 mass features from the leaf tips and leaf bases, showed that 90% have multiple significantly associated loci scattered across the genome. Several quantitative trait locus hotspots in the maize genome regulate the abundance of multiple, often metabolically related mass features. The utility of maize metabolite GWAS was demonstrated by confirming known benzoxazinoid biosynthesis genes, as well as by mapping isomeric variation in the accumulation of phenylpropanoid hydroxycitric acid esters to a single linkage block in a citrate synthase-like gene. Similar to gene expression databases, this metabolomic GWAS dataset constitutes an important public resource for linking maize metabolites with biosynthetic and regulatory genes.


Circulation ◽  
2015 ◽  
Vol 132 (suppl_3) ◽  
Author(s):  
Nathan R Tucker ◽  
Jiangchuan Ye ◽  
Honghuang Lin ◽  
Michael A McLellan ◽  
Emelia J Benjamin ◽  
...  

Introduction: Genome-wide association studies have identified 14 independent loci for atrial fibrillation (AF). The 4q25 locus upstream of the left-right asymmetry gene PITX2 is, by far, the strongest association signal for AF. However, as with most GWAS loci, the functional variants are noncoding, presumed to be regulatory, and remain unknown. We therefore sought to rapidly identify the functional variants at an AF locus by combining high throughput sequencing and massively parallel reporter assays. Methods and Results: We sequenced a ~750kb region encompassing the PITX2 locus in 462 individuals with early-onset AF from the MGH AF Study and 464 referents from the Framingham Heart Study. The SNP most significantly associated with AF in our sequenced sample was rs2129983, which is 140kb from PITX2 (OR=2.43, P =8.9X10 -16 ). rs2129983 is approximately 1.7kb from the most significantly associated SNP in a prior AF GWAS, rs6817105 (r 2 =0.52). From the targeted sequencing analysis, we identified 262 SNVs with a MAF >0.5% within a genomic region bounded by SNPs with an r2 greater than 0.4 with the top variant. To identify functional variants, we then utilized a massively parallel reporter assay (MPRA) in order to measure enhancer activity at each SNP across the entire AF locus. In both HL-1 and C2C12 myoblasts, MPRA identified many distinct SNP regions with differential enhancer activity. Using AF-association status as a standard, we were able to identify a series of variants that have both differential activity in either cell line tested and also a high level of association (rs17042076, rs4469143). Mechanistically, these functional SNPs are predicted to alter transcription factor binding. Conclusions: We have comprehensively identified the AF-associated variation at 4q25 and determined which of these variants are functional through differential enhancer activity. Here, in addition to identifying the causative variation for AF at 4q25, we provide a generalizable pathway for translating this work to other loci, a method that could expedite the identification of causative genetic variants at other disease loci.


2020 ◽  
Vol 24 ◽  
pp. 100145 ◽  
Author(s):  
Mohsen Mohammadi ◽  
Alencar Xavier ◽  
Travis Beckett ◽  
Savannah Beyer ◽  
Liyang Chen ◽  
...  

Neurology ◽  
2010 ◽  
Vol 74 (6) ◽  
pp. 480-486 ◽  
Author(s):  
F. Zou ◽  
M. M. Carrasquillo ◽  
V. S. Pankratz ◽  
O. Belbin ◽  
K. Morgan ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document