Diverse functions associate with trans-species polymorphisms in humans

AbstractLong-term balancing selection (LTBS) can maintain allelic variation at a locus over millions of years and through speciation events. Variants shared between species, hereafter “trans-species polymorphisms” (TSPs), often result from LTBS due to host-pathogen interactions. For instance, the major histocompatibility complex (MHC) locus contains TSPs present across primates. Several hundred TSPs have been identified in humans and chimpanzees; however, because many are in non-coding regions of the genome, the functions and adaptive roles for most TSPs remain unknown. We integrated diverse genomic annotations to explore the functions of 125 previously identified non-coding TSPs that are likely under LTBS since the common ancestor of humans and chimpanzees. We analyzed genome-wide functional assays, expression quantitative trait loci (eQTL), genome-wide association studies (GWAS), and phenome-wide association studies (PheWAS). We identify functional annotations for 119 TSP regions, including 71 with evidence of gene regulatory function from GTEx or genome-wide functional genomics data and 21 with evidence of trait association from GWAS and PheWAS. TSPs in humans associate with many immune system phenotypes, including response to pathogens, but we also find associations with a range of other phenotypes, including body mass, alcohol intake, urate levels, chronotype, and risk-taking behavior. The diversity of traits associated with non-coding human TSPs suggest that functions beyond the immune system are often subject to LTBS. Furthermore, several of these trait associations provide support and candidate genetic loci for previous hypothesis about behavioral diversity in great ape populations, such as the importance of variation in sleep cycles and risk sensitivity.Significance statementMost genetic variants present in human populations are young (<100,000 years old); however, a few hundred are millions of years old with origins before the divergence of humans and chimpanzees. These trans-species polymorphisms (TSPs) were likely maintained by balancing selection—evolutionary pressure to maintain genetic diversity at a locus. However, the functions driving this selection, especially for non-coding TSPs, are largely unknown. We integrate genome-wide annotation strategies to demonstrate TSP associations with immune system function, behavior (addition, cognition, risky behavior), uric acid metabolism, and many other phenotypes. These results substantially expand our understanding of functions TSPs and suggest a substantial role for balancing selection beyond the immune system.

Download Full-text

Diverse Functions Associate With Non-Coding Trans-species Polymorphisms in Humans

10.21203/rs.3.rs-559297/v1 ◽

2021 ◽

Author(s):

Keila Velazquez-Arcelay ◽

Mary Lauren Benton ◽

John A. Capra

Keyword(s):

Immune System ◽

Allelic Variation ◽

Balancing Selection ◽

Association Studies ◽

Regulatory Function ◽

Genome Wide Association Studies ◽

Functional Annotations ◽

Coding Regions ◽

Genome Wide ◽

Mhc Locus

Abstract Background: Long-term balancing selection (LTBS) can maintain allelic variation at a locus over millions of years and through speciation events. Variants shared between species, hereafter “trans-species polymorphisms” (TSPs), often result from LTBS due to host-pathogen interactions. For instance, the major histocompatibility complex (MHC) locus contains TSPs present across primates. Several hundred candidate TSPs have been identified in humans and chimpanzees; however, because many are in non-coding regions of the genome, the functions and adaptive roles for most TSPs remain unknown. Results: We integrated diverse genomic annotations, with a focus on non-coding regions, to explore the functions of 125 previously identified regions containing multiple TSPs in humans and chimpanzees. We analyzed genome-wide functional assays, expression quantitative trait loci (eQTL), genome-wide association studies (GWAS), and phenome-wide association studies (PheWAS). We identify functional annotations for 119 TSP regions, including 71 with evidence of gene regulatory function from GTEx or genome-wide functional genomics data and 21 with evidence of trait association from GWAS and PheWAS. TSPs in humans associate with many immune system phenotypes, including response to pathogens, but we also find associations with a range of other phenotypes, including body mass, alcohol intake, urate levels, chronotype, and risk-taking behavior. Conclusions: The diversity of traits associated with non-coding human TSPs further support previous hypotheses that functions beyond the immune system are subject to LTBS. Furthermore, several of these trait associations provide support and candidate genetic loci for previous hypothesis about behavioral diversity in great ape populations, such as the importance of variation in sleep cycles and risk sensitivity.

Download Full-text

Contributions of common genetic variants to risk of schizophrenia among individuals of African and Latino ancestry

Molecular Psychiatry ◽

10.1038/s41380-019-0517-y ◽

2019 ◽

Vol 25 (10) ◽

pp. 2455-2467 ◽

Cited By ~ 12

Author(s):

Tim B. Bigdeli ◽

◽

Giulio Genovese ◽

Penelope Georgakopoulos ◽

Jacquelyn L. Meyers ◽

...

Keyword(s):

Genetic Variants ◽

Association Studies ◽

Large Fraction ◽

African Ancestry ◽

Common Variant ◽

Human Populations ◽

Polygenic Risk Score ◽

Genome Wide Association Studies ◽

Genome Wide ◽

Common Genetic Variants

Abstract Schizophrenia is a common, chronic and debilitating neuropsychiatric syndrome affecting tens of millions of individuals worldwide. While rare genetic variants play a role in the etiology of schizophrenia, most of the currently explained liability is within common variation, suggesting that variation predating the human diaspora out of Africa harbors a large fraction of the common variant attributable heritability. However, common variant association studies in schizophrenia have concentrated mainly on cohorts of European descent. We describe genome-wide association studies of 6152 cases and 3918 controls of admixed African ancestry, and of 1234 cases and 3090 controls of Latino ancestry, representing the largest such study in these populations to date. Combining results from the samples with African ancestry with summary statistics from the Psychiatric Genomics Consortium (PGC) study of schizophrenia yielded seven newly genome-wide significant loci, and we identified an additional eight loci by incorporating the results from samples with Latino ancestry. Leveraging population differences in patterns of linkage disequilibrium, we achieve improved fine-mapping resolution at 22 previously reported and 4 newly significant loci. Polygenic risk score profiling revealed improved prediction based on trans-ancestry meta-analysis results for admixed African (Nagelkerke’s R2 = 0.032; liability R2 = 0.017; P < 10−52), Latino (Nagelkerke’s R2 = 0.089; liability R2 = 0.021; P < 10−58), and European individuals (Nagelkerke’s R2 = 0.089; liability R2 = 0.037; P < 10−113), further highlighting the advantages of incorporating data from diverse human populations.

Download Full-text

Genetics of hypertension: discoveries from the bench to human populations

AJP Renal Physiology ◽

10.1152/ajprenal.00334.2013 ◽

2014 ◽

Vol 306 (1) ◽

pp. F1-F11 ◽

Cited By ~ 19

Author(s):

Nora Franceschini ◽

Thu H. Le

Keyword(s):

Association Studies ◽

Complex Trait ◽

Experimental Models ◽

Sodium Reabsorption ◽

Human Populations ◽

Genome Wide Association Studies ◽

Blood Pressure Homeostasis ◽

Genome Wide ◽

Clinical Hypertension ◽

Renal Sodium Reabsorption

Hypertension is a complex trait that is influenced by both heritable and environmental factors. The search for genes accounting for the susceptibility to hypertension has driven parallel efforts in human research and in research using experimental animals in controlled environmental settings. Evidence from rodent models of genetic hypertension and human Mendelian forms of hypertension and hypotension have yielded mechanistic insights into the pathways that are perturbed in blood pressure homeostasis, most of which converge at the level of renal sodium reabsorption. However, the bridging of evidence from these very diverse approaches to identify mechanisms underlying hypertension susceptibility and the translation of these findings to human populations and public health remain a challenge. Furthermore, findings from genome-wide association studies still require functional validation in experimental models. In this review, we highlight results and implications from key studies in experimental and clinical hypertension to date.

Download Full-text

Integrative Tissue-Specific Functional Annotations in the Human Genome Provide Novel Insights on Many Complex Traits and Improve Signal Prioritization in Genome Wide Association Studies

PLoS Genetics ◽

10.1371/journal.pgen.1005947 ◽

2016 ◽

Vol 12 (4) ◽

pp. e1005947 ◽

Cited By ~ 56

Author(s):

Qiongshi Lu ◽

Ryan Lee Powles ◽

Qian Wang ◽

Beixin Julie He ◽

Hongyu Zhao

Keyword(s):

Human Genome ◽

Complex Traits ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Tissue Specific ◽

Functional Annotations ◽

Genome Wide

Download Full-text

Signals of polygenic adaptation on height have been overestimated due to uncorrected population structure in genome-wide association studies

10.1101/355057 ◽

2018 ◽

Cited By ~ 19

Author(s):

Mashaal Sohail ◽

Robert M. Maier ◽

Andrea Ganna ◽

Alex Bloemendal ◽

Alicia R. Martin ◽

...

Keyword(s):

Population Structure ◽

Association Studies ◽

Meta Analysis ◽

Human Populations ◽

Genome Wide Association Studies ◽

Multiple Traits ◽

Large Numbers ◽

Genome Wide ◽

Polygenic Adaptation ◽

The Uk

AbstractGenetic predictions of height differ among human populations and these differences are too large to be explained by genetic drift. This observation has been interpreted as evidence of polygenic adaptation. Differences across populations were detected using SNPs genome-wide significantly associated with height, and many studies also found that the signals grew stronger when large numbers of subsignificant SNPs were analyzed. This has led to excitement about the prospect of analyzing large fractions of the genome to detect subtle signals of selection and claims of polygenic adaptation for multiple traits. Polygenic adaptation studies of height have been based on SNP effect size measurements in the GIANT Consortium meta-analysis. Here we repeat the height analyses in the UK Biobank, a much more homogeneously designed study. Our results show that polygenic adaptation signals based on large numbers of SNPs below genome-wide significance are extremely sensitive to biases due to uncorrected population structure.

Download Full-text

Genome-wide genetic data on ~500,000 UK Biobank participants

10.1101/166298 ◽

2017 ◽

Cited By ~ 303

Author(s):

Clare Bycroft ◽

Colin Freeman ◽

Desislava Petkova ◽

Gavin Band ◽

Lloyd T. Elliott ◽

...

Keyword(s):

Quality Control ◽

Allelic Variation ◽

Association Studies ◽

Genetic Data ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genotype Data ◽

Uk Biobank ◽

Genome Wide ◽

Wide Range

AbstractThe UK Biobank project is a large prospective cohort study of ~500,000 individuals from across the United Kingdom, aged between 40-69 at recruitment. A rich variety of phenotypic and health-related information is available on each participant, making the resource unprecedented in its size and scope. Here we describe the genome-wide genotype data (~805,000 markers) collected on all individuals in the cohort and its quality control procedures. Genotype data on this scale offers novel opportunities for assessing quality issues, although the wide range of ancestries of the individuals in the cohort also creates particular challenges. We also conducted a set of analyses that reveal properties of the genetic data – such as population structure and relatedness – that can be important for downstream analyses. In addition, we phased and imputed genotypes into the dataset, using computationally efficient methods combined with the Haplotype Reference Consortium (HRC) and UK10K haplotype resource. This increases the number of testable variants by over 100-fold to ~96 million variants. We also imputed classical allelic variation at 11 human leukocyte antigen (HLA) genes, and as a quality control check of this imputation, we replicate signals of known associations between HLA alleles and many common diseases. We describe tools that allow efficient genome-wide association studies (GWAS) of multiple traits and fast phenome-wide association studies (PheWAS), which work together with a new compressed file format that has been used to distribute the dataset. As a further check of the genotyped and imputed datasets, we performed a test-case genome-wide association scan on a well-studied human trait, standing height.

Download Full-text

Genomic dissection of 43 serum urate-associated loci provides multiple insights into molecular mechanisms of urate control

10.1101/743864 ◽

2019 ◽

Cited By ~ 2

Author(s):

James Boocock ◽

Megan Leask ◽

Yukinori Okada ◽

Hirotaka Matsuo ◽

Yusuke Kawamura ◽

...

Keyword(s):

Gene Expression ◽

Molecular Mechanisms ◽

Association Studies ◽

Organic Anion ◽

Serum Urate ◽

Regulatory Function ◽

Eqtl Analysis ◽

Genome Wide Association Studies ◽

Genome Wide ◽

Causal Genes

AbstractSerum urate is the end-product of purine metabolism. Elevated serum urate is causal of gout and a predictor of renal disease, cardiovascular disease and other metabolic conditions. Genome-wide association studies (GWAS) have reported dozens of loci associated with serum urate control, however there has been little progress in understanding the molecular basis of the associated loci. Here we employed trans-ancestral meta-analysis using data from European and East Asian populations to identify ten new loci for serum urate levels. Genome-wide colocalization with cis-expression quantitative trait loci (eQTL) identified a further five new loci. By cis- and trans-eQTL colocalization analysis we identified 24 and 20 genes respectively where the causal eQTL variant has a high likelihood that it is shared with the serum urate-associated locus. One new locus identified was SLC22A9 that encodes organic anion transporter 7 (OAT7). We demonstrate that OAT7 is a very weak urate-butyrate exchanger. Newly implicated genes identified in the eQTL analysis include those encoding proteins that make up the dystrophin complex, a scaffold for signaling proteins and transporters at the cell membrane; MLXIP that, with the previously identified MLXIPL, is a transcription factor that may regulate serum urate via the pentose-phosphate pathway; and MRPS7 and IDH2 that encode proteins necessary for mitochondrial function. Trans-ancestral functional fine-mapping identified six loci (RREB1, INHBC, HLF, UBE2Q2, SFMBT1, HNF4G) with colocalized eQTL that contained putative causal SNPs (posterior probability of causality > 0.8). This systematic analysis of serum urate GWAS loci has identified candidate causal genes at 19 loci and a network of previously unidentified genes likely involved in control of serum urate levels, further illuminating the molecular mechanisms of urate control.Author SummaryHigh serum urate is a prerequisite for gout and a risk factor for metabolic disease. Previous GWAS have identified numerous loci that are associated with serum urate control, however, only a small handful of these loci have known molecular consequences. The majority of loci are within the non-coding regions of the genome and therefore it is difficult to ascertain how these variants might influence serum urate levels without tangible links to gene expression and / or protein function. We have applied a novel bioinformatic pipeline where we combined population-specific GWAS data with gene expression and genome connectivity information to identify putative causal genes for serum urate associated loci. Overall, we identified 15 novel serum urate loci and show that these loci along with previously identified loci are linked to the expression of 44 genes. We show that some of the variants within these loci have strong predicted regulatory function which can be further tested in functional analyses. This study expands on previous GWAS by identifying further loci implicated in serum urate control and new causal mechanisms supported by gene expression changes.

Download Full-text

LPCAT1 controls phosphate homeostasis in a zinc-dependent manner

eLife ◽

10.7554/elife.32077 ◽

2018 ◽

Vol 7 ◽

Cited By ~ 19

Author(s):

Mushtak Kisko ◽

Nadia Bouain ◽

Alaeddine Safi ◽

Anna Medici ◽

Robert C Akkers ◽

...

Keyword(s):

Allelic Variation ◽

Association Studies ◽

Essential Elements ◽

Zn Deficiency ◽

Genome Wide Association Studies ◽

Dependent Manner ◽

Loss Of Function ◽

Genome Wide ◽

Genetic Mechanisms ◽

Living Organisms

All living organisms require a variety of essential elements for their basic biological functions. While the homeostasis of nutrients is highly intertwined, the molecular and genetic mechanisms of these dependencies remain poorly understood. Here, we report a discovery of a molecular pathway that controls phosphate (Pi) accumulation in plants under Zn deficiency. Using genome-wide association studies, we first identified allelic variation of the Lyso-PhosphatidylCholine (PC) AcylTransferase 1 (LPCAT1) gene as the key determinant of shoot Pi accumulation under Zn deficiency. We then show that regulatory variation at the LPCAT1 locus contributes significantly to this natural variation and we further demonstrate that the regulation of LPCAT1 expression involves bZIP23 TF, for which we identified a new binding site sequence. Finally, we show that in Zn deficient conditions loss of function of LPCAT1 increases the phospholipid Lyso-PhosphatidylCholine/PhosphatidylCholine ratio, the expression of the Pi transporter PHT1;1, and that this leads to shoot Pi accumulation.

Download Full-text

Trans-ethnic meta-analysis identifies new loci associated with longitudinal blood pressure traits

Scientific Reports ◽

10.1038/s41598-021-83450-3 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Mateus H. Gouveia ◽

Amy R. Bentley ◽

Hampton Leonard ◽

Karlijn A. C. Meeks ◽

Kenneth Ekoru ◽

...

Keyword(s):

Blood Pressure ◽

Association Studies ◽

Meta Analysis ◽

Regulatory Function ◽

Genome Wide Association Studies ◽

The Novel ◽

Cross Sectional ◽

Age Related ◽

Genome Wide ◽

The Us

AbstractGenome-wide association studies (GWAS) have identified thousands of genetic loci associated with cross-sectional blood pressure (BP) traits; however, GWAS based on longitudinal BP have been underexplored. We performed ethnic-specific and trans-ethnic GWAS meta-analysis using longitudinal and cross-sectional BP data of 33,720 individuals from five cohorts in the US and one in Brazil. In addition to identifying several known loci, we identified thirteen novel loci with nine based on longitudinal and four on cross-sectional BP traits. Most of the novel loci were ethnic- or study-specific, with the majority identified in African Americans (AA). Four of these discoveries showed additional evidence of association in independent datasets, including an intergenic variant (rs4060030, p = 7.3 × 10–9) with reported regulatory function. We observed a high correlation between the meta-analysis results for baseline and longitudinal average BP (rho = 0.48). BP trajectory results were more correlated with those of average BP (rho = 0.35) than baseline BP(rho = 0.18). Heritability estimates trended higher for longitudinal traits than for cross-sectional traits, providing evidence for different genetic architectures. Furthermore, the longitudinal data identified up to 20% more BP known associations than did cross-sectional data. Our analyses of longitudinal BP data in diverse ethnic groups identified novel BP loci associated with BP trajectory, indicating a need for further longitudinal GWAS on BP and other age-related traits.

Download Full-text

GARFIELD - GWAS Analysis of Regulatory or Functional Information Enrichment with LD correction

10.1101/085738 ◽

2016 ◽

Cited By ~ 17

Author(s):

Valentina Iotchkova ◽

Graham R.S. Ritchie ◽

Matthias Geihs ◽

Sandro Morganella ◽

Josine L. Min ◽

...

Keyword(s):

Association Studies ◽

R Package ◽

Genome Wide Association Studies ◽

Protein Coding ◽

Functional Annotations ◽

Novel Approach ◽

Genome Wide ◽

Functional Consequences ◽

Genomic Regions ◽

Coding Variants

Loci discovered by genome-wide association studies (GWAS) predominantly map outside protein-coding genes. The interpretation of functional consequences of non-coding variants can be greatly enhanced by catalogs of regulatory genomic regions in cell lines and primary tissues. However, robust and readily applicable methods are still lacking to systematically evaluate the contribution of these regions to genetic variation implicated in diseases or quantitative traits. Here we propose a novel approach that leverages GWAS findings with regulatory or functional annotations to classify features relevant to a phenotype of interest. Within our framework, we account for major sources of confounding that current methods do not offer. We further assess enrichment statistics for 27 GWAS traits within regulatory regions from the ENCODE and Roadmap projects. We characterise unique enrichment patterns for traits and annotations, driving novel biological insights. The method is implemented in standalone software and R package to facilitate its application by the research community.

Download Full-text