Transcription factor regulation as a mechanism of confounding effects between distinct human traits
Genome-wide association studies (GWAS) to date have discovered thousands of genetic variants linked to human diseases and traits, which hold the potential to unravel the mechanisms of complex phenotypes. However, given that the majority of these associated variants reside in non-coding genomic regions, their predicted cis and trans-regulatory functions remain largely undefined. Here we show that correlation between human diseases and traits can follow geographical distribution of human populations, and that the underlying mechanism is at least partly genetically based. We report two Type 2 Diabetes (T2D) GWAS variants (rs7903146 and rs12255372) in the TCF7L2 locus that regulate expression in skin tissues but not lymphoblastoid or adipose tissues, of the KITLG gene that encodes an important regulator of melanogenesis and light hair color in European populations. We also report extensive binding events of TCF7L2 protein in the promoter region, immediate upstream region and first intron of the KITLG gene, which supports a trans-interaction between TCF7L2 and KITLG. We further show that both light hair color and T2D genetic variants are correlated with geographic latitude. Taken together, our observations suggest that natural variation in transcription factor loci in European human populations may be an underlying and confounding factor for the geographical correlation between human phenotypes, such as type 2 diabetes and light hair color. We postulate that transcription factor regulation may confound the correlation between seemingly diverse human traits. Furthermore, our findings demonstrate the importance of dissecting the genomic architecture of GWAS loci using multiple genetic and genomic datasets.