Mono- and bi-allelic effects of coding variants on disease in 176,899 Finns
Identifying Mendelian diseases with recessive inheritance is challenging as the majority of cases are caused by compound heterozygous genotypes which require sequencing data in families to definitively identify. Bottleneck events, such as in the Finnish population, enrich specific homozygous variants to higher frequencies and thus facilitate identification of disease associations through easily recognized homozygous genotypes. Here, we study homozygous and heterozygous effects of 82,516 coding variants on 2,444 disease endpoints using nationwide electronic health record (EHR) data of 176,899 Finns. We find known and novel associations to homozygous genotypes across a broad spectrum of phenotypes such as retinal dystrophy, adult-onset cataract and female infertility (13/20 of which would have been missed by the traditional additive GWAS model). With these results, and supporting simulations, we demonstrate the added benefit of homozygous scans in GWAS. We further use these results to explore inheritance patterns of known Mendelian variants. We find many Mendelian variants whose inheritance cannot be adequately described with the traditional definition of dominant or recessive. In particular, we find disease risk in heterozygous carriers of variants known to cause disease with recessive inheritance, as well as for variants labeled benign in ClinVar. Our results demonstrate how biobank efforts, particularly in founder populations, can broaden our understanding of the impact of genetic variants.