Prevalence Estimates of Predicted Pathogenic COL4A3 - COL4A5 Variants in a Population Sequencing Database and Their Implications for Alport Syndrome
Background: The prevalence of Alport syndrome varies from one in 5,000 to one in 53,000. This study estimated the frequencies of predicted pathogenic COL4A3- COL4A5 variants in sequencing databases of populations without known kidney disease. Methods: Predicted pathogenic variants were identified using filtering steps based on the ACMG/AMP criteria that considered collagen IV α3-α5 position 1 Gly to be critical domains. The population frequencies of predicted pathogenic COL4A3-COL4A5 variants were then determined per mean number of sequenced alleles. Population frequencies for compound heterozygous and digenic combinations were calculated from the results for heterozygous variants. Results:COL4A3-COL4A5 variants resulting in position 1 Gly substitutions were confirmed associated with haematuria (p each <0.0001). Predicted pathogenic COL4A5 variants were found in at least one in 2,320 individuals. p.(Gly624Asp), represented nearly half (16/33, 48%) the variants in Europeans. Most COL4A5 variants (54/59, 92%) had a biochemical feature that potentially mitigated clinical impact. Predicted pathogenic heterozygous COL4A3 and COL4A4 variants affected one in 106 of the population, consistent with the finding of Thin basement membrane nephropathy in normal donor kidney biopsies. Predicted pathogenic compound heterozygous variants occurred in one in 88,866 individuals and digenic variants in at least one in 44,793. Conclusions: The population frequencies for Alport syndrome are suggested by the frequencies of predicted pathogenic COL4A3-COL4A5 variants but must be adjusted for the disease penetrance of individual variants, as well as the likelihood of already diagnosed disease and non-Gly substitutions. Disease penetrance may depend on other genetic and environmental factors.