Meta-analysis of sequence-based association studies across three cattle breeds reveals 25 QTL for fat and protein percentages in milk at nucleotide resolution
AbstractBackgroundGenotyping and whole-genome sequencing data have been collected in many cattle breeds. The compilation of large reference panels facilitates imputing sequence variant genotypes for animals that have been genotyped using dense genotyping arrays. Association studies with imputed sequence variant genotypes allow characterization of quantitative trait loci (QTL) at nucleotide resolution particularly when individuals from several breeds are included in the mapping populations.ResultsWe imputed genotypes for more than 28 million sequence variants in 17,229 animals of the Braunvieh (BV), Fleckvieh (FV) and Holstein (HOL) cattle breeds in order to generate large mapping populations that are required to identify sequence variants underlying milk production traits. Within-breed association tests between imputed sequence variant genotypes and fat and protein percentages in milk uncovered between six and thirteen QTL (P<1e-8) per breed. Eight of the detected QTL were significant in more than one breed. We combined the association studies across three breeds using meta-analysis and identified 25 QTL including six that were not significant in the within-breed association studies. Closer inspection of the QTL revealed that two well-known causal missense mutations in the ABCG2 (p.Y581S, rs43702337, P=4.3e-34) and GHR (p.F279Y, rs385640152, P=1.6e-74) genes were the top variants at two QTL on chromosomes 6 and 20. Another true causal missense mutation in the DGAT1 gene (p.A232K, rs109326954, P=8.4e-1436) was the second top variant at a QTL on chromosome 14 but its allelic substitution effects were not consistent across three breeds analyzed. It turned out that the conflicting allelic substitution effects resulted from flaws in the imputed genotypes due to the use of a multi-breed reference population for genotype imputation.ConclusionsMany QTL for milk production traits segregate across breeds. Metaanalysis of association studies across breeds has greater power to detect such QTL than within-breed association studies. True causal mutations can be readily detected among the most significantly associated variants at QTL when the accuracy of imputation is high. However, true causal mutations may show conflicting allelic substitution effects across breeds when the imputed sequence variant genotypes contain flaws. Validating the effect of known causal variants is highly recommended in order to assess the ability to detect true causal mutations in association studies with imputed sequence variant genotypes.