scholarly journals A novel linkage-disequilibrium corrected genomic relationship matrix for SNP-heritability estimation and genomic prediction

Heredity ◽  
2017 ◽  
Vol 120 (4) ◽  
pp. 356-368 ◽  
Author(s):  
Boby Mathew ◽  
Jens Léon ◽  
Mikko J. Sillanpää
2019 ◽  
Vol 97 (Supplement_3) ◽  
pp. 49-49
Author(s):  
Andre Garcia ◽  
Yutaka Masuda ◽  
Stephen P Miller ◽  
Ignacy Misztal ◽  
Daniela Lourenco

Abstract With the increasing number of genotyped animals, the algorithm for proven and young (APY) can be used to compute the inverse of the genomic relationship matrix (G-1apy) in genomic BLUP (GBLUP) and single-step GBLUP (ssGBLUP). This algorithm also allows the use of all genotyped animals to calculate SNP effects from genomic EBV (GEBV), which can then be used to obtain indirect predictions (IP) for interim evaluations, or as genomic prediction for animals not included in official evaluations. The objective of the study was to evaluate the quality of IP from GBLUP with increasing number of genotyped animals. Birth weight, weaning weight and post-wearing gain phenotypes and genotypes were provided by the American Angus Association. Phenotypes and genotypes were divided in 3 scenarios based on birth year: genotyped animals born up to 2013 (114,937), 2014 (183,847) and 2015 (280,506). A 3-trait model was fit and GBLUP with APY was used to calculate GEBV and SNP effects. To calculate G-1apy, 19,021 core animals were randomly sampled from animals born up to 2013. Core animals remained the same, whereas the number of non-core animals increased as more genotyped animals were added. Additional analyses had updated core animals for each scenario. SNP effects were also calculated based on G-1apy and G-1 only for core animals (G-1core). IP were computed for all animals in each scenario by multiplying the centered genotypes by the SNP effects. To access the quality of IP, correlation between IP and GEBV was calculated. The Correlations were greater than 0.99 for all traits in all scenarios. Despite the increase of non-core animals in APY, GEBV were successfully retrieved from SNP effects using IP. When SNP effects were calculated based on G-1core, updating the core animals as the number of genotyped animals increase seems to be the best choice.


2017 ◽  
Vol 95 (12) ◽  
pp. 5197-5207 ◽  
Author(s):  
M. W. Iversen ◽  
Ø. Nordbø ◽  
E. Gjerlaug-Enger ◽  
E. Grindflek ◽  
M.S. Lopes ◽  
...  

2021 ◽  
Vol 53 (1) ◽  
Author(s):  
Theo Meuwissen ◽  
Irene van den Berg ◽  
Mike Goddard

Abstract Background Whole-genome sequence (WGS) data are increasingly available on large numbers of individuals in animal and plant breeding and in human genetics through second-generation resequencing technologies, 1000 genomes projects, and large-scale genotype imputation from lower marker densities. Here, we present a computationally fast implementation of a variable selection genomic prediction method, that could handle WGS data on more than 35,000 individuals, test its accuracy for across-breed predictions and assess its quantitative trait locus (QTL) mapping precision. Methods The Monte Carlo Markov chain (MCMC) variable selection model (Bayes GC) fits simultaneously a genomic best linear unbiased prediction (GBLUP) term, i.e. a polygenic effect whose correlations are described by a genomic relationship matrix (G), and a Bayes C term, i.e. a set of single nucleotide polymorphisms (SNPs) with large effects selected by the model. Computational speed is improved by a Metropolis–Hastings sampling that directs computations to the SNPs, which are, a priori, most likely to be included into the model. Speed is also improved by running many relatively short MCMC chains. Memory requirements are reduced by storing the genotype matrix in binary form. The model was tested on a WGS dataset containing Holstein, Jersey and Australian Red cattle. The data contained 4,809,520 genotypes on 35,549 individuals together with their milk, fat and protein yields, and fat and protein percentage traits. Results The prediction accuracies of the Jersey individuals improved by 1.5% when using across-breed GBLUP compared to within-breed predictions. Using WGS instead of 600 k SNP-chip data yielded on average a 3% accuracy improvement for Australian Red cows. QTL were fine-mapped by locating the SNP with the highest posterior probability of being included in the model. Various QTL known from the literature were rediscovered, and a new SNP affecting milk production was discovered on chromosome 20 at 34.501126 Mb. Due to the high mapping precision, it was clear that many of the discovered QTL were the same across the five dairy traits. Conclusions Across-breed Bayes GC genomic prediction improved prediction accuracies compared to GBLUP. The combination of across-breed WGS data and Bayesian genomic prediction proved remarkably effective for the fine-mapping of QTL.


2019 ◽  
Vol 51 (1) ◽  
Author(s):  
Pascal Duenk ◽  
Mario P. L. Calus ◽  
Yvonne C. J. Wientjes ◽  
Vivian P. Breen ◽  
John M. Henshall ◽  
...  

Following publication of original article [1], we noticed that there was an error: Eq. (3) on page 5 is the genomic relationship matrix that


2014 ◽  
Vol 54 (5) ◽  
pp. 544 ◽  
Author(s):  
N. Moghaddar ◽  
A. A. Swan ◽  
J. H. J. van der Werf

The objective of this study was to predict the accuracy of genomic prediction for 26 traits, including weight, muscle, fat, and wool quantity and quality traits, in Australian sheep based on a large, multi-breed reference population. The reference population consisted of two research flocks, with the main breeds being Merino, Border Leicester (BL), Poll Dorset (PD), and White Suffolk (WS). The genomic estimated breeding value (GEBV) was based on GBLUP (genomic best linear unbiased prediction), applying a genomic relationship matrix calculated from the 50K Ovine SNP chip marker genotypes. The accuracy of GEBV was evaluated as the Pearson correlation coefficient between GEBV and accurate estimated breeding value based on progeny records in a set of genotyped industry animals. The accuracies of weight traits were relatively low to moderate in PD and WS breeds (0.11–0.27) and moderate to relatively high in BL and Merino (0.25–0.63). The accuracy of muscle and fat traits was moderate to relatively high across all breeds (between 0.21 and 0.55). The accuracy of GEBV of yearling and adult wool traits in Merino was, on average, high (0.33–0.75). The results showed the accuracy of genomic prediction depends on trait heritability and the effective size of the reference population, whereas the observed GEBV accuracies were more related to the breed proportions in the multi-breed reference population. No extra gain in within-breed GEBV accuracy was observed based on across breed information. More investigations are required to determine the precise effect of across-breed information on within-breed genomic prediction.


2018 ◽  
Vol 53 (6) ◽  
pp. 717-726 ◽  
Author(s):  
Michel Marques Farah ◽  
Marina Rufino Salinas Fortes ◽  
Matthew Kelly ◽  
Laercio Ribeiro Porto-Neto ◽  
Camila Tangari Meira ◽  
...  

Abstract: The objective of this work was to evaluate the effects of genomic information on the genetic evaluation of hip height in Brahman cattle using different matrices built from genomic and pedigree data. Hip height measurements from 1,695 animals, genotyped with high-density SNP chip or imputed from 50 K high-density SNP chip, were used. The numerator relationship matrix (NRM) was compared with the H matrix, which incorporated the NRM and genomic relationship (G) matrix simultaneously. The genotypes were used to estimate three versions of G: observed allele frequency (HGOF), average minor allele frequency (HGMF), and frequency of 0.5 for all markers (HG50). For matrix comparisons, animal data were either used in full or divided into calibration (80% older animals) and validation (20% younger animals) datasets. The accuracy values for the NRM, HGOF, and HG50 were 0.776, 0.813, and 0.594, respectively. The NRM and HGOF showed similar minor variances for diagonal and off-diagonal elements, as well as for estimated breeding values. The use of genomic information resulted in relationship estimates similar to those obtained based on pedigree; however, HGOF is the best option for estimating the genomic relationship matrix and results in a higher prediction accuracy. The ranking of the top 20% animals was very similar for all matrices, but the ranking within them varies depending on the method used.


Sign in / Sign up

Export Citation Format

Share Document