scholarly journals PSXII-37 Validation of single-step GBLUP genomic predictions from threshold models using the linear regression method: an application in chicken mortality

2020 ◽  
Vol 98 (Supplement_4) ◽  
pp. 246-247
Author(s):  
Matias Bermann ◽  
Andres Legarra ◽  
Mary Kate Hollifield ◽  
Yutaka Masuda ◽  
Daniela Lourenco ◽  
...  

Abstract The objective of this study was to determine whether the linear regression (LR) method could be used to validate genomic threshold models used for analysis of categorical data. Statistics for the LR method were computed from estimated breeding values (EBVs) using the whole and truncated datasets. The method was tested using simulated and real chicken datasets. The simulated dataset included ten generations of 4,500 birds each; genotypes were available for the last three generations. Each animal was assigned a continuous trait, which was converted to a binary score assuming an incidence of failure of 7%. The real dataset included the survival status of 186,596 broilers (mortality rate equal to 7.2%) and genotypes of 18,047 birds. Both datasets were analyzed using Best Linear Unbiased Predictor (BLUP) or single-step GBLUP (ssGBLUP). The whole dataset included all phenotypes available, whereas in the partial dataset, phenotypes of the most recent generation were removed. In the simulated dataset, the accuracies based on the LR formulas were 0.45 for BLUP and 0.76 for ssGBLUP, whereas the correlations between true breeding values and EBVs were 0.37 and 0.65, respectively. The gain in accuracy by adding genomic information was overestimated by 0.09 when using the LR method compared to the true increase in accuracy. However, when the estimated ratio between the additive variance computed based on pedigree only and on pedigree and genomic information was considered, the difference between true and estimated gain was less than 0.02. Accuracies of BLUP and ssGBLUP with the real dataset were 0.41 and 0.47, respectively. Smaller improvements in accuracy when using ssGBLUP with the real dataset were due to population structure and lower heritability. The LR method is a useful tool for estimating improvements in accuracy of EBVs due to the inclusion of genomic information when traditional validation methods are not applicable.

2021 ◽  
pp. 3119-3125
Author(s):  
Piriyaporn Sungkhapreecha ◽  
Ignacy Misztal ◽  
Jorge Hidalgo ◽  
Daniela Lourenco ◽  
Sayan Buaban ◽  
...  

Background and Aim: Genomic selection improves accuracy and decreases the generation interval, increasing the selection response. This study was conducted to assess the benefits of using single-step genomic best linear unbiased prediction (ssGBLUP) for genomic evaluations of milk yield and heat tolerance in Thai-Holstein cows and to test the value of old phenotypic data to maintain the accuracy of predictions. Materials and Methods: The dataset included 104,150 milk yield records collected from 1999 to 2018 from 15,380 cows. The pedigree contained 33,799 animals born between 1944 and 2016, of which 882 were genotyped. Analyses were performed with and without genomic information using ssGBLUP and BLUP, respectively. Statistics for bias, dispersion, the ratio of accuracies, and the accuracy of estimated breeding values were calculated using the linear regression (LR) method. A partial dataset excluded the phenotypes of the last generation, and 66 bulls were identified as validation individuals. Results: Bias was considerable for BLUP (0.44) but negligible (–0.04) for ssGBLUP; dispersion was similar for both techniques (0.84 vs. 1.06 for BLUP and ssGBLUP, respectively). The ratio of accuracies was 0.33 for BLUP and 0.97 for ssGBLUP, indicating more stable predictions for ssGBLUP. The accuracy of predictions was 0.18 for BLUP and 0.36 for ssGBLUP. Excluding the first 10 years of phenotypic data (i.e., 1999-2008) decreased the accuracy to 0.09 for BLUP and 0.32 for ssGBLUP. Genomic information doubled the accuracy and increased the persistence of genomic estimated breeding values when old phenotypes were removed. Conclusion: The LR method is useful for estimating accuracies and bias in complex models. When the population size is small, old data are useful, and even a small amount of genomic information can substantially improve the accuracy. The effect of heat stress on first parity milk yield is small.


2020 ◽  
Vol 138 (1) ◽  
pp. 4-13
Author(s):  
Matias Bermann ◽  
Andres Legarra ◽  
Mary Kate Hollifield ◽  
Yutaka Masuda ◽  
Daniela Lourenco ◽  
...  

Author(s):  
I Misztal ◽  
I Aguilar ◽  
D Lourenco ◽  
L Ma ◽  
J Steibel ◽  
...  

Abstract Genomic selection is now practiced successfully across many species. However, many questions remain such as long-term effects, estimations of genomic parameters, robustness of GWAS with small and large datasets, and stability of genomic predictions. This study summarizes presentations from at the 2020 ASAS symposium. The focus of many studies until now is on linkage disequilibrium (LD) between two loci. Ignoring higher level equilibrium may lead to phantom dominance and epistasis. The Bulmer effect leads to a reduction of the additive variance; however, selection for increased recombination rate can release anew genetic variance. With genomic information, estimates of genetic parameters may be biased by genomic preselection, but costs of estimation can increase drastically due to the dense form of the genomic information. To make computation of estimates feasible, genotypes could be retained only for the most important animals, and methods of estimation should use algorithms that can recognize dense blocks in sparse matrices. GWAS studies using small genomic datasets frequently find many marker-trait associations whereas studies using much bigger datasets find only a few. Most current tools use very simple models for GWAS, possibly causing artifacts. These models are adequate for large datasets where pseudo-phenotypes such as deregressed proofs indirectly account for important effects for traits of interest. Artifacts arising in GWAS with small datasets can be minimized by using data from all animals (whether genotyped or not), realistic models, and methods that account for population structure. Recent developments permit computation of p-values from GBLUP, where models can be arbitrarily complex but restricted to genotyped animals only, and to single-step GBLUP that also uses phenotypes from ungenotyped animals. Stability was an important part of nongenomic evaluations, where genetic predictions were stable in the absence of new data even with low prediction accuracies. Unfortunately, genomic evaluations for such animals change because all animals with genotypes are connected. A top ranked animal can easily drop in the next evaluation, causing a crisis of confidence in genomic evaluations. While correlations between consecutive genomic evaluations are high, outliers can have differences as high as one SD. A solution to fluctuating genomic evaluations is to base selection decisions on groups of animals. While many issues in genomic selection have been solved, many new issues that require additional research continue to surface.


2021 ◽  
Vol 99 (2) ◽  
Author(s):  
Yutaka Masuda ◽  
Shogo Tsuruta ◽  
Matias Bermann ◽  
Heather L Bradford ◽  
Ignacy Misztal

Abstract Pedigree information is often missing for some animals in a breeding program. Unknown-parent groups (UPGs) are assigned to the missing parents to avoid biased genetic evaluations. Although the use of UPGs is well established for the pedigree model, it is unclear how UPGs are integrated into the inverse of the unified relationship matrix (H-inverse) required for single-step genomic best linear unbiased prediction. A generalization of the UPG model is the metafounder (MF) model. The objectives of this study were to derive 3 H-inverses and to compare genetic trends among models with UPG and MF H-inverses using a simulated purebred population. All inverses were derived using the joint density function of the random breeding values and genetic groups. The breeding values of genotyped animals (u2) were assumed to be adjusted for UPG effects (g) using matrix Q2 as u2∗=u2+Q2g before incorporating genomic information. The Quaas–Pollak-transformed (QP) H-inverse was derived using a joint density function of u2∗ and g updated with genomic information and assuming nonzero cov(u2∗,g′). The modified QP (altered) H-inverse also assumes that the genomic information updates u2∗ and g, but cov(u2∗,g′)=0. The UPG-encapsulated (EUPG) H-inverse assumed genomic information updates the distribution of u2∗. The EUPG H-inverse had the same structure as the MF H-inverse. Fifty percent of the genotyped females in the simulation had a missing dam, and missing parents were replaced with UPGs by generation. The simulation study indicated that u2∗ and g in models using the QP and altered H-inverses may be inseparable leading to potential biases in genetic trends. Models using the EUPG and MF H-inverses showed no genetic trend biases. These 2 H-inverses yielded the same genomic EBV (GEBV). The predictive ability and inflation of GEBVs from young genotyped animals were nearly identical among models using the QP, altered, EUPG, and MF H-inverses. Although the choice of H-inverse in real applications with enough data may not result in biased genetic trends, the EUPG and MF H-inverses are to be preferred because of theoretical justification and possibility to reduce biases.


2021 ◽  
Vol 12 ◽  
Author(s):  
Hossein Mehrban ◽  
Masoumeh Naserkheil ◽  
Deukhwan Lee ◽  
Noelia Ibáñez-Escriche

There has been a growing interest in the genetic improvement of carcass traits as an important and primary breeding goal in the beef cattle industry over the last few decades. The use of correlated traits and molecular information can aid in obtaining more accurate estimates of breeding values. This study aimed to assess the improvement in the accuracy of genetic predictions for carcass traits by using ultrasound measurements and yearling weight along with genomic information in Hanwoo beef cattle by comparing four evaluation models using the estimators of the recently developed linear regression method. We compared the performance of single-trait pedigree best linear unbiased prediction [ST-BLUP and single-step genomic (ST-ssGBLUP)], as well as multi-trait (MT-BLUP and MT-ssGBLUP) models for the studied traits at birth and yearling date of steers. The data comprised of 15,796 phenotypic records for yearling weight and ultrasound traits as well as 5,622 records for carcass traits (backfat thickness, carcass weight, eye muscle area, and marbling score), resulting in 43,949 single-nucleotide polymorphisms from 4,284 steers and 2,332 bulls. Our results indicated that averaged across all traits, the accuracy of ssGBLUP models (0.52) was higher than that of pedigree-based BLUP (0.34), regardless of the use of single- or multi-trait models. On average, the accuracy of prediction can be further improved by implementing yearling weight and ultrasound data in the MT-ssGBLUP model (0.56) for the corresponding carcass traits compared to the ST-ssGBLUP model (0.49). Moreover, this study has shown the impact of genomic information and correlated traits on predictions at the yearling date (0.61) using MT-ssGBLUP models, which was advantageous compared to predictions at birth date (0.51) in terms of accuracy. Thus, using genomic information and high genetically correlated traits in the multi-trait model is a promising approach for practical genomic selection in Hanwoo cattle, especially for traits that are difficult to measure.


Animals ◽  
2020 ◽  
Vol 10 (4) ◽  
pp. 569
Author(s):  
Chen Wei ◽  
Hanpeng Luo ◽  
Bingru Zhao ◽  
Kechuan Tian ◽  
Xixia Huang ◽  
...  

Genomic evaluations are a method for improving the accuracy of breeding value estimation. This study aimed to compare estimates of genetic parameters and the accuracy of breeding values for wool traits in Merino sheep between pedigree-based best linear unbiased prediction (PBLUP) and single-step genomic best linear unbiased prediction (ssGBLUP) using Bayesian inference. Data were collected from 28,391 yearlings of Chinese Merino sheep (classified in 1992–2018) at the Xinjiang Gonaisi Fine Wool Sheep-Breeding Farm, China. Subjectively-assessed wool traits, namely, spinning count (SC), crimp definition (CRIM), oil (OIL), and body size (BS), and objectively-measured traits, namely, fleece length (FL), greasy fleece weight (GFW), mean fiber diameter (MFD), crimp number (CN), and body weight pre-shearing (BWPS), were analyzed. The estimates of heritability for wool traits were low to moderate. The largest h2 values were observed for FL (0.277) and MFD (0.290) with ssGBLUP. The heritabilities estimated for wool traits with ssGBLUP were slightly higher than those obtained with PBLUP. The accuracies of breeding values were low to moderate, ranging from 0.362 to 0.573 for the whole population and from 0.318 to 0.676 for the genotyped subpopulation. The correlation between the estimated breeding values (EBVs) and genomic EBVs (GEBVs) ranged from 0.717 to 0.862 for the whole population, and the relative increase in accuracy when comparing EBVs with GEBVs ranged from 0.372% to 7.486% for these traits. However, in the genotyped population, the rank correlation between the estimates obtained with PBLUP and ssGBLUP was reduced to 0.525 to 0.769, with increases in average accuracy of 3.016% to 11.736% for the GEBVs in relation to the EBVs. Thus, genomic information could allow us to more accurately estimate the relationships between animals and improve estimates of heritability and the accuracy of breeding values by ssGBLUP.


Animals ◽  
2020 ◽  
Vol 11 (1) ◽  
pp. 24
Author(s):  
Megan Scholtens ◽  
Nicolas Lopez-Villalobos ◽  
Klaus Lehnert ◽  
Russell Snell ◽  
Dorian Garrick ◽  
...  

Selection on genomic breeding values (GBVs) is now readily available for ranking candidates in improvement schemes. Our objective was to quantify benefits in terms of accuracy of prediction from including genomic information in the single-trait estimation of breeding values (BVs) for a New Zealand mixed breed dairy goat herd. The dataset comprised phenotypic and pedigree records of 839 does. The phenotypes comprised estimates of 305-day lactation yields of milk, fat, and protein and average somatic cell score from the 2016 production season. Only 388 of the goats were genotyped with a Caprine 50K SNP chip and 41,981 of the single nucleotide polymorphisms (SNPs) passed quality control. Pedigree-based best linear unbiased prediction (PBLUP) was used to obtain across-breed breeding values (EBVs), whereas a single-step BayesC model (ssBC) was used to estimate across-breed GBVs. The average prediction accuracies ranged from 0.20 to 0.22 for EBVs and 0.34 to 0.43 for GBVs. Accuracies of GBVs were up to 103% greater than EBVs. Breed effects were more reliably estimated in the ssBC model compared with the PBLUP model. The greatest benefit of genomic prediction was for individuals with no pedigree or phenotypic records. Including genomic information improved the prediction accuracy of BVs compared with the current pedigree-based BLUP method currently implemented in the New Zealand dairy goat population.


2020 ◽  
Vol 98 (Supplement_4) ◽  
pp. 14-15
Author(s):  
Natalia Leite ◽  
Daniela Lourenco ◽  
Egbert Knol ◽  
Marcos Lopes ◽  
Fabyano Silva

Abstract Pig survival is an economically important trait with animal welfare implications. Although survival is highly affected by the environment, previous studies reported genetic variability of this trait, indicating the opportunity for improvement through selection. Genomic information is currently included in the major commercial pig breeding programs, which can be incorporated by single-step genomic BLUP (ssGBLUP). The objectives of this study were to: 1) estimate (co)variance components for farrowing, lactation, nursery, and finishing survival; and 2) compare the individual breeding value accuracies obtained using traditional pedigree-based BLUP (BLUP) and ssGBLUP methods. Individual survival records were collected for a crossbred pig population, and two-trait threshold models, which included maternal effects, were used for (co)variance components estimation. Direct and maternal breeding values were estimated using BLUP and ssGBLUP methods, and individual accuracies were obtained based on posterior standard deviation. Heritabilities for the four survival phases were low, ranging from 0.04 to 0.12. Pre-weaning survival was controlled by dam and piglet additive gene effects. The additive direct and maternal components were equally important at farrowing, whereas the piglet’s own genetic merit was the most expressive during lactation. Common environment estimates were higher than maternal genetic effects, indicating early life experiences related to the sow, but independent of the maternal genetic component. Nursery and finishing survival showed the same or higher heritabilities compared to pre-weaning stages. The genetic correlation between the pre-weaning phases was high (0.68), whereas the correlation between the post-weaning measurements was moderate (0.42). The incorporation of genomic information through ssGBLUP increased the individual accuracy, on average, from 0.36 to 0.41 for direct and from 0.29 to 0.37 for maternal effects compared to BLUP. Although the heritabilities for survival in different productive stages are low, genetic gains can be obtained, given that breeding values benefit from the inclusion of genomic information.


2021 ◽  
Vol 99 (Supplement_3) ◽  
pp. 23-23
Author(s):  
Natalia Leite ◽  
Ching-Yi Chen ◽  
Willian O Herring ◽  
Shogo Tsuruta ◽  
Daniela Lourenco

Abstract Phenotyping a large number of crossbred progeny for the evaluation of purebred animals can be expensive. As genotyping with low-density panels is becoming cheaper, we aimed to evaluate the tradeoff between having different percentages of genotypes and phenotypes for crossbred progeny of candidate boars. We used the linear regression (LR) method to investigate changes in accuracy, bias, and inflation of breeding values for crossbred traits in purebred boars. A total of 304,582 purebred and 147,474 crossbred animals were phenotyped for average daily gain (ADG) and backfat thickness (BF), out of which 46,691 purebred and 13,117 crossbred animals were genotyped. Genomic information consisted of imputed genotypes for 40,247 SNP markers after quality control. A four-trait animal model under single-step GBLUP was used that included phenotypes recorded in purebred and crossbred animals as correlated traits. The LR statistics were calculated based on breeding values of young purebred sires from complete and partial data. The first complete data included genotypes for purebreds and phenotypes for purebreds and crossbreds, whereas the second included also genotypes for crossbreds. The partial data included phenotypes on 50% or none of the progeny of validation sires, with or without genotypes for crossbred animals. When 50% of the progeny has phenotypes, adding genotypes for crossbred progeny marginally increased accuracy of ADG (0.77 vs 0.78) for 47 boars with more than 150 progeny with phenotypes. No increase was observed for BF. A small increase in bias and inflation by adding crossbred genotypes was observed for ADG but not for BF. When no phenotypes were available for crossbred progeny, accuracy for both traits was lower but improved with crossbred genotypes for ADG (0.61 vs 0.64) for boars with more than 150 progeny. The tradeoff between phenotypes and genotypes should be further investigated in larger datasets with more validation boars.


Author(s):  
Natália Galoro Leite ◽  
Egbert Frank Knol ◽  
André Luiz Seccatto Garcia ◽  
Marcos Soares Lopes ◽  
Louisa Zak ◽  
...  

Abstract Pig survival is an economically important trait with relevant social welfare implications, thus standing out as an important selection criterion for the current pig farming system. We aimed to estimate (co)variance components for survival in different production phases in a crossbred pig population, as well as to investigate the benefit of including genomic information through single-step genomic BLUP (ssGBLUP) on the prediction accuracy of survival traits compared to results from traditional BLUP. Individual survival records on, at most, 64,894 crossbred piglets were evaluated under two multi-trait threshold models. The first model included farrowing, lactation, and combined post-weaning survival, whereas the second model included nursery and finishing survival. Direct and maternal breeding values were estimated using BLUP and ssGBLUP methods. Further, prediction accuracy, bias, and dispersion were accessed using the Linear Regression validation method. Direct heritability estimates for survival in all studied phases were low (from 0.02 to 0.08). Survival in pre-weaning phases (farrowing and lactation) was controlled by the dam and piglet additive genetic effects, although the maternal side was more important. Post-weaning phases (nursery, finishing, and the combination of both) showed the same or higher direct heritabilities compared to pre-weaning phases. The genetic correlations between survival traits within pre- and post-weaning phases were favorable and strong, but correlations between pre- and post-weaning phases were moderate. The prediction accuracy of survival traits was low, although it increased by including genomic information through ssGBLUP, compared to the prediction accuracy from BLUP. Direct and maternal breeding values were similarly accurate with BLUP, but direct breeding values benefited more from genomic information. Overall, a slight increase in bias was observed when genomic information was included, whereas dispersion of breeding values was greatly reduced. Post-weaning survival (POST) presented higher direct heritability than in the pre-weaning phases and the highest prediction accuracy among all evaluated production phases, therefore standing out as a candidate trait for improving survival. Survival is a complex trait with low heritability; however, important genetic gains can still be obtained, especially under a genomic prediction framework.


Sign in / Sign up

Export Citation Format

Share Document