scholarly journals GPS Coordinates for Modelling Correlated Herd Effects in Genomic Prediction Models Applied to Hanwoo Beef Cattle

Animals ◽  
2021 ◽  
Vol 11 (7) ◽  
pp. 2050
Author(s):  
Beatriz Castro Dias Cuyabano ◽  
Gabriel Rovere ◽  
Dajeong Lim ◽  
Tae Hun Kim ◽  
Hak Kyo Lee ◽  
...  

It is widely known that the environment influences phenotypic expression and that its effects must be accounted for in genetic evaluation programs. The most used method to account for environmental effects is to add herd and contemporary group to the model. Although generally informative, the herd effect treats different farms as independent units. However, if two farms are located physically close to each other, they potentially share correlated environmental factors. We introduce a method to model herd effects that uses the physical distances between farms based on the Global Positioning System (GPS) coordinates as a proxy for the correlation matrix of these effects that aims to account for similarities and differences between farms due to environmental factors. A population of Hanwoo Korean cattle was used to evaluate the impact of modelling herd effects as correlated, in comparison to assuming the farms as completely independent units, on the variance components and genomic prediction. The main result was an increase in the reliabilities of the predicted genomic breeding values compared to reliabilities obtained with traditional models (across four traits evaluated, reliabilities of prediction presented increases that ranged from 0.05 ± 0.01 to 0.33 ± 0.03), suggesting that these models may overestimate heritabilities. Although little to no significant gain was obtained in phenotypic prediction, the increased reliability of the predicted genomic breeding values is of practical relevance for genetic evaluation programs.

Animals ◽  
2019 ◽  
Vol 9 (9) ◽  
pp. 672 ◽  
Author(s):  
Beatriz Castro Dias Castro Dias Cuyabano ◽  
Hanna Wackel ◽  
Donghyun Shin ◽  
Cedric Gondro

Genomic models that incorporate dense marker information have been widely used for predicting genomic breeding values since they were first introduced, and it is known that the relationship between individuals in the reference population and selection candidates affects the prediction accuracy. When genomic evaluation is performed over generations of the same population, prediction accuracy is expected to decay if the reference population is not updated. Therefore, the reference population must be updated in each generation, but little is known about the optimal way to do it. This study presents an empirical assessment of the prediction accuracy of genomic breeding values of production traits, across five generations in two Korean pig breeds. We verified the decay in prediction accuracy over time when the reference population was not updated. Additionally we compared the prediction accuracy using only the previous generation as the reference population, as opposed to using all previous generations as the reference population. Overall, the results suggested that, although there is a clear need to continuously update the reference population, it may not be necessary to keep all ancestral genotypes. Finally, comprehending how the accuracy of genomic prediction evolves over generations within a population adds relevant information to improve the performance of genomic selection.


2010 ◽  
Vol 42 (1) ◽  
Author(s):  
David Habier ◽  
Jens Tetens ◽  
Franz-Reinhold Seefried ◽  
Peter Lichtner ◽  
Georg Thaller

2021 ◽  
Vol 12 ◽  
Author(s):  
Md. Abdullah Al Bari ◽  
Ping Zheng ◽  
Indalecio Viera ◽  
Hannah Worral ◽  
Stephen Szwiec ◽  
...  

Phenotypic evaluation and efficient utilization of germplasm collections can be time-intensive, laborious, and expensive. However, with the plummeting costs of next-generation sequencing and the addition of genomic selection to the plant breeder’s toolbox, we now can more efficiently tap the genetic diversity within large germplasm collections. In this study, we applied and evaluated genomic prediction’s potential to a set of 482 pea (Pisum sativum L.) accessions—genotyped with 30,600 single nucleotide polymorphic (SNP) markers and phenotyped for seed yield and yield-related components—for enhancing selection of accessions from the USDA Pea Germplasm Collection. Genomic prediction models and several factors affecting predictive ability were evaluated in a series of cross-validation schemes across complex traits. Different genomic prediction models gave similar results, with predictive ability across traits ranging from 0.23 to 0.60, with no model working best across all traits. Increasing the training population size improved the predictive ability of most traits, including seed yield. Predictive abilities increased and reached a plateau with increasing number of markers presumably due to extensive linkage disequilibrium in the pea genome. Accounting for population structure effects did not significantly boost predictive ability, but we observed a slight improvement in seed yield. By applying the best genomic prediction model (e.g., RR-BLUP), we then examined the distribution of genotyped but nonphenotyped accessions and the reliability of genomic estimated breeding values (GEBV). The distribution of GEBV suggested that none of the nonphenotyped accessions were expected to perform outside the range of the phenotyped accessions. Desirable breeding values with higher reliability can be used to identify and screen favorable germplasm accessions. Expanding the training set and incorporating additional orthogonal information (e.g., transcriptomics, metabolomics, physiological traits, etc.) into the genomic prediction framework can enhance prediction accuracy.


2021 ◽  
Vol 99 (Supplement_3) ◽  
pp. 261-261
Author(s):  
Hinayah R Oliveira ◽  
Stephen P Miller ◽  
Luiz F Brito ◽  
Flavio S Schenkel

Abstract A recent study showed that longevity based on different culling reasons should be considered as different traits in genetic evaluations. However, it is still necessary to create a pipeline that avoid including/excluding animals culled for different reasons in every genetic evaluation run. This study aimed to: 1) perform a genetic evaluation of the longevity of cows culled due to fertility-related problems including records of animals culled for other reasons (i.e., age, structural problems, disease, and performance) as censored records; and, 2) identify the impact of censored data in the genetic parameters and breeding values estimated. Two longevity indicators were evaluated: traditional (TL; time from first calving to culling) and functional (FL; time period in which the cow was alive and also calving after its first calving) longevity. Both TL and FL were evaluated from 2 to 15 years-old, and codified as binary traits for each age (0 = culled and 1 = alive/calved). Both trait definitions were analyzed using a Bayesian random regression linear model. Animals culled for reasons other than fertility were either excluded from the data (standard) or had their records censored after the culling date reported in the dataset (censored). After the quality control, 154,419 and 450,124 animals had uncensored and censored records, respectively. Heritabilities estimated for TL over the ages ranged from 0.02 to 0.13 for standard, and from 0.01 to 0.12 for censored datasets. Heritabilities estimated for FL ranged from 0.01 to 0.14 (standard), and from 0.01 to 0.13 (censored). Average (SD) correlation of breeding values predicted over all ages, using the standard and censored datasets, was 0.77 (0.16) for TL, and 0.83 (0.11) for FL. Our findings suggest that including censored data in the analyses might impact the genomic evaluations and further work is need to determine the optimal predictive approach.


2012 ◽  
Vol 39 (3) ◽  
pp. 357-364 ◽  
Author(s):  
Seung Hwan Lee ◽  
Heong Cheul Kim ◽  
Dajeong Lim ◽  
Chang Gwan Dang ◽  
Yong Min Cho ◽  
...  

2019 ◽  
Author(s):  
Ainhoa Calleja-Rodriguez ◽  
Jin Pan ◽  
Tomas Funda ◽  
Zhi-Qiang Chen ◽  
John Baison ◽  
...  

ABSTRACTHigher genetic gains can be achieved through genomic selection (GS) by shortening time of progeny testing in tree breeding programs. Genotyping-by-sequencing (GBS), combined with two imputation methods, allowed us to perform the current genomic prediction study in Scots pine (Pinus sylvestrisL.). 694 individuals representing 183 full-sib families were genotyped and phenotyped for growth and wood quality traits. 8719 SNPs were used to compare different genomic prediction models. In addition, the impact on the predictive ability (PA) and prediction accuracy to estimate genomic breeding values was evaluated by assigning different ratios of training and validation sets, as well as different subsets of SNP markers. Genomic Best Linear Unbiased Prediction (GBLUP) and Bayesian Ridge Regression (BRR) combined with expectation maximization (EM) imputation algorithm showed higher PAs and prediction accuracies than Bayesian LASSO (BL). A subset of approximately 4000 markers was sufficient to provide the same PAs and accuracies as the full set of 8719 markers. Furthermore, PAs were similar for both pedigree- and genomic-based estimations, whereas accuracies and heritabilities were slightly higher for pedigree-based estimations. However, prediction accuracies of genomic models were sufficient to achieve a higher selection efficiency per year, varying between 50-87% compared to the traditional pedigree-based selection.


PLoS ONE ◽  
2021 ◽  
Vol 16 (12) ◽  
pp. e0261274
Author(s):  
Harrison J. Lamb ◽  
Ben J. Hayes ◽  
Imtiaz A. S. Randhawa ◽  
Loan T. Nguyen ◽  
Elizabeth M. Ross

Most traits in livestock, crops and humans are polygenic, that is, a large number of loci contribute to genetic variation. Effects at these loci lie along a continuum ranging from common low-effect to rare high-effect variants that cumulatively contribute to the overall phenotype. Statistical methods to calculate the effect of these loci have been developed and can be used to predict phenotypes in new individuals. In agriculture, these methods are used to select superior individuals using genomic breeding values; in humans these methods are used to quantitatively measure an individual’s disease risk, termed polygenic risk scores. Both fields typically use SNP array genotypes for the analysis. Recently, genotyping-by-sequencing has become popular, due to lower cost and greater genome coverage (including structural variants). Oxford Nanopore Technologies’ (ONT) portable sequencers have the potential to combine the benefits genotyping-by-sequencing with portability and decreased turn-around time. This introduces the potential for in-house clinical genetic disease risk screening in humans or calculating genomic breeding values on-farm in agriculture. Here we demonstrate the potential of the later by calculating genomic breeding values for four traits in cattle using low-coverage ONT sequence data and comparing these breeding values to breeding values calculated from SNP arrays. At sequencing coverages between 2X and 4X the correlation between ONT breeding values and SNP array-based breeding values was > 0.92 when imputation was used and > 0.88 when no imputation was used. With an average sequencing coverage of 0.5x the correlation between the two methods was between 0.85 and 0.92 using imputation, depending on the trait. This suggests that ONT sequencing has potential for in clinic or on-farm genomic prediction, however, further work to validate these findings in a larger population still remains.


Author(s):  
Hans-Jürgen Auinger ◽  
Christina Lehermeier ◽  
Daniel Gianola ◽  
Manfred Mayer ◽  
Albrecht E. Melchinger ◽  
...  

Abstract Key message Model training on data from all selection cycles yielded the highest prediction accuracy by attenuating specific effects of individual cycles. Expected reliability was a robust predictor of accuracies obtained with different calibration sets. Abstract The transition from phenotypic to genome-based selection requires a profound understanding of factors that determine genomic prediction accuracy. We analysed experimental data from a commercial maize breeding programme to investigate if genomic measures can assist in identifying optimal calibration sets for model training. The data set consisted of six contiguous selection cycles comprising testcrosses of 5968 doubled haploid lines genotyped with a minimum of 12,000 SNP markers. We evaluated genomic prediction accuracies in two independent prediction sets in combination with calibration sets differing in sample size and genomic measures (effective sample size, average maximum kinship, expected reliability, number of common polymorphic SNPs and linkage phase similarity). Our results indicate that across selection cycles prediction accuracies were as high as 0.57 for grain dry matter yield and 0.76 for grain dry matter content. Including data from all selection cycles in model training yielded the best results because interactions between calibration and prediction sets as well as the effects of different testers and specific years were attenuated. Among genomic measures, the expected reliability of genomic breeding values was the best predictor of empirical accuracies obtained with different calibration sets. For grain yield, a large difference between expected and empirical reliability was observed in one prediction set. We propose to use this difference as guidance for determining the weight phenotypic data of a given selection cycle should receive in model retraining and for selection when both genomic breeding values and phenotypes are available.


Sign in / Sign up

Export Citation Format

Share Document