scholarly journals Simulation of pedigree vs. fully-informative marker based relationships matrices in a loblolly pine breeding population

2021 ◽  
Author(s):  
Adam R Festa ◽  
Ross Whetten

Computer simulations of breeding strategies are an essential resource for tree breeders because they allow exploratory analyses into potential long-term impacts on genetic gain and inbreeding consequences without bearing the cost, time, or resource requirements of field experiments. Previous work has modeled the potential long-term implications on inbreeding and genetic gain using random mating and phenotypic selection. Reduction in sequencing costs has enabled the use of DNA marker-based relationship matrices in addition to or in place of pedigree-based allele sharing estimates; this has been shown to provide a significant increase in the accuracy of progeny breeding value prediction. A potential pitfall of genomic selection using genetic relationship matrices is increased coancestry among selections, leading to the accumulation of deleterious alleles and inbreeding depression. We used simulation to compare the relative genetic gain and risk of inbreeding depression within a breeding program similar to loblolly pine, utilizing pedigree-based or marker-based relationships over ten generations. We saw a faster rate of purging deleterious alleles when using a genomic relationship matrix based on markers that track identity-by-descent of segments of the genome. Additionally, we observed an increase in the rate of genetic gain when using a genomic relationship matrix instead of a pedigree-based relationship matrix. While the genetic variance of populations decreased more rapidly when using genomic-based relationship matrices as opposed to pedigree-based, there appeared to be no long-term consequences on the accumulation of deleterious alleles within the simulated breeding strategy.

2020 ◽  
Vol 10 (6) ◽  
pp. 2069-2078 ◽  
Author(s):  
Christos Palaiokostas ◽  
Shannon M. Clarke ◽  
Henrik Jeuthe ◽  
Rudiger Brauning ◽  
Timothy P. Bilton ◽  
...  

Arctic charr (Salvelinus alpinus) is a species of high economic value for the aquaculture industry, and of high ecological value due to its Holarctic distribution in both marine and freshwater environments. Novel genome sequencing approaches enable the study of population and quantitative genetic parameters even on species with limited or no prior genomic resources. Low coverage genotyping by sequencing (GBS) was applied in a selected strain of Arctic charr in Sweden originating from a landlocked freshwater population. For the needs of the current study, animals from year classes 2013 (171 animals, parental population) and 2017 (759 animals; 13 full sib families) were used as a template for identifying genome wide single nucleotide polymorphisms (SNPs). GBS libraries were constructed using the PstI and MspI restriction enzymes. Approximately 14.5K SNPs passed quality control and were used for estimating a genomic relationship matrix. Thereafter a wide range of analyses were conducted in order to gain insights regarding genetic diversity and investigate the efficiency of the genomic information for parentage assignment and breeding value estimation. Heterozygosity estimates for both year classes suggested a slight excess of heterozygotes. Furthermore, FST estimates among the families of year class 2017 ranged between 0.009 – 0.066. Principal components analysis (PCA) and discriminant analysis of principal components (DAPC) were applied aiming to identify the existence of genetic clusters among the studied population. Results obtained were in accordance with pedigree records allowing the identification of individual families. Additionally, DNA parentage verification was performed, with results in accordance with the pedigree records with the exception of a putative dam where full sib genotypes suggested a potential recording error. Breeding value estimation for juvenile growth through the usage of the estimated genomic relationship matrix clearly outperformed the pedigree equivalent in terms of prediction accuracy (0.51 opposed to 0.31). Overall, low coverage GBS has proven to be a cost-effective genotyping platform that is expected to boost the selection efficiency of the Arctic charr breeding program.


2021 ◽  
Author(s):  
Mitchell J. Feldmann ◽  
Hans-Peter Piepho ◽  
Steven J. Knapp

Many important traits in plants, animals, and microbes are polygenic and are therefore difficult to improve through traditional marker?assisted selection. Genomic prediction addresses this by enabling the inclusion of all genetic data in a mixed model framework. The main method for predicting breeding values is genomic best linear unbiased prediction (GBLUP), which uses the realized genomic relationship or kinship matrix (K) to connect genotype to phenotype. The use of relationship matrices allows information to be shared for estimating the genetic values for observed entries and predicting genetic values for unobserved entries. One of the key parameters of such models is genomic heritability (h2g), or the variance of a trait associated with a genome-wide sample of DNA polymorphisms. Here we discuss the relationship between several common methods for calculating the genomic relationship matrix and propose a new matrix based on the average semivariance that yields accurate estimates of genomic variance in the observed population regardless of the focal population quality as well as accurate breeding value predictions in unobserved samples. Notably, our proposed method is highly similar to the approach presented by Legarra (2016) despite different mathematical derivations and statistical perspectives and only deviates from the classic approach presented in VanRaden (2008) by a scaling factor. With current approaches, we found that the genomic heritability tends to be either over- or underestimated depending on the scaling and centering applied to the marker matrix (Z), the value of the average diagonal element of K, and the assortment of alleles and heterozygosity (H) in the observed population and that, unlike its predecessors, our newly proposed kinship matrix KASV yields accurate estimates of h2g in the observed population, generalizes to larger populations, and produces BLUPs equivalent to common methods in plants and animals.


2013 ◽  
Vol 55 (3) ◽  
pp. 165-171
Author(s):  
Joon-Ho Lee ◽  
Kwang-Hyun Cho ◽  
Chung-Il Cho ◽  
Kyung-Do Park ◽  
Deuk Hwan Lee

2021 ◽  
Vol 53 (1) ◽  
Author(s):  
Richard Bernstein ◽  
Manuel Du ◽  
Andreas Hoppe ◽  
Kaspar Bienefeld

Abstract Background With the completion of a single nucleotide polymorphism (SNP) chip for honey bees, the technical basis of genomic selection is laid. However, for its application in practice, methods to estimate genomic breeding values need to be adapted to the specificities of the genetics and breeding infrastructure of this species. Drone-producing queens (DPQ) are used for mating control, and usually, they head non-phenotyped colonies that will be placed on mating stations. Breeding queens (BQ) head colonies that are intended to be phenotyped and used to produce new queens. Our aim was to evaluate different breeding program designs for the initiation of genomic selection in honey bees. Methods Stochastic simulations were conducted to evaluate the quality of the estimated breeding values. We developed a variation of the genomic relationship matrix to include genotypes of DPQ and tested different sizes of the reference population. The results were used to estimate genetic gain in the initial selection cycle of a genomic breeding program. This program was run over six years, and different numbers of genotyped queens per year were considered. Resources could be allocated to increase the reference population, or to perform genomic preselection of BQ and/or DPQ. Results Including the genotypes of 5000 phenotyped BQ increased the accuracy of predictions of breeding values by up to 173%, depending on the size of the reference population and the trait considered. To initiate a breeding program, genotyping a minimum number of 1000 queens per year is required. In this case, genetic gain was highest when genomic preselection of DPQ was coupled with the genotyping of 10–20% of the phenotyped BQ. For maximum genetic gain per used genotype, more than 2500 genotyped queens per year and preselection of all BQ and DPQ are required. Conclusions This study shows that the first priority in a breeding program is to genotype phenotyped BQ to obtain a sufficiently large reference population, which allows successful genomic preselection of queens. To maximize genetic gain, DPQ should be preselected, and their genotypes included in the genomic relationship matrix. We suggest, that the developed methods for genomic prediction are suitable for implementation in genomic honey bee breeding programs.


2018 ◽  
Vol 98 (4) ◽  
pp. 750-759 ◽  
Author(s):  
Z. Karimi ◽  
M. Sargolzaei ◽  
J.A.B. Robinson ◽  
F.S. Schenkel

A single-nucleotide polymorphisms-based genomic relationship matrix (GSNP) discriminate less identity by state from identity by descent (IBD) alleles compared with a multi-locus haplotype-based relationship matrix (GHAP), which can better capture IBD alleles and recent relationships. We aimed to compare the prediction reliability and prediction bias of genomic best linear unbiased prediction (GBLUP) using either GSNP or GHAP in Holstein cattle. Therefore, a total of 57 traits with a wide range of heritability values were analyzed. Classical validation tests were done using a validation dataset comprised of 50k genotype records of 561–669 proven bulls born in 2010–2011 with an official estimated breeding value (EBV) in 2016 and a training set of 5314–19 678 bulls born before 2010, depending on the trait. The method for building the genomic relationship matrix (G) had significant, but small effect on observed reliability (r2GEBV) (p < 0.0001) and bias (p < 0.0001). A significant interaction between G and the level of trait heritability on r2GEBV and bias was also observed (p < 0.0001). The small gains in r2GEBV and small reductions in the bias by using GHAPBLUP were increased when predicting moderate to high-heritability traits compared with low-heritability traits.


2019 ◽  
Vol 51 (1) ◽  
Author(s):  
Pascal Duenk ◽  
Mario P. L. Calus ◽  
Yvonne C. J. Wientjes ◽  
Vivian P. Breen ◽  
John M. Henshall ◽  
...  

Following publication of original article [1], we noticed that there was an error: Eq. (3) on page 5 is the genomic relationship matrix that


2014 ◽  
Vol 54 (5) ◽  
pp. 544 ◽  
Author(s):  
N. Moghaddar ◽  
A. A. Swan ◽  
J. H. J. van der Werf

The objective of this study was to predict the accuracy of genomic prediction for 26 traits, including weight, muscle, fat, and wool quantity and quality traits, in Australian sheep based on a large, multi-breed reference population. The reference population consisted of two research flocks, with the main breeds being Merino, Border Leicester (BL), Poll Dorset (PD), and White Suffolk (WS). The genomic estimated breeding value (GEBV) was based on GBLUP (genomic best linear unbiased prediction), applying a genomic relationship matrix calculated from the 50K Ovine SNP chip marker genotypes. The accuracy of GEBV was evaluated as the Pearson correlation coefficient between GEBV and accurate estimated breeding value based on progeny records in a set of genotyped industry animals. The accuracies of weight traits were relatively low to moderate in PD and WS breeds (0.11–0.27) and moderate to relatively high in BL and Merino (0.25–0.63). The accuracy of muscle and fat traits was moderate to relatively high across all breeds (between 0.21 and 0.55). The accuracy of GEBV of yearling and adult wool traits in Merino was, on average, high (0.33–0.75). The results showed the accuracy of genomic prediction depends on trait heritability and the effective size of the reference population, whereas the observed GEBV accuracies were more related to the breed proportions in the multi-breed reference population. No extra gain in within-breed GEBV accuracy was observed based on across breed information. More investigations are required to determine the precise effect of across-breed information on within-breed genomic prediction.


Sign in / Sign up

Export Citation Format

Share Document