scholarly journals Haplotype Reconstruction in Connected Tetraploid F1 Populations

Genetics ◽  
2021 ◽  
Author(s):  
Chaozhi Zheng ◽  
Rodrigo R Amadeu ◽  
Patricio R Munoz ◽  
Jeffrey B Endelman

Abstract In diploid species, many multiparental populations have been developed to increase genetic diversity and quantitative trait loci (QTL) mapping resolution. In these populations, haplotype reconstruction has been used as a standard practice to increase the power of QTL detection in comparison with the marker-based association analysis. However, such software tools for polyploid species are few and limited to a single biparental F1 population. In this paper, a statistical framework for haplotype reconstruction has been developed and implemented in the software PolyOrigin for connected tetraploid F1 populations with shared parents, regardless of the number of parents or mating design. Given a genetic or physical map of markers, PolyOrigin first phases parental genotypes, then refines the input marker map, and finally reconstructs offspring haplotypes. PolyOrigin can utilize single nucleotide polymorphism (SNP) data coming from arrays or from sequence-based genotyping; in the latter case, bi-allelic read counts can be used (and are preferred) as input data to minimize the influence of genotype calling errors at low depth. With extensive simulation we show that PolyOrigin is robust to the errors in the input genotypic data and marker map. It works well for various population designs with ≥ offspring per parent and for sequences with read depth as low as 10x. PolyOrigin was further evaluated using an autotetraploid potato dataset with a 3 × 3 half-diallel mating design. In conclusion, PolyOrigin opens up exciting new possibilities for haplotype analysis in tetraploid breeding populations.

Author(s):  
Chaozhi Zheng ◽  
Rodrigo R. Amadeu ◽  
Patricio R. Munoz ◽  
Jeffrey B. Endelman

AbstractIn diploid species, many multi-parental populations have been developed to increase genetic diversity and quantitative trait loci (QTL) mapping resolution. In these populations, haplotype reconstruction has been used as a standard practice to increase QTL detection power in comparison with the marker-based association analysis. To realize similar benefits in tetraploid species (and eventually higher ploidy levels), a statistical framework for haplotype reconstruction has been developed and implemented in the software PolyOrigin for connected tetraploid F1 populations with shared parents. Haplotype reconstruction proceeds in two steps: first, parental genotypes are phased based on multi-locus linkage analysis; second, genotype probabilities for the parental alleles are inferred in the progeny. PolyOrigin can utilize genetic marker data from single nucleotide polymorphism (SNP) arrays or from sequence-based genotyping; in the latter case, bi-allelic read counts can be used (and are preferred) as input data to minimize the influence of genotype call errors at low depth. To account for errors in the input map, PolyOrigin includes functionality for filtering markers, inferring inter-marker distances, and refining local marker ordering. Simulation studies were used to investigate the effect of several variables on the accuracy of haplotype reconstruction, including the mating design, the number of parents, population size, and sequencing depth. PolyOrigin was further evaluated using an autotetraploid potato dataset with a 3×3 half-diallel mating design. In conclusion, PolyOrigin opens up exciting new possibilities for haplotype analysis in tetraploid breeding populations.


Genes ◽  
2021 ◽  
Vol 12 (7) ◽  
pp. 1074
Author(s):  
Joanna Grzegorczyk ◽  
Artur Gurgul ◽  
Maria Oczkowicz ◽  
Tomasz Szmatoła ◽  
Agnieszka Fornal ◽  
...  

Poland is the largest European producer of goose, while goose breeding has become an essential and still increasing branch of the poultry industry. The most frequently bred goose is the White Kołuda® breed, constituting 95% of the country’s population, whereas geese of regional varieties are bred in smaller, conservation flocks. However, a goose’s genetic diversity is inaccurately explored, mainly because the advantages of the most commonly used tools are strongly limited in non-model organisms. One of the most accurate used markers for population genetics is single nucleotide polymorphisms (SNP). A highly efficient strategy for genome-wide SNP detection is genotyping-by-sequencing (GBS), which has been already widely applied in many organisms. This study attempts to use GBS in 12 conservative goose breeds and the White Kołuda® breed maintained in Poland. The GBS method allowed for the detection of 3833 common raw SNPs. Nevertheless, after filtering for read depth and alleles characters, we obtained the final markers panel used for a differentiation analysis that comprised 791 SNPs. These variants were located within 11 different genes, and one of the most diversified variants was associated with the EDAR gene, which is especially interesting as it participates in the plumage development, which plays a crucial role in goose breeding.


Crop Science ◽  
2018 ◽  
Vol 58 (1) ◽  
pp. 180-191 ◽  
Author(s):  
Eugenia M. Munaro ◽  
Abelardo J. de la Vega ◽  
Karina E. D'Andrea ◽  
Carlos D. Messina ◽  
Maria E. Otegui

2021 ◽  
Vol 53 (1) ◽  
Author(s):  
Wim Gorssen ◽  
Roel Meyermans ◽  
Steven Janssens ◽  
Nadine Buys

Abstract Background Runs of homozygosity (ROH) have become the state-of-the-art method for analysis of inbreeding in animal populations. Moreover, ROH are suited to detect signatures of selection via ROH islands and are used in other applications, such as genomic prediction and genome-wide association studies (GWAS). Currently, a vast amount of single nucleotide polymorphism (SNP) data is available online, but most of these data have never been used for ROH analysis. Therefore, we performed a ROH analysis on large medium-density SNP datasets in eight animal species (cat, cattle, dog, goat, horse, pig, sheep and water buffalo; 442 different populations) and make these results publicly available. Results The results include an overview of ROH islands per population and a comparison of the incidence of these ROH islands among populations from the same species, which can assist researchers when studying other (livestock) populations or when looking for similar signatures of selection. We were able to confirm many known ROH islands, for example signatures of selection for the myostatin (MSTN) gene in sheep and horses. However, our results also included multiple other ROH islands, which are common to many populations and not identified to date (e.g. on chromosomes D4 and E2 in cats and on chromosome 6 in sheep). Conclusions We are confident that our repository of ROH islands is a valuable reference for future studies. The discovered ROH island regions represent a unique starting point for new studies or can be used as a reference for future studies. Furthermore, we encourage authors to add their population-specific ROH findings to our repository.


2021 ◽  
Author(s):  
Vishnu Ramasubramanian ◽  
William Beavis

AbstractPlant breeding is a decision making discipline based on understanding project objectives. Genetic improvement projects can have two competing objectives: maximize rate of genetic improvement and minimize loss of useful genetic variance. For commercial plant breeders competition in the marketplace forces greater emphasis on maximizing immediate genetic improvements. In contrast public plant breeders have an opportunity, perhaps an obligation, to place greater emphasis on minimizing loss of useful genetic variance while realizing genetic improvements. Considerable research indicates that short term genetic gains from Genomic Selection (GS) are much greater than Phenotypic Selection (PS), while PS provides better long term genetic gains because PS retains useful genetic diversity during the early cycles of selection. With limited resources must a soybean breeder choose between the two extreme responses provided by GS or PS? Or is it possible to develop novel breeding strategies that will provide a desirable compromise between the competing objectives? To address these questions, we decomposed breeding strategies into decisions about selection methods, mating designs and whether the breeding population should be organized as family islands. For breeding populations organized into islands decisions about possible migration rules among family islands were included. From among 60 possible strategies, genetic improvement is maximized for the first five to ten cycles using GS, a hub network mating design in breeding populations organized as fully connected family islands and migration rules allowing exchange of two lines among islands every other cycle of selection. If the objectives are to maximize both short-term and long-term gains, then the best compromise strategy is similar except a genomic mating design, instead of a hub networked mating design, is used. This strategy also resulted in realizing the greatest proportion of genetic potential of the founder populations. Weighted genomic selection applied to both non-isolated and island populations also resulted in realization of the greatest proportion of genetic potential of the founders, but required more cycles than the best compromise strategy.


2021 ◽  
Vol 12 ◽  
Author(s):  
Vishnu Ramasubramanian ◽  
William D. Beavis

Plant breeding is a decision-making discipline based on understanding project objectives. Genetic improvement projects can have two competing objectives: maximize the rate of genetic improvement and minimize the loss of useful genetic variance. For commercial plant breeders, competition in the marketplace forces greater emphasis on maximizing immediate genetic improvements. In contrast, public plant breeders have an opportunity, perhaps an obligation, to place greater emphasis on minimizing the loss of useful genetic variance while realizing genetic improvements. Considerable research indicates that short-term genetic gains from genomic selection are much greater than phenotypic selection, while phenotypic selection provides better long-term genetic gains because it retains useful genetic diversity during the early cycles of selection. With limited resources, must a soybean breeder choose between the two extreme responses provided by genomic selection or phenotypic selection? Or is it possible to develop novel breeding strategies that will provide a desirable compromise between the competing objectives? To address these questions, we decomposed breeding strategies into decisions about selection methods, mating designs, and whether the breeding population should be organized as family islands. For breeding populations organized into islands, decisions about possible migration rules among family islands were included. From among 60 possible strategies, genetic improvement is maximized for the first five to 10 cycles using genomic selection and a hub network mating design, where the hub parents with the largest selection metric make large parental contributions. It also requires that the breeding populations be organized as fully connected family islands, where every island is connected to every other island, and migration rules allow the exchange of two lines among islands every other cycle of selection. If the objectives are to maximize both short-term and long-term gains, then the best compromise strategy is similar except that the mating design could be hub network, chain rule, or a multi-objective optimization method-based mating design. Weighted genomic selection applied to centralized populations also resulted in the realization of the greatest proportion of the genetic potential of the founders but required more cycles than the best compromise strategy.


Sign in / Sign up

Export Citation Format

Share Document