Haplotype Reconstruction in Connected Tetraploid F1 Populations

Abstract In diploid species, many multiparental populations have been developed to increase genetic diversity and quantitative trait loci (QTL) mapping resolution. In these populations, haplotype reconstruction has been used as a standard practice to increase the power of QTL detection in comparison with the marker-based association analysis. However, such software tools for polyploid species are few and limited to a single biparental F1 population. In this paper, a statistical framework for haplotype reconstruction has been developed and implemented in the software PolyOrigin for connected tetraploid F1 populations with shared parents, regardless of the number of parents or mating design. Given a genetic or physical map of markers, PolyOrigin first phases parental genotypes, then refines the input marker map, and finally reconstructs offspring haplotypes. PolyOrigin can utilize single nucleotide polymorphism (SNP) data coming from arrays or from sequence-based genotyping; in the latter case, bi-allelic read counts can be used (and are preferred) as input data to minimize the influence of genotype calling errors at low depth. With extensive simulation we show that PolyOrigin is robust to the errors in the input genotypic data and marker map. It works well for various population designs with ≥ offspring per parent and for sequences with read depth as low as 10x. PolyOrigin was further evaluated using an autotetraploid potato dataset with a 3 × 3 half-diallel mating design. In conclusion, PolyOrigin opens up exciting new possibilities for haplotype analysis in tetraploid breeding populations.

Download Full-text

Haplotype Reconstruction in Connected Tetraploid F1 Populations

10.1101/2020.12.18.423519 ◽

2020 ◽

Cited By ~ 1

Author(s):

Chaozhi Zheng ◽

Rodrigo R. Amadeu ◽

Patricio R. Munoz ◽

Jeffrey B. Endelman

Keyword(s):

Diploid Species ◽

Haplotype Reconstruction ◽

Single Nucleotide ◽

Ploidy Levels ◽

Mating Design ◽

Statistical Framework ◽

Breeding Populations ◽

Diallel Mating Design ◽

Half Diallel ◽

Genotype Probabilities

AbstractIn diploid species, many multi-parental populations have been developed to increase genetic diversity and quantitative trait loci (QTL) mapping resolution. In these populations, haplotype reconstruction has been used as a standard practice to increase QTL detection power in comparison with the marker-based association analysis. To realize similar benefits in tetraploid species (and eventually higher ploidy levels), a statistical framework for haplotype reconstruction has been developed and implemented in the software PolyOrigin for connected tetraploid F1 populations with shared parents. Haplotype reconstruction proceeds in two steps: first, parental genotypes are phased based on multi-locus linkage analysis; second, genotype probabilities for the parental alleles are inferred in the progeny. PolyOrigin can utilize genetic marker data from single nucleotide polymorphism (SNP) arrays or from sequence-based genotyping; in the latter case, bi-allelic read counts can be used (and are preferred) as input data to minimize the influence of genotype call errors at low depth. To account for errors in the input map, PolyOrigin includes functionality for filtering markers, inferring inter-marker distances, and refining local marker ordering. Simulation studies were used to investigate the effect of several variables on the accuracy of haplotype reconstruction, including the mating design, the number of parents, population size, and sequencing depth. PolyOrigin was further evaluated using an autotetraploid potato dataset with a 3×3 half-diallel mating design. In conclusion, PolyOrigin opens up exciting new possibilities for haplotype analysis in tetraploid breeding populations.

Download Full-text

Estimation of effective population size using single-nucleotide polymorphism (SNP) data in Jeju horse

Journal of Animal Science and Technology ◽

10.1186/2055-0391-56-28 ◽

2014 ◽

Vol 56 (1) ◽

pp. 28 ◽

Cited By ~ 7

Author(s):

Kyoung-Tag Do ◽

Joon-Ho Lee ◽

Hak-Kyo Lee ◽

Jun Kim ◽

Kyung-Do Park

Keyword(s):

Single Nucleotide Polymorphism ◽

Population Size ◽

Effective Population Size ◽

Nucleotide Polymorphism ◽

Effective Population ◽

Single Nucleotide ◽

Snp Data ◽

Jeju Horse

Download Full-text

Single Nucleotide Polymorphism Discovery and Genetic Differentiation Analysis of Geese Bred in Poland, Using Genotyping-by-Sequencing (GBS)

Genes ◽

10.3390/genes12071074 ◽

2021 ◽

Vol 12 (7) ◽

pp. 1074

Author(s):

Joanna Grzegorczyk ◽

Artur Gurgul ◽

Maria Oczkowicz ◽

Tomasz Szmatoła ◽

Agnieszka Fornal ◽

...

Keyword(s):

Genotyping By Sequencing ◽

Read Depth ◽

Model Organisms ◽

Single Nucleotide Polymorphism Discovery ◽

Nucleotide Polymorphisms ◽

Single Nucleotide ◽

Polymorphism Discovery ◽

Genome Wide ◽

Plumage Development ◽

Edar Gene

Poland is the largest European producer of goose, while goose breeding has become an essential and still increasing branch of the poultry industry. The most frequently bred goose is the White Kołuda® breed, constituting 95% of the country’s population, whereas geese of regional varieties are bred in smaller, conservation flocks. However, a goose’s genetic diversity is inaccurately explored, mainly because the advantages of the most commonly used tools are strongly limited in non-model organisms. One of the most accurate used markers for population genetics is single nucleotide polymorphisms (SNP). A highly efficient strategy for genome-wide SNP detection is genotyping-by-sequencing (GBS), which has been already widely applied in many organisms. This study attempts to use GBS in 12 conservative goose breeds and the White Kołuda® breed maintained in Poland. The GBS method allowed for the detection of 3833 common raw SNPs. Nevertheless, after filtering for read depth and alleles characters, we obtained the final markers panel used for a differentiation analysis that comprised 791 SNPs. These variants were located within 11 different genes, and one of the most diversified variants was associated with the EDAR gene, which is especially interesting as it participates in the plumage development, which plays a crucial role in goose breeding.

Download Full-text

Physiological Bases of Across-Environment and Environment-Specific Responses for Grain Yield in Maize Hybrids Obtained from a Full Diallel Mating Design

Crop Science ◽

10.2135/cropsci2017.05.0293 ◽

2018 ◽

Vol 58 (1) ◽

pp. 180-191 ◽

Cited By ~ 1

Author(s):

Eugenia M. Munaro ◽

Abelardo J. de la Vega ◽

Karina E. D'Andrea ◽

Carlos D. Messina ◽

Maria E. Otegui

Keyword(s):

Grain Yield ◽

Maize Hybrids ◽

Mating Design ◽

Diallel Mating Design ◽

Diallel Mating

Download Full-text

A publicly available repository of ROH islands reveals signatures of selection in different livestock and pet species

Genetics Selection Evolution ◽

10.1186/s12711-020-00599-7 ◽

2021 ◽

Vol 53 (1) ◽

Author(s):

Wim Gorssen ◽

Roel Meyermans ◽

Steven Janssens ◽

Nadine Buys

Keyword(s):

Association Studies ◽

Genome Wide Association Studies ◽

Single Nucleotide ◽

Future Studies ◽

Snp Data ◽

Animal Populations ◽

Genome Wide ◽

Starting Point ◽

Different Populations ◽

Signatures Of Selection

Abstract Background Runs of homozygosity (ROH) have become the state-of-the-art method for analysis of inbreeding in animal populations. Moreover, ROH are suited to detect signatures of selection via ROH islands and are used in other applications, such as genomic prediction and genome-wide association studies (GWAS). Currently, a vast amount of single nucleotide polymorphism (SNP) data is available online, but most of these data have never been used for ROH analysis. Therefore, we performed a ROH analysis on large medium-density SNP datasets in eight animal species (cat, cattle, dog, goat, horse, pig, sheep and water buffalo; 442 different populations) and make these results publicly available. Results The results include an overview of ROH islands per population and a comparison of the incidence of these ROH islands among populations from the same species, which can assist researchers when studying other (livestock) populations or when looking for similar signatures of selection. We were able to confirm many known ROH islands, for example signatures of selection for the myostatin (MSTN) gene in sheep and horses. However, our results also included multiple other ROH islands, which are common to many populations and not identified to date (e.g. on chromosomes D4 and E2 in cats and on chromosome 6 in sheep). Conclusions We are confident that our repository of ROH islands is a valuable reference for future studies. The discovered ROH island regions represent a unique starting point for new studies or can be used as a reference for future studies. Furthermore, we encourage authors to add their population-specific ROH findings to our repository.

Download Full-text

Species trees from consensus single nucleotide polymorphism (SNP) data: Testing phylogenetic approaches with simulated and empirical data

Molecular Phylogenetics and Evolution ◽

10.1016/j.ympev.2017.07.018 ◽

2017 ◽

Vol 116 ◽

pp. 192-201 ◽

Cited By ~ 6

Author(s):

Alexander N. Schmidt-Lebuhn ◽

Nicola C. Aitken ◽

Aaron Chuah

Keyword(s):

Single Nucleotide Polymorphism ◽

Empirical Data ◽

Species Trees ◽

Nucleotide Polymorphism ◽

Single Nucleotide ◽

Snp Data

Download Full-text

Combining ability for yield and yield components in six parents and their 15 F1 hybrids of sesame (Sesamum indicum L.) in half diallel mating design

Journal of Plant Breeding and Crop Science ◽

10.5897/jpbcs2014.0481 ◽

2014 ◽

Vol 6 (12) ◽

pp. 179-184 ◽

Cited By ~ 1

Author(s):

Balla M. Ahmed Salah ◽

I. Adam Shahira

Keyword(s):

Combining Ability ◽

Yield Components ◽

Sesamum Indicum ◽

F1 Hybrids ◽

Mating Design ◽

Diallel Mating Design ◽

Yield And Yield Components ◽

Half Diallel ◽

Diallel Mating

Download Full-text

Relatedness and genetic variation in wild and captive populations of Mountain Bongo in Kenya obtained from genome-wide single-nucleotide polymorphism (SNP) data

Global Ecology and Conservation ◽

10.1016/j.gecco.2017.07.001 ◽

2017 ◽

Vol 11 ◽

pp. 196-206 ◽

Cited By ~ 3

Author(s):

Henrik Svengren ◽

Mike Prettejohn ◽

Donald Bunge ◽

Peter Fundi ◽

Mats Björklund

Keyword(s):

Single Nucleotide Polymorphism ◽

Genetic Variation ◽

Nucleotide Polymorphism ◽

Single Nucleotide ◽

Snp Data ◽

Captive Populations ◽

Genome Wide

Download Full-text

Strategies to assure optimal trade-offs among competing objectives for genetic improvement of soybean

10.1101/2021.02.19.431938 ◽

2021 ◽

Author(s):

Vishnu Ramasubramanian ◽

William Beavis

Keyword(s):

Genomic Selection ◽

Genetic Improvement ◽

Genetic Variance ◽

Breeding Strategies ◽

Short Term ◽

Genetic Gains ◽

Mating Design ◽

Genetic Potential ◽

Breeding Populations

AbstractPlant breeding is a decision making discipline based on understanding project objectives. Genetic improvement projects can have two competing objectives: maximize rate of genetic improvement and minimize loss of useful genetic variance. For commercial plant breeders competition in the marketplace forces greater emphasis on maximizing immediate genetic improvements. In contrast public plant breeders have an opportunity, perhaps an obligation, to place greater emphasis on minimizing loss of useful genetic variance while realizing genetic improvements. Considerable research indicates that short term genetic gains from Genomic Selection (GS) are much greater than Phenotypic Selection (PS), while PS provides better long term genetic gains because PS retains useful genetic diversity during the early cycles of selection. With limited resources must a soybean breeder choose between the two extreme responses provided by GS or PS? Or is it possible to develop novel breeding strategies that will provide a desirable compromise between the competing objectives? To address these questions, we decomposed breeding strategies into decisions about selection methods, mating designs and whether the breeding population should be organized as family islands. For breeding populations organized into islands decisions about possible migration rules among family islands were included. From among 60 possible strategies, genetic improvement is maximized for the first five to ten cycles using GS, a hub network mating design in breeding populations organized as fully connected family islands and migration rules allowing exchange of two lines among islands every other cycle of selection. If the objectives are to maximize both short-term and long-term gains, then the best compromise strategy is similar except a genomic mating design, instead of a hub networked mating design, is used. This strategy also resulted in realizing the greatest proportion of genetic potential of the founder populations. Weighted genomic selection applied to both non-isolated and island populations also resulted in realization of the greatest proportion of genetic potential of the founders, but required more cycles than the best compromise strategy.

Download Full-text

Strategies to Assure Optimal Trade-Offs Among Competing Objectives for the Genetic Improvement of Soybean

Frontiers in Genetics ◽

10.3389/fgene.2021.675500 ◽

2021 ◽

Vol 12 ◽

Author(s):

Vishnu Ramasubramanian ◽

William D. Beavis

Keyword(s):

Genomic Selection ◽

Genetic Improvement ◽

Genetic Variance ◽

Phenotypic Selection ◽

Breeding Strategies ◽

Short Term ◽

Mating Design ◽

Breeding Populations ◽

Hub Network

Plant breeding is a decision-making discipline based on understanding project objectives. Genetic improvement projects can have two competing objectives: maximize the rate of genetic improvement and minimize the loss of useful genetic variance. For commercial plant breeders, competition in the marketplace forces greater emphasis on maximizing immediate genetic improvements. In contrast, public plant breeders have an opportunity, perhaps an obligation, to place greater emphasis on minimizing the loss of useful genetic variance while realizing genetic improvements. Considerable research indicates that short-term genetic gains from genomic selection are much greater than phenotypic selection, while phenotypic selection provides better long-term genetic gains because it retains useful genetic diversity during the early cycles of selection. With limited resources, must a soybean breeder choose between the two extreme responses provided by genomic selection or phenotypic selection? Or is it possible to develop novel breeding strategies that will provide a desirable compromise between the competing objectives? To address these questions, we decomposed breeding strategies into decisions about selection methods, mating designs, and whether the breeding population should be organized as family islands. For breeding populations organized into islands, decisions about possible migration rules among family islands were included. From among 60 possible strategies, genetic improvement is maximized for the first five to 10 cycles using genomic selection and a hub network mating design, where the hub parents with the largest selection metric make large parental contributions. It also requires that the breeding populations be organized as fully connected family islands, where every island is connected to every other island, and migration rules allow the exchange of two lines among islands every other cycle of selection. If the objectives are to maximize both short-term and long-term gains, then the best compromise strategy is similar except that the mating design could be hub network, chain rule, or a multi-objective optimization method-based mating design. Weighted genomic selection applied to centralized populations also resulted in the realization of the greatest proportion of the genetic potential of the founders but required more cycles than the best compromise strategy.

Download Full-text