Using Probabilistic Genotypes in Linkage Analysis of polyploids

Abstract Marker genotypes are generally called as discrete values: homozygous versus heterozygous in the case of diploids, or an integer allele dosage in the case of polyploids. Software for linkage map construction and/or QTL analysis usually relies on such discrete genotypes. However, it may not always be possible, or desirable, to assign definite values to genotype observations in the presence of uncertainty in the genotype calling. Here, we present an approach that uses probabilistic marker dosages for linkage map construction in polyploids. We compare our method to an approach based on discrete dosages, using simulated SNP array and sequence reads data with varying levels of data quality. We validate our approach using experimental data from a potato (Solanum tuberosum L.) SNP array applied to an F1 mapping population. In comparison to the approach based on discrete dosages, we mapped an additional 562 markers. All but three of these were mapped to the expected chromosome and marker position. For the remaining three markers, no physical position was known. The use of dosage probabilities is of particular relevance for map construction in polyploids using sequencing data, as these often result in a higher level of uncertainty regarding allele dosage.

Download Full-text

Using probabilistic genotypes in linkage analysis of polyploids

Theoretical and Applied Genetics ◽

10.1007/s00122-021-03834-x ◽

2021 ◽

Author(s):

Yanlin Liao ◽

Roeland E. Voorrips ◽

Peter M. Bourke ◽

Giorgio Tumino ◽

Paul Arens ◽

...

Keyword(s):

Linkage Map ◽

Linkage Mapping ◽

Mapping Population ◽

Snp Array ◽

Linkage Maps ◽

Sequencing Data ◽

Genotype Calling ◽

Map Construction ◽

Allele Dosage ◽

Discrete Values

Abstract Key message In polyploids, linkage mapping is carried out using genotyping with discrete dosage scores. Here, we use probabilistic genotypes and we validate it for the construction of polyploid linkage maps. Abstract Marker genotypes are generally called as discrete values: homozygous versus heterozygous in the case of diploids, or an integer allele dosage in the case of polyploids. Software for linkage map construction and/or QTL analysis usually relies on such discrete genotypes. However, it may not always be possible, or desirable, to assign definite values to genotype observations in the presence of uncertainty in the genotype calling. Here, we present an approach that uses probabilistic marker dosages for linkage map construction in polyploids. We compare our method to an approach based on discrete dosages, using simulated SNP array and sequence reads data with varying levels of data quality. We validate our approach using experimental data from a potato (Solanum tuberosum L.) SNP array applied to an F1 mapping population. In comparison to the approach based on discrete dosages, we mapped an additional 562 markers. All but three of these were mapped to the expected chromosome and marker position. For the remaining three markers, no physical position was known. The use of dosage probabilities is of particular relevance for map construction in polyploids using sequencing data, as these often result in a higher level of uncertainty regarding allele dosage.

Download Full-text

Genomic Resource Development for Hydrangea (Hydrangea macrophylla (Thunb.) Ser.)—A Transcriptome Assembly and a High-Density Genetic Linkage Map

Horticulturae ◽

10.3390/horticulturae7020025 ◽

2021 ◽

Vol 7 (2) ◽

pp. 25

Author(s):

Xingbo Wu ◽

Amanda M. Hulse-Kemp ◽

Phillip A. Wadl ◽

Zach Smith ◽

Keithanne Mockaitis ◽

...

Keyword(s):

Genetic Resources ◽

Linkage Map ◽

Genetic Linkage ◽

Genetic Linkage Map ◽

Mapping Population ◽

High Density ◽

Linkage Maps ◽

Map Construction ◽

Hydrangea Macrophylla ◽

Genetic Linkage Maps

Hydrangea (Hydrangea macrophylla) is an important ornamental crop that has been cultivated for more than 300 years. Despite the economic importance, genetic studies for hydrangea have been limited by the lack of genetic resources. Genetic linkage maps and subsequent trait mapping are essential tools to identify and make markers available for marker-assisted breeding. A transcriptomic study was performed on two important cultivars, Veitchii and Endless Summer, to discover simple sequence repeat (SSR) markers and an F1 population based on the cross ‘Veitchii’ × ‘Endless Summer’ was established for genetic linkage map construction. Genotyping by sequencing (GBS) was performed on the mapping population along with SSR genotyping. From an analysis of 42,682 putative transcripts, 8780 SSRs were identified and 1535 were validated in the mapping parents. A total of 267 polymorphic SSRs were selected for linkage map construction. The GBS yielded 3923 high quality single nucleotide polymorphisms (SNPs) in the mapping population, resulting in a total of 4190 markers that were used to generate maps for each parent and a consensus map. The consensus linkage map contained 1767 positioned markers (146 SSRs and 1621 SNPs), spanned 1383.4 centiMorgans (cM), and was comprised of 18 linkage groups, with an average mapping interval of 0.8 cM. The transcriptome information and large-scale marker development in this study greatly expanded the genetic resources that are available for hydrangea. The high-density genetic linkage maps presented here will serve as an important foundation for quantitative trait loci mapping, map-based gene cloning, and marker-assisted selection of H. macrophylla.

Download Full-text

Genetic Linkage Map Construction and QTL Mapping for Yield and Fi-ber Quality in Upland Cotton (Gossypium hirsutum L.)

ACTA AGRONOMICA SINICA ◽

10.3724/sp.j.1006.2008.01199 ◽

2008 ◽

Vol 34 (7) ◽

pp. 1199-1205 ◽

Cited By ~ 18

Author(s):

Li CHEN

Keyword(s):

Qtl Mapping ◽

Gossypium Hirsutum ◽

Linkage Map ◽

Upland Cotton ◽

Genetic Linkage ◽

Genetic Linkage Map ◽

Map Construction

Download Full-text

Genetic dissection of rhizome yield-related traits in Nelumbo nucifera through genetic linkage map construction and QTL mapping

Plant Physiology and Biochemistry ◽

10.1016/j.plaphy.2021.01.020 ◽

2021 ◽

Vol 160 ◽

pp. 155-165

Author(s):

Longyu Huang ◽

Ming Li ◽

Dingding Cao ◽

Pingfang Yang

Keyword(s):

Qtl Mapping ◽

Linkage Map ◽

Genetic Linkage ◽

Genetic Linkage Map ◽

Nelumbo Nucifera ◽

Genetic Dissection ◽

Map Construction

Download Full-text

Chromosome-Level Assembly of the Common Lizard (Zootoca vivipara) Genome

Genome Biology and Evolution ◽

10.1093/gbe/evaa161 ◽

2020 ◽

Vol 12 (11) ◽

pp. 1953-1960

Author(s):

Andrey A Yurchenko ◽

Hans Recknagel ◽

Kathryn R Elmer

Keyword(s):

Linkage Map ◽

Single Copy ◽

Phenotypic Traits ◽

Sequencing Data ◽

High Coverage ◽

Squamate Reptiles ◽

Common Lizard ◽

Zootoca Vivipara ◽

The Common ◽

Chromosome Level

Abstract Squamate reptiles exhibit high variation in their phenotypic traits and geographical distributions and are therefore fascinating taxa for evolutionary and ecological research. However, genomic resources are very limited for this group of species, consequently inhibiting research efforts. To address this gap, we assembled a high-quality genome of the common lizard, Zootoca vivipara (Lacertidae), using a combination of high coverage Illumina (shotgun and mate-pair) and PacBio sequencing data, coupled with RNAseq data and genetic linkage map generation. The 1.46-Gb genome assembly has a scaffold N50 of 11.52 Mb with N50 contig size of 220.4 kb and only 2.96% gaps. A BUSCO analysis indicates that 97.7% of the single-copy Tetrapoda orthologs were recovered in the assembly. In total, 19,829 gene models were annotated to the genome using a combination of ab initio and homology-based methods. To improve the chromosome-level assembly, we generated a high-density linkage map from wild-caught families and developed a novel analytical pipeline to accommodate multiple paternity and unknown father genotypes. We successfully anchored and oriented almost 90% of the genome on 19 linkage groups. This annotated and oriented chromosome-level reference genome represents a valuable resource to facilitate evolutionary studies in squamate reptiles.

Download Full-text

Identification of a major locus conferring resistance to powdery mildew (Erysiphe polygoni DC) in mungbean (Vigna radiata L. Wilczek) by QTL analysis

Genome ◽

10.1139/g03-057 ◽

2003 ◽

Vol 46 (5) ◽

pp. 738-744 ◽

Cited By ~ 34

Author(s):

M E Humphry ◽

T Magner ◽

C L McIntyre ◽

E A.B Aitken ◽

C J Liu

Keyword(s):

Powdery Mildew ◽

Linkage Map ◽

Qtl Analysis ◽

Vigna Radiata ◽

Mapping Population ◽

Susceptible Variety ◽

Resistance Response ◽

Causal Organism ◽

Major Locus ◽

Erysiphe Polygoni

A major locus conferring resistance to the causal organism of powdery mildew, Erysiphe polygoni DC, in mungbean (Vigna radiata L. Wilczek) was identified using QTL analysis with a population of 147 recombinant inbred individuals. The population was derived from a cross between 'Berken', a highly susceptible variety, and ATF 3640, a highly resistant line. To test for response to powdery mildew, F7 and F8 lines were inoculated by dispersing decaying mungbean leaves with residual conidia of E. polygoni amongst the young plants to create an artificial epidemic and assayed in a glasshouse facility. To generate a linkage map, 322 RFLP clones were tested against the two parents and 51 of these were selected to screen the mapping population. The 51 probes generated 52 mapped loci, which were used to construct a linkage map spanning 350 cM of the mungbean genome over 10 linkage groups. Using these markers, a single locus was identified that explained up to a maximum of 86% of the total variation in the resistance response to the pathogen.Key words: mungbean, powdery mildew, Erysiphe polygoni, QTL, molecular markers.

Download Full-text

Genotyping-by-sequencing (GBS) for SNP-based linkage map construction for two Prunus rootstocks from a peach rootstock breeding program

Acta Horticulturae ◽

10.17660/actahortic.2021.1304.18 ◽

2021 ◽

pp. 113-120 ◽

Cited By ~ 1

Author(s):

V. Guajardo ◽

S. Solís ◽

K. Gasic ◽

C. Saski ◽

C. Font i Forcada ◽

...

Keyword(s):

Linkage Map ◽

Genotyping By Sequencing ◽

Breeding Program ◽

Map Construction ◽

Rootstock Breeding

Download Full-text

Linkage map construction and QTL analysis for Betula platyphylla Suk using RAPD, AFLP, ISSR and SSR

Silvae Genetica ◽

10.1515/sg-2012-0001 ◽

2012 ◽

Vol 61 (1-6) ◽

pp. 1-9 ◽

Cited By ~ 3

Author(s):

Kaixuan Zhang ◽

Dan Wang ◽

Chuanping Yang ◽

Guanjun Liu ◽

Guifeng Liu ◽

...

Keyword(s):

Linkage Map ◽

Average Distance ◽

Interval Mapping ◽

Betula Platyphylla ◽

Stem Height ◽

Trait Variation ◽

Mapping Strategy ◽

Map Construction ◽

Betula Platyphylla Suk ◽

Segregating Population

AbstractA linkage map for Betula platyphylla Suk was constructed based on RAPD, ISSR, AFLP and SSR markers by a pseudo-testcross mapping strategy. A F1segregating population including 80 progenies was obtained from the cross between two superior trees selected from Qinghai and Wangqing provenance, respectively. The paternal map was constructed with 282 markers consisting of 14 major and 15 minor (5 triplets and 10 doublets) linkage groups and spanning 1131 cM at an average distance of 4.0 cM between adjacent markers. The maternal map has 277 markers consisting of 15 major and 8 minor (5 triplets and 3 doublets) groups covering 1288 cM at an average distance of 4.6 cM between adjacent markers. In the same pedigree we investigated association of genetic markers with seedling stem height and circumference. The composite interval mapping was used to detect the number of quantitative trait loci and their position on the genetic linkage maps. Three QTLs (one on the male map and two on the female map) were found explaining 13.4%, 17.5% and 18.8% of the trait variation, respectively.

Download Full-text

High-throughput inference of pairwise coalescence times identifies signals of selection and enriched disease heritability

10.1101/276931 ◽

2018 ◽

Cited By ~ 3

Author(s):

Pier Francesco Palamara ◽

Jonathan Terhorst ◽

Yun S. Song ◽

Alkes L. Price

Keyword(s):

Positive Selection ◽

Snp Array ◽

Complex Trait ◽

Joint Analysis ◽

Background Selection ◽

Sequencing Data ◽

Data Set ◽

Coalescence Time ◽

Recent Positive Selection ◽

Coalescence Times

AbstractInterest in reconstructing demographic histories has motivated the development of methods to estimate locus-specific pairwise coalescence times from whole-genome sequence data. We developed a new method, ASMC, that can estimate coalescence times using only SNP array data, and is 2-4 orders of magnitude faster than previous methods when sequencing data are available. We were thus able to apply ASMC to 113,851 phased British samples from the UK Biobank, aiming to detect recent positive selection by identifying loci with unusually high density of very recent coalescence times. We detected 12 genome-wide significant signals, including 6 loci with previous evidence of positive selection and 6 novel loci, consistent with coalescent simulations showing that our approach is well-powered to detect recent positive selection. We also applied ASMC to sequencing data from 498 Dutch individuals (Genome of the Netherlands data set) to detect background selection at deeper time scales. We observed highly significant correlations between average coalescence time inferred by ASMC and other measures of background selection. We investigated whether this signal translated into an enrichment in disease and complex trait heritability by analyzing summary association statistics from 20 independent diseases and complex traits (average N=86k) using stratified LD score regression. Our background selection annotation based on average coalescence time was strongly enriched for heritability (p = 7×10−153) in a joint analysis conditioned on a broad set of functional annotations (including other background selection annotations), meta-analyzed across traits; SNPs in the top 20% of our annotation were 3.8x enriched for heritability compared to the bottom 20%. These results underscore the widespread effects of background selection on disease and complex trait heritability.

Download Full-text

Integrative DNA copy number detection and genotyping from sequencing and array-based platforms

10.1101/172700 ◽

2017 ◽

Cited By ~ 2

Author(s):

Zilu Zhou ◽

Weixin Wang ◽

Li-San Wang ◽

Nancy Ruonan Zhang

Keyword(s):

Copy Number ◽

Association Studies ◽

Snp Array ◽

Supplementary Information ◽

Detection Accuracy ◽

Sequencing Data ◽

Array Data ◽

Combining Data ◽

Allele Specific ◽

Cnv Detection

AbstractMotivationCopy number variations (CNVs) are gains and losses of DNA segments and have been associated with disease. Many large-scale genetic association studies are performing CNV analysis using whole exome sequencing (WES) and whole genome sequencing (WGS). In many of these studies, previous SNP-array data are available. An integrated cross-platform analysis is expected to improve resolution and accuracy, yet there is no tool for effectively combining data from sequencing and array platforms. The detection of CNVs using sequencing data alone can also be further improved by the utilization of allele-specific reads.ResultsWe propose a statistical framework, integrated Copy Number Variation detection algorithm (iCNV), which can be applied to multiple study designs: WES only, WGS only, SNP array only, or any combination of SNP and sequencing data. iCNV applies platform specific normalization, utilizes allele specific reads from sequencing and integrates matched NGS and SNP-array data by a Hidden Markov Model (HMM). We compare integrated two-platform CNV detection using iCNV to naive intersection or union of platforms and show that iCNV increases sensitivity and robustness. We also assess the accuracy of iCNV on WGS data only, and show that the utilization of allele-specific reads improve CNV detection accuracy compared to existing methods.Availabilityhttps://github.com/zhouzilu/[email protected], [email protected] informationSupplementary data are available at Bioinformatics online.

Download Full-text