pgHMA: Application of the Heteroduplex Mobility Assay Analysis in Phylogenetics and Population Genetics

The Heteroduplex mobility assay (HMA) has proven to be a robust tool for the detection of genetic variation. Here, we describe a simple and rapid application of the HMA by microfluidic capillary electrophoresis, for phylogenetics and population genetic analyses (pgHMA). We show how commonly applied techniques in phylogenetics and population genetics have equivalents with pgHMA: phylogenetic reconstruction with bootstrapping, skyline plots, and mismatch distribution analysis. We assess the performance and accuracy of pgHMA by comparing the results obtained against those obtained using standard methods of analyses applied to sequencing data. The resulting comparisons demonstrate that: (1) there is a significant linear relationship (R = 0.992) between heteroduplex mobility and genetic distance; (2) phylogenetic trees obtained by HMA and nucleotide sequences present nearly identical topologies; (3) clades with high pgHMA parametric bootstrap support also have high bootstrap support on nucleotide phylogenies; (4) skyline plots estimated from the UPGMA trees of HMA and Bayesian trees of nucleotide data reveal similar trends, especially for the median trend estimate of effective population size; and (5) optimized mismatch distributions of HMA are closely fitted to the mismatch distributions of nucleotide sequences. In summary, pgHMA is an easily-applied method for approximating phylogenetic diversity and population trends. KEYWORDS: bootstrap, heteroduplex mobility assay, mismatch distribution, phylogenetics, skyline plot

Download Full-text

Comprehensive genotyping of a Brazilian cassava (Manihot esculenta Crantz) germplasm bank: insights into diversification and domestication

Theoretical and Applied Genetics ◽

10.1007/s00122-021-03775-5 ◽

2021 ◽

Vol 134 (5) ◽

pp. 1343-1362

Author(s):

Alex C. Ogbonna ◽

Luciano Rogerio Braatz de Andrade ◽

Lukas A. Mueller ◽

Eder Jorge de Oliveira ◽

Guillaume J. Bauchet

Keyword(s):

Population Genetics ◽

Population Structure ◽

Manihot Esculenta ◽

Ex Situ ◽

Germplasm Bank ◽

Effective Population ◽

Manihot Esculenta Crantz ◽

Genetic Groups ◽

Population Structure Analysis ◽

Modern Breeding

Abstract Key message Brazilian cassava diversity was characterized through population genetics and clustering approaches, highlighting contrasted genetic groups and spatial genetic differentiation. Abstract Cassava (Manihot esculenta Crantz) is a major staple root crop of the tropics, originating from the Amazonian region. In this study, 3354 cassava landraces and modern breeding lines from the Embrapa Cassava Germplasm Bank (CGB) were characterized. All individuals were subjected to genotyping-by-sequencing (GBS), identifying 27,045 single-nucleotide polymorphisms (SNPs). Identity-by-state and population structure analyses revealed a unique set of 1536 individuals and 10 distinct genetic groups with heterogeneous linkage disequilibrium (LD). On this basis, a density of 1300–4700 SNP markers were selected for large-effect quantitative trait loci (QTL) detection. Identified genetic groups were further characterized for population genetics parameters including minor allele frequency (MAF), observed heterozygosity $$({H}_{o})$$ ( H o ) , effective population size estimate $$\widehat{{(N}_{e}}$$ ( N e ^ ) and polymorphism information content (PIC). Selection footprints and introgressions of M. glaziovii were detected. Spatial population structure analysis revealed five ancestral populations related to distinct Brazilian ecoregions. Estimation of historical relationships among identified populations suggests an early population split from Amazonian to Atlantic forest and Caatinga ecoregions and active gene flows. This study provides a thorough genetic characterization of ex situ germplasm resources from cassava’s center of origin, South America, with results shedding light on Brazilian cassava characteristics and its biogeographical landscape. These findings support and facilitate the use of genetic resources in modern breeding programs including implementation of association mapping and genomic selection strategies.

Download Full-text

Population Genomics of American Mink Using Whole Genome Sequencing Data

Genes ◽

10.3390/genes12020258 ◽

2021 ◽

Vol 12 (2) ◽

pp. 258

Author(s):

Karim Karimi ◽

Duy Ngoc Do ◽

Mehdi Sargolzaei ◽

Younes Miar

Keyword(s):

Population Genomics ◽

Association Studies ◽

American Mink ◽

Population History ◽

Whole Genome Sequencing Data ◽

Whole Genome ◽

Nucleotide Polymorphisms ◽

Sequencing Data ◽

Effective Population ◽

Cross Validation Error

Characterizing the genetic structure and population history can facilitate the development of genomic breeding strategies for the American mink. In this study, we used the whole genome sequences of 100 mink from the Canadian Centre for Fur Animal Research (CCFAR) at the Dalhousie Faculty of Agriculture (Truro, NS, Canada) and Millbank Fur Farm (Rockwood, ON, Canada) to investigate their population structure, genetic diversity and linkage disequilibrium (LD) patterns. Analysis of molecular variance (AMOVA) indicated that the variation among color-types was significant (p < 0.001) and accounted for 18% of the total variation. The admixture analysis revealed that assuming three ancestral populations (K = 3) provided the lowest cross-validation error (0.49). The effective population size (Ne) at five generations ago was estimated to be 99 and 50 for CCFAR and Millbank Fur Farm, respectively. The LD patterns revealed that the average r2 reduced to <0.2 at genomic distances of >20 kb and >100 kb in CCFAR and Millbank Fur Farm suggesting that the density of 120,000 and 24,000 single nucleotide polymorphisms (SNP) would provide the adequate accuracy of genomic evaluation in these populations, respectively. These results indicated that accounting for admixture is critical for designing the SNP panels for genotype-phenotype association studies of American mink.

Download Full-text

Techniques for the verification of minimal phylogenetic trees illustrated with ten mammalian haemoglobin sequences

Biochemical Journal ◽

10.1042/bj1870065 ◽

1980 ◽

Vol 187 (1) ◽

pp. 65-74 ◽

Cited By ~ 12

Author(s):

D Penny ◽

M D Hendy ◽

L R Foulds

Keyword(s):

Amino Acid ◽

Phylogenetic Tree ◽

Protein Sequence ◽

Phylogenetic Trees ◽

Sequence Data ◽

Protein Sequences ◽

Nucleotide Sequences ◽

Amino Acid Sequences ◽

Minimal Tree ◽

Protein Sequence Data

We have recently reported a method to identify the shortest possible phylogenetic tree for a set of protein sequences [Foulds Hendy & Penny (1979) J. Mol. Evol. 13. 127–150; Foulds, Penny & Hendy (1979) J. Mol. Evol. 13, 151–166]. The present paper discusses issues that arise during the construction of minimal phylogenetic trees from protein-sequence data. The conversion of the data from amino acid sequences into nucleotide sequences is shown to be advantageous. A new variation of a method for constructing a minimal tree is presented. Our previous methods have involved first constructing a tree and then either proving that it is minimal or transforming it into a minimal tree. The approach presented in the present paper progressively builds up a tree, taxon by taxon. We illustrate this approach by using it to construct a minimal tree for ten mammalian haemoglobin alpha-chain sequences. Finally we define a measure of the complexity of the data and illustrate a method to derive a directed phylogenetic tree from the minimal tree.

Download Full-text

Molecular markers applying in forest trees gene pool conservation

Glasnik Sumarskog fakulteta ◽

10.2298/gsf0999101m ◽

2009 ◽

pp. 101-113

Author(s):

Jelena Milovanovic ◽

Mirjana Sijacic-Nikolic

Keyword(s):

Population Genetics ◽

Molecular Markers ◽

Gene Flow ◽

Gene Pool ◽

Statistical Power ◽

Forest Trees ◽

Effective Population ◽

Migration Patterns ◽

Geographic Diversity ◽

Marker Loci

Many studies performed during the last years demonstrated the usefulness of neutral molecular markers in the field of conservation and population genetics of forest trees, in particular to understand the importance of migration patterns in shaping current genetic and geographic diversity and to measure important parameters such as effective population size, gene flow and past bottleneck. During the next years, a large amount of data at marker loci or at sequence level is expected to be collected, and to become excellent statistical power for the assessment of biological and evolutionary value.

Download Full-text

Can the Cambrian explosion be inferred through molecular phylogeny?

Development ◽

10.1242/dev.1994.supplement.15 ◽

1994 ◽

Vol 1994 (Supplement) ◽

pp. 15-25

Author(s):

Hervé Philippe ◽

Anne Chenuil ◽

André Adoutte

Keyword(s):

Phylogenetic Trees ◽

18S Rrna ◽

Phylogenetic Reconstruction ◽

Cambrian Explosion ◽

Bootstrap Support ◽

Time Interval ◽

Short Time Interval ◽

Data Set ◽

Molecular Phylogenetic ◽

Animal Phyla

Most of the major invertebrate phyla appear in the fossil record during a relatively short time interval, not exceeding 20 million years (Myr), 540-520 Myr ago. This rapid diversification is known as the `Cambrian explosion'. In the present paper, we ask whether molecular phylogenetic reconstruction provides confirmation for such an evolutionary burst. The expectation is that the molecular phylogenetic trees should take the form of a large unresolved multifurcation of the various animal lineages. Complete 18S rRNA sequences of 69 extant representatives of 15 animal phyla were obtained from data banks. After eliminating a major source of artefact leading to lack of resolution in phylogenetic trees (mutational saturation of sequences), we indeed observe that the major lines of triploblast coelomates (arthropods, molluscs, echinoderms, chordates...) are very poorly resolved i.e. the nodes defining the various clades are not supported by high bootstrap values. Using a previously developed procedure consisting of calculating bootstrap proportions of each node of the tree as a function of increasing amount of nucleotides (Lecointre, G., Philippe, H. Le, H. L. V. and Le Guyader, H. (1994) Mol. Phyl. Evol., in press) we obtain a more informative indication of the robustness of each node. In addition, this procedure allows us to estimate the number of additional nucleotides that would be required to resolve confidently the currently uncertain nodes; this number turns out to be extremely high and experimentally unfeasible. We then take this approach one step further: using parameters derived from the above analysis, assuming a molecular clock and using palaeontological dates for calibration, we establish a relationship between the number of sites contained in a given data set and the time interval that this data set can confidently resolve (with 95% bootstrap support). Under these assumptions, the presently available 18S rRNA database cannot confidently resolve cladogenetic events separated by less than about 40 Myr. Thus, at the present time, the potential resolution by the palaeontological approach is higher than that by the molecular one.

Download Full-text

Genomic reconstruction of the introduction and diversification of golden potato cyst nematode populations in Indonesia

Phytopathology ◽

10.1094/phyto-04-21-0150-r ◽

2021 ◽

Author(s):

Nurul Dwi Handayani ◽

Prabowo Lestari ◽

Wouter Van As ◽

Martijn Holterman ◽

Sven Van den Elsen ◽

...

Keyword(s):

Phylogenetic Trees ◽

18Th Century ◽

Gene Families ◽

Potato Production ◽

Innate Immune ◽

Sequencing Data ◽

Cyst Nematodes ◽

Selective Pressures ◽

Production Areas ◽

Venom Allergen

Potato cyst nematodes (PCN), the umbrella term for Globodera rostochiensis and G. pallida, co-evolved with their Solanaceous hosts in the Andeans. From there, PCN proliferated worldwide to virtually all potato production areas. PCN is a major factor limiting the potato production in Indonesia. In our survey, only G. rostochiensis was found. Fourteen field populations were collected on Java and Sumatra, and unique variants were called by mapping re-sequencing data on a G. rostochiensis reference genome. A phylogenetic tree based on 1.4 million unique variants showed a genotypic separation between the outgroup, a Scottish Ro1 population, and all Indonesian populations. This separation was comparable in size to the genotypic distinction between the Javanese and the Sumatran PCN populations. Next, variants within PCN effector gene families SPRYSEC, 1106, 4D06, and venom allergen-like protein (VAL) that all interfere with the host innate immune system were compared. Distinct selective pressures acted on these effector families; while SPRYSECs (4,341 SNPs/indels) behaved like neutral genes, the phylogenetic trees of 1106, 4D06 and VAL proteins (respectively 235, 790 and 150 SNPs/indels) showed deviating topologies. Our data suggest that PCN was introduced on Java not too long after the introduction of potato in the middle of the 18th century. Soon thereafter, the pathogen established on Sumatra, and started to diversify independently. This scenario was corroborated by diversification patterns of the effector families 1106, 4D06 and VAL. Our data demonstrate how genome re-sequencing data from a non-indigenous pathogen can be used to reconstruct the introduction and diversification process.

Download Full-text

Phylogenomics of orchids and their mycorrhizal fungi : trees, diversity, and the pursuit of symbiosis

10.32469/10355/72205 ◽

2019 ◽

Author(s):

◽

Sarah Unruh

Keyword(s):

Mycorrhizal Fungi ◽

Phylogenetic Trees ◽

High Throughput Sequencing ◽

Genomic Sequence ◽

Sequence Data ◽

Mycorrhizal Symbiosis ◽

Sequencing Data ◽

Phylogenetic Structure ◽

University Of Missouri ◽

Fungal Symbiosis

[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT REQUEST OF AUTHOR.] Phylogenetic trees show us how organisms are related and provide frameworks for studying and testing evolutionary hypotheses. To better understand the evolution of orchids and their mycorrhizal fungi, I used high-throughput sequencing data and bioinformatic analyses, to build phylogenetic hypotheses. In Chapter 2, I used transcriptome sequences to both build a phylogeny of the slipper orchid genera and to confirm the placement of a polyploidy event at the base of the orchid family. Polyploidy is hypothesized to be a strong driver of evolution and a source of unique traits so confirming this event leads us closer to explaining extant orchid diversity. The list of orthologous genes generated from this study will provide a less expensive and more powerful method for researchers examining the evolutionary relationships in Orchidaceae. In Chapter 3, I generated genomic sequence data for 32 fungal isolates that were collected from orchids across North America. I inferred the first multi-locus nuclear phylogenetic tree for these fungal clades. The phylogenetic structure of these fungi will improve the taxonomy of these clades by providing evidence for new species and for revising problematic species designations. A robust taxonomy is necessary for studying the role of fungi in the orchid mycorrhizal symbiosis. In chapter 4 I summarize my work and outline the future directions of my lab at Illinois College including addressing the remaining aims of my Community Sequencing Proposal with the Joint Genome Institute by analyzing the 15 fungal reference genomes I generated during my PhD. Together these chapters are the start of a life-long research project into the evolution and function of the orchid/fungal symbiosis.

Download Full-text

Evolution: medicine’s most basic science

Oxford Textbook of Medicine ◽

10.1093/med/9780199204854.003.020102_update_002 ◽

2010 ◽

pp. 12-15 ◽

Cited By ~ 4

Author(s):

Randolph M. Nesse ◽

Richard Dawkins

Keyword(s):

Population Genetics ◽

Basic Science ◽

Evolutionary Biology ◽

Phylogenetic Trees ◽

Evolutionary Methods

The role of evolutionary biology as a basic science for medicine has been expanding rapidly. Some evolutionary methods are already widely applied in medicine, such as population genetics and methods for analysing phylogenetic trees. Newer applications come from seeking evolutionary as well as proximate explanations for disease. ...

Download Full-text

Evolution: Medicine’s most basic science

Oxford Textbook of Medicine ◽

10.1093/med/9780198746690.003.0008 ◽

2020 ◽

pp. 39-42

Author(s):

Randolph M. Nesse ◽

Richard Dawkins

Keyword(s):

Population Genetics ◽

Natural Selection ◽

Basic Science ◽

Evolutionary Biology ◽

Phylogenetic Trees ◽

The Body ◽

Clinical Implications ◽

Trade Offs ◽

The Cost

The role of evolutionary biology as a basic science for medicine is expanding rapidly. Some evolutionary methods are already widely applied in medicine, such as population genetics and methods for analysing phylogenetic trees. Newer applications come from seeking evolutionary as well as proximate explanations for disease. Traditional medical research is restricted to proximate studies of the body’s mechanism, but separate evolutionary explanations are needed for why natural selection has left many aspects of the body vulnerable to disease. There are six main possibilities: mismatch, infection, constraints, trade-offs, reproduction at the cost of health, and adaptive defences. Like other basic sciences, evolutionary biology has limited direct clinical implications, but it provides essential research methods, encourages asking new questions that foster a deeper understanding of disease, and provides a framework that organizes the facts of medicine.

Download Full-text

Complete plastome sequences of Picea asperata and P. crassifolia and comparative analyses with P. abies and P. morrisonicola

Genome ◽

10.1139/gen-2018-0195 ◽

2019 ◽

Vol 62 (5) ◽

pp. 317-328

Author(s):

Fangqun Ouyang ◽

Jiwen Hu ◽

Junchen Wang ◽

Juanjuan Ling ◽

Zhi Wang ◽

...

Keyword(s):

Single Nucleotide Polymorphism ◽

Phylogenetic Trees ◽

Sympatric Species ◽

Sequencing Data ◽

Nucleotide Polymorphism ◽

Single Nucleotide ◽

Variable Regions ◽

Focal Species ◽

Phylogenetic Reconstructions ◽

Picea Asperata

Picea asperata and P. crassifolia have sympatric ranges and are closely related, but the differences between these species at the plastome level are unknown. To better understand the patterns of variation among Picea plastomes, the complete plastomes of P. asperata and P. crassifolia were sequenced. Then, the plastomes were compared with the complete plastomes of P. abies and P. morrisonicola, which are closely and distantly related to the focal species, respectively. We also used these sequences to construct phylogenetic trees to determine the relationships among and between the four species as well as additional taxa from Pinaceae and other gymnosperms. Analysis of our sequencing data allowed us to identify 438 single nucleotide polymorphism (SNPs) point mutation events, 95 indel events, four inversion events, and seven highly variable regions, including six gene spacer regions (psbJ-petA, trnT-psaM, trnS-trnD, trnL-rps4, psaC-ccsA, and rps7-trnL) and one gene (ycf1). The highly variable regions are appropriate targets for future use in the phylogenetic reconstructions of closely related, sympatric species of Picea as well as Pinaceae in general.

Download Full-text