A Primer of Molecular Population Genetics

Author(s):  
Asher D. Cutter

The study of molecular population genetics seeks to understand the micro-evolutionary principles underlying DNA sequence variation and change. It addresses such questions as: Why do individuals differ as much as they do in their DNA sequences? What are the genomic signatures of adaptations? How often does natural selection dictate changes to DNA and accumulate as differences between species? How does the ebb and flow in the abundance of individuals over time get marked onto chromosomes to record genetic history? The concepts used to answer such questions also apply to analysis of personal genomics, genome-wide association studies, phylogenetics, landscape and conservation genetics, forensics, molecular anthropology, and selection scans. This Primer of Molecular Population Genetics introduces the bare essentials of the theory and practice of evolutionary analysis through the lens of DNA sequence change in populations. Intended as an introductory text for upper-level undergraduates and junior graduate students, this Primer also provides an accessible entryway for scientists from other areas of biology to appreciate the ideas and practice of molecular population genetics. With the revolutionary advances in genomic data acquisition, understanding molecular population genetics is now a fundamental requirement for today’s life scientists.

2018 ◽  
Vol 2018 ◽  
pp. 1-14 ◽  
Author(s):  
Afifah Binti Azam ◽  
Elena Aisha Binti Azizan

Primary hypertension is widely believed to be a complex polygenic disorder with the manifestation influenced by the interactions of genomic and environmental factors making identification of susceptibility genes a major challenge. With major advancement in high-throughput genotyping technology, genome-wide association study (GWAS) has become a powerful tool for researchers studying genetically complex diseases. GWASs work through revealing links between DNA sequence variation and a disease or trait with biomedical importance. The human genome is a very long DNA sequence which consists of billions of nucleotides arranged in a unique way. A single base-pair change in the DNA sequence is known as a single nucleotide polymorphism (SNP). With the help of modern genotyping techniques such as chip-based genotyping arrays, thousands of SNPs can be genotyped easily. Large-scale GWASs, in which more than half a million of common SNPs are genotyped and analyzed for disease association in hundreds of thousands of cases and controls, have been broadly successful in identifying SNPs associated with heart diseases, diabetes, autoimmune diseases, and psychiatric disorders. It is however still debatable whether GWAS is the best approach for hypertension. The following is a brief overview on the outcomes of a decade of GWASs on primary hypertension.


2020 ◽  
Vol 79 (Suppl 1) ◽  
pp. 226.1-227
Author(s):  
M. Zervou ◽  
A. Andreou ◽  
C. Matalliotaki ◽  
T. Niewold ◽  
G. Goulielmos ◽  
...  

Background:Genome-wide association studies (GWAS) have identified hundreds of autoimmune diseases-associated loci so far but much of the heritability of these diseases remains unknown. In an attempt to identify potential causal variants, various studies revealed that the missense variant rs35677470 atDNASE1L3is associated with the development of systemic lupus erythematosus (SLE), rheumatoid arthritis (RA) and systemic sclerosis (SSc), thus exhibiting a pleiotropic effect. Deoxyribonuclease I-like 3 (DNase1L3) is a member of human DNase I family, representing a nuclease that cleaves double-stranded DNA during apoptosis and is involved in the development of autoimmune diseases [1].Objectives:To investigate the role of the rs35677470 polymorphism atDNASE1L3gene leading to the R206C mutation in SLE, RA and SSc [2-3] and the mechanism that may affect the loss of function in the protein structure.Methods:TheDNASE1L3evolution was investigated to define conservation elements in the protein sequence using, BLASTP extended searches [4], TCOFFEE [5] multiple sequence alignments, and MEGAX [6] for phylogenetics analysis. Three-dimensional (3D) homology modeling was used to localize the polymorphism under study. The mutant was constructed by molecular modeling using the structures of homologous DNAses (PDB entries 1atn, 4awn, 3d3w; [7-9]). Molecular mechanics/dynamics studies were applied to validate structural/functional changes caused by the R206C substitution. All figures depicting 3D models were generated using the PyMOL molecular-graphics system V.2.2 (Schrodinger, LLC).Results:The evolutionary analysis shows heavily conserved sequence elements among species indicating structural/functional importance. Structural analysis revealed that the rs35677470 SNP codes for a nonconservative amino acid variation, R206C, disrupts the conserved electrostatic network holding protein secondary structure elements to place. Specifically, the R206 to E170 interaction, part of a salt bridge network stabilizing two a-helices, is being interrupted, thereby affecting the molecular architecture (Fig. 1). Indeed, previous studies on the effect of this SNP in Caucasian populations resulting in a lower level of DNAse1L3 activity are consistent with this observation [10].Figure 1.Ribbon representation of the DNAse1L3 homology model showing the position of the stabilizing salt bridge network (E170- R206, R208- D219). Insert figure shows the R206C mutation. Positively charged R (in blue), negatively charged D,E (in red) and C (in yellow) are shown. Distances are in Angstroms.Conclusion:This study represents a comprehensive evaluation of the shared autoimmune loci ofDNASE1L3(rs35677470), reported to produce an inactive form of DNaseIL3 [10]. The structural analysis, explains the potential role of the produced mutation by modifying the placement of structural elements and consequently introducing disorder in the protein folding and affecting biological function. Altogether, this study contributes to the delineation of the genetic architecture of SLE, RA and SSc.References:[1]Sisirak V et al (2016). Cell 166:88–101[2]Westra HJ et al (2018). Nat Genet. 50:1366-74[3]Acosta-Herrera M et al. (2018). Ann Rheum Dis. 78:311-19[4]Altschul SF et al (1997) Nucleic Acids Res. 25:3389-402.[5]Notredame et al (2000) JMB, 302:205-17[6]Kumar S et al (2018) Mol. Biol. Evol. 35:1547-9[7]Kabsch W et al (1990) Nature 347: 37-44[8]Parciegla et al (2012) Biochemistry 51: 10250[9]Sasaki K et al (1993) Acta Cryst., A 49: 111-2[10]Ueki et al (2009) Clinica Chim. Acta 407:20–4Disclosure of Interests:None declared


2018 ◽  
Author(s):  
Venice Juanillas ◽  
Alexis Dereeper ◽  
Nicolas Beaume ◽  
Gaetan Droc ◽  
Joshua Dizon ◽  
...  

AbstractBackgroundRice molecular genetics, breeding, genetic diversity, and allied research (such as rice-pathogen interaction) have adopted sequencing technologies and high density genotyping platforms for genome variation analysis and gene discovery. Germplasm collections representing rice diversity, improved varieties and elite breeding materials are accessible through rice gene banks for use in research and breeding, with many having genome sequences and high density genotype data available. Combining phenotypic and genotypic information on these accessions enables genome-wide association analysis, which is driving quantitative trait loci (QTL) discovery and molecular marker development. Comparative sequence analyses across QTL regions facilitate the discovery of novel alleles. Analyses involving DNA sequences and large genotyping matrices for thousands of samples, however, pose a challenge to non-computer savvy rice researchers.FindingsWe adopted the Galaxy framework to build the federated Rice Galaxy resource, with shared datasets, tools, and analysis workflows relevant to rice research. The shared datasets include high density genotypes from the 3,000 Rice Genomes project and sequences with corresponding annotations from nine published rice genomes. Rice Galaxy includes tools for designing single nucleotide polymorphism (SNP) assays, analyzing genome-wide association studies, population diversity, rice-bacterial pathogen diagnostics, and a suite of published genomic prediction methods. A prototype Rice Galaxy compliant to Open Access, Open Data, and Findable, Accessible, Interoperable, and Reproducible principles is also presented.ConclusionsRice Galaxy is a freely available resource that empowers the plant research community to perform state-of-the-art analyses and utilize publicly available big datasets for both fundamental and applied science.


Stroke ◽  
2017 ◽  
Vol 48 (suppl_1) ◽  
Author(s):  
Guido J Falcone ◽  
Chia-Ling Phuah ◽  
Farid Radmanesh ◽  
Gina M Peloso ◽  
James F Meschia ◽  
...  

Introduction: In observational studies, higher plasma high-density lipoprotein cholesterol (HDL-C) has been associated with increased risk of spontaneous intracerebral hemorrhage (ICH). Common DNA sequence variants within the cholesteryl ester transfer protein ( CETP ) gene decrease CETP protein activity and increase plasma HDL-C; as such, medicines that inhibit CETP and raise HDL-C are in clinical development to combat coronary artery disease. Hypothesis: Common CETP DNA sequence variants associated with higher HDL-C also increase risk for ICH. Methods: We performed a two-stage case-control genetic association study in Caucasians. The discovery phase utilized data on 12 independent loci within CETP (+/- 50 kilobases) from 3 genome-wide association studies of ICH. Replication involved direct genotyping in 5 additional studies. We also constructed a genetic risk score with 7 independent CETP variants and tested it for association with HDL-C and ICH risk. We used principal component analysis to account for population structure and a Bonferroni-adjusted p<0.004 (12 tests) to declare statistical significance. Results: The discovery phase included 1149 ICH cases (43% lobar hemorrhages) and 1238 controls. Twelve variants were nominally associated (p<0.05) with ICH, with the strongest association at the rs173539 locus (Figure 1: OR 1.25, 95%CI 1.11-1.41; p=6.0x10 -4 ) and no heterogeneity across studies (I 2 =0%). This association was replicated in 1625 cases (43% lobar hemorrhages) and 1845 controls (OR 1.12, 95%CI 1.02-1.24; p=0.03). A genetic score of independent CETP variants known to increase HDL-C by ~2.85 mg/dL was strongly associated with ICH risk (OR 1.86, 95%CI 1.44-2.40; p=1.4x10 -6 ). Conclusion: Genetic variants in CETP associated with increased HDL-C raise the risk of ICH. Given ongoing therapeutic development in CETP inhibition and other HDL-raising strategies, further exploration of potential adverse cerebrovascular outcomes is warranted.


2021 ◽  
Vol 80 (Suppl 1) ◽  
pp. 1037.1-1037
Author(s):  
M. Zervou ◽  
A. Andreou ◽  
G. Goulielmos ◽  
E. Eliopoulos

Background:Genome wide association studies (GWAS) have successfully identified novel autoimmune disease-associated loci, with many of them shared by multiple disease-associated pathways but much of the genetics and pathophysiological mechanisms remain still obscure. Considering that most of the potential causal variants are still unknown, many studies showed that the missense variant rs35667974 at interferon induced with helicase C domain 1 (IFIH1) gene is protective for type 1 diabetes (T1D), psoriasis (PS) and psoriatic arthritis (PsA), while it was found to be also associated with ankylosing spondylitis (AS), Crohn’s disease (CD) and ulcerative colitis (UC). IFIH1 gene encodes a cytoplasmic RNA helicase that recognizes viral RNA and is involved in innate immunity through recognition of viral RNA [1].Objectives:To investigate the role of the rare rs35667974 variant of IFIH1 gene, which resides in exon 14 and changes a conserved isoleucine at position #923 to valine in T1D, PS, PsA, AS, CD and UC [2-4] as well as the mechanism that may affect the function in the protein structure.Methods:Evolutionary analysis revealed heavily conserved sequence elements among species, indicating structural/functional importance of the mutation at position #923. In silico mutagenesis and three-dimensional (3D) homology modeling was used to localize the polymorphism under study on the IFIH1 protein. The mutant was constructed using molecular modeling with the program Maestro (Schrodinger, LLC) [5]. Molecular mechanics/dynamics studies were applied to validate structural/functional changes caused by the Ile923V substitution. All figures depicting 3D models were generated using the PyMOL molecular-graphics system V.2.2 [6].Results:Evolutionary and structural analysis revealed that the position of residue Ile923 is located on a protein loop (921-927) directly interacting with mRNA both to the phosphoribose chain and the base pairs. Mutation of Ile to Val at position #923 will directly affect the said interaction with mRNA [7] (Figure 1). Moreover, it has been reported that pre-mRNA or mRNA levels did not correlate with Ile923Val, suggesting that, Ile923Val did not alter the expression of IFIH1 in a major way [7].Figure 1.Structural representation of the interferon-induced helicase C domain-containing protein 1 [Homo sapiens] structure (4GL2 from Protein Data Bank) (in green) bound to the double stranded RNA (in orange-yellow). The location of the mutation (I923V) is highlighted in pink. The proximity of the aminoacid 923 to the nucleotide is apparent.Conclusion:This study represents a comprehensive evaluation of the role of the shared rs35667974 variant of autoimmune locus IFIH1, reported to lead to a loss of function phenotype and, as a consequence, reduced levels of IFIH1 protein and activity that protect against autoimmunity. Structural analysis of rare shared genetic susceptibility or protection loci may provide insight to our understanding of the pathophysiology of autoimmune diseases and the research findings may affect the better management of the diseases under study.References:[1]Nejentsev S et al. (2009). Science 324:387–389.[2]Smyth DJ et al. (2006). Nat Genet. 38: 617–619.[3]Li Y et al. (2010). J Invest Dermatol 130:2768–2772.[4]Ellinghaus D et al. (2016). Nat Genet 48:510–518.[5]Schrödinger Release 2017-1: Maestro, Schrödinger, LLC, New York, NY, 2017.[6]Schrödinger LLC: The PyMOL Molecular Graphics System 2016 version 2.2.[7]Wu B et al. (2013). Cell 152:276-289[8] Downes K et al. (2010). PLoS One 5:e12646.Disclosure of Interests:None declared


1982 ◽  
Vol 39 (1) ◽  
pp. 63-77 ◽  
Author(s):  
Naoyuki Takahata

SummaryA general model of linked genes or a part of a genome is proposed which enables us to study various problems in molecular population genetics in a unified way. Several formulae with special reference to the linkage disequilibrium and genetic distance are derived for neutral mutations in finite populations, based on the method of diffusion equations. It is argued that the model and formulae are useful particularly when observations are made in terms of DNA sequence.


Neurogenetics ◽  
2021 ◽  
Author(s):  
Magdalena Krygier ◽  
Maria Mazurkiewicz-Bełdzińska

AbstractCerebellar ataxias (CAs) comprise a group of rare, neurological disorders characterized by extensive phenotypic and genetic heterogeneity. The core clinical feature is the cerebellar syndrome, which is often accompanied by other neurological or non-neurological signs. In the last 30 years, our understanding of the CA etiology has increased significantly, and numerous ataxia-associated genes have been discovered. Conventional variants or tandem repeat expansions, localized in the coding or non-coding DNA sequences, lead to hereditary ataxia, which can display different patterns of inheritance. Advances in molecular techniques have enabled a rapid and cost-effective detection of causative variants in a significant number of CA patients. However, despite performing extensive investigations, a definite diagnosis is still unknown in the majority of affected individuals. In this review, we discuss the major advances in the genetics of CAs over the last 30 years, focusing on the impact of next-generation sequencing on the genetic landscape of childhood- and adult-onset CAs. Additionally, we outline possible directions for further genetic research in hereditary and sporadic CAs in the era of increasing application of whole-genome sequencing and genome-wide association studies in various neurological disorders.


Sign in / Sign up

Export Citation Format

Share Document