Faculty Opinions recommendation of The Mosaic Ancestry of the Drosophila Genetic Reference Panel and the D. melanogaster Reference Genome Reveals a Network of Epistatic Fitness Interactions.

Author(s):  
Norman Johnson
2015 ◽  
Author(s):  
John E Pool

North American populations of Drosophila melanogaster are thought to derive from both European and African source populations, but despite their importance for genetic research, patterns of admixture along their genomes are essentially undocumented. Here, I infer geographic ancestry along genomes of the Drosophila Genetic Reference Panel (DGRP) and the D. melanogaster reference genome. Overall, the proportion of African ancestry was estimated to be 20% for the DGRP and 9% for the reference genome. Based on the size of admixture tracts and the approximate timing of admixture, I estimate that the DGRP population underwent roughly 13.9 generations per year. Notably, ancestry levels varied strikingly among genomic regions, with significantly less African introgression on the X chromosome, in regions of high recombination, and at genes involved in specific processes such as circadian rhythm. An important role for natural selection during the admixture process was further supported by a genome-wide signal of ancestry disequilibrium, in that many between-chromosome pairs of loci showed a deficiency of Africa-Europe allele combinations. These results support the hypothesis that admixture between partially genetically isolated Drosophila populations led to natural selection against incompatible genetic variants, and that this process is ongoing. The ancestry blocks inferred here may be relevant for the performance of reference alignment in this species, and may bolster the design and interpretation of many population genetic and association mapping studies.


2019 ◽  
Author(s):  
Logan J. Everett ◽  
Wen Huang ◽  
Shanshan Zhou ◽  
Mary Anna Carbone ◽  
Richard F. Lyman ◽  
...  

SummaryA major challenge in modern biology is to understand how naturally occurring variation in DNA sequences affects complex organismal traits through networks of intermediate molecular phenotypes. Here, we performed deep RNA sequencing of 200 Drosophila Genetic Reference Panel inbred lines with complete genome sequences, and mapped expression quantitative trait loci for annotated genes, novel transcribed regions (most of which are long noncoding RNAs), transposable elements and microbial species. We identified host variants that affect expression of transposable elements, independent of their copy number, as well as microbiome composition. We constructed sex-specific expression quantitative trait locus regulatory networks. These networks are enriched for novel transcribed regions and target genes in heterochromatin and euchromatic regions of reduced recombination, and genes regulating transposable element expression. This study provides new insights regarding the role of natural genetic variation in regulating gene expression and generates testable hypotheses for future functional analyses.


Genetics ◽  
2021 ◽  
Author(s):  
Jacinta Davis ◽  
Claire Da Silva Santos ◽  
Narda Caudillo Zavala ◽  
Nicholas Gans ◽  
Daniel Patracuolla ◽  
...  

Abstract Parkinson’s Disease (PD) is primarily characterized by the loss of dopaminergic (DA) neurons in the brain. However, little is known about why DA neurons are selectively vulnerable to PD. To identify genes that are associated with DA neuron loss, we screened through 201 wild-caught populations of Drosophila melanogaster as part of the Drosophila Genetic Reference Panel (DGRP). Here we identify the top associated genes containing SNPs that render DA neurons vulnerable. These genes were further analyzed by using mutant analysis and tissue-specific knockdown for functional validation. We found that this loss of DA neurons caused progressive locomotor dysfunction in mutants and gene knockdown analysis. The identification of genes associated with the progressive loss of DA neurons should help to uncover factors that render these neurons vulnerable in PD, and possibly develop strategies to make these neurons more resilient.


2020 ◽  
Author(s):  
Isis da Costa Hermisdorff ◽  
Raphael Bermal Costa ◽  
Lucia Galvão de Albuquerque ◽  
Hubert Pausch ◽  
Naveen Kumar Kadri

AbstractBackgroundImputation accuracy among other things depends on the size of the reference panel, the marker’s minor allele frequency (MAF), and the correct placement of variants on the reference genome assembly. Using high-density genotypes of 3938 Nellore cattle from Brazil, we investigated the accuracy of imputation from 50K to 777K SNP density, using map positions determined according to the bovine genome assemblies UMD3.1 and ARS-UCD1.2. We assessed the effect of reference and target panel sizes on the pre-phasing-based imputation quality using ten-fold cross-validation. Further, we compared the reliability of the model-based imputation quality score (Rsq) from Minimac3 to empirical imputation accuracy.ResultsThe overall accuracy of imputation measured as the squared correlation between true and imputed allele dosages (R2dose) was virtually identical using either the UMD3.1 or ARS-UCD1.2 genome assembly. When the size of the reference panel increased from 250 to 2000, R2dose increased from 0.845 to 0.917, and the number of polymorphic markers in the imputed data set increased from 586,701 to 618,660. Advantages in both accuracy and marker density were also observed when larger target panels were imputed, likely resulting from more accurate haplotype inference. Imputation accuracy and the marker density in the imputed data increased from 0.903 to 0.913 and from 593,239 to 595,570 when haplotypes were inferred in 500 and 2900 target animals, respectively. The model-based imputation quality scores from Minimac3 (Rsq) were highly correlated to but systematically higher than empirically estimated accuracies. The correlation between these metrics increased with the size of the reference panel and MAF of imputed variants.ConclusionsAccurate imputation of BovineHD BeadChip markers is possible in Nellore cattle using the new bovine reference genome assembly ARS-UCD1.2. The use of large reference and target panels improves the accuracy of the imputed genotypes and provides genotypes for more markers segregating at low frequency for downstream genomic analyses. The model-based imputation quality score from Minimac3 (Rsq) can be used to detect poorly imputed variants but its reliability depends on the size of the reference panel used and MAF of the imputed variants.


2020 ◽  
Vol 30 (3) ◽  
pp. 485-496 ◽  
Author(s):  
Logan J. Everett ◽  
Wen Huang ◽  
Shanshan Zhou ◽  
Mary Anna Carbone ◽  
Richard F. Lyman ◽  
...  

2020 ◽  
Vol 21 (1) ◽  
Author(s):  
Michael V. Frochaux ◽  
Maroun Bou Sleiman ◽  
Vincent Gardeux ◽  
Riccardo Dainese ◽  
Brian Hollis ◽  
...  

2019 ◽  
Vol 1 (12) ◽  
pp. 1226-1242 ◽  
Author(s):  
Roel P. J. Bevers ◽  
Maria Litovchenko ◽  
Adamandia Kapopoulou ◽  
Virginie S. Braman ◽  
Matthew R. Robinson ◽  
...  

BMC Genomics ◽  
2020 ◽  
Vol 21 (1) ◽  
Author(s):  
Isis da Costa Hermisdorff ◽  
Raphael Bermal Costa ◽  
Lucia Galvão de Albuquerque ◽  
Hubert Pausch ◽  
Naveen Kumar Kadri

Abstract Background Imputation accuracy among other things depends on the size of the reference panel, the marker’s minor allele frequency (MAF), and the correct placement of single nucleotide polymorphism (SNP) on the reference genome assembly. Using high-density genotypes of 3938 Nellore cattle from Brazil, we investigated the accuracy of imputation from 50 K to 777 K SNP density using Minimac3, when map positions were determined according to the bovine genome assemblies UMD3.1 and ARS-UCD1.2. We assessed the effect of reference and target panel sizes on the pre-phasing based imputation quality using ten-fold cross-validation. Further, we compared the reliability of the model-based imputation quality score (Rsq) from Minimac3 to the empirical imputation accuracy. Results The overall accuracy of imputation measured as the squared correlation between true and imputed allele dosages (R2dose) was almost identical using either the UMD3.1 or ARS-UCD1.2 genome assembly. When the size of the reference panel increased from 250 to 2000, R2dose increased from 0.845 to 0.917, and the number of polymorphic markers in the imputed data set increased from 586,701 to 618,660. Advantages in both accuracy and marker density were also observed when larger target panels were imputed, likely resulting from more accurate haplotype inference. Imputation accuracy increased from 0.903 to 0.913, and the marker density in the imputed data increased from 593,239 to 595,570 when haplotypes were inferred in 500 and 2900 target animals. The model-based imputation quality scores from Minimac3 (Rsq) were systematically higher than empirically estimated accuracies. However, both metrics were positively correlated and the correlation increased with the size of the reference panel and MAF of imputed variants. Conclusions Accurate imputation of BovineHD BeadChip markers is possible in Nellore cattle using the new bovine reference genome assembly ARS-UCD1.2. The use of large reference and target panels improves the accuracy of the imputed genotypes and provides genotypes for more markers segregating at low frequency for downstream genomic analyses. The model-based imputation quality score from Minimac3 (Rsq) can be used to detect poorly imputed variants but its reliability depends on the size of the reference panel and MAF of the imputed variants.


2020 ◽  
Vol 2 (4) ◽  
pp. 381-381
Author(s):  
Roel P. J. Bevers ◽  
Maria Litovchenko ◽  
Adamandia Kapopoulou ◽  
Virginie S. Braman ◽  
Matthew R. Robinson ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document