scholarly journals Parallel adaptation in autopolyploid Arabidopsis arenosa is dominated by repeated recruitment of shared alleles

2021 ◽  
Author(s):  
Veronika Konečná ◽  
Sian Bray ◽  
Jakub Vlček ◽  
Magdalena Bohutínská ◽  
Doubravka Požárová ◽  
...  

AbstractRelative contributions of pre-existing vs de novo genomic variation to adaptation are poorly understood, especially in polyploid organisms, which maintain increased variation. We assess this in high resolution using autotetraploid Arabidopsis arenosa, which repeatedly adapted to toxic serpentine soils that exhibit skewed elemental profiles. Leveraging a fivefold replicated serpentine invasion, we assess selection on SNPs and structural variants (TEs) in 78 resequenced individuals and discovered substantial parallelism in candidate genes involved in ion homeostasis. We further modelled parallel selection and inferred repeated sweeps on a shared pool of variants in nearly all these loci, supporting theoretical expectations. A single, striking exception is represented by TWO PORE CHANNEL 1, which exhibits convergent evolution from independent de novo mutations at an identical, otherwise conserved site at the calcium channel selectivity gate. Taken together, this suggests that polyploid populations can rapidly adapt to environmental extremes, calling on both pre-existing variation and novel polymorphisms.

2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Veronika Konečná ◽  
Sian Bray ◽  
Jakub Vlček ◽  
Magdalena Bohutínská ◽  
Doubravka Požárová ◽  
...  

AbstractRelative contributions of pre-existing vs de novo genomic variation to adaptation are poorly understood, especially in polyploid organisms. We assess this in high resolution using autotetraploid Arabidopsis arenosa, which repeatedly adapted to toxic serpentine soils that exhibit skewed elemental profiles. Leveraging a fivefold replicated serpentine invasion, we assess selection on SNPs and structural variants (TEs) in 78 resequenced individuals and discover significant parallelism in candidate genes involved in ion homeostasis. We further model parallel selection and infer repeated sweeps on a shared pool of variants in nearly all these loci, supporting theoretical expectations. A single striking exception is represented by TWO PORE CHANNEL 1, which exhibits convergent evolution from independent de novo mutations at an identical, otherwise conserved site at the calcium channel selectivity gate. Taken together, this suggests that polyploid populations can rapidly adapt to environmental extremes, calling on both pre-existing variation and novel polymorphisms.


2018 ◽  
Author(s):  
Alba Sanchis-Juan ◽  
Jonathan Stephens ◽  
Courtney E French ◽  
Nicholas Gleadall ◽  
Karyn Mégy ◽  
...  

AbstractComplex structural variants (cxSVs) are genomic rearrangements comprising multiple structural variants, typically involving three or more breakpoint junctions. They contribute to human genomic variation and can cause Mendelian disease, however they are not typically considered during genetic testing. Here, we investigate the role of cxSVs in Mendelian disease using short-read whole genome sequencing (WGS) data from 1,324 individuals with neurodevelopmental or retinal disorders from the NIHR BioResource project. We present four cases of individuals with a cxSV affecting Mendelian disease-associated genes. Three of the cxSVs are pathogenic: a de novo duplication-inversion-inversion-deletion affecting ARID1B in an individual with Coffin-Siris syndrome, a deletion-inversion-duplication affecting HNRNPU in an individual with intellectual disability and seizures, and a homozygous deletion-inversion-deletion affecting CEP78 in an individual with cone-rod dystrophy. Additionally, we identified a de novo duplication-inversion-duplication overlapping CDKL5 in an individual with neonatal hypoxic-ischaemic encephalopathy. Long-read sequencing technology used to resolve the breakpoints demonstrated the presence of both a disrupted and an intact copy of CDKL5 on the same allele; therefore, it was classified as a variant of uncertain significance. Analysis of sequence flanking all breakpoint junctions in all the cxSVs revealed both microhomology and longer repetitive sequences, suggesting both replication and homology based processes. Accurate resolution of cxSVs is essential for clinical interpretation, and here we demonstrate that long-read WGS is a powerful technology by which to achieve this. Our results show cxSVs are an important although rare cause of Mendelian disease, and we therefore recommend their consideration during research and clinical investigations.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Surajit Bhattacharya ◽  
Hayk Barseghyan ◽  
Emmanuèle C. Délot ◽  
Eric Vilain

Abstract Background Whole genome sequencing is effective at identification of small variants, but because it is based on short reads, assessment of structural variants (SVs) is limited. The advent of Optical Genome Mapping (OGM), which utilizes long fluorescently labeled DNA molecules for de novo genome assembly and SV calling, has allowed for increased sensitivity and specificity in SV detection. However, compared to small variant annotation tools, OGM-based SV annotation software has seen little development, and currently available SV annotation tools do not provide sufficient information for determination of variant pathogenicity. Results We developed an R-based package, nanotatoR, which provides comprehensive annotation as a tool for SV classification. nanotatoR uses both external (DGV; DECIPHER; Bionano Genomics BNDB) and internal (user-defined) databases to estimate SV frequency. Human genome reference GRCh37/38-based BED files are used to annotate SVs with overlapping, upstream, and downstream genes. Overlap percentages and distances for nearest genes are calculated and can be used for filtration. A primary gene list is extracted from public databases based on the patient’s phenotype and used to filter genes overlapping SVs, providing the analyst with an easy way to prioritize variants. If available, expression of overlapping or nearby genes of interest is extracted (e.g. from an RNA-Seq dataset, allowing the user to assess the effects of SVs on the transcriptome). Most quality-control filtration parameters are customizable by the user. The output is given in an Excel file format, subdivided into multiple sheets based on SV type and inheritance pattern (INDELs, inversions, translocations, de novo, etc.). nanotatoR passed all quality and run time criteria of Bioconductor, where it was accepted in the April 2019 release. We evaluated nanotatoR’s annotation capabilities using publicly available reference datasets: the singleton sample NA12878, mapped with two types of enzyme labeling, and the NA24143 trio. nanotatoR was also able to accurately filter the known pathogenic variants in a cohort of patients with Duchenne Muscular Dystrophy for which we had previously demonstrated the diagnostic ability of OGM. Conclusions The extensive annotation enables users to rapidly identify potential pathogenic SVs, a critical step toward use of OGM in the clinical setting.


2019 ◽  
Author(s):  
Glenn Hickey ◽  
David Heller ◽  
Jean Monlong ◽  
Jonas A. Sibbesen ◽  
Jouni Sirén ◽  
...  

AbstractStructural variants (SVs) remain challenging to represent and study relative to point mutations despite their demonstrated importance. We show that variation graphs, as implemented in the vg toolkit, provide an effective means for leveraging SV catalogs for short-read SV genotyping experiments. We benchmarked vg against state-of-the-art SV genotypers using three sequence-resolved SV catalogs generated by recent long-read sequencing studies. In addition, we use assemblies from 12 yeast strains to show that graphs constructed directly from aligned de novo assemblies improve genotyping compared to graphs built from intermediate SV catalogs in the VCF format.


2021 ◽  
Vol 12 ◽  
Author(s):  
Wei Zhan ◽  
Manish Muhuri ◽  
Phillip W. L. Tai ◽  
Guangping Gao

Conventional vaccinations and immunotherapies have encountered major roadblocks in preventing infectious diseases like HIV, influenza, and malaria. These challenges are due to the high genomic variation and immunomodulatory mechanisms inherent to these diseases. Passive transfer of broadly neutralizing antibodies may offer partial protection, but these treatments require repeated dosing. Some recombinant viral vectors, such as those based on lentiviruses and adeno-associated viruses (AAVs), can confer long-term transgene expression in the host after a single dose. Particularly, recombinant (r)AAVs have emerged as favorable vectors, given their high in vivo transduction efficiency, proven clinical efficacy, and low immunogenicity profiles. Hence, rAAVs are being explored to deliver recombinant antibodies to confer immunity against infections or to diminish the severity of disease. When used as a vaccination vector for the delivery of antigens, rAAVs enable de novo synthesis of foreign proteins with the conformation and topology that resemble those of natural pathogens. However, technical hurdles like pre-existing immunity to the rAAV capsid and production of anti-drug antibodies can reduce the efficacy of rAAV-vectored immunotherapies. This review summarizes rAAV-based prophylactic and therapeutic strategies developed against infectious diseases that are currently being tested in pre-clinical and clinical studies. Technical challenges and potential solutions will also be discussed.


2020 ◽  
Author(s):  
Cheng Sun ◽  
Jiaxing Huang ◽  
Yun Wang ◽  
Xiaomeng Zhao ◽  
Long Su ◽  
...  

AbstractBumblebees are a diverse group of globally important pollinators in natural ecosystems and for agricultural food production. With both eusocial and solitary lifecycle phases, and some social parasite species, they are especially interesting models to understand social evolution, behavior, and ecology. Reports of many species in decline point to pathogen transmission, habitat loss, pesticide usage, and global climate change, as interconnected causes. These threats to bumblebee diversity make our reliance on a handful of well-studied species for agricultural pollination particularly precarious. To broadly sample bumblebee genomic and phenotypic diversity, we de novo sequenced and assembled the genomes of 17 species, representing all 15 subgenera, producing the first genus-wide quantification of genetic and genomic variation potentially underlying key ecological and behavioral traits. The species phylogeny resolves subgenera relationships while incomplete lineage sorting likely drives high levels of gene tree discordance. Five chromosome-level assemblies show a stable 18-chromosome karyotype, with major rearrangements creating 25 chromosomes in social parasites. Differential transposable element activity drives changes in genome sizes, with putative domestications of repetitive sequences influencing gene coding and regulatory potential. Dynamically evolving gene families and signatures of positive selection point to genus-wide variation in processes linked to foraging, diet and metabolism, immunity and detoxification, as well as adaptations for life at high altitudes. These high-quality genomic resources capture natural genetic and phenotypic variation across bumblebees, offering new opportunities to advance our understanding of their remarkable ecological success and to identify and manage current and future threats.


2016 ◽  
Author(s):  
Shaun D Jackman ◽  
Benjamin P Vandervalk ◽  
Hamid Mohamadi ◽  
Justin Chu ◽  
Sarah Yeo ◽  
...  

AbstractThe assembly of DNA sequences de novo is fundamental to genomics research. It is the first of many steps towards elucidating and characterizing whole genomes. Downstream applications, including analysis of genomic variation between species, between or within individuals critically depends on robustly assembled sequences. In the span of a single decade, the sequence throughput of leading DNA sequencing instruments has increased drastically, and coupled with established and planned large-scale, personalized medicine initiatives to sequence genomes in the thousands and even millions, the development of efficient, scalable and accurate bioinformatics tools for producing high-quality reference draft genomes is timely.With ABySS 1.0, we originally showed that assembling the human genome using short 50 bp sequencing reads was possible by aggregating the half terabyte of compute memory needed over several computers using a standardized message-passing system (MPI). We present here its re-design, which departs from MPI and instead implements algorithms that employ a Bloom filter, a probabilistic data structure, to represent a de Bruijn graph and reduce memory requirements.We present assembly benchmarks of human Genome in a Bottle 250 bp Illumina paired-end and 6 kbp mate-pair libraries from a single individual, yielding a NG50 (NGA50) scaffold contiguity of 3.5 (3.0) Mbp using less than 35 GB of RAM, a modest memory requirement by today’s standard that is often available on a single computer. We also investigate the use of BioNano Genomics and 10x Genomics’ Chromium data to further improve the scaffold contiguity of this assembly to 42 (15) Mbp.


2017 ◽  
Author(s):  
Mircea Cretu Stancu ◽  
Markus J. van Roosmalen ◽  
Ivo Renkens ◽  
Marleen Nieboer ◽  
Sjors Middelkamp ◽  
...  

AbstractStructural genomic variants form a common type of genetic alteration underlying human genetic disease and phenotypic variation. Despite major improvements in genome sequencing technology and data analysis, the detection of structural variants still poses challenges, particularly when variants are of high complexity. Emerging long-read single-molecule sequencing technologies provide new opportunities for detection of structural variants. Here, we demonstrate sequencing of the genomes of two patients with congenital abnormalities using the ONT MinION at 11x and 16x mean coverage, respectively. We developed a bioinformatic pipeline - NanoSV - to efficiently map genomic structural variants (SVs) from the long-read data. We demonstrate that the nanopore data are superior to corresponding short-read data with regard to detection of de novo rearrangements originating from complex chromothripsis events in the patients. Additionally, genome-wide surveillance of SVs, revealed 3,253 (33%) novel variants that were missed in short-read data of the same sample, the majority of which are duplications < 200bp in size. Long sequencing reads enabled efficient phasing of genetic variations, allowing the construction of genome-wide maps of phased SVs and SNVs. We employed read-based phasing to show that all de novo chromothripsis breakpoints occurred on paternal chromosomes and we resolved the long-range structure of the chromothripsis. This work demonstrates the value of long-read sequencing for screening whole genomes of patients for complex structural variants.


2019 ◽  
Author(s):  
Michael A. Martin ◽  
Drishti Kaul ◽  
Gene S. Tan ◽  
Christopher W. Woods ◽  
Katia Koelle

AbstractThe rapid evolution of influenza is an important contributing factor to its high worldwide incidence. The emergence and spread of genetic point mutations has been thoroughly studied both within populations and within individual hosts. In addition, influenza viruses are also known to generate genomic variation during their replication in the form of defective viral genomes (DVGs). These DVGs are formed by internal deletions in at least one gene segment that render them incapable of replication without the presence of wild-type virus. DVGs have previously been identified in natural human infections and may be associated with less severe clinical outcomes. These studies have not been able to address how DVG populations evolve in vivo in individual infections due to their cross-sectional design. Here we present an analysis of DVGs present in samples from two longitudinal influenza A H3N2 human challenge studies. We observe the generation of DVGs in almost all subjects. Although the genetic composition of DVG populations was highly variable, identical DVGs were observed both between multiple samples within single hosts as well as between hosts. Most likely due to stochastic effects, we did not observe clear instances of selection for specific DVGs or for shorter DVGs over the course of infection. Furthermore, DVG presence was not found to be associated with peak viral titer or peak symptom scores. Our analyses highlight the diversity of DVG populations within a host over the course of infection and the apparent role that genetic drift plays in their population dynamics.ImportanceThe evolution of influenza virus, in terms of single nucleotide variants and the reassortment of gene segments, has been studied in detail. However, influenza is known to generate defective viral genomes (DVGs) during replication, and little is known about how these genomes evolve both within hosts and at the population level. Studies in animal models have indicated that prophylactically or therapeutically administered DVGs can impact patterns of disease progression. However, the formation of naturally-occurring DVGs, their evolutionary dynamics, and their contribution to disease severity in human hosts is not well understood. Here, we identify the formation of de novo DVGs in samples from human challenge studies throughout the course of infection. We analyze their evolutionary trajectories, revealing the important role of genetic drift in shaping DVG populations during acute infections with well-adapted viral strains.


2019 ◽  
Vol 7 (2) ◽  
pp. 391-402 ◽  
Author(s):  
Yaoxi He ◽  
Haiyi Lou ◽  
Chaoying Cui ◽  
Lian Deng ◽  
Yang Gao ◽  
...  

Abstract Structural variants (SVs) may play important roles in human adaptation to extreme environments such as high altitude but have been under-investigated. Here, combining long-read sequencing with multiple scaffolding techniques, we assembled a high-quality Tibetan genome (ZF1), with a contig N50 length of 24.57 mega-base pairs (Mb) and a scaffold N50 length of 58.80 Mb. The ZF1 assembly filled 80 remaining N-gaps (0.25 Mb in total length) in the reference human genome (GRCh38). Markedly, we detected 17 900 SVs, among which the ZF1-specific SVs are enriched in GTPase activity that is required for activation of the hypoxic pathway. Further population analysis uncovered a 163-bp intronic deletion in the MKL1 gene showing large divergence between highland Tibetans and lowland Han Chinese. This deletion is significantly associated with lower systolic pulmonary arterial pressure, one of the key adaptive physiological traits in Tibetans. Moreover, with the use of the high-quality de novo assembly, we observed a much higher rate of genome-wide archaic hominid (Altai Neanderthal and Denisovan) shared non-reference sequences in ZF1 (1.32%–1.53%) compared to other East Asian genomes (0.70%–0.98%), reflecting a unique genomic composition of Tibetans. One such archaic hominid shared sequence—a 662-bp intronic insertion in the SCUBE2 gene—is enriched and associated with better lung function (the FEV1/FVC ratio) in Tibetans. Collectively, we generated the first high-resolution Tibetan reference genome, and the identified SVs may serve as valuable resources for future evolutionary and medical studies.


Sign in / Sign up

Export Citation Format

Share Document