scholarly journals A gap-free tomato genome built from complementary PacBio and Nanopore long DNA sequences reveals extensive linkage drag during breeding

2021 ◽  
Author(s):  
Willem M. J. van Rengs ◽  
Maximilian H.-W. Schmidt ◽  
Sieglinde Effgen ◽  
Yazhong Wang ◽  
Mohd Waznul Adly Mohd Zaidan ◽  
...  

The assembly and scaffolding of plant crop genomes facilitates the characterization of genetically diverse cultivated and wild germplasm. The cultivated tomato has been improved through the introgression of genetic material from related wild species, including resistance to pandemic strains of Tobacco Mosaic virus (TMV) from Solanum peruvianum. Here we applied PacBio HiFi and ONT nanopore sequencing to develop independent, highly contiguous and complementary assemblies of an inbred TMV-resistant tomato variety. We merged the HiFi and ONT assemblies to generate a long-read-only assembly where all twelve chromosomes were represented as twelve contiguous sequences (N50=68.5 Mbp). The merged assembly was validated by chromosome conformation capture data and is highly consistent with previous tomato assemblies that made use of genetic maps and HiC for scaffolding. Our long-read-only assembly reveals that a complex series of structural variants linked to the TMV resistance gene likely contributed to linkage drag of a 64.1 Mbp region of the S. peruvianum genome during tomato breeding. We show that this minimal introgression region is present in six cultivated tomato hybrid varieties developed in three commercial breeding programs. Our results suggest that complementary long read technologies can facilitate the rapid generation of near complete genome sequences.

Genetics ◽  
2000 ◽  
Vol 154 (1) ◽  
pp. 323-332
Author(s):  
David Waddington ◽  
Anthea J Springbett ◽  
David W Burt

Abstract Comparative genetic maps of two species allow insights into the rearrangements of their genomes since divergence from a common ancestor. When the map details the positions of genes (or any set of orthologous DNA sequences) on chromosomes, syntenic blocks of one or more genes may be identified and used, with appropriate models, to estimate the number of chromosomal segments with conserved content conserved between species. We propose a model for the distribution of the lengths of unobserved segments on each chromosome that allows for widely differing chromosome lengths. The model uses as data either the counts of genes in a syntenic block or the distance between extreme members of a block, or both. The parameters of the proposed segment length distribution, estimated by maximum likelihood, give predictions of the number of conserved segments per chromosome. The model is applied to data from two comparative maps for the chicken, one with human and one with mouse.


2021 ◽  
Author(s):  
Stefano Meucci ◽  
Luise Schulte ◽  
Kathleen R. Stoof-Leichsenring ◽  
Stefan Kruse ◽  
Konstantin Krutovsky ◽  
...  

<p>Siberian larch forests dominate large areas of northern Russia and contribute important roles for the world´s ecosystem. In order to understand the past dynamics of larches and their adaptive genetic variation, sedimentary ancient DNA (sedaDNA) extracted from lake sediment cores is a crucial source of genetic material. The difficulty of retrieving extremely rare DNA sequences from samples reaching back up to 25000 years in age, is challenging. Previous studies (Schulte et al.) showed that the hybridization capture allowed an enrichment of targeted sequences by several orders of magnitude in comparison to shotgun sequencing method. Therefore, we established for the first time, a hybridization capture method targeting 65 candidate adaptive genes laying on the Larix nuclear genome. Our preliminary results showed the ability of our newly established method to enrich extremely rare DNA sequences of the targeted Larix candidate adaptive genes, which were not retrieved by shotgun sequencing method applied on the same samples. Furthermore, the results allowed to detect and compare specific nucleotide polymorphism of adaptive candidate genes among sedaDNA samples distributed in space and time. The establishment of this new method is laying the basis to investigate possible adaptive variation of larch species acquired across the dry and cold conditions of the Last Glacial Maximum (LGM); as well as their possible advantages or disadvantages in relation to the current environmental changes toward dry and warm conditions.</p>


2018 ◽  
Author(s):  
Kiran Lalwani ◽  
Shivani Sheth ◽  
Inayatullah Sheikh ◽  
Afzal Ansari ◽  
Fulesh Kunwar ◽  
...  

Chromosomal translocations involve exchange of genetic material between non- homologous chromosomes leading to the formation of a fusion gene with altered function. The clinical consequences of non-random and recurrent chromosomal translocations have been so well understood in carcinogenesis that they serve as diagnostic and prognostic markers and also help in therapy decisions, mainly in leukemia and lymphoma. However, the molecular mechanisms underlying these recurrent genetic exchanges are yet to be understood. Various approaches employed include the extent of the vicinity of the partner chromosomes in the nucleus, DNA sequences at the breakpoints, etc. The present study addresses the stability of DNA sequences at the breakpoint regions using in-silico approach in terms of physicochemical properties such as; AT%, flexibility, melting temperature, enthalpy, entropy, stacking energy and free energy. Changes in these properties may lead to instability of DNA which could affect gene expression in particular and genome organization in general. Our study indicates that the fusion sequences are comparatively more unstable and hence, more prone to breakage. Current study along with others could lead to developing a model for predicting breakage prone genomic regions using this novel in-silico approach.


2021 ◽  
Vol 10 (41) ◽  
Author(s):  
W. E. Moore ◽  
G. K. K. Lai ◽  
S. D. J. Griffin ◽  
F. C. C. Leung

Kosakonia cowanii is a Gram-negative, motile, facultative anaerobic enterobacterium that is found in soil, water, and sewage. K. cowanii SMBL-WEM22 is a halotolerant strain that was isolated from seawater in Hong Kong. The complete genome of SMBL-WEM22 (5,037,617 bp, with a GC content of 55.02%) was determined by hybrid assembly of short- and long-read DNA sequences.


Author(s):  
Theodore L. Roth ◽  
Alexander Marson

Genetic diseases cause numerous complex and intractable pathologies. DNA sequences encoding each human's complexity and many disease risks are contained in the mitochondrial genome, nuclear genome, and microbial metagenome. Diagnosis of these diseases has unified around applications of next-generation DNA sequencing. However, translating specific genetic diagnoses into targeted genetic therapies remains a central goal. To date, genetic therapies have fallen into three broad categories: bulk replacement of affected genetic compartments with a new exogenous genome, nontargeted addition of exogenous genetic material to compensate for genetic errors, and most recently, direct correction of causative genetic alterations using gene editing. Generalized methods of diagnosis, therapy, and reagent delivery into each genetic compartment will accelerate the next generations of curative genetic therapies. We discuss the structure and variability of the mitochondrial, nuclear, and microbial metagenomic compartments, as well as the historical development and current practice of genetic diagnostics and gene therapies targeting each compartment.


2020 ◽  
Vol 10 (10) ◽  
pp. 3565-3574
Author(s):  
Jinping Wang ◽  
Shoule Tian ◽  
Xiaoli Sun ◽  
Xinchao Cheng ◽  
Naibin Duan ◽  
...  

The Chinese chestnut (Castanea mollissima Bl.) is a woody nut crop with a high ecological value. Although many cultivars have been selected from natural seedlings, elite lines with comprehensive agronomic traits and characters remain rare. To explore genetic resources with aid of whole genome sequence will play important roles in modern breeding programs for chestnut. In this study, we generated a high-quality C. mollissima genome assembly by combining 90× Pacific Biosciences long read and 170× high-throughput chromosome conformation capture data. The assembly was 688.93 Mb in total, with a contig N50 of 2.83 Mb. Most of the assembled sequences (99.75%) were anchored onto 12 chromosomes, and 97.07% of the assemblies were accurately anchored and oriented. A total of 33,638 protein-coding genes were predicted in the C. mollissima genome. Comparative genomic and transcriptomic analyses provided insights into the genes expressed in specific tissues, as well as those associated with burr development in the Chinese chestnut. This highly contiguous assembly of the C. mollissima genome provides a valuable resource for studies aiming at identifying and characterizing agronomical-important traits, and will aid the design of breeding strategies to develop more focused, faster, and predictable improvement programs.


2019 ◽  
Vol 6 (Supplement_2) ◽  
pp. S289-S289
Author(s):  
Shelby Simar ◽  
Blake Hanson ◽  
German Contreras ◽  
Katherine Reyes ◽  
Pranoti V Sahasrabhojane ◽  
...  

Abstract Background Vancomycin-resistant enterococci (VRE) are a major cause of nosocomial bloodstream infections. Enterococci exhibit remarkable genomic plasticity and can recombine through the acquisition of genetic material via mobile genetic elements (MGEs), including resistance genes. The accessory genome plays a major role in the evolution of enterococci within the human host. Thus, dissecting the entire genome (pan-genome) is of paramount importance to characterize the population structure of enterococci causing disease. Methods VENOUS is an ongoing prospective, observational study of adults with enterococcal bacteremia. From September 2016 to March 2018, E. faecalis (Efs) and E. faecium (Efm) were collected in 14 hospitals of a single hospital system and a major cancer center in Houston, TX, and a general hospital in Detroit, MI. Short- and long-read genomic sequencing were performed with Illumina MiSeq and Oxford Nanopore Technologies GridION X5, respectively. A proprietary bioinformatics pipeline was utilized for genome assembly and further analyses. Results 156 Efs and 98 Efm isolates from single patients were analyzed. The average proportion of core genes in each genome was 64.6% (53.0–74.1) and 49.1% (45.2–51.0) for Efs and Efm, respectively. The vanA gene cluster was identified in 5.1% (8/157) of Efs and 57.1% (56/98) of Efm. The plasmid-encoded aac(6′)-Ie-aph(2″)-Ia gene conferring high-level resistance to aminoglycosides was found in 37.6% (59/157) Efs, seven of which also possessed vanA. Long-read sequencing of vanA-harboring plasmids from a subset of VRE revealed that the vanA cluster was carried in plasmids ranging from 31.7 to 132.3 kb. Although the vanA operon was fairly conserved, insertions of MGE were identified in the intergenic regions of vanS/vanH and vanX/vanY. Furthermore, a variety of MGE insertions mediated integration of the vanA operon, including IS1216 and IS256 (figure). Conclusion Accessory genes, including AMR genes, comprise a significant proportion of the enterococcal pan-genome, indicating major genetic plasticity within these organisms. Acquired resistance genes seem to have a high degree of recombination and play a substantial role in the expansion of the genomic repertoire in clinical isolates. Disclosures Samuel L. Aitken, PharmD, Melinta Therapeutics: Grant/Research Support, Research Grant; Merck, Sharpe, and Dohme: Advisory Board; Shionogi: Advisory Board.


2006 ◽  
Vol 33 (8) ◽  
pp. 775 ◽  
Author(s):  
Huyen T. T. Phan ◽  
Simon R. Ellwood ◽  
Rebecca Ford ◽  
Steve Thomas ◽  
Richard Oliver

Orthologous markers transferable between distantly related legume species allow for the rapid generation of genetic maps in species where there is little pre-existing genomic or EST information. We are using the model legume Medicago truncatula Gaertn. to develop such markers in legumes of importance to Australian agriculture. This will enable the construction of comparative genetic maps, help to determine patterns of chromosomal evolution in the legume family, and characterise syntenic relationships between M. truncatula and cultivated legumes. This information can then be used to identify markers that are tightly linked to the genes of interest, candidate gene(s) for a trait, and expedite the isolation of such genes. Among the Papilionoideae, we compared ESTs from the phylogenetically distant species, M. truncatula, Lupinus albus and Glycine max, to produce 500 intron-targeted amplified polymorphic markers (ITAPs). In addition to 126 M. truncatula cross-species markers from Department of Plant Pathology, University of California (USA), these markers were used to generate comparative genetic maps of lentil (Lens culinaris Medik.) and white lupin (Lupinus albus Linn.). Our results showed that 90% of the ITAPs markers amplified genomic DNA in M. truncatula, 80% in Lupinus albus, and 70% in Lens culinaris. The comparative map of Lens culinaris was constructed based on 79 ITAP markers. The Lupinus albus comparative map was developed from 105 gene-based markers together with 223 AFLP markers. Although a direct and simple syntenic relationship was observed between M. truncatula and Lens culinaris genomes, there is evidence of moderate chromosomal rearrangement. This may account for the different chromosome numbers in the two species. A more complicated pattern among homologous blocks was apparent between the Lupinus albus and M. truncatula genomes.


2016 ◽  
Author(s):  
Diego D. Cambuy ◽  
Felipe H. Coutinho ◽  
Bas E. Dutilh

AbstractIn modern-day metagenomics, there is an increasing need for robust taxonomic annotation of long DNA sequences from unknown micro-organisms. Long metagenomic sequences may be derived from assembly of short-read metagenomes, or from long-read single molecule sequencing. Here we introduce CAT, a pipeline for robust taxonomic classification of long DNA sequences. We show that CAT correctly classifies contigs at different taxonomic levels, even in simulated metagenomic datasets that are very distantly related from the sequences in the database. CAT is implemented in Python and the required scripts can be freely downloaded from Github.


2017 ◽  
Author(s):  
David Eccles ◽  
Jodie Chandler ◽  
Mali Camberis ◽  
Benard Henrissat ◽  
Sergey Koren ◽  
...  

AbstractEukaryotic genome assembly remains a challenge in part because of the prevalence of complex DNA repeats. This is a particularly acute problem for holocentric nematodes because of the large number of satellite DNA sequences found throughout their genomes. These have been recalcitrant to most genome sequencing methods. At the same time, many nematodes are parasites and some represent a serious threat to human health. There is a pressing need for better molecular characterization of animal and plant parasitic nematodes. The advent of long-read DNA sequencing methods offers the promise of resolving complex genomes. Using Nippostrongylus brasiliensis as a test case, applying improved base-calling algorithms and assembly methods, we demonstrate the feasibility of de novo genome assembly matching current community standards using only MinION long reads. In doing so, we uncovered an unexpected diversity of very long and complex DNA repeat sequences, including massive tandem repeats of tRNA genes. The method has the added advantage of preserving haplotypic variants and so has the potential to be used in population analyses.


Sign in / Sign up

Export Citation Format

Share Document