scholarly journals Metagenomic analysis of nepoviruses: diversity, evolution and identification of a genome region in members of subgroup A that appears to be important for host range

Author(s):  
J. M. Hily ◽  
N. Poulicard ◽  
J. Kubina ◽  
J. S. Reynard ◽  
A. S. Spilmont ◽  
...  

AbstractData mining and metagenomic analysis of 277 open reading frame sequences of bipartite RNA viruses of the genus Nepovirus, family Secoviridae, were performed, documenting how challenging it can be to unequivocally assign a virus to a particular species, especially those in subgroups A and C, based on some of the currently adopted taxonomic demarcation criteria. This work suggests a possible need for their amendment to accommodate pangenome information. In addition, we revealed a host-dependent structure of arabis mosaic virus (ArMV) populations at a cladistic level and confirmed a phylogeographic structure of grapevine fanleaf virus (GFLV) populations. We also identified new putative recombination events in members of subgroups A, B and C. The evolutionary specificity of some capsid regions of ArMV and GFLV that were described previously and biologically validated as determinants of nematode transmission was circumscribed in silico. Furthermore, a C-terminal segment of the RNA-dependent RNA polymerase of members of subgroup A was predicted to be a putative host range determinant based on statistically supported higher π (substitutions per site) values for GFLV and ArMV isolates infecting Vitis spp. compared with non-Vitis-infecting ArMV isolates. This study illustrates how sequence information obtained via high-throughput sequencing can increase our understanding of mechanisms that modulate virus diversity and evolution and create new opportunities for advancing studies on the biology of economically important plant viruses.

2021 ◽  
Author(s):  
jean-michel hily ◽  
Nils Poulicard ◽  
Julie Kubina ◽  
Jean-sebastien Reynard ◽  
Anne-Sophie Spilmont ◽  
...  

Abstract Datamining and metagenomic analyses of 277 open reading frame sequences of bipartite RNA viruses and variants in the genus Nepovirus documented how delicate it can be to unequivocally identify species, in particular subgroup A and C species, based on some of the currently adopted taxonomic demarcation criteria. It suggests a possible need for their amendment to accommodate pangenome information. In addition, we revealed a host-dependent structure of arabis mosaic virus (ArMV) populations at a cladistic level and confirmed a phylogeographic structure of grapevine fanleaf virus (GFLV) populations. We also identified new putative recombinant events for species of subgroups A, B and C. The evolutionary specificity of some capsid regions of ArMV and GFLV that were previously described and biologically validated as vector determinant was circumscribed in silico. Furthermore, a C-terminal segment of the RNA-dependent RNA polymerase of subgroup A species was predicted as a putative host range determinant based on statistically supported higher π values for GFLV and ArMV isolates infecting Vitis spp. compared to non-Vitis infecting ArMV isolates. This study illustrated how sequence information obtained via high throughput sequencing can increase our understanding of mechanisms that modulate virus diversity and evolution and create new opportunities for advancing studies on the biology of economically important plant viruses.


Viruses ◽  
2021 ◽  
Vol 13 (7) ◽  
pp. 1304
Author(s):  
Nicolás Bejerman ◽  
Ralf G. Dietzgen ◽  
Humberto Debat

Rhabdoviruses infect a large number of plant species and cause significant crop diseases. They have a negative-sense, single-stranded unsegmented or bisegmented RNA genome. The number of plant-associated rhabdovirid sequences has grown in the last few years in concert with the extensive use of high-throughput sequencing platforms. Here, we report the discovery of 27 novel rhabdovirus genomes associated with 25 different host plant species and one insect, which were hidden in public databases. These viral sequences were identified through homology searches in more than 3000 plant and insect transcriptomes from the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) using known plant rhabdovirus sequences as the query. The identification, assembly and curation of raw SRA reads resulted in sixteen viral genome sequences with full-length coding regions and ten partial genomes. Highlights of the obtained sequences include viruses with unique and novel genome organizations among known plant rhabdoviruses. Phylogenetic analysis showed that thirteen of the novel viruses were related to cytorhabdoviruses, one to alphanucleorhabdoviruses, five to betanucleorhabdoviruses, one to dichorhaviruses and seven to varicosaviruses. These findings resulted in the most complete phylogeny of plant rhabdoviruses to date and shed new light on the phylogenetic relationships and evolutionary landscape of this group of plant viruses. Furthermore, this study provided additional evidence for the complexity and diversity of plant rhabdovirus genomes and demonstrated that analyzing SRA public data provides an invaluable tool to accelerate virus discovery, gain evolutionary insights and refine virus taxonomy.


Plants ◽  
2021 ◽  
Vol 10 (4) ◽  
pp. 753
Author(s):  
Miroslav Glasa ◽  
Richard Hančinský ◽  
Katarína Šoltys ◽  
Lukáš Predajňa ◽  
Jana Tomašechová ◽  
...  

In recent years, high throughput sequencing (HTS) has brought new possibilities to the study of the diversity and complexity of plant viromes. Mixed infection of a single plant with several viruses is frequently observed in such studies. We analyzed the virome of 10 tomato and sweet pepper samples from Slovakia, all showing the presence of potato virus Y (PVY) infection. Most datasets allow the determination of the nearly complete sequence of a single-variant PVY genome, belonging to one of the PVY recombinant strains (N-Wi, NTNa, or NTNb). However, in three to-mato samples (T1, T40, and T62) the presence of N-type and O-type sequences spanning the same genome region was documented, indicative of mixed infections involving different PVY strains variants, hampering the automated assembly of PVY genomes present in the sample. The N- and O-type in silico data were further confirmed by specific RT-PCR assays targeting UTR-P1 and NIa genomic parts. Although full genomes could not be de novo assembled directly in this situation, their deep coverage by relatively long paired reads allowed their manual re-assembly using very stringent mapping parameters. These results highlight the complexity of PVY infection of some host plants and the challenges that can be met when trying to precisely identify the PVY isolates involved in mixed infection.


2020 ◽  
Vol 16 (11) ◽  
pp. e1008415
Author(s):  
Teresa Maria Rosaria Noviello ◽  
Francesco Ceccarelli ◽  
Michele Ceccarelli ◽  
Luigi Cerulo

Small non-coding RNAs (ncRNAs) are short non-coding sequences involved in gene regulation in many biological processes and diseases. The lack of a complete comprehension of their biological functionality, especially in a genome-wide scenario, has demanded new computational approaches to annotate their roles. It is widely known that secondary structure is determinant to know RNA function and machine learning based approaches have been successfully proven to predict RNA function from secondary structure information. Here we show that RNA function can be predicted with good accuracy from a lightweight representation of sequence information without the necessity of computing secondary structure features which is computationally expensive. This finding appears to go against the dogma of secondary structure being a key determinant of function in RNA. Compared to recent secondary structure based methods, the proposed solution is more robust to sequence boundary noise and reduces drastically the computational cost allowing for large data volume annotations. Scripts and datasets to reproduce the results of experiments proposed in this study are available at: https://github.com/bioinformatics-sannio/ncrna-deep.


2020 ◽  
Vol 110 (1) ◽  
pp. 68-79 ◽  
Author(s):  
Merike Sõmera ◽  
Anders Kvarnheden ◽  
Cécile Desbiez ◽  
Dag-Ragnar Blystad ◽  
Pille Sooväli ◽  
...  

High-throughput sequencing technologies were used to identify plant viruses in cereal samples surveyed from 2012 to 2017. Fifteen genome sequences of a tenuivirus infecting wheat, oats, and spelt in Estonia, Norway, and Sweden were identified and characterized by their distances to other tenuivirus sequences. Like most tenuiviruses, the genome of this tenuivirus contains four genomic segments. The isolates found from different countries shared at least 92% nucleotide sequence identity at the genome level. The planthopper Javesella pellucida was identified as a vector of the virus. Laboratory transmission tests using this vector indicated that wheat, oats, barley, rye, and triticale, but none of the tested pasture grass species (Alopecurus pratensis, Dactylis glomerata, Festuca rubra, Lolium multiflorum, Phleum pratense, and Poa pratensis), are susceptible. Taking into account the vector and host range data, the tenuivirus we have found most probably represents European wheat striate mosaic virus first identified about 60 years ago. Interestingly, whereas we were not able to infect any of the tested cereal species mechanically, Nicotiana benthamiana was infected via mechanical inoculation in laboratory conditions, displaying symptoms of yellow spots and vein clearing evolving into necrosis, eventually leading to plant death. Surprisingly, one of the virus genome segments (RNA2) encoding both a putative host systemic movement enhancer protein and a putative vector transmission factor was not detected in N. benthamiana after several passages even though systemic infection was observed, raising fundamental questions about the role of this segment in the systemic spread in several hosts.


Viruses ◽  
2020 ◽  
Vol 12 (1) ◽  
pp. 111 ◽  
Author(s):  
Benoît Moury ◽  
Cécile Desbiez

Virus host range, i.e., the number and diversity of host species of viruses, is an important determinant of disease emergence and of the efficiency of disease control strategies. However, for plant viruses, little is known about the genetic or ecological factors involved in the evolution of host range. Using available genome sequences and host range data, we performed a phylogenetic analysis of host range evolution in the genus Potyvirus, a large group of plant RNA viruses that has undergone a radiative evolution circa 7000 years ago, contemporaneously with agriculture intensification in mid Holocene. Maximum likelihood inference based on a set of 59 potyviruses and 38 plant species showed frequent host range changes during potyvirus evolution, with 4.6 changes per plant species on average, including 3.1 host gains and 1.5 host loss. These changes were quite recent, 74% of them being inferred on the terminal branches of the potyvirus tree. The most striking result was the high frequency of correlated host gains occurring repeatedly in different branches of the potyvirus tree, which raises the question of the dependence of the molecular and/or ecological mechanisms involved in adaptation to different plant species.


2020 ◽  
Vol 3 (1) ◽  
Author(s):  
Sara Lado ◽  
Jean Pierre Elbers ◽  
Angela Doskocil ◽  
Davide Scaglione ◽  
Emiliano Trucchi ◽  
...  

AbstractDromedaries have been essential for the prosperity of civilizations in arid environments and the dispersal of humans, goods and cultures along ancient, cross-continental trading routes. With increasing desertification their importance as livestock species is rising rapidly, but little is known about their genome-wide diversity and demographic history. As previous studies using few nuclear markers found weak phylogeographic structure, here we detected fine-scale population differentiation in dromedaries across Asia and Africa by adopting a genome-wide approach. Global patterns of effective migration rates revealed pathways of dispersal after domestication, following historic caravan routes like the Silk and Incense Roads. Our results show that a Pleistocene bottleneck and Medieval expansions during the rise of the Ottoman empire have shaped genome-wide diversity in modern dromedaries. By understanding subtle population structure we recognize the value of small, locally adapted populations and appeal for securing genomic diversity for a sustainable utilization of this key desert species.


BMC Genetics ◽  
2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Liping Guan ◽  
Ke Cao ◽  
Yong Li ◽  
Jian Guo ◽  
Qiang Xu ◽  
...  

Abstract Background Peach (Prunus persica L.) is a diploid species and model plant of the Rosaceae family. In the past decade, significant progress has been made in peach genetic research via DNA markers, but the number of these markers remains limited. Results In this study, we performed a genome-wide DNA markers detection based on sequencing data of six distantly related peach accessions. A total of 650,693~1,053,547 single nucleotide polymorphisms (SNPs), 114,227~178,968 small insertion/deletions (InDels), 8386~12,298 structure variants (SVs), 2111~2581 copy number variants (CNVs) and 229,357~346,940 simple sequence repeats (SSRs) were detected and annotated. To demonstrate the application of DNA markers, 944 SNPs were filtered for association study of fruit ripening time and 15 highly polymorphic SSRs were selected to analyze the genetic relationship among 221 accessions. Conclusions The results showed that the use of high-throughput sequencing to develop DNA markers is fast and effective. Comprehensive identification of DNA markers, including SVs and SSRs, would be of benefit to genetic diversity evaluation, genetic mapping, and molecular breeding of peach.


Plants ◽  
2019 ◽  
Vol 8 (8) ◽  
pp. 270 ◽  
Author(s):  
Yun Gyeong Lee ◽  
Sang Chul Choi ◽  
Yuna Kang ◽  
Kyeong Min Kim ◽  
Chon-Sik Kang ◽  
...  

The whole genome sequencing (WGS) has become a crucial tool in understanding genome structure and genetic variation. The MinION sequencing of Oxford Nanopore Technologies (ONT) is an excellent approach for performing WGS and it has advantages in comparison with other Next-Generation Sequencing (NGS): It is relatively inexpensive, portable, has simple library preparation, can be monitored in real-time, and has no theoretical limits on reading length. Sorghum bicolor (L.) Moench is diploid (2n = 2x = 20) with a genome size of about 730 Mb, and its genome sequence information is released in the Phytozome database. Therefore, sorghum can be used as a good reference. However, plant species have complex and large genomes when compared to animals or microorganisms. As a result, complete genome sequencing is difficult for plant species. MinION sequencing that produces long-reads can be an excellent tool for overcoming the weak assembly of short-reads generated from NGS by minimizing the generation of gaps or covering the repetitive sequence that appears on the plant genome. Here, we conducted the genome sequencing for S. bicolor cv. BTx623 while using the MinION platform and obtained 895,678 reads and 17.9 gigabytes (Gb) (ca. 25× coverage of reference) from long-read sequence data. A total of 6124 contigs (covering 45.9%) were generated from Canu, and a total of 2661 contigs (covering 50%) were generated from Minimap and Miniasm with a Racon through a de novo assembly using two different tools and mapped assembled contigs against the sorghum reference genome. Our results provide an optimal series of long-read sequencing analysis for plant species while using the MinION platform and a clue to determine the total sequencing scale for optimal coverage that is based on various genome sizes.


Sign in / Sign up

Export Citation Format

Share Document