scholarly journals Phylogenetic Analysis of HIV-1 Genomes Based on the Position-Weighted K-mers Method

Entropy ◽  
2020 ◽  
Vol 22 (2) ◽  
pp. 255
Author(s):  
Yuanlin Ma ◽  
Zuguo Yu ◽  
Runbin Tang ◽  
Xianhua Xie ◽  
Guosheng Han ◽  
...  

HIV-1 viruses, which are predominant in the family of HIV viruses, have strong pathogenicity and infectivity. They can evolve into many different variants in a very short time. In this study, we propose a new and effective alignment-free method for the phylogenetic analysis of HIV-1 viruses using complete genome sequences. Our method combines the position distribution information and the counts of the k-mers together. We also propose a metric to determine the optimal k value. We name our method the Position-Weighted k-mers (PWkmer) method. Validation and comparison with the Robinson–Foulds distance method and the modified bootstrap method on a benchmark dataset show that our method is reliable for the phylogenetic analysis of HIV-1 viruses. PWkmer can resolve within-group variations for different known subtypes of Group M of HIV-1 viruses. This method is simple and computationally fast for whole genome phylogenetic analysis.


2019 ◽  
Vol 165 (1) ◽  
pp. 21-31 ◽  
Author(s):  
A. M. Dullemans ◽  
M. Botermans ◽  
M. J. D. de Kock ◽  
C. E. de Krom ◽  
T. A. J. van der Lee ◽  
...  

Abstract To obtain insight into the sequence diversity of strawberry latent ringspot virus (SLRSV), isolates from collections and diagnostic samples were sequenced by high-throughput sequencing. For five SLRSV isolates, the complete genome sequences were determined, and for 18 other isolates nearly complete genome sequences were determined. The sequence data were analysed in relation to sequences of SLRSV and related virus isolates available in the NCBI GenBank database. The genome sequences were annotated, and sequences of the protease-polymerase (Pro-Pol) region and coat proteins (CPs) (large and small CP together) were used for phylogenetic analysis. The amino acid sequences of the Pro-Pol region were very similar, whereas the nucleotide sequences of this region were more variable. The amino acid sequences of the CPs were less similar, which was corroborated by the results of a serological comparison performed using antisera raised against different isolates of SLRSV. Based on these results, we propose that SLRSV and related unassigned viruses be assigned to a new genus within the family Secoviridae, named “Stralarivirus”. Based on the phylogenetic analysis, this genus should include at least three viruses, i.e., SLRSV-A, SLRSV-B and lychnis mottle virus. The newly generated sequence data provide a basis for designing molecular tests to screen for SLRSV.



2019 ◽  
Author(s):  
Chengyuan Wu ◽  
Shiquan Ren ◽  
Jie Wu ◽  
Kelin Xia

AbstractWe introduce an alignment-free method, the Magnus Representation, to analyze genome sequences. The Magnus Representation captures higher-order information in genome sequences. We combine our approach with the idea ofk-mers to define an effectively computable Mean Magnus Vector. We perform phylogenetic analysis on three datasets: mosquito-borne viruses, filoviruses, and bacterial genomes. Our results on ebolaviruses are consistent with previous phylogenetic analyses, and confirm the modern viewpoint that the 2014 West African Ebola outbreak likely originated from Central Africa. Our analysis also confirms the close relationship betweenBundibugyo ebolavirusandTaï Forest ebolavirus. For bacterial genomes, our method is able to classify relatively well at the family and genus level, as well as at higher levels such as phylum level. The bacterial genomes are also separated well into Gram-positive and Gram-negative subgroups.



Viruses ◽  
2021 ◽  
Vol 13 (9) ◽  
pp. 1842
Author(s):  
Bert Vanmechelen ◽  
Zafeiro Zisi ◽  
Sophie Gryseels ◽  
Joëlle Goüy de Bellocq ◽  
Bram Vrancken ◽  
...  

Recent years have witnessed the discovery of several new viruses belonging to the family Arteriviridae, expanding the known diversity and host range of this group of complex RNA viruses. Although the pathological relevance of these new viruses is not always clear, several well-studied members of the family Arteriviridae are known to be important animal pathogens. Here, we report the complete genome sequences of four new arterivirus variants, belonging to two putative novel species. These new arteriviruses were discovered in African rodents and were given the names Lopma virus and Praja virus. Their genomes follow the characteristic genome organization of all known arteriviruses, even though they are only distantly related to currently known rodent-borne arteriviruses. Phylogenetic analysis shows that Lopma virus clusters in the subfamily Variarterivirinae, while Praja virus clusters near members of the subfamily Heroarterivirinae: the yet undescribed forest pouched giant rat arterivirus and hedgehog arterivirus 1. A co-divergence analysis of rodent-borne arteriviruses confirms that they share similar phylogenetic patterns with their hosts, with only very few cases of host shifting events throughout their evolutionary history. Overall, the genomes described here and their unique clustering with other arteriviruses further illustrate the existence of multiple rodent-borne arterivirus lineages, expanding our knowledge of the evolutionary origin of these viruses.



2019 ◽  
Vol 8 (34) ◽  
Author(s):  
Hazuki Yamashita ◽  
Takayuki Wada ◽  
Yusuke Kato ◽  
Takuji Ikeda ◽  
Masayuki Imajoh

Flavobacterium psychrophilum is a Gram-negative, psychrophilic bacterium within the family Flavobacteriaceae. Here, we report the draft genome sequences of three F. psychrophilum strains isolated from skin ulcers of diseased ayu caught by tomozuri angling at three sites in the Kagami River in Japan.



Pathogens ◽  
2021 ◽  
Vol 10 (1) ◽  
pp. 41
Author(s):  
Marcos Godoy ◽  
Daniel A. Medina ◽  
Rudy Suarez ◽  
Sandro Valenzuela ◽  
Jaime Romero ◽  
...  

Piscine orthoreovirus (PRV) belongs to the family Reoviridae and has been described mainly in association with salmonid infections. The genome of PRV consists of about 23,600 bp, with 10 segments of double-stranded RNA, classified as small (S1 to S4), medium (M1, M2 and M3) and large (L1, L2 and L3); these range approximately from 1000 bp (segment S4) to 4000 bp (segment L1). How the genetic variation among PRV strains affects the virulence for salmonids is still poorly understood. The aim of this study was to describe the molecular phylogeny of PRV based on an extensive sequence analysis of the S1 and M2 segments of PRV available in the GenBank database to date (May 2020). The analysis was extended to include new PRV sequences for S1 and M2 segments. In addition, subgenotype classifications were assigned to previously published unclassified sequences. It was concluded that the phylogenetic trees are consistent with the original classification using the PRV genomic segment S1, which differentiates PRV into two major genotypes, I and II, and each of these into two subgenotypes, designated as Ia and Ib, and IIa and IIb, respectively. Moreover, some clusters of country- and host-specific PRV subgenotypes were observed in the subset of sequences used. This work strengthens the subgenotype classification of PRV based on the S1 segment and can be used to enhance research on the virulence of PRV.



2003 ◽  
Vol 60 (3) ◽  
pp. 533-568 ◽  
Author(s):  
J. C. MANNING ◽  
P. GOLDBLATT ◽  
M. F. FAY

A revised generic synopsis of sub-Saharan Hyacinthaceae is presented, based on a molecular phylogenetic analysis of the family. Generic rank is accorded only to reciprocally monophyletic clades that can be distinguished by recognizable morphological discontinuities, thereby permitting an appropriate generic assignment of species not included in the analysis. Three subfamilies are recognized within the region. Subfamily Ornithogaloideae, characterized by flattened or angular seeds with tightly adhering testa, is considered to include the single genus Ornithogalum, which is expanded to include the genera Albuca, Dipcadi, Galtonia, Neopatersonia and Pseudogaltonia. Recognizing any of these segregates at generic level renders the genus Ornithogalum polyphyletic, while subdivision of Ornithogalum into smaller, morphologically distinguishable segregates in order to preserve the monophyly of each is not possible. Subfamily Urgineoideae, characterized by flattened or winged seeds with brittle, loosely adhering testa, comprises the two mainland African genera Bowiea and Drimia. The latter is well circumscribed by its deciduous, short-lived perianth and includes the previously recognized genera Litanthus, Rhadamanthus, Schizobasis and Tenicroa. The monotypic Madagascan Igidia is provisionally included in the subfamily as a third genus on the basis of its seeds, pending molecular confirmation of its relationships. Subfamily Hyacinthoideae resolves into three clades, distinguished as tribes Hyacintheae (strictly northern hemisphere and not treated further), Massonieae and Pseudoprospereae tribus nov. Full descriptions and a key to their identification are provided for all genera. New combinations reflecting the generic circumscriptions adopted here are made for most African and all Indian and Madagascan species.



Genetics ◽  
2003 ◽  
Vol 165 (2) ◽  
pp. 613-621 ◽  
Author(s):  
Douglas R Dorer ◽  
Jamie A Rudnick ◽  
Etsuko N Moriyama ◽  
Alan C Christensen

Abstract Within the unique Triplo-lethal region (Tpl) of the Drosophila melanogaster genome we have found a cluster of 20 genes encoding a novel family of proteins. This family is also present in the Anopheles gambiae genome and displays remarkable synteny and sequence conservation with the Drosophila cluster. The family is also present in the sequenced genome of D. pseudoobscura, and homologs have been found in Aedes aegypti mosquitoes and in four other insect orders, but it is not present in the sequenced genome of any noninsect species. Phylogenetic analysis suggests that the cluster evolved prior to the divergence of Drosophila and Anopheles (250 MYA) and has been highly conserved since. The ratio of synonymous to nonsynonymous substitutions and the high codon bias suggest that there has been selection on this family both for expression level and function. We hypothesize that this gene family is Tpl, name it the Osiris family, and consider possible functions. We also predict that this family of proteins, due to the unique dosage sensitivity and the lack of homologs in noninsect species, would be a good target for genetic engineering or novel insecticides.



2021 ◽  
Vol 95 ◽  
Author(s):  
M.M. Montes ◽  
J. Barneche ◽  
Y. Croci ◽  
D. Balcazar ◽  
A. Almirón ◽  
...  

Abstract During a parasitological survey of fishes at Iguazu National Park, Argentina, specimens belonging to the allocreadiid genus Auriculostoma were collected from the intestine of Characidium heirmostigmata. The erection of the new species is based on a unique combination of morphological traits as well as on phylogenetic analysis. Auriculostoma guacurarii n. sp. resembles four congeneric species – Auriculostoma diagonale, Auriculostoma platense, Auriculostoma tica and Auriculostoma totonacapanensis – in having smooth and oblique testes, but can be distinguished by a combination of several morphological features, hosts association and geographic distribution. Morphologically, the new species can be distinguished from both A. diagonale and A. platense by the egg size (bigger in the first and smaller in the last); from A. tica by a shorter body length, the genital pore position and the extension of the caeca; and from A. totonacapanensis by the size of the oral and ventral sucker and the post-testicular space. Additionally, one specimen of Auriculostoma cf. stenopteri from the characid Charax stenopterus (Characiformes) from La Plata River, Argentina, was sampled and the partial 28S rRNA gene was sequenced. The phylogenetic analysis revealed that A. guacurarii n. sp. clustered with A. tica and these two as sister taxa to A. cf. stenopteri. The new species described herein is the tenth species in the genus and the first one parasitizing a member of the family Crenuchidae.



2009 ◽  
Vol 23 (3) ◽  
pp. 193 ◽  
Author(s):  
Matjaž Kuntner ◽  
Ingi Agnarsson

Phylogenies are underutilised, powerful predictors of traits in unstudied species. We tested phylogenetic predictions of web-related behaviour in Clitaetra Simon, 1889, an Afro-Indian spider genus of the family Nephilidae. Clitaetra is phylogenetically sister to all other nephilids and thus important for understanding ancestral traits. Behavioural information on Clitaetra has been limited to only C. irenae Kuntner, 2006 from South Africa which constructs ladder webs. A resolved species-level phylogeny unambiguously optimised Clitaetra behavioural biology and predicted web traits in five unstudied species and a uniform intrageneric nephilid web biology. We tested these predictions by studying the ecology and web biology of C. perroti Simon, 1894 on Madagascar and C. episinoides Simon, 1889 on Mayotte. We confirm predicted arboricolous web architecture in these species. The expected ontogenetic allometric transition from orbs in juveniles to elongate ladder webs in adults was statistically significant in C. perroti, whereas marginally not significant in C. episinoides. We demonstrate the persistence of the temporary spiral in finished Clitaetra webs. A morphological and behavioural phylogenetic analysis resulted in unchanged topology and persisting unambiguous behavioural synapomorphies. Our results support the homology of Clitaetra hub reinforcement with the nephilid hub-cup. In Clitaetra, behaviour was highly predictable and remained consistent with new observations. Our results confirm that nephilid web biology is evolutionarily conserved within genera.



Sign in / Sign up

Export Citation Format

Share Document