nucleotide composition
Recently Published Documents


TOTAL DOCUMENTS

494
(FIVE YEARS 158)

H-INDEX

49
(FIVE YEARS 6)

2021 ◽  
Vol 10 (1) ◽  
pp. 31
Author(s):  
Alina Matsvay ◽  
Marina Dyachkova ◽  
Ivan Mikhaylov ◽  
Daniil Kiselev ◽  
Anna Say ◽  
...  

Discovery and study of viruses carried by migratory birds are tasks of high importance due to the host’s ability to spread infectious diseases over significant distances. With this paper, we present and characterize the first complete genome sequence of atadenovirus from a tern bird (common tern, Sterna hirundo) preliminarily named tern atadenovirus 1 (TeAdV-1). TeAdV-1 genome is a linear double-stranded DNA molecule, 31,334 base pairs which contain 30 methionine-initiated open reading frames with gene structure typical for Atadenovirus genus, and the shortest known inverted terminal repeats (ITRs) within the Atadenovirus genus consisted of 25 bases. The nucleotide composition of the genome is characterized by a low G + C content (33.86%), which is the most AT-rich genome of known avian adenoviruses within Atadenovirus genus. The nucleotide sequence of the TeAdV-1 genome shows high divergence compared to known representatives of the Atadenovirus genus with the highest similarity to the duck atadenovirus 1 (53.7%). Phylogenetic analysis of the protein sequences of core genes confirms the taxonomic affiliation of the new representative to the genus Atadenovirus with the degree of divergence from the known representatives exceeding the interspecies distance within the genus. Thereby we proposed a novel TeAdV-1 to be considered as a separate species.


2021 ◽  
Author(s):  
Megha Mathur ◽  
Sumeet Patiyal ◽  
Anjali Dhall ◽  
Shipra Jain ◽  
Ritu Tomer ◽  
...  

In the past few decades, public repositories on nucleotides have increased with exponential rates. This pose a major challenge to researchers to predict the structure and function of nucleotide sequences. In order to annotate function of nucleotide sequences it is important to compute features/attributes for predicting function of these sequences using machine learning techniques. In last two decades, several software/platforms have been developed to elicit a wide range of features for nucleotide sequences. In order to complement the existing methods, here we present a platform named Nfeature developed for computing wide range of features of DNA and RNA sequences. It comprises of three major modules namely Composition, Correlation, and Binary profiles. Composition module allow to compute different type of compositions that includes mono-/di-tri-nucleotide composition, reverse complement composition, pseudo composition. Correlation module allow to compute various type of correlations that includes auto-correlation, cross-correlation, pseudo-correlation. Similarly, binary profile is developed for computing binary profile based on nucleotides, di-nucleotides, di-/tri-nucleotide properties. Nfeature also allow to compute entropy of sequences, repeats in sequences and distribution of nucleotides in sequences. In addition to compute feature in whole sequence, it also allows to compute features from part of sequence like split-composition, N-terminal, C-terminal. In a nutshell, Nfeature amalgamates existing features as well as number of novel features like nucleotide repeat index, distance distribution, entropy, binary profile, and properties. This tool computes a total of 29217 and 14385 features for DNA and RNA sequence, respectively. In order to provide, a highly efficient and user-friendly tool, we have developed a standalone package and web-based platform (https://webs.iiitd.edu.in/raghava/nfeature).


2021 ◽  
Vol 7 (12) ◽  
Author(s):  
Alex Z. Zaccaron ◽  
Ioannis Stergiopoulos

Powdery mildews comprise a large group of economically important phytopathogenic fungi. However, limited information exists on their mitochondrial genomes. Here, we assembled and compared the mitochondrial genomes of the powdery mildew pathogens Blumeria graminis f. sp. tritici, Erysiphe pisi, and Golovinomyces cichoracearum. Included in the comparative analysis was also the mitochondrial genome of Erysiphe necator that was previously analysed. The mitochondrial genomes of the four Erysiphales exhibit a similar gene content and organization but a large variation in size, with sizes ranging from 109800 bp in B. graminis f. sp. tritici to 332165 bp in G. cichoracearum, which is the largest mitochondrial genome of a fungal pathogen reported to date. Further comparative analysis revealed an unusual bimodal GC distribution in the mitochondrial genomes of B. graminis f. sp. tritici and G. cichoracearum that was not previously observed in fungi. The cytochrome b (cob) genes of E. necator, E. pisi, and G. cichoracearum were also exceptionally rich in introns, which in turn harboured rare open reading frames encoding reverse transcriptases that were likely acquired horizontally. Golovinomyces cichoracearum had also the longest cob gene (45 kb) among 703 fungal cob genes analysed. Collectively, these results provide novel insights into the organization of mitochondrial genomes of powdery mildew pathogens and represent valuable resources for population genetics and evolutionary studies.


Author(s):  
I. V. Stepanyan

The author examines new challenges of ergonomics and occupational health, including unknown risks, issues of prevention, and ethics. The author also presents an overview of modern bioinformatics systems and visualization methods in bioinformatics. The researcher analyzed the health risks of human interaction with large volumes of textual information and advanced computational methods to prevent computer syndrome, including overstrain of the visual analyzer and pain in the back, neck, and hands. The study aims to analyze the representations of hereditary molecular genetic information in the form of graphic patterns available for visual perception, characterizing the initial data, and study the possibility of visualizing large amounts of data using the method of characteristic patterns. The author developed new methods of presenting large volumes of hereditary genetic information in bioinformatic systems. The basis of the method is information processing based on computer algorithms. The methods allow us to visually assess the differences in the genetic structure of various species of living organisms and identify the features of their nucleotide composition. The fixation of the internal ordering of the information signal in an individual graphical quasi-fractal structure is a characteristic feature of the methods considered. It makes it possible to expand the possibilities of visual-analytical thinking of a person when interacting with large amounts of information through bioinformatics tools.


Author(s):  
Ashley Haluck-Kangas ◽  
Monal Patel ◽  
Bidur Paudel ◽  
Aparajitha Vaidyanathan ◽  
Andrea E. Murmann ◽  
...  

Abstractmicro(mi)RNAs are short noncoding RNAs that through their seed sequence (pos. 2–7/8 of the guide strand) regulate cell function by targeting complementary sequences (seed matches) located mostly in the 3′ untranslated region (3′ UTR) of mRNAs. Any short RNA that enters the RNA induced silencing complex (RISC) can kill cells through miRNA-like RNA interference when its 6mer seed sequence (pos. 2–7 of the guide strand) has a G-rich nucleotide composition. G-rich seeds mediate 6mer Seed Toxicity by targeting C-rich seed matches in the 3′ UTR of genes critical for cell survival. The resulting Death Induced by Survival gene Elimination (DISE) predominantly affects cancer cells but may contribute to cell death in other disease contexts. This review summarizes recent findings on the role of DISE/6mer Seed Tox in cancer; its therapeutic potential; its contribution to therapy resistance; its selectivity, and why normal cells are protected. In addition, we explore the connection between 6mer Seed Toxicity and aging in relation to cancer and certain neurodegenerative diseases.


Insects ◽  
2021 ◽  
Vol 12 (12) ◽  
pp. 1049
Author(s):  
Huifeng Zhao ◽  
Ye Chen ◽  
Zitong Wang ◽  
Haifeng Chen ◽  
Yaoguang Qin

The complete mitochondrial genomes of two species of Chalcididae were newly sequenced: Brachymeria lasus and Haltichella nipponensis. Both circular mitogenomes are 15,147 and 15,334 bp in total length, respectively, including 13 protein-coding genes (PCGs), two ribosomal RNA genes (rRNAs), and 22 transfer RNA genes (tRNAs) and an A+T-rich region. The nucleotide composition indicated a strong A/T bias. All PCGs of B. lasus and H. nipponensis began with the start codon ATD, except for B. lasus, which had an abnormal initiation codon TTG in ND1. Most PCGs of the two mitogenomes are terminated by a codon of TAR, and the remaining PCGs by the incomplete stop codon T or TA (ATP6, COX3, and ND4 in both species, with an extra CYTB in B. lasus). Except for trnS1 and trnF, all tRNAs can be folded into a typical clover structure. Both mitogenomes had similar control regions, and two repeat units of 135 bp were found in H. nipponensis. Phylogenetic analyses based on two datasets (PCG123 and PCG12) covering Chalcididae and nine families of Chalcidoidea were conducted using two methods (maximum likelihood and Bayesian inference); all the results support Mymaridae as the sister group of the remaining Chalcidoidea, with Chalcididae as the next successive group. Only analyses of PCG123 generated similar topologies of Mymaridae + (Chalcididae + (Agaonidae + remaining Chalcidoidea)) and provided one relative stable clade as Eulophidae + (Torymidae + (Aphelinidae + Trichogrammatidae)). Our mitogenomic phylogenetic results share one important similarity with earlier molecular phylogenetic efforts: strong support for the monophyly of many families, but a largely unresolved or unstable “backbone” of relationships among families.


Agronomy ◽  
2021 ◽  
Vol 11 (11) ◽  
pp. 2289
Author(s):  
Yuan Niu ◽  
Yanyan Luo ◽  
Chunlei Wang ◽  
Weibiao Liao

Cucumber is the most important vegetable crop in the Cucurbitaceae family. Condon usage bias (CUB) is a valuable character of species evolution. However, there is little research on the CUB of cucumber. Thus, this study analyzes the codon usage patterns of cucumber and its relatives within Cucurbitaceae on the genomic level. The analysis of fundamental indicators of codon characteristics shows that it was slightly GC poor, and there was weak codon usage bias in cucumber. We conduct the analysis of neutrality plot, ENC plot, P2 index, and COA indicates that the nucleotide composition, mutation pressure, and translational selection might play roles in CUB in cucumber and its relatives. Among these factors, nucleotide composition might play the most critical role. Based on these analyses, 30 optimal codons were identified in cucumber, most of them ending with U or A. Meanwhile, based on the RSCU values of species, a cluster tree was constructed, in which the situation of cucumber is consistent with the current taxonomic and evolutionary studies in Cucurbitaceae. This study systematically compared the CUB patterns and shaping factors of cucumber and its relatives, laying a foundation for future research on genetic engineering and evolutionary mechanisms in Cucurbitaceae.


2021 ◽  
Author(s):  
Jonas Koeppel ◽  
Elin Madli Peets ◽  
Juliane Weller ◽  
Ananth Pallaseni ◽  
Fabio Liberante ◽  
...  

Any short sequence can be precisely written into a selected genomic target using prime editing. This ability facilitates protein tagging, correction of pathogenic deletions, and many other exciting applications. However, it remains unclear what types of sequences prime editors can efficiently insert, and how to choose optimal reagents for a desired outcome. To characterize features that influence insertion efficiency, we designed a library of 2,666 sequences up to 69 nt in length and measured the frequency of their insertion into four genomic sites in three human cell lines, using different prime editor systems. We discover that insertion sequence length, nucleotide composition and secondary structure all affect insertion rates, and that mismatch repair proficiency is a strong determinant for the shortest insertions. Combining the sequence and repair features into a machine learning model, we can predict insertion frequency for new sequences with R = 0.69. The tools we provide allow users to choose optimal constructs for DNA insertion using prime editing.


2021 ◽  
pp. 1040-1053
Author(s):  
Nickolai A. Bochkarev ◽  
Elena I. Zuykova ◽  
Lyudmila A. Pestryakova ◽  
Lena A. Ushnitskaya ◽  
Evgeny S. Zakharov ◽  
...  

The results of morphological and genetic analyses of forms/species of the Coregonus lavaretus pidschian (Gmelin, 1789) complex from the Indigirka and Kolyma river basins are presented in the context of there being recent postglacial speciation events. It has been found that the studied whitefishes belong to the sparsely rakered and low lateral-line forms and have previously been described as Coregonus lavaretus pidschian n. jucagiricus Drjagin (Berg), 1932. Based on these characters, this whitefish does not differ from most Arctic whitefish populations (in particular from Coregonus lavaretus glacialis Kirillov, 1972). Analysis of variability of the ND1 gene (mtDNA) showed that whitefishes from the Indigirka and Kolyma basins belong to a distant phylogenetic lineage, which is significantly different from all previously studied whitefish lineages from the Ob, Yenisei, Lena, Anadyr, and Amur river basins. Analysis of variability of the ITS1 fragment (nDNA) showed that all studied forms/species (from Ob River to Amur River basins), including C. l. pidschian n. jucagiricus, have a tandem arrangement of two identical nucleotide fragments and very similar nucleotide composition of the ITS1 region. Based on contemporary data, this phylogenetic lineage of the C. pidschian complex could be considered a young postglacial allopatric species.


Sign in / Sign up

Export Citation Format

Share Document