scholarly journals Defensin-like peptides in wheat analyzed by whole-transcriptome sequencing: a focus on structural diversity and role in induced resistance

PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e6125 ◽  
Author(s):  
Tatyana I. Odintsova ◽  
Marina P. Slezina ◽  
Ekaterina A. Istomina ◽  
Tatyana V. Korostyleva ◽  
Artem S. Kasianov ◽  
...  

Antimicrobial peptides (AMPs) are the main components of the plant innate immune system. Defensins represent the most important AMP family involved in defense and non-defense functions. In this work, global RNA sequencing and de novo transcriptome assembly were performed to explore the diversity of defensin-like (DEFL) genes in the wheat Triticum kiharae and to study their role in induced resistance (IR) mediated by the elicitor metabolites of a non-pathogenic strain FS-94 of Fusarium sambucinum. Using a combination of two pipelines for DEFL mining in transcriptome data sets, as many as 143 DEFL genes were identified in T. kiharae, the vast majority of them represent novel genes. According to the number of cysteine residues and the cysteine motif, wheat DEFLs were classified into ten groups. Classical defensins with a characteristic 8-Cys motif assigned to group 1 DEFLs represent the most abundant group comprising 52 family members. DEFLs with a characteristic 4-Cys motif CX{3,5}CX{8,17}CX{4,6}C named group 4 DEFLs previously found only in legumes were discovered in wheat. Within DEFL groups, subgroups of similar sequences originated by duplication events were isolated. Variation among DEFLs within subgroups is due to amino acid substitutions and insertions/deletions of amino acid sequences. To identify IR-related DEFL genes, transcriptional changes in DEFL gene expression during elicitor-mediated IR were monitored. Transcriptional diversity of DEFL genes in wheat seedlings in response to the fungus Fusarium oxysporum, FS-94 elicitors, and the combination of both (elicitors + fungus) was demonstrated, with specific sets of up- and down-regulated DEFL genes. DEFL expression profiling allowed us to gain insight into the mode of action of the elicitors from F. sambucinum. We discovered that the elicitors up-regulated a set of 24 DEFL genes. After challenge inoculation with F. oxysporum, another set of 22 DEFLs showed enhanced expression in IR-displaying seedlings. These DEFLs, in concert with other defense molecules, are suggested to determine enhanced resistance of elicitor-pretreated wheat seedlings. In addition to providing a better understanding of the mode of action of the elicitors from FS-94 in controlling diseases, up-regulated IR-specific DEFL genes represent novel candidates for genetic transformation of plants and development of pathogen-resistant crops.

2021 ◽  
Author(s):  
Chris Papadopoulos ◽  
Isabelle Callebaut ◽  
Jean-Christophe Gelly ◽  
Isabelle Hatin ◽  
Olivier Namy ◽  
...  

The noncoding genome plays an important role in de novo gene birth and in the emergence of genetic novelty. Nevertheless, how noncoding sequences' properties could promote the birth of novel genes and shape the evolution and the structural diversity of proteins remains unclear. Therefore, by combining different bioinformatic approaches, we characterized the fold potential diversity of the amino acid sequences encoded by all intergenic ORFs (Open Reading Frames) of S. cerevisiae with the aim of (i) exploring whether the large structural diversity observed in proteomes is already present in noncoding sequences, and (ii) estimating the potential of the noncoding genome to produce novel protein bricks that can either give rise to novel genes or be integrated into pre-existing proteins, thus participating in protein structure diversity and evolution. We showed that amino acid sequences encoded by most yeast intergenic ORFs contain the elementary building blocks of protein structures. Moreover, they encompass the large structural diversity of canonical proteins with strikingly the majority predicted as foldable. Then, we investigated the early stages of de novo gene birth by identifying intergenic ORFs with a strong translation signal in ribosome profiling experiments and by reconstructing the ancestral sequences of 70 yeast de novo genes. This enabled us to highlight sequence and structural factors determining de novo gene emergence. Finally, we showed a strong correlation between the fold potential of de novo proteins and the one of their ancestral amino acid sequences, reflecting the relationship between the noncoding genome and the protein structure universe.


2021 ◽  
Author(s):  
◽  
Samaneh Azari

<p>De novo peptide sequencing algorithms have been developed for peptide identification in proteomics from tandem mass spectra (MS/MS), which can be used to identify and discover novel peptides and proteins that do not have a database available. Despite improvements in MS instrumentation and de novo sequencing methods, a significant number of CID MS/MS spectra still remain unassigned with the current algorithms, often leading to low confidence of peptide assignments to the spectra. Moreover, current algorithms often fail to construct the completely matched sequences, and produce partial matches. Therefore, identification of full-length peptides remains challenging. Another major challenge is the existence of noise in MS/MS spectra which makes the data highly imbalanced. Also missing peaks, caused by incomplete MS fragmentation makes it more difficult to infer a full-length peptide sequence. In addition, the large search space of all possible amino acid sequences for each spectrum leads to a high false discovery rate. This thesis focuses on improving the performance of current methods by developing new algorithms corresponding to three steps of preprocessing, sequence optimisation and post-processing using machine learning for more comprehensive interrogation of MS/MS datasets. From the machine learning point of view, the three steps can be addressed by solving different tasks such as classification, optimisation, and symbolic regression. Since Evolutionary Algorithms (EAs), as effective global search techniques, have shown promising results in solving these problems, this thesis investigates the capability of EAs in improving the de novo peptide sequencing. In the preprocessing step, this thesis proposes an effective GP-based method for classification of signal and noise peaks in highly imbalanced MS/MS spectra with the purpose of having a positive influence on the reliability of the peptide identification. The results show that the proposed algorithm is the most stable classification method across various noise ratios, outperforming six other benchmark classification algorithms. The experimental results show a significant improvement in high confidence peptide assignments to MS/MS spectra when the data is preprocessed by the proposed GP method. Moreover, the first multi-objective GP approach for classification of peaks in MS/MS data, aiming at maximising the accuracy of the minority class (signal peaks) and the accuracy of the majority class (noise peaks) is also proposed in this thesis. The results show that the multi-objective GP method outperforms the single objective GP algorithm and a popular multi-objective approach in terms of retaining more signal peaks and removing more noise peaks. The multi-objective GP approach significantly improved the reliability of peptide identification. This thesis proposes a GA-based method to solve the complex optimisation task of de novo peptide sequencing, aiming at constructing full-length sequences. The proposed GA method benefits the GA capability of searching a large search space of potential amino acid sequences to find the most likely full-length sequence. The experimental results show that the proposed method outperforms the most commonly used de novo sequencing method at both amino acid level and peptide level. This thesis also proposes a novel method for re-scoring and re-ranking the peptide spectrum matches (PSMs) from the result of de novo peptide sequencing, aiming at minimising the false discovery rate as a post-processing approach. The proposed GP method evolves the computer programs to perform regression and classification simultaneously in order to generate an effective scoring function for finding the correct PSMs from many incorrect ones. The results show that the new GP-based PSM scoring function significantly improves the identification of full-length peptides when it is used to post-process the de novo sequencing results.</p>


Development ◽  
1989 ◽  
Vol 105 (2) ◽  
pp. 279-298
Author(s):  
H. Herrmann ◽  
B. Fouquet ◽  
W.W. Franke

To provide a basis for studies of the expression of genes encoding the diverse kinds of intermediate-filament (IF) proteins during embryogenesis of Xenopus laevis we have isolated and characterized IF protein cDNA clones. Here we report the identification of two types of Xenopus vimentin, Vim1 and Vim4, with their complete amino acid sequences as deduced from the cloned cDNAs, both of which are expressed during early embryogenesis. In addition, we have obtained two further vimentin cDNAs (Vim2 and 3) which are sequence variants of closely related Vim1. The high evolutionary conservation of the amino acid sequences (Vim1: 458 residues; Mr approximately 52,800; Vim4: 463 residues; Mr approximately 53,500) to avian and mammalian vimentin and, to a lesser degree, to desmin from the same and higher vertebrate species, is emphasized, including conserved oligopeptide motifs in their head domains. Using these cDNAs in RNA blot and ribonuclease protection assays of various embryonic stages, we observed a dramatic increase of vimentin RNA at stage 14, in agreement with immunocytochemical results obtained with antibody VIM-3B4. The significance of very weak mRNA signals detected in earlier stages is discussed in relation to negative immunocytochemical results obtained in these stages. The first appearance of vimentin has been localized to a distinct mesenchymal cell layer underlying the neural plate or tube, respectively. The results are discussed in relation to programs of de novo synthesis of other cytoskeletal proteins in amphibian and mammalian development.


Life ◽  
2019 ◽  
Vol 9 (1) ◽  
pp. 8 ◽  
Author(s):  
Michael S. Wang ◽  
Kenric J. Hoegler ◽  
Michael H. Hecht

Life as we know it would not exist without the ability of protein sequences to bind metal ions. Transition metals, in particular, play essential roles in a wide range of structural and catalytic functions. The ubiquitous occurrence of metalloproteins in all organisms leads one to ask whether metal binding is an evolved trait that occurred only rarely in ancestral sequences, or alternatively, whether it is an innate property of amino acid sequences, occurring frequently in unevolved sequence space. To address this question, we studied 52 proteins from a combinatorial library of novel sequences designed to fold into 4-helix bundles. Although these sequences were neither designed nor evolved to bind metals, the majority of them have innate tendencies to bind the transition metals copper, cobalt, and zinc with high nanomolar to low-micromolar affinity.


2003 ◽  
Vol 185 (1) ◽  
pp. 359-370 ◽  
Author(s):  
Wai-Leung Ng ◽  
Krystyna M. Kazmierczak ◽  
Gregory T. Robertson ◽  
Raymond Gilmour ◽  
Malcolm E. Winkler

ABSTRACT The effects of sublethal concentrations of four different classes of translation inhibitors (puromycin, tetracycline, chloramphenicol, and erythromycin) on global transcription patterns of Streptococcus pneumoniae R6 were determined by microarray analyses. Consistent with the general mode of action of these inhibitors, relative transcript levels of genes that encode ribosomal proteins and translation factors or that mediate tRNA charging and amino acid biosynthesis increased or decreased, respectively. Transcription of the heat shock regulon was induced only by puromycin or streptomycin treatment, which lead to truncation or mistranslation, respectively, but not by other antibiotics that block translation, transcription, or amino acid charging of tRNA. In contrast, relative transcript amounts of certain genes involved in transport, cellular processes, energy metabolism, and purine nucleotide (pur) biosynthesis were changed by different translation inhibitors. In particular, transcript amounts from a pur gene cluster and from purine uptake and salvage genes were significantly elevated by several translation inhibitors, but not by antibiotics that target other cellular processes. Northern blotting confirmed increased transcript amounts from part of the pur gene cluster in cells challenged by translation inhibitors and revealed the presence of a 10-kb transcript. Purine metabolism genes were negatively regulated by a homologue of the PurR regulatory protein, and full derepression in a ΔpurR mutant depended on optimal translation. Unexpectedly, hierarchical clustering of the microarray data distinguished among the global transcription patterns caused by antibiotics that inhibit different steps in the translation cycle. Together, these results show that there is extensive control of transcript amounts by translation in S. pneumoniae, especially for de novo purine nucleotide biosynthesis. In addition, these global transcription patterns form a signature that can be used to classify the mode of action and potential mechanism of new translation inhibitors.


2017 ◽  
Vol 61 (4) ◽  
pp. 421-426 ◽  
Author(s):  
Joanna Kołsut ◽  
Paulina Borówka ◽  
Błażej Marciniak ◽  
Ewelina Wójcik ◽  
Arkadiusz Wojtasik ◽  
...  

AbstractIntroduction: Colibacillosis – the most common disease of poultry, is caused mainly by avian pathogenic Escherichia coli (APEC). However, thus far, no pattern to the molecular basis of the pathogenicity of these bacteria has been established beyond dispute. In this study, genomes of APEC were investigated to ascribe importance and explore the distribution of 16 genes recognised as their virulence factors.Material and Methods: A total of 14 pathogenic for poultry E. coli strains were isolated, and their DNA was sequenced, assembled de novo, and annotated. Amino acid sequences from these bacteria and an additional 16 freely available APEC amino acid sequences were analysed with the DIFFIND tool to define their virulence factors.Results: The DIFFIND tool enabled quick, reliable, and convenient assessment of the differences between compared amino acid sequences from bacterial genomes. The presence of 16 protein sequences indicated as pathogenicity factors in poultry resulted in the generation of a heatmap which categorises genomes in terms of the existence and similarity of the analysed protein sequences.Conclusion: The proposed method of detection of virulence factors using the capabilities of the DIFFIND tool may be useful in the analysis of similarities of E. coli and other sequences deriving from bacteria. Phylogenetic analysis resulted in reliable segregation of 30 APEC strains into five main clusters containing various virulence associated genes (VAGs).


2021 ◽  
Vol 118 (11) ◽  
pp. e2017228118
Author(s):  
Christoffer Norn ◽  
Basile I. M. Wicky ◽  
David Juergens ◽  
Sirui Liu ◽  
David Kim ◽  
...  

The protein design problem is to identify an amino acid sequence that folds to a desired structure. Given Anfinsen’s thermodynamic hypothesis of folding, this can be recast as finding an amino acid sequence for which the desired structure is the lowest energy state. As this calculation involves not only all possible amino acid sequences but also, all possible structures, most current approaches focus instead on the more tractable problem of finding the lowest-energy amino acid sequence for the desired structure, often checking by protein structure prediction in a second step that the desired structure is indeed the lowest-energy conformation for the designed sequence, and typically discarding a large fraction of designed sequences for which this is not the case. Here, we show that by backpropagating gradients through the transform-restrained Rosetta (trRosetta) structure prediction network from the desired structure to the input amino acid sequence, we can directly optimize over all possible amino acid sequences and all possible structures in a single calculation. We find that trRosetta calculations, which consider the full conformational landscape, can be more effective than Rosetta single-point energy estimations in predicting folding and stability of de novo designed proteins. We compare sequence design by conformational landscape optimization with the standard energy-based sequence design methodology in Rosetta and show that the former can result in energy landscapes with fewer alternative energy minima. We show further that more funneled energy landscapes can be designed by combining the strengths of the two approaches: the low-resolution trRosetta model serves to disfavor alternative states, and the high-resolution Rosetta model serves to create a deep energy minimum at the design target structure.


2021 ◽  
Vol 12 ◽  
Author(s):  
Jasmin Gattringer ◽  
Olivier Eteme Ndogo ◽  
Bernhard Retzl ◽  
Carina Ebermann ◽  
Christian W. Gruber ◽  
...  

Traditional medicine and the use of herbal remedies are well established in the African health care system. For instance, Violaceae plants are used for antimicrobial or anti-inflammatory applications in folk medicine. This study describes the phytochemical analysis and bioactivity screening of four species of the violet tribe Allexis found in Cameroon. Allexis cauliflora, Allexis obanensis, Allexis batangae and Allexis zygomorpha were evaluated for the expression of circular peptides (cyclotides) by mass spectrometry. The unique cyclic cystine-rich motif was identified in several peptides of all four species. Knowing that members of this peptide family are protease inhibitors, the plant extracts were evaluated for the inhibition of human prolyl oligopeptidase (POP). Since all four species inhibited POP activity, a bioactivity-guided fractionation approach was performed to isolate peptide inhibitors. These novel cyclotides, alca 1 and alca 2 exhibited IC50 values of 8.5 and 4.4 µM, respectively. To obtain their amino acid sequence information, combinatorial enzymatic proteolysis was performed. The proteolytic fragments were evaluated in MS/MS fragmentation experiments and the full-length amino acid sequences were obtained by de novo annotation of fragment ions. In summary, this study identified inhibitors of the human protease POP, which is a drug target for inflammatory or neurodegenerative disorders.


2011 ◽  
pp. 155-174
Author(s):  
Cristiano Chiarabelli ◽  
Cecilia Portela Pallares ◽  
Anna Quintarelli

Author(s):  
Nai-Tong Yu ◽  
Zhi-Ying Cai ◽  
Zhongguo Xiong ◽  
Yan Yang ◽  
Zhi-Xin Liu

An outbreak of a virus-like disease has caused severe damage to noni plants (Morinda citrifolia L.) in Xishuangbanna area of China's southwestern Yunnan province since 2015. The diseased plants displayed typical mosaic symptom with light and dark green patches on leaves. Flexuous filamentous virus particles of about 800 nm in length were observed from the leaf saps by transmission electron microscope. Illumina transcriptomic sequencing further revealed the presence of a potyvirus and its near complete genome was obtained from de novo assembly. The complete genome of 9,659 nts was obtained by Sanger sequencing of eight amplicons generate by RT-PCR and 5&rsquo; and 3&rsquo; RACE. BLASTp analysis of the polyprotein sequence showed that the virus was most closely related to Tobacco vein banding mosaic virus (TVBMV), but these two viruses only shared 50.7% amino acid sequence similarity. Both phylogenetic analyses of the polyprotein and CP amino acid sequences indicated that this virus is a member of genus Potyvirus. However, the low sequence homology with all known potyviruses established this virus as a new species in the genus, tentatively named as Noni mosaic virus (NoMV). Our field surveys showed that 100% of the symptomatic samples and 28.57% of the asymptomatic samples were infected with this novel potyvirus. Aphids collected from diseased leaves were also detected carrying the virus. In summary, our data indicated that a novel species of potyvirus, NoMV, is prevalent in Yunnan, China and is associated with an emerging mosaic disease on M. citrifolia.


Sign in / Sign up

Export Citation Format

Share Document