sequence motifs Latest Research Papers

Helical structure motifs made searchable for functional peptide design

Nature Communications ◽

10.1038/s41467-021-27655-0 ◽

2022 ◽

Vol 13 (1) ◽

Author(s):

Cheng-Yu Tsai ◽

Emmanuel Oluwatobi Salawu ◽

Hongchun Li ◽

Guan-Yu Lin ◽

Ting-Yu Kuo ◽

...

Keyword(s):

Secondary Structure ◽

Point Mutations ◽

Protein Secondary Structure ◽

Helical Structure ◽

Peptide Design ◽

Sequence Motifs ◽

Helical Structures ◽

Protein Protein Interaction ◽

Functional Peptides ◽

Functional Peptide

AbstractThe systematic design of functional peptides has technological and therapeutic applications. However, there is a need for pattern-based search engines that help locate desired functional motifs in primary sequences regardless of their evolutionary conservation. Existing databases such as The Protein Secondary Structure database (PSS) no longer serves the community, while the Dictionary of Protein Secondary Structure (DSSP) annotates the secondary structures when tertiary structures of proteins are provided. Here, we extract 1.7 million helices from the PDB and compile them into a database (Therapeutic Peptide Design database; TP-DB) that allows queries of compounded patterns to facilitate the identification of sequence motifs of helical structures. We show how TP-DB helps us identify a known purification-tag-specific antibody that can be repurposed into a diagnostic kit for Helicobacter pylori. We also show how the database can be used to design a new antimicrobial peptide that shows better Candida albicans clearance and lower hemolysis than its template homologs. Finally, we demonstrate how TP-DB can suggest point mutations in helical peptide blockers to prevent a targeted tumorigenic protein-protein interaction. TP-DB is made available at http://dyn.life.nthu.edu.tw/design/.

Molecular Determinants of Selectivity in Disordered Complexes May Shed Light on Specificity in Protein Condensates

Biomolecules ◽

10.3390/biom12010092 ◽

2022 ◽

Vol 12 (1) ◽

pp. 92

Author(s):

Alexander Miguel Monzon ◽

Damiano Piovesan ◽

Monika Fuxreiter

Keyword(s):

High Resolution ◽

Selective Recognition ◽

Conformational Heterogeneity ◽

Structural Studies ◽

Sequence Motifs ◽

Conformational Ensembles ◽

Wide Range ◽

Molecular Determinants ◽

High Degree ◽

Shed Light

Biomolecular condensates challenge the classical concepts of molecular recognition. The variable composition and heterogeneous conformations of liquid-like protein droplets are bottlenecks for high-resolution structural studies. To obtain atomistic insights into the organization of these assemblies, here we have characterized the conformational ensembles of specific disordered complexes, including those of droplet-driving proteins. First, we found that these specific complexes exhibit a high degree of conformational heterogeneity. Second, we found that residues forming contacts at the interface also sample many conformations. Third, we found that different patterns of contacting residues form the specific interface. In addition, we observed a wide range of sequence motifs mediating disordered interactions, including charged, hydrophobic and polar contacts. These results demonstrate that selective recognition can be realized by variable patterns of weakly defined interaction motifs in many different binding configurations. We propose that these principles also play roles in determining the selectivity of biomolecular condensates.

Prediction of Human proteins having interactions with HPV16 proteins using virus and host sequence motifs.

Pakistan Journal of Biochemistry and Biotechnology ◽

10.52700/pjbb.v2i2.54 ◽

2021 ◽

Vol 2 (2) ◽

pp. 107-126

Author(s):

Rabbiah Manzoor Malik ◽

Sahar Fazal ◽

Syed Touqeer Abbas ◽

Aamer Bhatti ◽

Mukhtar Ullah ◽

...

Keyword(s):

Cervical Cancer ◽

Hpv Infection ◽

Cervical Region ◽

Sequence Motifs ◽

Host Proteins ◽

Human Host ◽

Linear Peptide ◽

Peptide Motifs ◽

Human Proteins ◽

E6 And E7

Background: Human Papillomavirus (HPV) infection has been found to be the major cause of cancer of cervical region, in females. Genome of HPV codes for 6 functional proteins E1, E2, E4, E5, E6 and E7. These proteins play different roles in development of HPV infection and its progression towards cervical cancer. The interactions of HPV proteins with human DNA and proteins occurs in the presence of short linear peptide motifs on these proteins, have similar sequence to those found on proteins in human cells. Methods: After identification of human motifs in HPV proteins, by use of ELM resource, their counter domains were found from PROSITE. The proteins of human proteome containing these counter domains were predicted as the proteins having possibility of interactions with HPV proteins. Results: we predicted 9468 human proteins for having interactions with HPV proteins. Our predicted proteins were enriched with the host proteins having possibility of being interacted by HPV proteins. 10% of our predicted proteins were already reported to be affected by one or more HPV proteins. The list of predicated proteins can be utilized to find out the connectivity between the virus HPV and human host. It can also be used to determine the pathways involved in pathogenesis of HPV leading towards the cervical cancer Conclusion: The list of predicated proteins can be utilized to find out the connectivity between the virus HPV and human host. It can also be used to determine the pathways involved in pathogenesis of HPV leading towards the cervical cancer.

Genomic environments scale the activities of diverse core promoters

Genome Research ◽

10.1101/gr.276025.121 ◽

2021 ◽

Author(s):

Clarice K.Y. Hong ◽

Barak A. Cohen

Keyword(s):

Core Promoter ◽

Classical Model ◽

Regulatory Elements ◽

Sequence Motifs ◽

Core Promoters ◽

The Core ◽

Genomic Location ◽

Basal Transcriptional Machinery ◽

Promoter Specificity ◽

Genomic Locations

A classical model of gene regulation is that enhancers provide specificity whereas core promoters provide a modular site for the assembly of the basal transcriptional machinery. However, examples of core promoter specificity have led to an alternate hypothesis in which specificity is achieved by core promoters with different sequence motifs that respond differently to genomic environments containing different enhancers and chromatin landscapes. To distinguish between these models, we measured the activities of hundreds of diverse core promoters in four different genomic locations and, in a complementary experiment, six different core promoters at thousands of locations across the genome. Although genomic locations had large effects on expression, the intrinsic activities of different classes of promoters were preserved across genomic locations, suggesting that core promoters are modular regulatory elements whose activities are independently scaled up or down by different genomic locations. This scaling of promoter activities is nonlinear and depends on the genomic location and the strength of the core promoter. Our results support the classical model of regulation in which diverse core promoter motifs set the intrinsic strengths of core promoters, which are then amplified or dampened by the activities of their genomic environments.

Identification of upstream transcription factor binding sites in orthologous genes using mixed Student's t-test statistics

10.1101/2021.12.20.473422 ◽

2021 ◽

Author(s):

Tinghua Huang ◽

Hong Xiao ◽

Qi Tian ◽

Zhen He ◽

Cheng Yuan ◽

...

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Messenger Rna ◽

Good Alternative ◽

T Test ◽

Upstream Sequence ◽

Sequence Motifs ◽

Orthologous Genes ◽

Student’S T ◽

Student’S T Test

Background: Transcription factor (TF) regulates the transcription of DNA to messenger RNA by binding to upstream sequence motifs. Identifying the locations of known motifs in whole genomes is computationally intensive. Methodology/Principal Findings: This study presents a computational tool, named "Grit", for screening TF-binding sites (TFBS) by coordinating transcription factors to their promoter sequences in orthologous genes. This tool employs a newly developed mixed Student's t-test statistical method that detects high-scoring conserved and non-conserved binding sites among species. The program performs sequence scanning at a rate of 3.2 Mb/s on a quad-core Amazon server and has been benchmarked by the well-established ChIP-Seq datasets, putting Grit amongst the top-ranked TFBS predictors. It marginally outperforms the well-known transcription factor motif scanning tools, Pscan (4.8%) and FIMO (17.8%), in analyzing well-documented ChIP-Atlas human genome Chip-Seq datasets. Significance: Grit is a good alternative to current available motif scanning tools and is publicly available at http://www.thua45.cn/grit under an academic free license.

Common sequence motifs of nascent chains engage the ribosome surface and trigger factor

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.2103015118 ◽

2021 ◽

Vol 118 (52) ◽

pp. e2103015118

Author(s):

Annika Deckert ◽

Anaïs M. E. Cassaignau ◽

Xiaolin Wang ◽

Tomasz Włodarski ◽

Sammy H. S. Chan ◽

...

Keyword(s):

Surface Interactions ◽

Trigger Factor ◽

Structural Basis ◽

Sequence Motifs ◽

Nascent Chain ◽

Dynamics Simulations ◽

Polypeptide Chains ◽

Common Sequence

In the cell, the conformations of nascent polypeptide chains during translation are modulated by both the ribosome and its associated molecular chaperone, trigger factor. The specific interactions that underlie these modulations, however, are still not known in detail. Here, we combine protein engineering, in-cell and in vitro NMR spectroscopy, and molecular dynamics simulations to explore how proteins interact with the ribosome during their biosynthesis before folding occurs. Our observations of α-synuclein nascent chains in living Escherichia coli cells reveal that ribosome surface interactions dictate the dynamics of emerging disordered polypeptides in the crowded cytosol. We show that specific basic and aromatic motifs drive such interactions and directly compete with trigger factor binding while biasing the direction of the nascent chain during its exit out of the tunnel. These results reveal a structural basis for the functional role of the ribosome as a scaffold with holdase characteristics and explain how handover of the nascent chain to specific auxiliary proteins occurs among a host of other factors in the cytosol.

Sequence conservation, domain architectures, and phylogenetic distribution of the HD-GYP type c-di-GMP phosphodiesterases

Journal of Bacteriology ◽

10.1128/jb.00561-21 ◽

2021 ◽

Author(s):

Michael Y. Galperin ◽

Shan-Ho Chou

Keyword(s):

Protein Interactions ◽

Regulatory Protein ◽

Second Messenger ◽

Distribution Patterns ◽

Distribution Functions ◽

Sequence Conservation ◽

Sequence Motifs ◽

Phylogenetic Distribution ◽

Protein Families ◽

Conserved Sequence

The HD-GYP domain, named after two of its conserved sequence motifs, was first described in 1999 as a specialized version of the widespread HD phosphohydrolase domain that had additional highly conserved amino acid residues. Domain associations of HD-GYP indicated its involvement in bacterial signal transduction and distribution patterns of this domain suggested that it could serve as a hydrolase of the bacterial second messenger c-di-GMP, in addition to or instead of the EAL domain. Subsequent studies confirmed the ability of various HD-GYP domains to hydrolyze c-di-GMP to linear pGpG and/or GMP. Certain HD-GYP-containing proteins hydrolyze another second messenger, cGAMP, and some HD-GYP domains participate in regulatory protein-protein interactions. The recently solved structures of HD-GYP domains from four distinct organisms clarified the mechanisms of c-di-GMP binding and metal-assisted hydrolysis. However, the HD-GYP domain is poorly represented in public domain databases, which causes certain confusion about its phylogenetic distribution, functions, and domain architectures. Here, we present a refined sequence model for the HD-GYP domain and describe the roles of its most conserved residues in metal and/or substrate binding. We also calculate the numbers of HD-GYPs encoded in various genomes and list the most common domain combinations involving HD-GYP, such as the RpfG (REC–HD-GYP), Bd1817 (DUF3391– HD-GYP), and PmGH (GAF–HD-GYP) protein families. We also provide the descriptions of six HD-GYP–associated domains, including four novel integral membrane sensor domains. This work is expected to stimulate studies of diverse HD-GYP-containing proteins, their N-terminal sensor domains and the signals to which they respond. IMPORTANCE The HD-GYP domain forms class II of c-di-GMP phosphodiesterases that control the cellular levels of the universal bacterial second messenger c-di-GMP and therefore affect flagellar and/or twitching motility, cell development, biofilm formation, and, often, virulence. Despite more than 20 years of research, HD-GYP domains are insufficiently characterized; they are often confused with ‘classical’ HD domains that are involved in various housekeeping activities and may participate in signaling, hydrolyzing (p)ppGpp and c-di-AMP. This work provides an updated description of the HD-GYP domain, including its sequence conservation, phylogenetic distribution, domain architectures, and the most widespread HD-GYP-containing protein families. This work shows that HD-GYP domains are widespread in many environmental bacteria and are predominant c-di-GMP hydrolases in many lineages, including clostridia and deltaproteobacteria .

The structural repertoire of Fusarium oxysporum f. sp. lycopersici effectors revealed by experimental and computational studies

10.1101/2021.12.14.472499 ◽

2021 ◽

Author(s):

Daniel S. Yu ◽

Megan A Outram ◽

Ashley Smith ◽

Carl L McCombe ◽

Pravin B Khambalkar ◽

...

Keyword(s):

Fusarium Oxysporum ◽

Structure Prediction ◽

Plant Pathogens ◽

Plant Immunity ◽

Sequence Motifs ◽

Structural Class ◽

X Ray Crystallography ◽

Pathogen Virulence ◽

Plant Infection ◽

Dual Domain

Plant pathogens secrete proteins, known as effectors, that function in the apoplast and inside plant cells to promote virulence. Effectors can also be detected by cell-surface and cytosolic receptors, resulting in the activation of defence pathways and plant immunity. Our understanding of fungal effector function and detection by immunity receptors is limited largely due to high sequence diversity and lack of identifiable sequence motifs precluding prediction of structure or function. Recent studies have demonstrated that fungal effectors can be grouped into structural classes despite significant sequence variation. Using protein x-ray crystallography, we identify a new structural class of effectors hidden within the secreted in xylem (SIX) effectors from Fusarium oxysporum f. sp. lycopersici (Fol). The recognised effectors Avr1 (SIX4) and Avr3 (SIX1) represent the founding members of the Fol dual-domain (FOLD) effector class. Using AlphaFold ab initio protein structure prediction, benchmarked against the experimentally determined structures, we demonstrate SIX6 and SIX13 are FOLD effectors. We show that the conserved N-domain of Avr1 and Avr3 is sufficient for recognition by their corresponding, but structurally-distinct, immunity receptors. Additional structural prediction and comparison indicate that 11 of the 14 SIX effectors group into four structural families. This revealed that genetically linked effectors are related structurally, and we provide direct evidence for a physical association between one divergently-transcribed effector pair. Collectively, these data indicate that Fol secretes groups of structurally-related molecules during plant infection, an observation that has broad implications for our understanding of pathogen virulence and the engineering of plant immunity receptors.

Discovery of an ancient MHC category with both class I and class II features

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.2108104118 ◽

2021 ◽

Vol 118 (51) ◽

pp. e2108104118

Author(s):

Kazuhiko Okamura ◽

Johannes M. Dijkstra ◽

Kentaro Tsukamoto ◽

Unni Grimholt ◽

Geert F. Wiegertjes ◽

...

Keyword(s):

Mhc Class I ◽

Class Ii ◽

Class I ◽

Sequence Motifs ◽

Critical Question ◽

Specific Sequence ◽

Mhc Molecules ◽

Mhc Class I Molecule ◽

Chain Organization ◽

Lymphocyte Populations

Two classes of major histocompatibility complex (MHC) molecules, MHC class I and class II, play important roles in our immune system, presenting antigens to functionally distinct T lymphocyte populations. However, the origin of this essential MHC class divergence is poorly understood. Here, we discovered a category of MHC molecules (W-category) in the most primitive jawed vertebrates, cartilaginous fish, and also in bony fish and tetrapods. W-category, surprisingly, possesses class II–type α- and β-chain organization together with class I–specific sequence motifs for interdomain binding, and the W-category α2 domain shows unprecedented, phylogenetic similarity with β2-microglobulin of class I. Based on the results, we propose a model in which the ancestral MHC class I molecule evolved from class II–type W-category. The discovery of the ancient MHC group, W-category, sheds a light on the long-standing critical question of the MHC class divergence and suggests that class II type came first.

Plastics degradation by hydrolytic enzymes: the Plastics-Active Enzymes Database - PAZy

10.22541/au.163924551.18365494/v1 ◽

2021 ◽

Author(s):

Patrick Buchholz ◽

Hongli Zhang ◽

Pablo Perez-Garcia ◽

Lena-Luisa Nover ◽

Jennifer Chow ◽

...

Keyword(s):

Markov Models ◽

Hydrolytic Enzymes ◽

Ecological Niches ◽

Structural Description ◽

Sequence Motifs ◽

Sequence Alignments ◽

Activity Data ◽

Multiple Sequence ◽

Multiple Sequence Alignments ◽

Open Access Database

Petroleum based plastics are durable and accumulate in all ecological niches. Knowledge on enzymatic degradation is sparse. Today, less than 50 verified plastics-active enzymes are known. First examples of enzymes acting on the polymers polyethylene terephthalate (PET) and polyurethane (PUR) have been reported together with a detailed biochemical and structural description. Further, very few polyamide (PA) oligomer active enzymes are known. In this paper, the current known enzymes acting on the synthetic polymers PET and PUR are briefly summarized, their published activity data were collected and integrated into a comprehensive open access database. The Plastics-Active Enzymes Database (PAZy) represents an inventory of known and experimentally verified plastics-active enzymes. Almost 3000 homologues of PET-active enzymes were identified by profile hidden Markov models. Over 2000 homologues of PUR-active enzymes were identified by BLAST. Based on multiple sequence alignments, conservation analysis identified the most conserved amino acids, and sequence motifs for PET- and PUR-active enzymes were derived.

sequence motifs
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Helical structure motifs made searchable for functional peptide design

Molecular Determinants of Selectivity in Disordered Complexes May Shed Light on Specificity in Protein Condensates

Prediction of Human proteins having interactions with HPV16 proteins using virus and host sequence motifs.

Genomic environments scale the activities of diverse core promoters

Identification of upstream transcription factor binding sites in orthologous genes using mixed Student's t-test statistics

Common sequence motifs of nascent chains engage the ribosome surface and trigger factor

Sequence conservation, domain architectures, and phylogenetic distribution of the HD-GYP type c-di-GMP phosphodiesterases

The structural repertoire of Fusarium oxysporum f. sp. lycopersici effectors revealed by experimental and computational studies

Discovery of an ancient MHC category with both class I and class II features

Plastics degradation by hydrolytic enzymes: the Plastics-Active Enzymes Database - PAZy

Export Citation Format

sequence motifsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Helical structure motifs made searchable for functional peptide design

Molecular Determinants of Selectivity in Disordered Complexes May Shed Light on Specificity in Protein Condensates

Prediction of Human proteins having interactions with HPV16 proteins using virus and host sequence motifs.

Genomic environments scale the activities of diverse core promoters

Identification of upstream transcription factor binding sites in orthologous genes using mixed Student's t-test statistics

Common sequence motifs of nascent chains engage the ribosome surface and trigger factor

Sequence conservation, domain architectures, and phylogenetic distribution of the HD-GYP type c-di-GMP phosphodiesterases

The structural repertoire of Fusarium oxysporum f. sp. lycopersici effectors revealed by experimental and computational studies

Discovery of an ancient MHC category with both class I and class II features

Plastics degradation by hydrolytic enzymes: the Plastics-Active Enzymes Database - PAZy

sequence motifs
Recently Published Documents