amino acid repeats
Recently Published Documents


TOTAL DOCUMENTS

86
(FIVE YEARS 13)

H-INDEX

23
(FIVE YEARS 1)

2021 ◽  
Vol 1192 (1) ◽  
pp. 012005
Author(s):  
A H M Gameil ◽  
F Yusof ◽  
A S Azmi ◽  
N I Mohamad Puad

Abstract Collagens are an important and ubiquitous family of proteins. They have many functions in the human body and similarly have found numerous, potent applications in various industries including the manufacture of biomaterials. The ever-increasing demand for collagen has made necessary the exploration of alternative sources such as bacterial collagen-like proteins which have a triple-helical domain of Gly-X-Y amino acid repeats. Detection and quantification of native collagens have been well-established. However, collagen-like proteins differ in their composition and do not have the unique abundance of hydroxyproline and hydroxylysine found in vertebrate collagens. Thus, this poses a problem in the detection and quantification of collagen-like proteins. This paper evaluates reports on the detection and quantification of collagens and collagen-like proteins. A systematic search of the PubMed database was conducted in May 2021, to which five additional papers were added. The 310 unique search results were then subjected to a screening and elimination process, at the end of which 22 papers were included in the study. The findings were summarized and presented in a table that highlights progress in this field. While novel methods have been developed for the detection and quantitation of collagens in general, mainly using enzyme digestion, hybridization, and fluorescence, there is a need for a rapid, one-step method that selectively and sensitively detects and quantitates collagen and collagen-like protein samples with ease.


2021 ◽  
Author(s):  
Alex X Lu ◽  
Amy X Lu ◽  
Iva Pritisanac ◽  
Taraneh Zarin ◽  
Julie D Forman-Kay ◽  
...  

A major challenge to the characterization of intrinsically disordered regions (IDRs), which are widespread in the proteome, but relatively poorly understood, is the identification of molecular features, such as short motifs, amino acid repeats and physicochemical properties that mediate the functions of these regions. Here, we introduce a proteome-scale feature discovery method for IDRs. Our method, which we call "reverse homology", exploits the principle that important functional features are conserved over evolution as a contrastive learning signal for deep learning: given a set of homologous IDRs, the neural network has to correctly choose a randomly held-out homologue from another set of IDRs sampled randomly from the proteome. We pair reverse homology with a simple architecture and interpretation techniques, and show that the network learns conserved features of IDRs that can be interpreted as motifs, repeats, and other features. We also show that our model can be used to produce specific predictions of what residues and regions are most important to the function, providing a computational strategy for designing mutagenesis experiments in uncharacterized IDRs. Our results suggest that feature discovery using neural networks is a promising avenue to gain systematic insight into poorly understood protein sequences.


2021 ◽  
Author(s):  
Basem Al-Shayeb ◽  
Marie C. Schoelmerich ◽  
Jacob West-Roberts ◽  
Luis E. Valentin-Alvarado ◽  
Rohan Sachdeva ◽  
...  

Anaerobic methane oxidation exerts a key control on greenhouse gas emissions, yet factors that modulate the activity of microorganisms performing this function remain little explored. In studying groundwater, sediments, and wetland soil where methane production and oxidation occur, we discovered extraordinarily large, diverse DNA sequences that primarily encode hypothetical proteins. Four curated, complete genomes are linear, up to ~1 Mbp in length and share genome organization, including replicore structure, long inverted terminal repeats, and genome-wide unique perfect tandem direct repeats that are intergenic or generate amino acid repeats. We infer that these are a new type of archaeal extrachromosomal element with a distinct evolutionary origin. Gene sequence similarity, phylogeny, and local divergence of sequence composition indicate that many of their genes were assimilated from methane-oxidizing Methanoperedens archaea. We refer to these elements as "Borgs". We identified at least 19 different Borg types coexisting with Methanoperedens in four distinct ecosystems. Borg genes expand redox and respiratory capacity (e.g., clusters of multiheme cytochromes), ability to respond to changing environmental conditions, and likely augment Methanoperedens capacity for methane oxidation (e.g., methyl coenzyme M reductase). By this process, Borgs could play a previously unrecognized role in controlling greenhouse gas emissions.


2020 ◽  
Vol 0 (0) ◽  
Author(s):  
David Mary Rajathei ◽  
Subbiah Parthasarathy ◽  
Samuel Selvaraj

AbstractAmino acid repeats are found to play important roles in both structures and functions of the proteins. These are commonly found in all kingdoms of life, especially in eukaryotes and a larger fraction of human proteins composed of repeats. Further, the abnormal expansions of shorter repeats cause various diseases to humans. Therefore, the analysis of repeats of the entire human proteome along with functional, mutational and disease information would help to better understand their roles in proteins. To fulfill this need, we developed a web database HPREP (http://bioinfo.bdu.ac.in/hprep) for human proteome repeats using Perl and HTML programming. We identified different categories of well-characterized repeats and domain repeats that are present in the human proteome of UniProtKB/Swiss-Prot by using in-house Perl programming and novel repeats by using the repeat detection T-REKS tool as well as XSTREAM web server. Further, these proteins are annotated with functional, mutational and disease information and grouped according to specific repeat types. The developed database enables the users to search by specific repeat type in order to understand their involvement in proteins. Thus, the HPREP database is expected to be a useful resource to gain better insight regarding the different repeats in human proteome and their biological roles.


2020 ◽  
Author(s):  
Vincent J. Lynch ◽  
Gunter P. Wagner

AbstractAn enduring problem in biology is explaining how the functions of genes originated and how those functions diverge between species. Despite detailed studies on the functional evolution of a few proteins, the molecular mechanisms by which protein functions have evolved are almost entirely unknown. Here we show that a polyalanine tract in the homeodomain transcription factor HoxA11 arose in the stem-lineage of mammals and functions as an autonomous repressor module by physically interacting with the PAH domains of SIN3 proteins. These results suggest that long polyalanine tracts, which are common in transcription factors and often associated with disease, may generally function as repressor domains and can contribute to the diversification of transcription factor functions despite the deleterious consequences of polyalanine tract expansion.Research HighlightsWe show that a polyalanine track in HoxA11 evolved into a repressor domain in mammals through an increase in alanine repeat number, indicating that transcription factors can evolve novel functions despite the potential deleterious consequences associated with amino acid repeats.


2020 ◽  
Vol 14 ◽  
pp. 117793222090643 ◽  
Author(s):  
Sailen Barik

The tricopeptide (amino acid number in the 30s) repeats constitute some of the most common amino acid repeats in proteins of diverse organisms. The most important representatives of this class are the 34-residue and 35-residue repeats, eponymously known as tetratricopeptide repeat (TPR) and pentatricopeptide repeat (PPR), respectively. The unit motif of both consists of a pair of alpha helices. As members of the large, all-helical repeat classes, TPR and PPR share structural similarities, but also play specific roles in protein function. In this study, a comprehensive bioinformatic analysis of the PPR units and the linkers that connect them was conducted. The results suggested the existence of PPR repeats of various formats, as well as smaller, PPR-unrelated repeats. Besides their length, these repeats differed in amino acid arrangements and location of key amino acids. These findings provide a broader and unified perspective of the pentatricopeptide family while raising provocative questions about the assembly and evolution of these domains.


2019 ◽  
Vol 11 (11) ◽  
pp. 3159-3178
Author(s):  
Ilaria Pelassa ◽  
Marica Cibelli ◽  
Veronica Villeri ◽  
Elena Lilliu ◽  
Serena Vaglietti ◽  
...  

Abstract Homopolymeric amino acid repeats (AARs) like polyalanine (polyA) and polyglutamine (polyQ) in some developmental proteins (DPs) regulate certain aspects of organismal morphology and behavior, suggesting an evolutionary role for AARs as developmental “tuning knobs.” It is still unclear, however, whether these are occasional protein-specific phenomena or hints at the existence of a whole AAR-based regulatory system in DPs. Using novel approaches to trace their functional and evolutionary history, we find quantitative evidence supporting a generalized, combinatorial role of AARs in developmental processes with evolutionary implications. We observe nonrandom AAR distributions and combinations in HOX and other DPs, as well as in their interactomes, defining elements of a proteome-wide combinatorial functional code whereby different AARs and their combinations appear preferentially in proteins involved in the development of specific organs/systems. Such functional associations can be either static or display detectable evolutionary dynamics. These findings suggest that progressive changes in AAR occurrence/combination, by altering embryonic development, may have contributed to taxonomic divergence, leaving detectable traces in the evolutionary history of proteomes. Consistent with this hypothesis, we find that the evolutionary trajectories of the 20 AARs in eukaryotic proteomes are highly interrelated and their individual or compound dynamics can sharply mark taxonomic boundaries, or display clock-like trends, carrying overall a strong phylogenetic signal. These findings provide quantitative evidence and an interpretive framework outlining a combinatorial system of AARs whose compound dynamics mark at the same time DP functions and evolutionary transitions.


2019 ◽  
Vol 19 (1) ◽  
Author(s):  
Juan A. Subirana ◽  
Xavier Messeguer

Abstract Background Satellites or tandem repeats are very abundant in many eukaryotic genomes. Occasionally they have been reported to be present in some prokaryotes, but to our knowledge there is no general comparative study on their occurrence. For this reason we present here an overview of the distribution and properties of satellites in a set of representative species. Our results provide novel insights into the evolutionary relationship between eukaryotes, Archaea and Bacteria. Results We have searched all possible satellites present in the NCBI reference group of genomes in Archaea (142 species) and in Bacteria (119 species), detecting 2735 satellites in Archaea and 1067 in Bacteria. We have found that the distribution of satellites is very variable in different organisms. The archaeal Methanosarcina class stands out for the large amount of satellites in their genomes. Satellites from a few species have similar characteristics to those in eukaryotes, but most species have very few satellites: only 21 species in Archaea and 18 in Bacteria have more than 4 satellites/Mb. The distribution of satellites in these species is reminiscent of what is found in eukaryotes, but we find two significant differences: most satellites have a short length and many of them correspond to segments of genes coding for amino acid repeats. Transposition of non-coding satellites throughout the genome occurs rarely: only in the bacteria Leptospira interrogans and the archaea Methanocella conradii we have detected satellite families of transposed satellites with long repeats. Conclusions Our results demonstrate that the presence of satellites in the genome is not an exclusive feature of eukaryotes. We have described a few prokaryotes which do contain satellites. We present a discussion on their eventual evolutionary significance.


2019 ◽  
Author(s):  
Peng He ◽  
Shuyin Wu ◽  
Yanli Jiang ◽  
Lihua Zhang ◽  
Meiju Tang ◽  
...  

Abstract Background: The pentatricopeptide repeat (PPR) gene family, which contains multiple 35-amino acid repeats, constitutes one of the largest gene families in plants. PPR proteins function in organelles to target specific transcripts and are involved in plant development and growth. However, the function of PPR proteins in cotton is still unknown. Results: In this study, we characterized a PPR gene YELLOW-GREEN LEAF (GhYGL1d) that is required for cotton plastid development. The GhYGL1d gene has a DYW domain in C-terminal and is highly express in leaves, localized to the chloroplast fractions. GhYGL1d share high amino acid-sequence homology with AtECB2. In atecb2 mutant, overexpression of GhYGL1d rescued the seedling lethal phenotype and restored the editing of accD and ndhF transcripts. Silencing of GhYGL1d led to the reduction of chlorophyll and phenotypically yellow-green leaves in cotton. Compared with wild type, GhYGL1d-silenced cotton showed significant deformations of thylakoid structures. Furthermore, the transcription levels of plastid-encoded polymerase (PEP) and nuclear-encoded polymerase (NEP) dependent genes were decreased in GhYGL1d-silenced cotton. Conclusions: Our data indicate that GhYGL1d not only contributes to the editing of accD and ndhF genes, but also affects the expression of NEP- and PEP-dependent genes to regulate the development of thylakoids, and therefore regulates leaf variegation in cotton.


2019 ◽  
Author(s):  
Peng He ◽  
Shuyin Wu ◽  
Yanli Jiang ◽  
Lihua Zhang ◽  
Meiju Tang ◽  
...  

Abstract Background: The pentatricopeptide repeat (PPR) gene family, which contains multiple 35-amino acid repeats, constitutes one of the largest gene families in plants. PPR proteins function in organelles to target specific transcripts and are involved in plant development and growth. However, the function of PPR proteins in cotton is still unknown. Results: In this study, we characterized a PPR gene YELLOW-GREEN LEAF (GhYGL1d) that is required for cotton plastid development. The GhYGL1d gene has a DYW domain in C-terminal and is highly express in leaves, localized to the chloroplast fractions. GhYGL1d share high amino acid-sequence homology with AtECB2. In atecb2 mutant, overexpression of GhYGL1d rescued the seedling lethal phenotype and restored the editing of accD and ndhF transcripts. Silencing of GhYGL1d led to the reduction of chlorophyll and phenotypically yellow-green leaves in cotton. Compared with wild type, GhYGL1d-silenced cotton showed significant deformations of thylakoid structures. Furthermore, the transcription levels of plastid-encoded polymerase (PEP) and nuclear-encoded polymerase (NEP) dependent genes were decreased in GhYGL1d-silenced cotton. Conclusions: Our data indicate that GhYGL1d not only contributes to the editing of accD and ndhF genes, but also affects the expression of NEP- and PEP-dependent genes to regulate the development of thylakoids, and therefore regulates leaf variegation in cotton.


Sign in / Sign up

Export Citation Format

Share Document