short protein
Recently Published Documents


TOTAL DOCUMENTS

26
(FIVE YEARS 9)

H-INDEX

10
(FIVE YEARS 1)

Genes ◽  
2021 ◽  
Vol 12 (12) ◽  
pp. 1864
Author(s):  
Jiwon Lee ◽  
Aaron Wacholder ◽  
Anne-Ruxandra Carvunis

Microproteins (<100 amino acids) are receiving increasing recognition as important participants in numerous biological processes, but their evolutionary dynamics are poorly understood. SPAAR is a recently discovered microprotein that regulates muscle regeneration and angiogenesis through interactions with conserved signaling pathways. Interestingly, SPAAR does not belong to any known protein family and has known homologs exclusively among placental mammals. This lack of distant homology could be caused by challenges in homology detection of short sequences, or it could indicate a recent de novo emergence from a noncoding sequence. By integrating syntenic alignments and homology searches, we identify SPAAR orthologs in marsupials and monotremes, establishing that SPAAR has existed at least since the emergence of mammals. SPAAR shows substantial primary sequence divergence but retains a conserved protein structure. In primates, we infer two independent evolutionary events leading to the de novo origination of 5′ elongated isoforms of SPAAR from a noncoding sequence and find evidence of adaptive evolution in this extended region. Thus, SPAAR may be of ancient origin, but it appears to be experiencing continual evolutionary innovation in mammals.


PLoS ONE ◽  
2021 ◽  
Vol 16 (5) ◽  
pp. e0248841
Author(s):  
Denys Bulavka ◽  
Ariel A. Aptekmann ◽  
Nicolás A. Méndez ◽  
Teresa Krick ◽  
Ignacio E. Sánchez

Linear motifs are short protein subsequences that mediate protein interactions. Hundreds of motif classes including thousands of motif instances are known. Our theory estimates how many motif classes remain undiscovered. As commonly done, we describe motif classes as regular expressions specifying motif length and the allowed amino acids at each motif position. We measure motif specificity for a pair of motif classes by quantifying how many motif-discriminating positions prevent a protein subsequence from matching the two classes at once. We derive theorems for the maximal number of motif classes that can simultaneously maintain a certain number of motif-discriminating positions between all pairs of classes in the motif universe, for a given amino acid alphabet. We also calculate the fraction of all protein subsequences that would belong to a motif class if all potential motif classes came into existence. Naturally occurring pairs of motif classes present most often a single motif-discriminating position. This mild specificity maximizes the potential number of coexisting motif classes, the expansion of the motif universe due to amino acid modifications and the fraction of amino acid sequences that code for a motif instance. As a result, thousands of linear motif classes may remain undiscovered.


Author(s):  
Rachel Kolodny ◽  
Sergey Nepomnyachiy ◽  
Dan S Tawfik ◽  
Nir Ben-Tal
Keyword(s):  

Author(s):  
Rachel Kolodny ◽  
Sergey Nepomnyachiy ◽  
Dan S Tawfik ◽  
Nir Ben-Tal

Abstract The vast majority of theoretically possible polypeptide chains do not fold, let alone confer function. Hence, protein evolution from preexisting building blocks has clear potential advantages over ab initio emergence from random sequences. In support of this view, sequence similarities between different proteins is generally indicative of common ancestry, and we collectively refer to such homologous sequences as ‘themes’. At the domain level, sequence homology is routinely detected. However, short themes which are segments, or fragments of intact domains, are particularly interesting because they may provide hints about the emergence of domains, as opposed to divergence of preexisting domains, or their mixing-and-matching to form multi-domain proteins. Here we identified 525 representative short themes, comprising 20-to-80 residues, that are unexpectedly shared between domains considered to have emerged independently. Among these ‘bridging themes’ are ones shared between the most ancient domains, e.g., Rossmann, P-loop NTPase, TIM-barrel, Flavodoxin, and Ferredoxin-like. We elaborate on several particularly interesting cases, where the bridging themes mediate ligand binding. Ligand binding may have contributed to the stability and the plasticity of these building blocks, and to their ability to invade preexisting domains or serve as starting points for completely new domains.


Author(s):  
Rachel Kolodny ◽  
Sergey Nepomnyachiy ◽  
Dan S. Tawfik ◽  
Nir Ben-Tal

AbstractThe vast majority of theoretically possible polypeptide chains do not fold, let alone confer function. Hence, protein evolution from preexisting building blocks has clear potential advantages over ab initio emergence from random sequences. In support of this view, sequence similarities between different proteins is generally indicative of common ancestry, and we collectively refer to such homologous sequences as ‘themes’. At the domain level, sequence homology is routinely detected. However, short themes which are segments, or fragments of intact domains, are particularly interesting because they may provide hints about the emergence of domains, as opposed to divergence of preexisting domains, or their mixing-and-matching to form multi-domain proteins. Here we identified 525 representative short themes, comprising 20-to-80 residues, that are unexpectedly shared between domains considered to have emerged independently. Among these ‘bridging themes’ are ones shared between the most ancient domains, e.g., Rossmann, P-loop NTPase, TIM-barrel, Flavodoxin, and Ferredoxin-like. We elaborate on several particularly interesting cases, where the bridging themes mediate ligand binding. Ligand binding may have contributed to the stability and the plasticity of these building blocks, and to their ability to invade preexisting domains or serve as starting points for completely new domains.


2020 ◽  
Author(s):  
Qiuye Li ◽  
W. Michael Babinchak ◽  
Witold K Surewicz

Amyotrophic lateral sclerosis and several other neurodegenerative diseases are associated with brain deposits of TDP-43 aggregates. Cryo-EM structure of amyloid formed from the entire TDP-43 low complexity domain reveals single protofilament fibrils containing a large (138-residue), tightly packed core with structural features that differ from those previously found for fibrils formed from short protein fragments. The atomic model provides insight into potential structural perturbations caused by phosphorylation and disease-related mutations.


2020 ◽  
Author(s):  
Izarbe Aísa-Marín ◽  
M José López-Iniesta ◽  
Santiago Milla ◽  
Jaume Lillo ◽  
Gemma Navarro ◽  
...  

ABSTRACTMutations in NR2E3 cause retinitis pigmentosa (RP) and enhanced S-cone syndrome (ESCS) in humans. This gene produces a large isoform encoded in 8 exons and a previously unreported shorter isoform of 7 exons, whose function is unknown. We generated two mouse models by targeting exon 8 of Nr2e3 using CRISPR/Cas9-D10A nickase. Allele Δ27 is an in-frame deletion of 27 bp that ablates the dimerization domain, whereas allele ΔE8 (full deletion of exon 8), produces only the short isoform that lacks the dimerization and repressor domains. The Δ27 mutant shows developmental alterations and a non-progressive electrophysiological dysfunction that resembles the ESCS phenotype. The ΔE8 mutant exhibits progressive retinal degeneration, as occurs in human RP patients. Interestingly, the mutant retinas show invaginations similar to fovea-like pits. Our mutants suggest a role of Nr2e3 as a cone-patterning regulator and provide valuable models for studying mechanisms of NR2E3-associated retinal dystrophies and evaluating potential therapies.Abstract FigureHighlights- Nr2e3 mouse models were generated by exon 8 deletion using CRISPR/Cas9 D10A nickase.- New Nr2e3 mRNA retaining intron 7 encodes a short protein expressed in adult retina.- Deletion of 9 aa of the NR2E3 dimerization domain causes enhanced S-cone syndrome.- Deletion of exon 8 produces a phenotype similar to Retinitis Pigmentosa in mouse.


2018 ◽  
Author(s):  
Roman Mylonas ◽  
Ilan Beer ◽  
Christian Iseli ◽  
Chloe Chong ◽  
HuiSong Pak ◽  
...  

AbstractSpliced peptides are short protein fragments spliced together in the proteasome by peptide bond formation. True estimation of the contribution of proteasome-spliced peptides (PSPs) to the global Human Leukocyte Antigen (HLA) ligandome is critical. A recent study suggested that PSPs contribute up to 30% of the HLA ligandome. We performed a thorough reanalysis of the reported results using multiple computational tools and various validation steps and concluded that only a fraction of the proposed PSPs passes the quality filters. To better estimate the actual number of PSPs, we present an alternative workflow. We performed de-novo sequencing of the HLA-peptide spectra and discarded all de-novo sequences found in the UniProt database. We checked whether the remaining de-novo sequences could match spliced peptides from human proteins. The spliced sequences were appended to the UniProt fasta file, which was searched by two search tools at a FDR of 1%. We find that maximally 2-4% of the HLA ligandome could be explained as spliced protein fragments. The majority of these potential PSPs have good peptide-spectrum match properties and are predicted to bind the respective HLA molecules. However, it remains to be shown how many of these potential PSPs actually originate from proteasomal splicing events.


Sign in / Sign up

Export Citation Format

Share Document