Developing a mathematical method to search for latent periodicity in protein amino-acid sequences with deletions and insertions

BIOPHYSICS ◽  
2015 ◽  
Vol 60 (6) ◽  
pp. 876-885 ◽  
Author(s):  
E. V. Korotkov ◽  
M. A. Korotkova
1993 ◽  
Vol 4 (3) ◽  
pp. 287-292 ◽  
Author(s):  
D.L. Kauffman ◽  
P.J. Keller ◽  
A. Bennick ◽  
M. Blum

Human proline-rich proteins (PRPs) constitute a complex family of salivary proteins that are encoded by a small number of genes. The primary gene product is cleaved by proteases, thereby giving rise to about 20 secreted proteins. To determine the genes for the secreted PRPs, therefore, it is necessary to obtain sequences of both the secreted proteins and the DNA encoding these proteins. We have sequenced most PRPs from one donor (D.K.) and aligned the protein sequences with available DNA sequences from unrelated individuals. Partial sequence data have now been obtained for an additional PRP from D.K. named II-1. This protein was purified from parotid saliva by gel filtration and ion-exchange chromatography. Peptides were obtained by cleavage with trypsin, clostripain, and N-bromosuccinimide, followed by column chromatography. The peptides were sequenced on a gas-phase protein sequenator. Overlapping peptide sequences were obtained for most of II-1 and aligned with translated DNA sequences. The best fit was obtained with clones containing sequences for the allele PRB4" (Lyons et al., 1988). However, there was not complete identity of the protein amino acid sequence and the DNA-derived sequences, indicating that II-1 is not encoded by PRB4". Other PRPs isolated from D.K. also fail to conform to any DNA structure so far reported. This shows the need to obtain amino acid sequences and corresponding DNA sequences from the same person to assign genes for the PRPs and to determine the location of the postribosomal cleavage points in the primary translation product.


2019 ◽  
Vol 20 (4) ◽  
pp. 306-320 ◽  
Author(s):  
Omar Barukab ◽  
Yaser Daanial Khan ◽  
Sher Afzal Khan ◽  
Kuo-Chen Chou

Background: The amino acid residues, in protein, undergo post-translation modification (PTM) during protein synthesis, a process of chemical and physical change in an amino acid that in turn alters behavioral properties of proteins. Tyrosine sulfation is a ubiquitous posttranslational modification which is known to be associated with regulation of various biological functions and pathological processes. Thus its identification is necessary to understand its mechanism. Experimental determination through site-directed mutagenesis and high throughput mass spectrometry is a costly and time taking process, thus, the reliable computational model is required for identification of sulfotyrosine sites. Methodology: In this paper, we present a computational model for the prediction of the sulfotyrosine sites named iSulfoTyr-PseAAC in which feature vectors are constructed using statistical moments of protein amino acid sequences and various position/composition relative features. These features are incorporated into PseAAC. The model is validated by jackknife, cross-validation, self-consistency and independent testing. Results: Accuracy determined through validation was 93.93% for jackknife test, 95.16% for crossvalidation, 94.3% for self-consistency and 94.3% for independent testing. Conclusion: The proposed model has better performance as compared to the existing predictors, however, the accuracy can be improved further, in future, due to increasing number of sulfotyrosine sites in proteins.


2020 ◽  
Author(s):  
Xu Mengmeng ◽  
Xu Yuejuan ◽  
Chen Sun ◽  
Lu Yanan ◽  
Li Fen ◽  
...  

Abstract Background: Conotruncal heart defects (CTDs) are a group of congenital heart malformations that cause anomalies of cardiac outflow tracts. In the past few decades, many genes related to CTDs have been reported. Serum response factor (SRF) is a ubiquitous nuclear protein that acts as transcription factor, and SRF was found to be a critical factor in heart development and to be strongly expressed in the myocardium of the developing mouse and chicken hearts. The targeted inactivation of SRF during heart development leads to embryonic lethality and myocardial defects in mice. Results: To illustrate the relationship between SRF and human heart defects, we screened SRF mutations in 527 CTD patients, a cross sectional study. Two novel mutations (Mut1: c.821A>G p.G274D, the adenine(A) was mutated to guanine(G) at position 821 of the SRF gene coding sequences (CDS), lead to the Glycine(G) mutated to Asparticacid(D) at position 274 of the SRF protein amino acid sequences; Mut2: c.880G>T p.G294C, the guanine(G) was mutated to thymine (T) at position 880 of the SRF CDS, lead to the Glycine(G) mutated to Cysteine (C) at position 294 of the SRF protein amino acid sequences.) of SRF (NM_003131.2) were identified. Western blotting and real-time PCR showed that there were no obvious differences between the protein expression and mRNA transcription of mutants and wild-type SRF. A dual luciferase reporter assay showed that both SRF mutants (G274D and G294C) impaired SRF transcriptional activity at the SRF promoter and atrial natriuretic factor (ANF) promoter (p<0.05), additionally, the mutants displayed reduced synergism with GATA4. Conclusion: These results suggest that SRF-p.G274D and SRF-p.G294C may have potential pathogenic effects.


Antioxidants ◽  
2019 ◽  
Vol 8 (2) ◽  
pp. 50 ◽  
Author(s):  
Yuichiro Suzuki

Biological oxidation plays important roles in the pathogenesis of various diseases and aging. Carbonylation is one mode of protein oxidation. It has been reported that amino acids that are susceptible to carbonylation are arginine (Arg), proline (Pro), lysine, and threonine residues. The carbonylation product of both Arg and Pro residues is glutamyl semialdehyde. While chemically the oxidation reactions of neither Pro to glutamyl semialdehyde nor Arg to glutamyl semialdehyde are reversible, experimental results from our laboratory suggest that the biological system may drive the reduction of glutamyl semialdehyde to Pro in the protein structure. Further, glutamyl semialdehyde can be oxidized to become glutamic acid (Glu). Therefore, I hypothesize that biological oxidation post-translationally converts Arg to Pro, Arg to Glu, and Pro to Glu within the protein structure. Our mass spectrometry experiments provided evidence that, in human cells, 5–10% of peroxiredoxin 6 protein molecules have Pro-45 replaced by Glu. This concept of protein amino acid conversion challenges the dogma that amino acid sequences are strictly defined by nucleic acid sequences. I propose that, in the biological system, amino acid replacements can occur post-translationally through redox regulation, and protein molecules with non-DNA coding sequences confer functions.


1971 ◽  
Vol 121 (3) ◽  
pp. 453-459 ◽  
Author(s):  
L. R. Croft ◽  
S. G. Waley

The amino acid sequences around the cysteine residues in the lens protein, γ-crystallin, were studied. Fraction II of the γ-crystallin from calf lens (Björk, 1964) was used. The protein was oxidized with performic acid and then hydrolysed with trypsin. Six peptides containing cysteic acid were isolated. One of the peptides contained three residues of cysteic acid and the others contained one residue of cysteic acid. We conclude that there are eight unique residues of cysteic acid in the oxidized protein. Amino acid analysis suggests that there are also eight residues of cysteic acid in the molecule, which thus contains only one polypeptide chain.


PLoS ONE ◽  
2012 ◽  
Vol 7 (11) ◽  
pp. e50039 ◽  
Author(s):  
Kenta Motomura ◽  
Tomohiro Fujita ◽  
Motosuke Tsutsumi ◽  
Satsuki Kikuzato ◽  
Morikazu Nakamura ◽  
...  

1983 ◽  
Vol 213 (1) ◽  
pp. 31-38 ◽  
Author(s):  
N Tamiya ◽  
N Maeda ◽  
H G Cogger

The main neurotoxic components, toxins Hydrophis ornatus a and Hydrophis lapemoides a, were isolated from the venoms of the sea snakes Hydrophis ornatus and Hydrophis lapemoides respectively. The amino acid sequence of toxin Hydrophis ornatus a was deduced to be identical with that of toxin Astrotia stokesii a [Maeda & Tamiya (1978) Biochem. J. 175, 507-517] on the basis of identity of the tryptic peptide ‘map’ and the amino acid composition of each peptide. The amino acid sequence of toxin Hydrophis lapemoides a was determined mainly on the basis of identity of the amino acid compositions, mobilities on paper electrophoresis and migration positions on paper chromatography of the tryptic peptides with those of other sea-snake toxins whose sequences are known. Both toxins Hydrophis ornatus a and Hydrophis lapemoides a consisted of 60 amino acid residues and there were six amino acid replacements between them. The taxonomy of sea snakes in the Hydrophis ornatus complex has long been confused, and the above snakes were originally assigned to taxa that proved to be inconsistent with the relationships indicated by the neurotoxin amino acid sequences obtained. A subsequent re-examination of the specimens revealed an error in the original identifications and demonstrated the value of the protein amino acid sequences in systematic and phylogenetic studies. The isolation procedure and results of amino acid analysis of the tryptic peptides have been deposited as Supplementary Publication SUP 50121 (8 pages) with the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained as indicated in Biochem. J. (1983) 209, 5.


Sign in / Sign up

Export Citation Format

Share Document