scholarly journals Nucleotide sequence and encoded amino acid sequence of a genomic gene region for a low molecular weight glutenin

1988 ◽  
Vol 16 (23) ◽  
pp. 11376-11376 ◽  
Author(s):  
Ernest G. Pitts ◽  
J. Antoni Rafalski ◽  
Charles Hedgcoth
1989 ◽  
Vol 264 (5) ◽  
pp. 2560-2567
Author(s):  
G Camici ◽  
G Manao ◽  
G Cappugi ◽  
A Modesti ◽  
M Stefani ◽  
...  

FEBS Letters ◽  
1972 ◽  
Vol 22 (1) ◽  
pp. 121-123 ◽  
Author(s):  
Edward C. Franklin ◽  
Mordechai Pras ◽  
Mark Levin ◽  
Blas Frangione

2019 ◽  
Author(s):  
Wenfa Ng

FASTA file format is a common file type for distributing proteome information, especially those obtained from Uniprot. While MATLAB could automatically read fasta files using the built-in function, fastaread, important information such as protein name and organism name remain enmeshed in a character array. Hence, difficulty exists in automatic extraction of protein names from fasta proteome file to help in building a database with fields comprising protein name and its amino acid sequence. The objective of this work was in developing a MATLAB software that could automatically extract protein name and amino acid sequence information from fasta proteome file and assign them to a new database that comprises fields such as protein name, amino acid sequence, number of amino acid residues, molecular weight of protein and nucleotide sequence of protein. Information on number of amino acid residues came from the use of the length built-in function in MATLAB analyzing the length of the amino acid sequence of a protein. The final two fields were provided by MATLAB built-in functions molweight and aa2nt, respectively. Molecular weight of proteins is useful for a variety of applications while nucleotide sequence is essential for gene synthesis applications in molecular cloning. Finally, the MATLAB software is also equipped with an error check function to help detect letters in the amino acid sequence that are not part of the family of 20 natural amino acids. Sequences with such letters would constitute as error inputs to molweight and aa2nt, and would not be processed. Collectively, given that important information such as protein name is enmeshed in a character array in fasta proteome file, this work sets out to develop a MATLAB software that could automatically extract protein name and amino acid sequence information, and assigns them to a new protein database. Using built-in functions, number of amino acid residues, molecular weight and nucleotide sequence of each protein were calculated; thereby, yielding a new protein database with improved functionalities that could support a variety of biology workflows ranging from sequence alignment to molecular cloning.


1976 ◽  
Vol 54 (10) ◽  
pp. 835-842 ◽  
Author(s):  
A. S. Mak ◽  
B. L. Jones

The complete amino acid sequence of β-purothionin, a low molecular weight, very basic, protein isolated from wheat endosperm material, has been determined. β-purothionin is toxic to some bacteria, to yeasts, and to animals when injected. The protein contains 45 amino acid residues and has a molecular weight of 4913. The 8 cysteine and 10 basic residues are distributed throughout the molecule. The primary structure of the protein shows considerable homology to those of the viscotoxins, which are toxic, small, basic proteins found in the leaves and stems of European mistletoe (Viscum album L.).


2019 ◽  
Vol 8 (5) ◽  
pp. 172-177
Author(s):  
Rajeev Kumar ◽  
S. P. Singh ◽  
Mahesh Kumar ◽  
Anil Kumar

Outer membrane of Gram-negative bacteria has complex profile of proteins. The outer membrane proteins (OMPs) isolated from S. typhimurium by urea-EDTA extraction method and analysed through SDS-PAGE showed a complex electrophoretic profile having more than 15 low molecular weight proteins with molecular masses ranging between 3.5 and 43 kDa. The most important outer membrane protein (Omp28) of S. typhimurium with submolecular masses of 12.32kDa of main protein was recovered. The gene responsible for expression of this protein was also amplified through PCR and sequenced, showed 341bp amplicon size and predicted amino acid sequence of this pro-tein was determined. The Antigenic index was calculated from amino acid sequence of same gene and found 2.2 (0.1-2.2) suggesting highly antigenic in nature. The experimentally determined values are close agreement with the theoretically calculated molecular weight 12.32 kDa and pI: 9.61 from the gene sequence of this protein. The antigenic natures of predicted protein values are close agreement with experimental determent of Omp28 of S. typhimurium a possible formula for vaccine developmental of genus Salmonella.


2019 ◽  
Author(s):  
Wenfa Ng

FASTA file format is a common file type for distributing proteome information, especially those obtained from Uniprot. While MATLAB could automatically read fasta files using the built-in function, fastaread, important information such as protein name and organism name remain enmeshed in a character array. Hence, difficulty exists in automatic extraction of protein names from fasta proteome file to help in building a database with fields comprising protein name and its amino acid sequence. The objective of this work was in developing a MATLAB software that could automatically extract protein name and amino acid sequence information from fasta proteome file and assign them to a new database that comprises fields such as protein name, amino acid sequence, number of amino acid residues, molecular weight of protein and nucleotide sequence of protein. Information on number of amino acid residues came from the use of the length built-in function in MATLAB analyzing the length of the amino acid sequence of a protein. The final two fields were provided by MATLAB built-in functions molweight and aa2nt, respectively. Molecular weight of proteins is useful for a variety of applications while nucleotide sequence is essential for gene synthesis applications in molecular cloning. Finally, the MATLAB software is also equipped with an error check function to help detect letters in the amino acid sequence that are not part of the family of 20 natural amino acids. Sequences with such letters would constitute as error inputs to molweight and aa2nt, and would not be processed. Collectively, given that important information such as protein name is enmeshed in a character array in fasta proteome file, this work sets out to develop a MATLAB software that could automatically extract protein name and amino acid sequence information, and assigns them to a new protein database. Using built-in functions, number of amino acid residues, molecular weight and nucleotide sequence of each protein were calculated; thereby, yielding a new protein database with improved functionalities that could support a variety of biology workflows ranging from sequence alignment to molecular cloning.


Sign in / Sign up

Export Citation Format

Share Document