protein code
Recently Published Documents


TOTAL DOCUMENTS

20
(FIVE YEARS 3)

H-INDEX

6
(FIVE YEARS 0)

2021 ◽  
Author(s):  
Yi Yao Tan

This paper presents a method to find structurally high variance segments of the different conformations of a single protein and uses clusters them using different distance metrics and interpretation of coordinate and angle data presented by three different methods: root mean squared derivation RMSD, t-distributed Stochastic Neighbor Embedding (t-SNE) based map, and dihedral-based clustering. The methods were applied on the human cylin-dependent kinase 2 (CDK2) protein, code P24941 uniprot using a series of python scripts and clustering packages. We test our methods on the data of the CDK2 protein as it is a highly researched protein, with practical applications of clustering in cancer research, crucial in the regulation of the cell-cycle, and has a sizeable amount of experimental data collected on the confirmation structures. While using the distance based root mean squared deviation RMSD provides data of structure to structure dissimilarity between different conformations, a simple RMSD matrix lacks to ability to describe the subsequence-wise in shape and absolute position which could be the main identifying elements for a protein's conformation and state. To make up for this loss we explore an intuitive and more flexible method, able to accept multiple high structural variance segments, which takes coordinate based data, through a series of maps and with the help of t-SNE, and maps each segment as a feature in the clustering matrix. This method, however, would require additional testing on other proteins and modifications to verify its consistency and test its robustness. In the end we explore the pros and cons of the three methods applied on the high structural variance regions. Despite the randomness factor by the t-SNE used in mapping the coordinates to lower dimensions, the coordinate-based approach consistently performed better than the RSMD and dihedral based methods in clustering the three groups of the CDK2 protein kinase. We also found that analyzing only the substructures identified by the high variance detection algorithm consistently provided more distinct clusters with higher multi-class F1 scores.


2021 ◽  
Author(s):  
Felix Van der Jeugt ◽  
Rien Maertens ◽  
Aranka Steyaert ◽  
Pieter Verschaffelt ◽  
Caroline De Tender ◽  
...  

Shotgun metagenomics is now commonplace to gain insights into communities from diverse environments, but fast, memory-friendly, and accurate tools are needed for deep taxonomic analysis of the metagenome data. To meet this need we developed UMGAP, a highly versatile open source command line tool implemented in Rust for taxonomic profiling of shotgun metagenomes. It differs from state-of-the-art tools in its use of protein code regions identified in short reads for robust taxonomic identifications, a broad-spectrum index that can identify both archaea, bacteria, eukaryotes and viruses, a non-monolithic design, and support for interactive visualizations of complex biodiversities.


2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Stephen Rong ◽  
Luke Buerer ◽  
Christy L. Rhine ◽  
Jing Wang ◽  
Kamil J. Cygan ◽  
...  
Keyword(s):  

2017 ◽  
Vol 49 (6) ◽  
pp. 848-855 ◽  
Author(s):  
Rachel Soemedi ◽  
Kamil J Cygan ◽  
Christy L Rhine ◽  
Jing Wang ◽  
Charlston Bulacan ◽  
...  

2014 ◽  
Vol 11 (9) ◽  
pp. 888-889
Author(s):  
Tal Nawy
Keyword(s):  

eLife ◽  
2013 ◽  
Vol 2 ◽  
Author(s):  
Zhitao Hu ◽  
Xia-Jing Tong ◽  
Joshua M Kaplan

Synaptic transmission consists of fast and slow components of neurotransmitter release. Here we show that these components are mediated by distinct exocytic proteins. The Caenorhabditis elegans unc-13 gene is required for SV exocytosis, and encodes long and short isoforms (UNC-13L and S). Fast release was mediated by UNC-13L, whereas slow release required both UNC-13 proteins and was inhibited by Tomosyn. The spatial location of each protein correlated with its effect. Proteins adjacent to the dense projection mediated fast release, while those controlling slow release were more distal or diffuse. Two UNC-13L domains accelerated release. C2A, which binds RIM (a protein associated with calcium channels), anchored UNC-13 at active zones and shortened the latency of release. A calmodulin binding site accelerated release but had little effect on UNC-13’s spatial localization. These results suggest that UNC-13L, UNC-13S, and Tomosyn form a molecular code that dictates the timing of neurotransmitter release.


Sign in / Sign up

Export Citation Format

Share Document