scholarly journals Multi-Level DBSCAN: A Hierarchical Density-Based Clustering Method for Analyzing Molecular Dynamics Simulation Trajectories

2021 ◽  
Author(s):  
Song Liu ◽  
Siqin Cao ◽  
Michael Alexander SUAREZ VASQUEZ ◽  
Eshani C Goonetillek ◽  
Xuhui Huang

Molecular Dynamic (MD) simulations have been extensively used as a powerful tool to investigate dynamics of biological molecules in recent decades. Generally, MD simulations generate high-dimensional data that is very hard to visualize and comprehend. As a result, clustering algorithms have been commonly used to reduce the dimensionality of MD data with the key benefit being their ability to reduce the dimensionality of MD data without prior knowledge of structural details or dynamic mechanisms. In this paper, we propose a new algorithm, the Multi-Level Density-Based Spatial Clustering of Applications with Noise (ML-DBSCAN), which combines the clustering results at different resolution of density levels to obtain the hierarchical structure of the free energy landscape and the metastable state assignment. At relatively low resolutions, the ML-DBSCAN can efficiently detect high population regions that contain all metastable states, while at higher resolutions, the ML-DBSCAN can find all metastable states and structural details of the free energy landscape. We demonstrate the powerfulness of the ML-DBSCAN in generating metastable states with a particle moving in a Mexican hat-like potential, and four peptide and protein examples are used to demonstrate how hierarchical structures of free energy landscapes can be found. Furthermore, we developed a GPU implementation of the ML-DBSCAN, which allows the algorithm to handle larger MD datasets and be up to two orders of magnitude faster than the CPU implementation. We demonstrate the power of the ML-DBSCAN on MD simulation datasets of five systems: a 2D-potential, alanine dipeptide, β-hairpin Tryptophan Zipper 2 (Trpzip2), Human Islet Amyloid Polypeptide (hIAPP), and Maltose Binding Protein (MBP). Our code is available at https://github.com/liusong299/ML-DBSCAN.

2018 ◽  
Author(s):  
Navjeet Ahalawat ◽  
Jagannath Mondal

Collective variables (CV), when chosen judiciously, can play an important role in recognizing rate-limiting processes and rare events in any biomolecular systems. However, high dimensionality and inherent complexities associated with such biochemical systems render the identification of an optimal CV a challenging task, which in turn precludes the elucidation of underlying conformational landscape in sufficient details. In this context, a relevant model system is presented by 16residue, β hairpin of GB1 protein. Despite being the target of numerous theoretical and computational studies for understanding the protein folding, the set of CVs optimally characterizing the conformational landscape of, β hairpin of GB1 protein has remained elusive, resulting in a lack of consensus on its folding mechanism. Here we address this by proposing a pair of optimal CVs which can resolve the underlying free energy landscape of GB1 hairpin quite efficiently. Expressed as a linear combination of a number of traditional CVs, the optimal CV for this system is derived by employing recently introduced Timestructured Independent Component Analysis (TICA) approach on a large number of independent unbiased simulations. By projecting the replica-exchange simulated trajectories along these pair of optimized CVs, the resulting free energy landscape of this system are able to resolve four distinct wellseparated metastable states encompassing the extensive ensembles of folded,unfolded and molten globule states. Importantly, the optimized CVs were found to be capable of automatically recovering a novel partial helical state of this protein, without needing to explicitly invoke helicity as a constituent CV. Furthermore, a quantitative sensitivity analysis of each constituent in the optimized CV provided key insights on the relative contributions of the constituent CVs in the overall free energy landscapes. Finally, the kinetic pathways con necting these metastable states, constructed using a Markov State Model, provide an optimum description of underlying folding mechanism of the peptide. Taken together, this work oers a quantitatively robust approach towards comprehensive mapping of the underlying folding landscape of a quintessential model system along its optimized collective variables.


2007 ◽  
Vol 3 ◽  
pp. 757-766
Author(s):  
Masakazu Sekijima ◽  
Jun Doi ◽  
Shinya Honda ◽  
Tamotsu Noguchi ◽  
Shigenori Shimizu ◽  
...  

2021 ◽  
Author(s):  
Hangjin Jiang ◽  
Xuhui Huang ◽  
Han Li ◽  
Wing H Wong ◽  
Xiaodan Fan

Deciphering the free energy landscape of biomolecular structure space is crucial for understanding many complex molecular processes, such as protein-protein interaction, RNA folding, and protein folding. A major source of current dynamic structure data is Molecular Dynamics (MD) simulations. Several methods have been proposed to investigate the free energy landscape from MD data, but all of them rely on the assumption that kinetic similarity is associated with global geometric similarity, which may lead to unsatisfactory results. In this paper, we proposed a new method called Conditional Angle Partition Tree to reveal the hierarchical free energy landscape by correlating local geometric similarity with kinetic similarity. Its application on the benchmark alanine dipeptide MD data showed a much better performance than existing methods in exploring and understanding the free energy landscape. We also applied it to the MD data of Villin HP35. Our results are more reasonable on various aspects than those from other methods and very informative on the hierarchical structure of its energy landscape.


RSC Advances ◽  
2017 ◽  
Vol 7 (46) ◽  
pp. 28580-28590 ◽  
Author(s):  
Peng Sang ◽  
Xing Du ◽  
Li-Quan Yang ◽  
Zhao-Hui Meng ◽  
Shu-Qun Liu

The physicochemical bases for enzyme cold-adaptation remain elusive.


2017 ◽  
Vol 19 (2) ◽  
pp. 1257-1267 ◽  
Author(s):  
Qiang Shao ◽  
Zhijian Xu ◽  
Jinan Wang ◽  
Jiye Shi ◽  
Weiliang Zhu

A combination of a homology modeling technique and an enhanced sampling molecular dynamics simulation implemented using the SITS method is employed to compute a detailed map of the free-energy landscape and explore the conformational transition pathway of B-RAF kinase.


Sign in / Sign up

Export Citation Format

Share Document