Hadoop Spark Based Hydrogen Bond Analysis Tool (H-BAT) for Molecular Dynamics Simulation Trajectory Data

Mapping Intimacies ◽

10.26434/chemrxiv.12942305.v1 ◽

2020 ◽

Author(s):

Sandeep Surendra Malviya ◽

Ramakrishnan Edyapatti Periyasamy ◽

Vinod Jani ◽

Mallikarjunachari Uppuladinne V N ◽

Ankita Sonawane ◽

...

Keyword(s):

Molecular Dynamics ◽

Hydrogen Bond ◽

Molecular Dynamics Simulation ◽

High Performance ◽

Dynamics Simulation ◽

Mode Analysis ◽

Trajectory Data ◽

Simulation Data ◽

Hydrogen Bond Formation ◽

Spark Framework

Molecular dynamics (MD) is a computational technique that works on the Newton's equations of motion to study the dynamics of various biomolecules and, is commonly used by structural biologists. With the development of advanced simulation techniques and increasing computing power, large amounts of data are being generated from these simulations. Various enhanced sampling techniques are currently being used, that are able to capture rare events and generate simulation data in the form of multiple trajectories. Analyzing the simulation trajectory data and extracting meaningful information using the traditional sequential post-simulation data analysis methods are becoming increasingly untenable. Currently, molecular dynamics simulation algorithms that are scalable on high-performance computing clusters are available which generate a huge amount of MD data in short span of time. The need of the hour lies in developing a advanced and high-performance analytics platform based tool that can analyze this huge simulation data in a faster and more efficient way. The Hadoop Spark framework, provides an excellent platform that meets these requirements of handling large amounts of data parallely and perform analytics with high scalability. In this study, a tool name H-BAT has been developed using the Hadoop Spark platform to calculate hydrogen bonding within all solute-solute, solute-solvent and solvent-solvent molecules in large MD simulation trajectories. Vector geometry has been used for calculation of angle and distance between the atoms which are present in the form of triplets of filtered atoms taking part in hydrogen bond formation. The benchmarking was performed up to a data size of 48 GB which showed linear scalability. Additionally, the tool is capable of handling multiple similar trajectories simultaneously. Future enhancement of the tool would include various other analysis like normal mode analysis, RMSD, 2DRMSD and Water Density Analysis using the Hadoop Spark framework.<br>

Download Full-text

Hadoop Spark Based Hydrogen Bond Analysis Tool (H-BAT) for Molecular Dynamics Simulation Trajectory Data

10.26434/chemrxiv.12942305 ◽

2020 ◽

Author(s):

Sandeep Surendra Malviya ◽

Ramakrishnan Edyapatti Periyasamy ◽

Vinod Jani ◽

Mallikarjunachari Uppuladinne V N ◽

Ankita Sonawane ◽

...

Keyword(s):

Molecular Dynamics ◽

Hydrogen Bond ◽

Molecular Dynamics Simulation ◽

High Performance ◽

Dynamics Simulation ◽

Mode Analysis ◽

Trajectory Data ◽

Simulation Data ◽

Hydrogen Bond Formation ◽

Spark Framework

Molecular dynamics (MD) is a computational technique that works on the Newton's equations of motion to study the dynamics of various biomolecules and, is commonly used by structural biologists. With the development of advanced simulation techniques and increasing computing power, large amounts of data are being generated from these simulations. Various enhanced sampling techniques are currently being used, that are able to capture rare events and generate simulation data in the form of multiple trajectories. Analyzing the simulation trajectory data and extracting meaningful information using the traditional sequential post-simulation data analysis methods are becoming increasingly untenable. Currently, molecular dynamics simulation algorithms that are scalable on high-performance computing clusters are available which generate a huge amount of MD data in short span of time. The need of the hour lies in developing a advanced and high-performance analytics platform based tool that can analyze this huge simulation data in a faster and more efficient way. The Hadoop Spark framework, provides an excellent platform that meets these requirements of handling large amounts of data parallely and perform analytics with high scalability. In this study, a tool name H-BAT has been developed using the Hadoop Spark platform to calculate hydrogen bonding within all solute-solute, solute-solvent and solvent-solvent molecules in large MD simulation trajectories. Vector geometry has been used for calculation of angle and distance between the atoms which are present in the form of triplets of filtered atoms taking part in hydrogen bond formation. The benchmarking was performed up to a data size of 48 GB which showed linear scalability. Additionally, the tool is capable of handling multiple similar trajectories simultaneously. Future enhancement of the tool would include various other analysis like normal mode analysis, RMSD, 2DRMSD and Water Density Analysis using the Hadoop Spark framework.<br>

Download Full-text

ROLE OF THE CONSERVATIVE INTERHELICAL HYDROGEN BOND SER74-TRP158 AT THE CHOLESTEROL BINDING SITE IN THE CONFORMATIONAL STABILITY OF THE b2-ADRENERGIC RECEPTOR: MOLECULAR DYNAMICS SIMULATION

Журнал структурной химии ◽

10.15372/jsc20170224 ◽

2017 ◽

Keyword(s):

Molecular Dynamics ◽

Hydrogen Bond ◽

Molecular Dynamics Simulation ◽

Binding Site ◽

Adrenergic Receptor ◽

Conformational Stability ◽

Dynamics Simulation ◽

Cholesterol Binding

Download Full-text

Molecular dynamics simulation data of self-diffusion coefficient for Lennard–Jones chain fluids

Fluid Phase Equilibria ◽

10.1016/j.fluid.2004.04.007 ◽

2004 ◽

Vol 221 (1-2) ◽

pp. 25-33 ◽

Author(s):

R.A. Reis ◽

F.C. Silva ◽

R. Nobrega ◽

J.Vladimir Oliveira ◽

F.W. Tavares

Keyword(s):

Molecular Dynamics ◽

Diffusion Coefficient ◽

Molecular Dynamics Simulation ◽

Dynamics Simulation ◽

Simulation Data ◽

Self Diffusion ◽

Lennard Jones ◽

Chain Fluids ◽

Self Diffusion Coefficient

Download Full-text

Application of normal mode analysis in molecular dynamics simulation of model alkanes

Chemical Physics ◽

10.1016/0301-0104(90)90013-y ◽

1990 ◽

Vol 146 (1-2) ◽

pp. 147-153 ◽

Author(s):

Gianni Cardini ◽

Vincenzo Schettino

Keyword(s):

Molecular Dynamics ◽

Molecular Dynamics Simulation ◽

Normal Mode ◽

Normal Mode Analysis ◽

Dynamics Simulation ◽

Download Full-text

Voronoi Polyhedron Investigation of the Short-Range Order in Liquid Bismuth Using Molecular Dynamics Simulation Data

Russian Metallurgy (Metally) ◽

10.1134/s0036029521080085 ◽

2021 ◽

Vol 2021 (8) ◽

pp. 987-991

Author(s):

B. R. Gelchinskii ◽

A. A. Yuryev ◽

E. M. Zhilina ◽

K. V. Beltyukova

Keyword(s):

Molecular Dynamics ◽

Molecular Dynamics Simulation ◽

Short Range ◽

Short Range Order ◽

Range Order ◽

Dynamics Simulation ◽

Simulation Data ◽

Voronoi Polyhedron ◽

Download Full-text

An Automatic Classification of Molecular Dynamics Simulation Data into States, and Its Application to the Construction of a Markov State Model

Journal of the Physical Society of Japan ◽

10.7566/jpsj.87.114802 ◽

2018 ◽

Vol 87 (11) ◽

pp. 114802

Author(s):

Reika Ito ◽

Takashi Yoshidome

Keyword(s):

Molecular Dynamics ◽

Molecular Dynamics Simulation ◽

Automatic Classification ◽

State Model ◽

Dynamics Simulation ◽

Simulation Data ◽

Markov State Model ◽

Download Full-text

Impact of Mutation on the Structural Stability and the Conformational Landscape of Inhibitor-Resistant TEM β-Lactamase: A High-Performance Molecular Dynamics Simulation Study

The Journal of Physical Chemistry B ◽

10.1021/acs.jpcb.1c05988 ◽

2021 ◽

Vol 125 (40) ◽

pp. 11188-11196

Author(s):

Sandip K. Mukherjee ◽

Mandira Mukherjee ◽

Padmaja P. Mishra

Keyword(s):

Molecular Dynamics ◽

Molecular Dynamics Simulation ◽

Structural Stability ◽

Simulation Study ◽

High Performance ◽

Dynamics Simulation ◽

Conformational Landscape

Download Full-text

Bringing Molecular Dynamics Simulation Data into View

Trends in Biochemical Sciences ◽

10.1016/j.tibs.2019.06.004 ◽

2019 ◽

Vol 44 (11) ◽

pp. 902-913 ◽

Author(s):

Peter W. Hildebrand ◽

Alexander S. Rose ◽

Johanna K.S. Tiemann

Keyword(s):

Molecular Dynamics ◽

Molecular Dynamics Simulation ◽

Dynamics Simulation ◽

Simulation Data

Download Full-text

1SK-01 A class library for developing multi-copy, multi-scale molecular dynamics simulation programs(1SK High Performance Computational Approaches to Biological Functions,The 49th Annual Meeting of the Biophysical Society of Japan)

Seibutsu Butsuri ◽

10.2142/biophys.51.s8_6 ◽

2011 ◽

Vol 51 (supplement) ◽

pp. S8-S9

Author(s):

Tohru Terada ◽

Yasuhiro Matsunaga ◽

Kei Moritsugu ◽

Akinori Kidera

Keyword(s):

Molecular Dynamics ◽

Molecular Dynamics Simulation ◽

Annual Meeting ◽

High Performance ◽

Dynamics Simulation ◽

Biological Functions ◽

Computational Approaches ◽

Multi Scale ◽

Class Library ◽

Biophysical Society

Download Full-text

Role of the conservative interhelical hydrogen bond Ser74–Trp158 at the cholesterol binding site in the conformational stability of the β2-adrenergic receptor: Molecular dynamics simulation

Journal of Structural Chemistry ◽

10.1134/s002247661702024x ◽

2017 ◽

Vol 58 (2) ◽

pp. 384-391 ◽

Author(s):

T. V. Bogdan ◽

E. S. Alekseev

Keyword(s):

Molecular Dynamics ◽

Hydrogen Bond ◽

Molecular Dynamics Simulation ◽

Binding Site ◽

Adrenergic Receptor ◽

Conformational Stability ◽

Dynamics Simulation ◽

Β2 Adrenergic Receptor ◽

Cholesterol Binding

Download Full-text