Mind reading of the proteins: Deep-learning to forecast molecular dynamics

AbstractMolecular dynamics (MD) simulations have emerged to become the back-bone of today’s computational biophysics. Simulation tools such as, NAMD, AMBER and GROMACS have accumulated more than 100,000 users. Despite this remarkable success, now also bolstered by compatibility with graphics processor units (GPUs) and exascale computers, even the most scalable simulations cannot access biologically relevant timescales - the number of numerical integration steps necessary for solving differential equations in a million-to-billion-dimensional space is computationally in-tractable. Recent advancements in Deep Learning has made it such that patterns can be found in high dimensional data. In addition, Deep Learning have also been used for simulating physical dynamics. Here, we utilize LSTMs in order to predict future molecular dynamics from current and previous timesteps, and examine how this physics-guided learning can benefit researchers in computational biophysics. In particular, we test fully connected Feed-forward Neural Networks, Recurrent Neural Networks with LSTM / GRU memory cells with TensorFlow and PyTorch frame-works trained on data from NAMD simulations to predict conformational transitions on two different biological systems. We find that non-equilibrium MD is easier to train and performance improves under the assumption that each atom is independent of all other atoms in the system. Our study represents a case study for high-dimensional data that switches stochastically between fast and slow regimes. Applications of resolving these sets will allow real-world applications in the interpretation of data from Atomic Force Microscopy experiments.

Download Full-text

Neural networks trained with high-dimensional functions approximation data in high-dimensional space

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211417 ◽

2021 ◽

pp. 1-12

Author(s):

Jian Zheng ◽

Jianfeng Wang ◽

Yanping Chen ◽

Shuping Chen ◽

Jingjin Chen ◽

...

Keyword(s):

Neural Networks ◽

Dimensional Space ◽

Data Distribution ◽

High Dimensional ◽

Sufficient Information ◽

Sufficient Data ◽

High Dimensional Space ◽

Positive Effects ◽

The Neural Networks ◽

Using Data

Neural networks can approximate data because of owning many compact non-linear layers. In high-dimensional space, due to the curse of dimensionality, data distribution becomes sparse, causing that it is difficulty to provide sufficient information. Hence, the task becomes even harder if neural networks approximate data in high-dimensional space. To address this issue, according to the Lipschitz condition, the two deviations, i.e., the deviation of the neural networks trained using high-dimensional functions, and the deviation of high-dimensional functions approximation data, are derived. This purpose of doing this is to improve the ability of approximation high-dimensional space using neural networks. Experimental results show that the neural networks trained using high-dimensional functions outperforms that of using data in the capability of approximation data in high-dimensional space. We find that the neural networks trained using high-dimensional functions more suitable for high-dimensional space than that of using data, so that there is no need to retain sufficient data for neural networks training. Our findings suggests that in high-dimensional space, by tuning hidden layers of neural networks, this is hard to have substantial positive effects on improving precision of approximation data.

Download Full-text

Detecting outlying subspaces for high-dimensional data: the new task, algorithms, and performance

Knowledge and Information Systems ◽

10.1007/s10115-006-0020-z ◽

2006 ◽

Vol 10 (3) ◽

pp. 333-355 ◽

Cited By ~ 78

Author(s):

Ji Zhang ◽

Hai Wang

Keyword(s):

High Dimensional Data ◽

High Dimensional ◽

And Performance

Download Full-text

Effective approximation of high-dimensional space using neural networks

The Journal of Supercomputing ◽

10.1007/s11227-021-04038-2 ◽

2021 ◽

Author(s):

Jian Zheng ◽

Jianfeng Wang ◽

Yanping Chen ◽

Shuping Chen ◽

Jingjin Chen ◽

...

Keyword(s):

Neural Networks ◽

Dimensional Space ◽

High Dimensional ◽

High Dimensional Space ◽

Effective Approximation

Download Full-text

Fuzzy Rule Extraction Using Radial Basis Function Neural Networks in High-Dimensional Data

Intelligent Knowledge-Based Systems ◽

10.1007/978-1-4020-7829-3_44 ◽

2005 ◽

pp. 1616-1655

Author(s):

F. Admiraal-Behloul ◽

Johan H. C. Reiber

Keyword(s):

Neural Networks ◽

Radial Basis Function ◽

Basis Function ◽

High Dimensional Data ◽

Fuzzy Rule ◽

Rule Extraction ◽

High Dimensional ◽

Radial Basis

Download Full-text

Dimension Reduction and Clustering of High Dimensional Data using Auto-Associative Neural Networks

International Journal of Computer Applications ◽

10.5120/12540-9090 ◽

2013 ◽

Vol 72 (11) ◽

pp. 31-37 ◽

Cited By ~ 1

Author(s):

Zalhan MohdZin ◽

Rubiyah Yusof ◽

Ehsan Mesbahi

Keyword(s):

Neural Networks ◽

Dimension Reduction ◽

High Dimensional Data ◽

High Dimensional

Download Full-text

Subspace Clustering of High Dimensional Data Using Differential Evolution

Nature-Inspired Algorithms for Big Data Frameworks - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-5225-5852-1.ch003 ◽

2019 ◽

pp. 47-74 ◽

Cited By ~ 1

Author(s):

Parul Agarwal ◽

Shikha Mehta

Keyword(s):

Differential Evolution ◽

Distance Measure ◽

Dimensional Space ◽

Clustering Algorithms ◽

High Dimensional Data ◽

Subspace Clustering ◽

High Dimensional ◽

Dbscan Clustering ◽

Evolution Algorithms ◽

Self Adaptive

Subspace clustering approaches cluster high dimensional data in different subspaces. It means grouping the data with different relevant subsets of dimensions. This technique has become very effective as a distance measure becomes ineffective in a high dimensional space. This chapter presents a novel evolutionary approach to a bottom up subspace clustering SUBSPACE_DE which is scalable to high dimensional data. SUBSPACE_DE uses a self-adaptive DBSCAN algorithm to perform clustering in data instances of each attribute and maximal subspaces. Self-adaptive DBSCAN clustering algorithms accept input from differential evolution algorithms. The proposed SUBSPACE_DE algorithm is tested on 14 datasets, both real and synthetic. It is compared with 11 existing subspace clustering algorithms. Evaluation metrics such as F1_Measure and accuracy are used. Performance analysis of the proposed algorithms is considerably better on a success rate ratio ranking in both accuracy and F1_Measure. SUBSPACE_DE also has potential scalability on high dimensional datasets.

Download Full-text

Visualization of big high dimensional data in a three dimensional space

Proceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies - BDCAT '16 ◽

10.1145/3006299.3006340 ◽

2016 ◽

Cited By ~ 3

Author(s):

Ying Xie ◽

Pooja Chenna ◽

Jing (Selena) He ◽

Linh Le ◽

Jacey Planteen

Keyword(s):

Dimensional Space ◽

High Dimensional Data ◽

Three Dimensional ◽

High Dimensional ◽

Three Dimensional Space

Download Full-text

Dimensionality Reduction by Weighted Connections between Neighborhoods

Abstract and Applied Analysis ◽

10.1155/2014/928136 ◽

2014 ◽

Vol 2014 ◽

pp. 1-5 ◽

Cited By ~ 1

Author(s):

Fuding Xie ◽

Yutao Fan ◽

Ming Zhou

Keyword(s):

Dimensionality Reduction ◽

Dimensional Space ◽

High Dimensional Data ◽

Reduction Technique ◽

Experimental Results ◽

High Dimensional ◽

Reduced Dimensionality ◽

Dimensionality Reduction Technique ◽

Low Dimensionality ◽

Local Topology

Dimensionality reduction is the transformation of high-dimensional data into a meaningful representation of reduced dimensionality. This paper introduces a dimensionality reduction technique by weighted connections between neighborhoods to improveK-Isomap method, attempting to preserve perfectly the relationships between neighborhoods in the process of dimensionality reduction. The validity of the proposal is tested by three typical examples which are widely employed in the algorithms based on manifold. The experimental results show that the local topology nature of dataset is preserved well while transforming dataset in high-dimensional space into a new dataset in low-dimensionality by the proposed method.

Download Full-text