Construction of a neural network energy function for protein physics

AbstractClassical potentials are widely used to describe protein physics, due to their simplicity and accuracy, but they are continuously challenged as real applications become more demanding with time. Deep neural networks could help generating alternative ways of describing protein physics. Here we propose an unsupervised learning method to derive a neural network energy function for proteins. The energy function is a probability density model learned from plenty of 3D local structures which have been extensively explored by evolution. We tested this model on a few applications (assessment of protein structures, protein dynamics and protein sequence design), showing that the neural network can correctly recognize patterns in protein structures. In other words, the neural network learned some aspects of protein physics from experimental data.

Download Full-text

NNV: The Neural Network Verification Tool for Deep Neural Networks and Learning-Enabled Cyber-Physical Systems

Computer Aided Verification - Lecture Notes in Computer Science ◽

10.1007/978-3-030-53288-8_1 ◽

2020 ◽

pp. 3-17 ◽

Cited By ~ 5

Author(s):

Hoang-Dung Tran ◽

Xiaodong Yang ◽

Diego Manzanas Lopez ◽

Patrick Musau ◽

Luan Viet Nguyen ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Networks ◽

Cyber Physical Systems ◽

Physical Systems ◽

Verification Tool ◽

The Neural Network ◽

Network Verification

Download Full-text

Enhancing protein backbone angle prediction by using simpler models of deep neural networks

Scientific Reports ◽

10.1038/s41598-020-76317-6 ◽

2020 ◽

Vol 10 (1) ◽

Author(s):

Fereshteh Mataeimoghadam ◽

M. A. Hakim Newton ◽

Abdollah Dehzangi ◽

Abdul Karim ◽

B. Jayaram ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Structure Prediction ◽

Protein Structures ◽

Absolute Error ◽

Grand Challenge ◽

Protein Backbone ◽

The Neural Network ◽

Benchmark Datasets ◽

The Neural Networks

Abstract Protein structure prediction is a grand challenge. Prediction of protein structures via the representations using backbone dihedral angles has recently achieved significant progress along with the on-going surge of deep neural network (DNN) research in general. However, we observe that in the protein backbone angle prediction research, there is an overall trend to employ more and more complex neural networks and then to throw more and more features to the neural networks. While more features might add more predictive power to the neural network, we argue that redundant features could rather clutter the scenario and more complex neural networks then just could counterbalance the noise. From artificial intelligence and machine learning perspectives, problem representations and solution approaches do mutually interact and thus affect performance. We also argue that comparatively simpler predictors can more easily be reconstructed than the more complex ones. With these arguments in mind, we present a deep learning method named Simpler Angle Predictor (SAP) to train simpler DNN models that enhance protein backbone angle prediction. We then empirically show that SAP can significantly outperform existing state-of-the-art methods on well-known benchmark datasets: for some types of angles, the differences are 6–8 in terms of mean absolute error (MAE). The SAP program along with its data is available from the website https://gitlab.com/mahnewton/sap.

Download Full-text

Modeling a Thermochemical Reactor of a Solar Refrigerator by BaCl2-NH3 Sorption Using Artificial Neural Networks and Mathematical Symmetry Groups

Mathematical Problems in Engineering ◽

10.1155/2020/9098709 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Onesimo Meza-Cruz ◽

Isaac Pilatowsky ◽

Agustín Pérez-Ramírez ◽

Carlos Rivera-Blanco ◽

Youness El Hamzaoui ◽

...

Keyword(s):

Neural Network ◽

Experimental Data ◽

Neural Networks ◽

Heating Temperature ◽

Barium Chloride ◽

Symmetry Groups ◽

Solar Cooling ◽

Network Training ◽

The Neural Network ◽

Marquardt Algorithm

The aim of this work is to present a model for heat transfer, desorbed refrigerant, and pressure of an intermittent solar cooling system’s thermochemical reactor based on backpropagation neural networks and mathematical symmetry groups. In order to achieve this, a reactor was designed and built based on the reaction of BaCl2-NH3. Experimental data from this reactor were collected, where barium chloride was used as a solid absorbent and ammonia as a refrigerant. The neural network was trained using the Levenberg–Marquardt algorithm. The correlation coefficient between experimental data and data simulated by the neural network was r = 0.9957. In the neural network’s sensitivity analysis, it was found that the inputs, reactor’s heating temperature and sorption time, influence neural network’s learning by 35% and 20%, respectively. It was also found that, by applying permutations to experimental data and using multibase mathematical symmetry groups, the neural network training algorithm converges faster.

Download Full-text

OPTIMIZATION PROCESS ANALYSIS FOR HYPERPARAMETERS OF NEURAL NETWORK DATA PROCESSING STRUCTURES

Vestnik komp iuternykh i informatsionnykh tekhnologii ◽

10.14489/vkit.2020.10.pp.003-010 ◽

2020 ◽

pp. 3-10

Author(s):

V. N. Gridin ◽

I. A. Evdokimov ◽

B. R. Salem ◽

V. I. Solodovnikov

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Networks ◽

Process Analysis ◽

Validation Process ◽

The Neural Network ◽

Source Data ◽

The Neural Networks ◽

Qualitative Characteristics ◽

Setting Parameters

The analysis of key stages, implementation features and functioning principles of the neural networks, including deep neural networks, has been carried out. The problems of choosing the number of hidden elements, methods for the internal topology selection and setting parameters are considered. It is shown that in the training and validation process it is possible to control the capacity of a neural network and evaluate the qualitative characteristics of the constructed model. The issues of construction processes automation and hyperparameters optimization of the neural network structures are considered depending on the user's tasks and the available source data. A number of approaches based on the use of probabilistic programming, evolutionary algorithms, and recurrent neural networks are presented.

Download Full-text

Neural network method as means of processing experimental data on grain crop yields

E3S Web of Conferences ◽

10.1051/e3sconf/202016101031 ◽

2020 ◽

Vol 161 ◽

pp. 01031

Author(s):

Aleksandr Nikiforov ◽

Aleksei Kuchumov ◽

Sergei Terentev ◽

Inessa Karamulina ◽

Iraida Romanova ◽

...

Keyword(s):

Neural Network ◽

Experimental Data ◽

Neural Networks ◽

Network Model ◽

Neural Network Model ◽

Crop Yields ◽

Winter Triticale ◽

The Neural Network ◽

Grain Crop ◽

Scientific Results

In the work based on agroecological and technological testing of varieties of grain crops of domestic and foreign breeding, winter triticale in particular, conducted on the experimental field of the Smolensk State Agricultural Academy between 2015 and 2019, we present the methodology and results of processing the experimental data used for constructing the neural network model. Neural networks are applicable for solving tasks that are difficult for computers of traditional design and humans alike. Those are processing large volumes of experimental data, automation of image recognition, approximation of functions and prognosis. Neural networks include analyzing subject areas and weight coefficients of neurons, detecting conflict samples and outliers, normalizing data, determining the number of samples required for teaching a neural network and increasing the learning quality when their number is insufficient, as well as selecting the neural network type and decomposition based on the number of input neurons. We consider the technology of initial data processing and selecting the optimal neural network structure that allows to significantly reduce modeling errors in comparison with neural networks created with unprepared source data. Our accumulated experience of working with neural networks has demonstrated encouraging results, which indicates the prospects of this area, especially when describing processes with large amounts of variables. In order to verify the resulting neural network model, we have carried out a computational experiment, which showed the possibility of applying scientific results in practice.

Download Full-text

Bias in Deep Neural Networks in Land Use Characterization for International Development

Remote Sensing ◽

10.3390/rs13152908 ◽

2021 ◽

Vol 13 (15) ◽

pp. 2908

Author(s):

Do-Hyung Kim ◽

Guzmán López ◽

Diego Kiedanski ◽

Iyke Maduako ◽

Braulio Ríos ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Land Use ◽

International Development ◽

Deep Neural Networks ◽

Satellite Image ◽

Fine Tuning ◽

Training Dataset ◽

Model Accuracy ◽

The Neural Network

Understanding the biases in Deep Neural Networks (DNN) based algorithms is gaining paramount importance due to its increased applications on many real-world problems. A known problem of DNN penalizing the underrepresented population could undermine the efficacy of development projects dependent on data produced using DNN-based models. In spite of this, the problems of biases in DNN for Land Use and Land Cover Classification (LULCC) have not been a subject of many studies. In this study, we explore ways to quantify biases in DNN for land use with an example of identifying school buildings in Colombia from satellite imagery. We implement a DNN-based model by fine-tuning an existing, pre-trained model for school building identification. The model achieved overall 84% accuracy. Then, we used socioeconomic covariates to analyze possible biases in the learned representation. The retrained deep neural network was used to extract visual features (embeddings) from satellite image tiles. The embeddings were clustered into four subtypes of schools, and the accuracy of the neural network model was assessed for each cluster. The distributions of various socioeconomic covariates by clusters were analyzed to identify the links between the model accuracy and the aforementioned covariates. Our results indicate that the model accuracy is lowest (57%) where the characteristics of the landscape are predominantly related to poverty and remoteness, which confirms our original assumption on the heterogeneous performances of Artificial Intelligence (AI) algorithms and their biases. Based on our findings, we identify possible sources of bias and present suggestions on how to prepare a balanced training dataset that would result in less biased AI algorithms. The framework used in our study to better understand biases in DNN models would be useful when Machine Learning (ML) techniques are adopted in lieu of ground-based data collection for international development programs. Because such programs aim to solve issues of social inequality, MLs are only applicable when they are transparent and accountable.

Download Full-text

Application of Artificial Neural Networks in Assessing the Equilibrium Depth of Local Scour Around Bridge Piers

Volume 7: Ocean Engineering ◽

10.1115/omae2015-42387 ◽

2015 ◽

Cited By ~ 1

Author(s):

Ehsan Sarshari ◽

Philippe Mullhaupt

Keyword(s):

Neural Network ◽

Experimental Data ◽

Neural Networks ◽

Artificial Neural Networks ◽

Empirical Models ◽

Local Scour ◽

Bridge Piers ◽

Local Conditions ◽

The Neural Network ◽

Artificial Neural

Scour can have the effect of subsidence of the piers in bridges, which can ultimately lead to the total collapse of these systems. Effective bridge design needs appropriate information on the equilibrium depth of local scour. The flow field around bridge piers is complex so that deriving a theoretical model for predicting the exact equilibrium depth of local scour seems to be near impossible. On the other hand, the assessment of empirical models highly depends on local conditions, which is usually too conservative. In the present study, artificial neural networks are used to estimate the equilibrium depth of the local scour around bridge piers. Assuming such equilibrium depth is a function of five variables, and using experimental data, a neural network model is trained to predict this equilibrium depth. Multilayer neural networks with backpropagation algorithm with different learning rules are investigated and implemented. Different methods of data normalization besides the effect of initial weightings and overtraining phenomenon are addressed. The results show well adoption of the neural network predictions against experimental data in comparison with the estimation of empirical models.

Download Full-text

Exploring deep neural networks via layer-peeled model: Minority collapse in imbalanced training

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.2103091118 ◽

2021 ◽

Vol 118 (43) ◽

pp. e2103091118

Author(s):

Cong Fang ◽

Hangfeng He ◽

Qi Long ◽

Weijie J. Su

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Deep Neural Networks ◽

Model Minority ◽

Tight Frame ◽

Learning Models ◽

The Neural Network ◽

Long Time ◽

Topmost Layer

In this paper, we introduce the Layer-Peeled Model, a nonconvex, yet analytically tractable, optimization program, in a quest to better understand deep neural networks that are trained for a sufficiently long time. As the name suggests, this model is derived by isolating the topmost layer from the remainder of the neural network, followed by imposing certain constraints separately on the two parts of the network. We demonstrate that the Layer-Peeled Model, albeit simple, inherits many characteristics of well-trained neural networks, thereby offering an effective tool for explaining and predicting common empirical patterns of deep-learning training. First, when working on class-balanced datasets, we prove that any solution to this model forms a simplex equiangular tight frame, which, in part, explains the recently discovered phenomenon of neural collapse [V. Papyan, X. Y. Han, D. L. Donoho, Proc. Natl. Acad. Sci. U.S.A. 117, 24652–24663 (2020)]. More importantly, when moving to the imbalanced case, our analysis of the Layer-Peeled Model reveals a hitherto-unknown phenomenon that we term Minority Collapse, which fundamentally limits the performance of deep-learning models on the minority classes. In addition, we use the Layer-Peeled Model to gain insights into how to mitigate Minority Collapse. Interestingly, this phenomenon is first predicted by the Layer-Peeled Model before being confirmed by our computational experiments.

Download Full-text

Domain Randomization Using Deep Neural Networks for Estimating Positions of Bolts

Journal of Computing and Information Science in Engineering ◽

10.1115/1.4047074 ◽

2020 ◽

Vol 20 (5) ◽

Author(s):

Ezra Ameperosa ◽

Pranav A. Bhounsule

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Deep Neural Networks ◽

Single Object ◽

Automated Method ◽

Bounding Box ◽

The Neural Network ◽

Camera Position ◽

Position And Orientation

Abstract Current manual practices of replacing bolts on structures are time-consuming and costly, especially because of numerous bolts. Thus, an automated method that can visually detect and localize bolt positions would be highly beneficial. We demonstrate the use of deep neural networks using domain randomization for detecting and localizing bolts on a workpiece. In contrast to previous approaches that require training on real images, the use of domain randomization enables all training in simulation. The key idea is to create a wide variety of computer-generated synthetic images by varying the texture, color, camera position and orientation, distractor objects, and noise, and train the neural network on these images such that the neural network is robust to scene variability and hence provides accurate results when deployed on real images. Using domain randomization, we train two neural networks, a faster regional convolutional neural network for detecting the bolt and placing a bounding box, and a regression convolutional neural network for estimating the x- and y-position of the bolts relative to the coordinates fixed to the workpiece. Our results indicate that in the best case, we can detect bolts with 85% accuracy and can predict 75% of bolts within 1.27 cm accuracy. The novelty of this work is in using domain randomization to detect and localize: (1) multiples of a single object and (2) small-sized objects (0.6 cm × 2.5 cm).

Download Full-text

A Novel Low-Bit Quantization Strategy for Compressing Deep Neural Networks

Computational Intelligence and Neuroscience ◽

10.1155/2020/7839064 ◽

2020 ◽

Vol 2020 ◽

pp. 1-7

Author(s):

Xin Long ◽

XiangRong Zeng ◽

Zongcheng Ben ◽

Dianle Zhou ◽

Maojun Zhang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Networks ◽

Computational Cost ◽

Network Models ◽

Memory Consumption ◽

Neural Network Models ◽

The Neural Network ◽

The Neural Networks ◽

Novel Strategy

The increase in sophistication of neural network models in recent years has exponentially expanded memory consumption and computational cost, thereby hindering their applications on ASIC, FPGA, and other mobile devices. Therefore, compressing and accelerating the neural networks are necessary. In this study, we introduce a novel strategy to train low-bit networks with weights and activations quantized by several bits and address two corresponding fundamental issues. One is to approximate activations through low-bit discretization for decreasing network computational cost and dot-product memory. The other is to specify weight quantization and update mechanism for discrete weights to avoid gradient mismatch. With quantized low-bit weights and activations, the costly full-precision operation will be replaced by shift operation. We evaluate the proposed method on common datasets, and results show that this method can dramatically compress the neural network with slight accuracy loss.

Download Full-text