Efficient parallelization of perturbative Monte Carlo QM/MM simulations in heterogeneous platforms

In trained deep neural networks, unstructured pruning can reduce redundant weights to lower storage cost. However, it requires the customization of hardwares to speed up practical inference. Another trend accelerates sparse model inference on general-purpose hardwares by adopting coarse-grained sparsity to prune or regularize consecutive weights for efficient computation. But this method often sacrifices model accuracy. In this paper, we propose a novel fine-grained sparsity approach, Balanced Sparsity, to achieve high model accuracy with commercial hardwares efficiently. Our approach adapts to high parallelism property of GPU, showing incredible potential for sparsity in the widely deployment of deep learning services. Experiment results show that Balanced Sparsity achieves up to 3.1x practical speedup for model inference on GPU, while retains the same high model accuracy as finegrained sparsity.

Download Full-text

APPLICATION OF NOVEL CLONAL ALGORITHM IN MULTIOBJECTIVE OPTIMIZATION

International Journal of Information Technology & Decision Making ◽

10.1142/s0219622010003804 ◽

2010 ◽

Vol 09 (02) ◽

pp. 239-266 ◽

Cited By ~ 19

Author(s):

JIANYONG CHEN ◽

QIUZHEN LIN ◽

QINGBIN HU

Keyword(s):

Multiobjective Optimization ◽

Coarse Grained ◽

Pareto Optimal ◽

Pareto Optimal Front ◽

Fine Grained ◽

Initial Stage ◽

Speed Up ◽

Main Notion ◽

Hybrid Mutation Operator ◽

Cooling Schedule

In this paper, a novel clonal algorithm applied in multiobjecitve optimization (NCMO) is presented, which is designed from the improvement of search operators, i.e. dynamic mutation probability, dynamic simulated binary crossover (D-SBX) operator and hybrid mutation operator combining with Gaussian and polynomial mutations (GP-HM) operator. The main notion of these approaches is to perform more coarse-grained search at initial stage in order to speed up the convergence toward the Pareto-optimal front. Once the solutions are getting close to the Pareto-optimal front, more fine-grained search is performed in order to reduce the gaps between the solutions and the Pareto-optimal front. Based on this purpose, a cooling schedule is adopted in these approaches, reducing the parameters gradually to a minimal threshold, the aim of which is to keep a desirable balance between fine-grained search and coarse-grained search. By this means, the exploratory capabilities of NCMO are enhanced. When compared with various state-of-the-art multiobjective optimization algorithms developed recently, simulation results show that NCMO has remarkable performance.

Download Full-text

An Experimental Analysis of a New Interval-Based Mutation Operator

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026815500182 ◽

2015 ◽

Vol 14 (03) ◽

pp. 1550018 ◽

Cited By ~ 3

Author(s):

K. Liagkouras ◽

K. Metaxiotis

Keyword(s):

Experimental Analysis ◽

Pareto Front ◽

Computational Experiments ◽

Coarse Grained ◽

Mutation Operator ◽

Fine Grained ◽

Initial Stage ◽

Speed Up ◽

Better Than

In this paper, we present a novel Interval-Based Mutation (IBMU) operator. The proposed mutation operator is performing coarse-grained search at initial stage in order to speed up convergence toward more promising regions of the search landscape. Then, more fine-grained search is performed in order to guide the solutions towards the Pareto front. Computational experiments indicate that the proposed mutation operator performs better than conventional approaches for solving several well-known benchmarking problems.

Download Full-text

Performance differences between the episode-based DBC and diagnosis-related DRG case mix systems

Maandblad Voor Accountancy en Bedrijfseconomie ◽

10.5117/mab.91.24045 ◽

2017 ◽

Vol 91 (7/8) ◽

pp. 224-235

Author(s):

Yvonne Krabbe-Alkemade ◽

Tom Groot

Keyword(s):

Health Care ◽

Coarse Grained ◽

Cost Information ◽

Dutch Hospital ◽

Fine Grained ◽

Specification Errors ◽

Cost System ◽

The Cost ◽

Product Definition ◽

Very High

This paper explores the question how much detail a cost system needs to have in order to provide reliable cost information at a reasonable price. In general, fine-grained cost systems with a lot of detail (in product definition, in cost drivers and in cost pools) are expected to provide more reliable cost information than coarse- grained cost systems with less detail. This paper takes as an example the DBC cost system that has been developed for the Dutch hospital sector. The fine-grained DBC system with over 40,000 health care products appears to outperform lowergrained DRG systems with only 15,000 and 6,000 health care products on cost homogeneity and predictive validity. It does so however at the cost of a high number of products with measurement and specification errors, caused by a large number of outliers and by a low number of observations in product groups. The cost-effectiveness of the DBC system is not very high: only 3% of all DBC-codes explains 80% of total costs, whereas the lower-grained DRG system uses 14% of the codes to explain 80% of total costs. Combined with the high administration cost of the DBCsystem, it was from an economic perspective, a sensible idea to replace the finegrained DBC-system by the coarse-grained DOT system.

Download Full-text

LOC program for line radiative transfer

Astronomy and Astrophysics ◽

10.1051/0004-6361/202039456 ◽

2020 ◽

Vol 644 ◽

pp. A151

Author(s):

Mika Juvela

Keyword(s):

Monte Carlo ◽

Radiative Transfer ◽

Graphics Processing Units ◽

Large Scale ◽

3D Models ◽

Personal Computers ◽

Radio Lines ◽

Central Processing ◽

Speed Up ◽

Set Up

Context. Radiative transfer (RT) modelling is part of many astrophysical simulations. It is used to make synthetic observations and to assist the analysis of observations. We concentrate on modelling the radio lines emitted by the interstellar medium. In connection with high-resolution models, this can be a significant computationally challenge. Aims. Our aim is to provide a line RT program that makes good use of multi-core central processing units (CPUs) and graphics processing units (GPUs). Parallelisation is essential to speed up computations and to enable large modelling tasks with personal computers. Methods. The program LOC is based on ray-tracing (i.e. not Monte Carlo) and uses standard accelerated lambda iteration methods for faster convergence. The program works on 1D and 3D grids. The 1D version makes use of symmetries to speed up the RT calculations. The 3D version works with octree grids, and to enable calculations with large models, is optimised for low memory usage. Results. Tests show that LOC results agree with other RT codes to within ∼2%. This is typical of code-to-code differences, which are often related to different interpretations of the model set-up. LOC run times compare favourably especially with those of Monte Carlo codes. In 1D tests, LOC runs were faster by up to a factor ∼20 on a GPU than on a single CPU core. In spite of the complex path calculations, a speed-up of up to ∼10 was also observed for 3D models using octree discretisation. GPUs enable calculations of models with hundreds of millions of cells, as are encountered in the context of large-scale simulations of interstellar clouds. Conclusions. LOC shows good performance and accuracy and is able to handle many RT modelling tasks on personal computers. It is written in Python, with only the computing-intensive parts implemented as compiled OpenCL kernels. It can therefore also a serve as a platform for further experimentation with alternative RT implementation details.

Download Full-text

Study of Fine-grained Nested Parallelism in CDCL SAT Solvers

ACM Transactions on Parallel Computing ◽

10.1145/3470639 ◽

2021 ◽

Vol 8 (3) ◽

pp. 1-18

Author(s):

James Edwards ◽

Uzi Vishkin

Keyword(s):

Computer Architecture ◽

Coarse Grained ◽

Future Research ◽

Sat Solvers ◽

Fine Grained ◽

Nested Parallelism ◽

Clause Learning ◽

Speed Up ◽

Fine Grained Parallelism ◽

Problem Instances

Boolean satisfiability (SAT) is an important performance-hungry problem with applications in many problem domains. However, most work on parallelizing SAT solvers has focused on coarse-grained, mostly embarrassing, parallelism. Here, we study fine-grained parallelism that can speed up existing sequential SAT solvers, which all happen to be of the so-called Conflict-Directed Clause Learning variety. We show the potential for speedups of up to 382× across a variety of problem instances. We hope that these results will stimulate future research, particularly with respect to a computer architecture open problem we present.

Download Full-text

Multigraining: An algorithm for simultaneous fine-grained and coarse-grained simulation of molecular systems

The Journal of Chemical Physics ◽

10.1063/1.2187488 ◽

2006 ◽

Vol 124 (15) ◽

pp. 154106 ◽

Cited By ~ 80

Author(s):

Markus Christen ◽

Wilfred F. van Gunsteren

Keyword(s):

Coarse Grained ◽

Molecular Systems ◽

Fine Grained

Download Full-text

Deep Learning for Variational Multi-Scale Molecular Modeling

10.26434/chemrxiv.9640814.v4 ◽

2020 ◽

Author(s):

Jun Zhang ◽

Yaokun Lei ◽

Yi Isaac Yang ◽

Yi Qin Gao

Keyword(s):

Coarse Grained ◽

Atomistic Simulations ◽

Molecular Models ◽

Enhanced Sampling ◽

Physical Systems ◽

Molecular Systems ◽

Fine Grained ◽

Multi Scale ◽

Machine Learning Approach ◽

Training Objective

Molecular simulations are widely applied in the study of chemical and bio-physical systems. However, the accessible timescales of atomistic simulations are limited, and extracting equilibrium properties of systems containing rare events remains challenging. Two distinct strategies are usually adopted in this regard: either sticking to the atomistic level and performing enhanced sampling, or trading details for speed by leveraging coarse-grained models. Although both strategies are promising, either of them, if adopted individually, exhibits severe limitations. In this paper we propose a machine-learning approach to ally both strategies so that simulations on different scales can benefit mutually from their cross-talks: Accurate coarse-grained (CG) models can be inferred from the fine-grained (FG) simulations through deep generative learning; In turn, FG simulations can be boosted by the guidance of CG models via deep reinforcement learning. Our method defines a variational and adaptive training objective which allows end-to-end training of parametric molecular models using deep neural networks. Through multiple experiments, we show that our method is efficient and flexible, and performs well on challenging chemical and bio-molecular systems.

Download Full-text

Deep Learning for Variational Multi-Scale Molecular Modeling

10.26434/chemrxiv.9640814 ◽

2020 ◽

Cited By ~ 2

Author(s):

Jun Zhang ◽

Yaokun Lei ◽

Yi Isaac Yang ◽

Yi Qin Gao

Keyword(s):

Coarse Grained ◽

Atomistic Simulations ◽

Molecular Models ◽

Enhanced Sampling ◽

Physical Systems ◽

Molecular Systems ◽

Fine Grained ◽

Multi Scale ◽

Machine Learning Approach ◽

Training Objective

Molecular simulations are widely applied in the study of chemical and bio-physical systems. However, the accessible timescales of atomistic simulations are limited, and extracting equilibrium properties of systems containing rare events remains challenging. Two distinct strategies are usually adopted in this regard: either sticking to the atomistic level and performing enhanced sampling, or trading details for speed by leveraging coarse-grained models. Although both strategies are promising, either of them, if adopted individually, exhibits severe limitations. In this paper we propose a machine-learning approach to ally both strategies so that simulations on different scales can benefit mutually from their cross-talks: Accurate coarse-grained (CG) models can be inferred from the fine-grained (FG) simulations through deep generative learning; In turn, FG simulations can be boosted by the guidance of CG models via deep reinforcement learning. Our method defines a variational and adaptive training objective which allows end-to-end training of parametric molecular models using deep neural networks. Through multiple experiments, we show that our method is efficient and flexible, and performs well on challenging chemical and bio-molecular systems.

Download Full-text

The Automated Optimisation of a Coarse-Grained Force Field Using Free Energy Data

10.1101/2020.08.13.250233 ◽

2020 ◽

Author(s):

Javier Caceres-Delpiano ◽

Lee-Ping Wang ◽

Jonathan W. Essex

Keyword(s):

Free Energy ◽

Force Field ◽

Degrees Of Freedom ◽

Coarse Grained ◽

Free Energies ◽

Energy Data ◽

Molecular Systems ◽

Atomistic Models ◽

The Cost ◽

Long Timescales

AbstractAtomistic models provide a detailed representation of molecular systems, but are sometimes inadequate for simulations of large systems over long timescales. Coarse-grained models enable accelerated simulations by reducing the number of degrees of freedom, at the cost of reduced accuracy. New optimisation processes to parameterise these models could improve their quality and range of applicability. We present an automated approach for the optimisation of coarse-grained force fields, by reproducing free energy data derived from atomistic molecular simulations. To illustrate the approach, we implemented hydration free energy gradients as a new target for force field optimisation in ForceBalance and applied it successfully to optimise the un-charged side-chains and the protein backbone in the SIRAH protein coarse-grain force field. The optimised parameters closely reproduced hydration free energies of atomistic models and gave improved agreement with experiment.

Download Full-text