fortran implementation Latest Research Papers

Higher-level geophysical modelling

10.5194/egusphere-egu21-2127 ◽

2021 ◽

Author(s):

Roman Nuterman ◽

Dion Häfner ◽

Markus Jochum

Keyword(s):

Machine Learning ◽

Programming Languages ◽

High Performance ◽

Ocean Model ◽

User Friendliness ◽

Model Code ◽

Building Models ◽

Fortran Implementation ◽

High Level ◽

New Generation

<p>Until recently, our pure Python, primitive equation ocean model Veros&#160;<br>has been about 1.5x slower than a corresponding Fortran implementation.&#160;<br>But thanks to a thriving scientific and machine learning library&#160;<br>ecosystem, tremendous speed-ups on GPU, and to a lesser degree CPU, are&#160;<br>within reach. Leveraging Google's JAX library, we find that our Python&#160;<br>model code can reach a 2-5 times higher energy efficiency on GPU&#160;<br>compared to a traditional Fortran model.</p><p>Therefore, we propose a new generation of geophysical models: One that&#160;<br>combines high-level abstractions and user friendliness on one hand, and&#160;<br>that leverages modern developments in high-performance computing and&#160;<br>machine learning research on the other hand.</p><p>We discuss what there is to gain from building models in high-level&#160;<br>programming languages, what we have achieved in Veros, and where we see&#160;<br>the modelling community heading in the future.</p>

Download Full-text

Next-generation geophysical modelling

10.5194/egusphere-egu2020-20536 ◽

2020 ◽

Author(s):

Roman Nuterman ◽

Dion Häfner ◽

Markus Jochum ◽

Brian Vinter

Keyword(s):

Programming Languages ◽

High Performance ◽

Ocean Model ◽

User Friendliness ◽

Model Code ◽

Building Models ◽

Fortran Implementation ◽

High Level ◽

New Generation ◽

Performance Computing

<div>So far, our pure Python, primitive equation ocean model Veros has been</div><div>about 50% slower than a corresponding Fortran implementation. But recent</div><div>benchmarks show that, thanks to a thriving scientific and machine</div><div>learning library ecosystem, tremendous speed-ups on GPU, and to a lesser</div><div>degree CPU, are within reach. On GPU, we find that the same model code</div><div>can reach a 2-5 times higher energy efficiency compared to a traditional</div><div>Fortran model.</div><div>We thus propose a new generation of geophysical models. One that</div><div>combines high-level abstractions and user friendliness on one hand, and</div><div>that leverages modern developments in high-performance computing on the</div><div>other hand.</div><div>We discuss what there is to gain from building models in high-level</div><div>programming languages, what we have achieved, and what the future holds</div><div>for us and the modelling community.</div>

Download Full-text

HOMMEXX 1.0: a performance-portable atmospheric dynamical core for the Energy Exascale Earth System Model

Geoscientific Model Development ◽

10.5194/gmd-12-1423-2019 ◽

2019 ◽

Vol 12 (4) ◽

pp. 1423-1441 ◽

Cited By ~ 5

Author(s):

Luca Bertagna ◽

Michael Deakin ◽

Oksana Guba ◽

Daniel Sunderland ◽

Andrew M. Bradley ◽

...

Keyword(s):

Message Passing ◽

Message Passing Interface ◽

Earth System Model ◽

Parallel Execution ◽

System Model ◽

Earth System ◽

Dynamical Core ◽

Fortran Implementation ◽

Many Core ◽

Intel Xeon

Abstract. We present an architecture-portable and performant implementation of the atmospheric dynamical core (High-Order Methods Modeling Environment, HOMME) of the Energy Exascale Earth System Model (E3SM). The original Fortran implementation is highly performant and scalable on conventional architectures using the Message Passing Interface (MPI) and Open MultiProcessor (OpenMP) programming models. We rewrite the model in C++ and use the Kokkos library to express on-node parallelism in a largely architecture-independent implementation. Kokkos provides an abstraction of a compute node or device, layout-polymorphic multidimensional arrays, and parallel execution constructs. The new implementation achieves the same or better performance on conventional multicore computers and is portable to GPUs. We present performance data for the original and new implementations on multiple platforms, on up to 5400 compute nodes, and study several aspects of the single- and multi-node performance characteristics of the new implementation on conventional CPU (e.g., Intel Xeon), many core CPU (e.g., Intel Xeon Phi Knights Landing), and Nvidia V100 GPU.

Download Full-text

HOMMEXX 1.0: A Performance Portable Atmospheric Dynamical Core for the Energy Exascale Earth System Model

10.5194/gmd-2018-218 ◽

2018 ◽

Author(s):

Luca Bertagna ◽

Michael Deakin ◽

Oksana Guba ◽

Daniel Sunderland ◽

Andrew M. Bradley ◽

...

Keyword(s):

Earth System Model ◽

Parallel Execution ◽

System Model ◽

Xeon Phi ◽

Earth System ◽

Intel Xeon Phi ◽

Dynamical Core ◽

Multidimensional Arrays ◽

Fortran Implementation ◽

Multicore Computers

Abstract. We present an architecture-portable and performant implementation of the atmospheric dynamical core (HOMME) of the Energy Exascale Earth System Model (E3SM). The original Fortran implementation is highly performant and scalable on conventional architectures using MPI and OpenMP. We rewrite the model in C++ and use the Kokkos library to express on-node parallelism in a largely architecture-independent implementation. Kokkos provides an abstraction of a compute node or device, layout-polymorphic multidimensional arrays, and parallel execution constructs. The new implementation achieves the same or better performance on conventional multicore computers and is portable to GPUs. We present performance data for the original and new implementations on multiple platforms, on up to 5400 compute nodes, and study several aspects of the single- and multi-node performance characteristics of the new implementation on conventional CPU, Intel Xeon Phi Knights Landing, and Nvidia V100 GPU.

Download Full-text

A Fortran implementation of isogeometric analysis for thin plate problems with the penalty method

Engineering Computations ◽

10.1108/ec-10-2015-0306 ◽

2016 ◽

Vol 33 (7) ◽

pp. 2149-2164 ◽

Cited By ~ 3

Author(s):

Feng Chang ◽

Weiqiang Wang ◽

Yan Liu ◽

Yanpeng Qu

Keyword(s):

Boundary Conditions ◽

Programming Languages ◽

Thin Plate ◽

Isogeometric Analysis ◽

Penalty Method ◽

Basis Functions ◽

Element Analysis ◽

Content Type ◽

Fortran Implementation ◽

High Level

Purpose As one of the earliest high-level programming languages, Fortran with easy accessibility and computational efficiency is widely used in the engineering field. The purpose of this paper is to present a Fortran implementation of isogeometric analysis (IGA) for thin plate problems. Design/methodology/approach IGA based on non-uniform rational B-splines (NURBS) offers exact geometries and is more accurate than finite element analysis (FEA). Unlike the basis functions in FEA, NURBS basis functions are non-interpolated. Hence, the penalty method is used to enforce boundary conditions. Findings Several thin plate examples based on the Kirchhoff-Love theory were illustrated to demonstrate the accuracy of the implementation in contrast with analytical solutions, and the efficiency was validated in comparison with another open method. Originality/value A Fortran implementation of NURBS-based IGA was developed to solve Kirchhoff-Love plate problems. It easily obtained high-continuity basis functions, which are necessary for Kirchhoff formulation. In comparison with theoretical solutions, the numerical examples demonstrated higher accuracy and faster convergence of the Fortran implementation. The Fortran implementation can well solve the time-consuming problem, and it was validated by the time-consumption comparison with the Matlab implementation. Due to the non-interpolation of NURBS, the penalty method was used to impose boundary conditions. A suggestion of the selection of penalty coefficients was given.

Download Full-text

A Fortran Implementation of Isogeometric Analysis with Superiority in Convergence and Accuracy

Procedia Engineering ◽

10.1016/j.proeng.2015.12.227 ◽

2015 ◽

Vol 130 ◽

pp. 342-354 ◽

Cited By ~ 1

Author(s):

F. Chang ◽

W.Q. Wang ◽

Y. Liu

Keyword(s):

Isogeometric Analysis ◽

Fortran Implementation

Download Full-text

A Coarray Fortran implementation to support data-intensive application development

Cluster Computing ◽

10.1007/s10586-013-0302-7 ◽

2013 ◽

Vol 17 (2) ◽

pp. 569-583 ◽

Cited By ~ 5

Author(s):

Deepak Eachempati ◽

Alan Richardson ◽

Siddhartha Jana ◽

Terrence Liao ◽

Henri Calandra ◽

...

Keyword(s):

Application Development ◽

Data Intensive ◽

Fortran Implementation ◽

Data Intensive Application

Download Full-text

A Coarray Fortran Implementation to Support Data-Intensive Application Development

2012 SC Companion: High Performance Computing, Networking Storage and Analysis ◽

10.1109/sc.companion.2012.106 ◽

2012 ◽

Cited By ~ 1

Author(s):

Deepak Eachempati ◽

Alan Richardson ◽

Terrence Liao ◽

Henri Calandra ◽

Barbara Chapman

Keyword(s):

Application Development ◽

Data Intensive ◽

Fortran Implementation ◽

Data Intensive Application

Download Full-text

AD in Fortran: Implementation via Prepreprocessor

Lecture Notes in Computational Science and Engineering - Recent Advances in Algorithmic Differentiation ◽

10.1007/978-3-642-30023-3_25 ◽

2012 ◽

pp. 273-284

Author(s):

Alexey Radul ◽

Barak A. Pearlmutter ◽

Jeffrey Mark Siskind

Keyword(s):

Fortran Implementation

Download Full-text

Acceleration of a two-dimensional Euler flow solver using commodity graphics hardware

Proceedings of the Institution of Mechanical Engineers Part C Journal of Mechanical Engineering Science ◽

10.1243/09544062jmes813ft ◽

2007 ◽

Vol 221 (12) ◽

pp. 1745-1748 ◽

Cited By ~ 38

Author(s):

T Brandvik ◽

G Pullan

Keyword(s):

Programming Model ◽

Graphics Processing Unit ◽

Graphics Hardware ◽

Test Case ◽

Processing Unit ◽

Two Dimensional ◽

Flow Solver ◽

Fortran Implementation ◽

Euler Solver ◽

Graphics Processing

The implementation of a two-dimensional Euler solver on graphics hardware is described. The graphics processing unit is highly parallelized and uses a programming model that is well suited to flow computation. Results for a transonic turbine cascade test-case are presented. For large grids (106 nodes) a 40 times speed-up compared with a Fortran implementation on a contemporary CPU is observed.

Download Full-text

fortran implementation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Higher-level geophysical modelling

Next-generation geophysical modelling

HOMMEXX 1.0: a performance-portable atmospheric dynamical core for the Energy Exascale Earth System Model

HOMMEXX 1.0: A Performance Portable Atmospheric Dynamical Core for the Energy Exascale Earth System Model

A Fortran implementation of isogeometric analysis for thin plate problems with the penalty method

A Fortran Implementation of Isogeometric Analysis with Superiority in Convergence and Accuracy

A Coarray Fortran implementation to support data-intensive application development

A Coarray Fortran Implementation to Support Data-Intensive Application Development

AD in Fortran: Implementation via Prepreprocessor

Acceleration of a two-dimensional Euler flow solver using commodity graphics hardware

Export Citation Format

fortran implementationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Higher-level geophysical modelling

Next-generation geophysical modelling

HOMMEXX 1.0: a performance-portable atmospheric dynamical core for the Energy Exascale Earth System Model

HOMMEXX 1.0: A Performance Portable Atmospheric Dynamical Core for the Energy Exascale Earth System Model

A Fortran implementation of isogeometric analysis for thin plate problems with the penalty method

A Fortran Implementation of Isogeometric Analysis with Superiority in Convergence and Accuracy

A Coarray Fortran implementation to support data-intensive application development

A Coarray Fortran Implementation to Support Data-Intensive Application Development

AD in Fortran: Implementation via Prepreprocessor

Acceleration of a two-dimensional Euler flow solver using commodity graphics hardware

fortran implementation
Recently Published Documents