Large-scale machine learning based on functional networks for biomedical big data with high performance computing platforms

Abstract Significant investments to upgrade and construct large-scale scientific facilities demand commensurate investments in R&D to design algorithms and computing approaches to enable scientific and engineering breakthroughs in the big data era. Innovative Artificial Intelligence (AI) applications have powered transformational solutions for big data challenges in industry and technology that now drive a multi-billion dollar industry, and which play an ever increasing role shaping human social patterns. As AI continues to evolve into a computing paradigm endowed with statistical and mathematical rigor, it has become apparent that single-GPU solutions for training, validation, and testing are no longer sufficient for computational grand challenges brought about by scientific facilities that produce data at a rate and volume that outstrip the computing capabilities of available cyberinfrastructure platforms. This realization has been driving the confluence of AI and high performance computing (HPC) to reduce time-to-insight, and to enable a systematic study of domain-inspired AI architectures and optimization schemes to enable data-driven discovery. In this article we present a summary of recent developments in this field, and describe specific advances that authors in this article are spearheading to accelerate and streamline the use of HPC platforms to design and apply accelerated AI algorithms in academia and industry.

Download Full-text

Analytical Performance Estimation for Large-Scale Reconfigurable Dataflow Platforms

ACM Transactions on Reconfigurable Technology and Systems ◽

10.1145/3452742 ◽

2021 ◽

Vol 14 (3) ◽

pp. 1-21

Author(s):

Ryota Yasudo ◽

José G. F. Coutinho ◽

Ana-Lucia Varbanescu ◽

Wayne Luk ◽

Hideharu Amano ◽

...

Keyword(s):

High Performance Computing ◽

High Performance ◽

Large Scale ◽

Heterogeneous Systems ◽

Performance Estimation ◽

Performance Impact ◽

Accurate Performance ◽

Computing Platforms ◽

Reduced Power Consumption ◽

Performance Computing

Next-generation high-performance computing platforms will handle extreme data- and compute-intensive problems that are intractable with today’s technology. A promising path in achieving the next leap in high-performance computing is to embrace heterogeneity and specialised computing in the form of reconfigurable accelerators such as FPGAs, which have been shown to speed up compute-intensive tasks with reduced power consumption. However, assessing the feasibility of large-scale heterogeneous systems requires fast and accurate performance prediction. This article proposes Performance Estimation for Reconfigurable Kernels and Systems (PERKS), a novel performance estimation framework for reconfigurable dataflow platforms. PERKS makes use of an analytical model with machine and application parameters for predicting the performance of multi-accelerator systems and detecting their bottlenecks. Model calibration is automatic, making the model flexible and usable for different machine configurations and applications, including hypothetical ones. Our experimental results show that PERKS can predict the performance of current workloads on reconfigurable dataflow platforms with an accuracy above 91%. The results also illustrate how the modelling scales to large workloads, and how performance impact of architectural features can be estimated in seconds.

Download Full-text

The impact of machine learning on processing big data in high-performance computing

Smart Computing ◽

10.1201/9781003167488-40 ◽

2021 ◽

pp. 344-353

Author(s):

Rohit ◽

B. Gupta ◽

K. K. Gola

Keyword(s):

Machine Learning ◽

Big Data ◽

High Performance Computing ◽

High Performance ◽

The Impact ◽

Performance Computing

Download Full-text

Large-scale first-principles quantum transport simulations using plane wave basis set on high performance computing platforms

Computer Physics Communications ◽

10.1016/j.cpc.2020.107737 ◽

2021 ◽

Vol 260 ◽

pp. 107737

Author(s):

Meng Ye ◽

Xiangwei Jiang ◽

Shu-Shen Li ◽

Lin-Wang Wang

Keyword(s):

Plane Wave ◽

High Performance Computing ◽

Quantum Transport ◽

First Principles ◽

High Performance ◽

Large Scale ◽

Basis Set ◽

Computing Platforms ◽

Performance Computing

Download Full-text

Analysis of Machine Learning for Processing Big Data in High Performance Computing: A Review

EAI Endorsed Transactions on Cloud Systems ◽

10.4108/eai.7-9-2020.166353 ◽

2020 ◽

Vol 6 (19) ◽

pp. 166353

Author(s):

Rohit Rohit ◽

B. Gupta ◽

K. Gola

Keyword(s):

Machine Learning ◽

Big Data ◽

High Performance Computing ◽

High Performance ◽

Performance Computing

Download Full-text

Convergence of Artificial Intelligence and High Performance Computing on NSF-supported Cyberinfrastructure

10.21203/rs.3.rs-36973/v1 ◽

2020 ◽

Cited By ~ 1

Author(s):

Eliu Huerta ◽

Asad Khan ◽

Edward Davis ◽

Colleen Bushell ◽

William Gropp ◽

...

Keyword(s):

Artificial Intelligence ◽

Big Data ◽

High Performance Computing ◽

High Performance ◽

Large Scale ◽

Social Patterns ◽

Computing Paradigm ◽

Recent Developments ◽

Optimization Schemes ◽

Performance Computing

Abstract Significant investments to upgrade and construct large-scale scientific facilities demand commensurate investments in R\&D to design algorithms and computing approaches to enable scientific and engineering breakthroughs in the big data era. Innovative Artificial Intelligence (AI) applications have powered transformational solutions for big data challenges in industry and technology that now drive a multi-billion dollar industry, and which play an ever increasing role shaping human social patterns. As AI continues to evolve into a computing paradigm endowed with statistical and mathematical rigor, it has become apparent that single-GPU solutions for training, validation, and testing are no longer sufficient for AI applications that aim to provide novel solutions for big-data challenges posed by scientific facilities that produce data at a rate and volume that outstrip the computing capabilities of available cyberinfrastructure platforms. This realization has been driving the confluence of AI and high performance computing (HPC), which is critical to reduce time-to-insight, and to enable a systematic study of domain-inspired AI architectures and optimization schemes to enable data-driven discovery. In this article we present a summary of recent developments in this field, and discuss avenues to accelerate and streamline the use of HPC platforms to design accelerated AI algorithms.

Download Full-text

PCJ Java library as a solution to integrate HPC, Big Data and Artificial Intelligence workloads

Journal Of Big Data ◽

10.1186/s40537-021-00454-6 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Marek Nowicki ◽

Łukasz Górski ◽

Piotr Bała

Keyword(s):

Artificial Intelligence ◽

Big Data ◽

High Performance Computing ◽

High Performance ◽

Large Scale ◽

And Performance ◽

Performance Results ◽

Computational Systems ◽

Performance Computing ◽

Java Library

AbstractWith the development of peta- and exascale size computational systems there is growing interest in running Big Data and Artificial Intelligence (AI) applications on them. Big Data and AI applications are implemented in Java, Scala, Python and other languages that are not widely used in High-Performance Computing (HPC) which is still dominated by C and Fortran. Moreover, they are based on dedicated environments such as Hadoop or Spark which are difficult to integrate with the traditional HPC management systems. We have developed the Parallel Computing in Java (PCJ) library, a tool for scalable high-performance computing and Big Data processing in Java. In this paper, we present the basic functionality of the PCJ library with examples of highly scalable applications running on the large resources. The performance results are presented for different classes of applications including traditional computational intensive (HPC) workloads (e.g. stencil), as well as communication-intensive algorithms such as Fast Fourier Transform (FFT). We present implementation details and performance results for Big Data type processing running on petascale size systems. The examples of large scale AI workloads parallelized using PCJ are presented.

Download Full-text