High-Performance Computing In High-Throughput Sequencing

The NSF-funded Scalable CyberInfrastructure for Artificial Intelligence and Likelihood Free Inference (SCAILFIN) project aims to develop and deploy artificial intelligence (AI) and likelihood-free inference (LFI) techniques and software using scalable cyberinfrastructure (CI) built on top of existing CI elements. Specifically, the project has extended the CERN-based REANA framework, a cloud-based data analysis platform deployed on top of Kubernetes clusters that was originally designed to enable analysis reusability and reproducibility. REANA is capable of orchestrating extremely complicated multi-step workflows, and uses Kubernetes clusters both for scheduling and distributing container-based workloads across a cluster of available machines, as well as instantiating and monitoring the concrete workloads themselves. This work describes the challenges and development efforts involved in extending REANA and the components that were developed in order to enable large scale deployment on High Performance Computing (HPC) resources. Using the Virtual Clusters for Community Computation (VC3) infrastructure as a starting point, we implemented REANA to work with a number of differing workload managers, including both high performance and high throughput, while simultaneously removing REANA’s dependence on Kubernetes support at the workers level.

Download Full-text

High-performance computing and Grid applications in high-throughput protein crystallography

Acta Crystallographica Section A Foundations of Crystallography ◽

10.1107/s0108767304095170 ◽

2004 ◽

Vol 60 (a1) ◽

pp. s243-s243

Author(s):

R. M. Keegan ◽

D. Meredith ◽

G. Winter ◽

M. D. Winn

Keyword(s):

High Performance Computing ◽

High Throughput ◽

High Performance ◽

Protein Crystallography ◽

Grid Applications ◽

Performance Computing

Download Full-text

Running IceCube GPU simulations on Titan

EPJ Web of Conferences ◽

10.1051/epjconf/201921403024 ◽

2019 ◽

Vol 214 ◽

pp. 03024

Author(s):

Vladimir Brik ◽

David Schultz ◽

Gonzalo Merino

Keyword(s):

High Performance Computing ◽

High Throughput ◽

High Performance ◽

Management Systems ◽

Workload Management ◽

High Throughput Computing ◽

Large Numbers ◽

Mpi Applications ◽

Performance Computing

Here we report IceCube’s first experiences of running GPU simulations on the Titan supercomputer. This undertaking was non-trivial because Titan is designed for High Performance Computing (HPC) workloads, whereas IceCube’s workloads fall under the High Throughput Computing (HTC) category. In particular: (i) Titan’s design, policies, and tools are geared heavily toward large MPI applications, while IceCube’s workloads consist of large numbers of relatively small independent jobs, (ii) Titan compute nodes run Cray Linux, which is not directly compatible with IceCube software, and (iii) Titan compute nodes cannot access outside networks, making it impossible to access IceCube’s CVMFS repositories and workload management systems. This report examines our experience of packaging our application in Singularity containers and using HTCondor as the second-level scheduler on the Titan supercomputer.

Download Full-text

Leveraging the Power of High Performance Computing for Next Generation Sequencing Data Analysis: Tricks and Twists from a High Throughput Exome Workflow

PLoS ONE ◽

10.1371/journal.pone.0126321 ◽

2015 ◽

Vol 10 (5) ◽

pp. e0126321 ◽

Cited By ~ 29

Author(s):

Amit Kawalia ◽

Susanne Motameny ◽

Stephan Wonczak ◽

Holger Thiele ◽

Lech Nieroda ◽

...

Keyword(s):

Data Analysis ◽

Next Generation Sequencing ◽

High Performance Computing ◽

High Throughput ◽

High Performance ◽

Next Generation Sequencing Data ◽

Sequencing Data ◽

Performance Computing ◽

Generation Sequencing ◽

Sequencing Data Analysis

Download Full-text

High-Throughput Computing Versus High-Performance Computing for Groundwater Applications

Ground Water ◽

10.1111/gwat.12320 ◽

2015 ◽

Vol 53 (2) ◽

pp. 180-184 ◽

Cited By ~ 11

Author(s):

Michael N. Fienen ◽

Randall J. Hunt

Keyword(s):

High Performance Computing ◽

High Throughput ◽

High Performance ◽

High Throughput Computing ◽

Performance Computing

Download Full-text

Cosmos : A Unified Accounting System both for the HTCondor and Slurm Clusters at IHEP

EPJ Web of Conferences ◽

10.1051/epjconf/202024507060 ◽

2020 ◽

Vol 245 ◽

pp. 07060

Author(s):

Ran Du ◽

Jingyan Shi ◽

Xiaowei Jiang ◽

Jiaheng Zou

Keyword(s):

High Performance Computing ◽

Virtual Machine ◽

High Throughput ◽

High Performance ◽

Accounting System ◽

High Throughput Computing ◽

Parallel Jobs ◽

The Status ◽

Set Up ◽

Performance Computing

HTCondor was adopted to manage the High Throughput Computing (HTC) cluster at IHEP in 2016. In 2017 a Slurm cluster was set up to run High Performance Computing (HPC) jobs. To provide accounting services for these two clusters, we implemented a unified accounting system named Cosmos. Multiple workloads bring different accounting requirements. Briefly speaking, there are four types of jobs to account. First of all, 30 million single-core jobs run in the HTCondor cluster every year. Secondly, Virtual Machine (VM) jobs run in the legacy HTCondor VM cluster. Thirdly, parallel jobs run in the Slurm cluster, and some of these jobs are run on the GPU worker nodes to accelerate computing. Lastly, some selected HTC jobs are migrated from the HTCondor cluster to the Slurm cluster for research purposes. To satisfy all the mentioned requirements, Cosmos is implemented with four layers: acquisition, integration, statistics and presentation. Details about the issues and solutions of each layer will be presented in the paper. Cosmos has run in production for two years, and the status shows that it is a well-functioning system, also meets the requirements of the HTCondor and Slurm clusters.

Download Full-text