CoFilter: A High-Performance Switch-Accelerated Stateful Packet Filter for Bare-Metal Servers

Virtualization technologies have evolved along with the development of computational environments. Virtualization offered needed features at that time such as isolation, accountability, resource allocation, resource fair sharing and so on. Novel processor technologies bring to commodity computers the possibility to emulate diverse environments where a wide range of computational scenarios can be run. Along with processors evolution, developers have implemented different virtualization mechanisms exhibiting enhanced performance from previous virtualized environments. Recently, operating system-based virtualization technologies captured the attention of communities abroad because their important improvements on performance area. In this paper, the features of three container-based operating systems virtualization tools (LXC, Docker and Singularity) are presented. LXC, Docker, Singularity and bare metal are put under test through a customized single node HPL-Benchmark and a MPI-based application for the multi node testbed. Also the disk I/O performance, Memory (RAM) performance, Network bandwidth and GPU performance are tested for the COS technologies vs bare metal. Preliminary results and conclusions around them are presented and discussed.

Download Full-text

Performance and scaling behavior of bioinformatic applications in virtualization environments to create awareness for the efficient use of compute resources

PLoS Computational Biology ◽

10.1371/journal.pcbi.1009244 ◽

2021 ◽

Vol 17 (7) ◽

pp. e1009244

Author(s):

Maximilian Hanussek ◽

Felix Bartusch ◽

Jens Krüger

Keyword(s):

Machine Learning ◽

Virtual Environments ◽

High Performance ◽

Biological Data ◽

Scaling Behavior ◽

Bare Metal ◽

Learning Framework ◽

Speed Up ◽

Clustal Omega ◽

Performance Computing

The large amount of biological data available in the current times, makes it necessary to use tools and applications based on sophisticated and efficient algorithms, developed in the area of bioinformatics. Further, access to high performance computing resources is necessary, to achieve results in reasonable time. To speed up applications and utilize available compute resources as efficient as possible, software developers make use of parallelization mechanisms, like multithreading. Many of the available tools in bioinformatics offer multithreading capabilities, but more compute power is not always helpful. In this study we investigated the behavior of well-known applications in bioinformatics, regarding their performance in the terms of scaling, different virtual environments and different datasets with our benchmarking tool suite BOOTABLE. The tool suite includes the tools BBMap, Bowtie2, BWA, Velvet, IDBA, SPAdes, Clustal Omega, MAFFT, SINA and GROMACS. In addition we added an application using the machine learning framework TensorFlow. Machine learning is not directly part of bioinformatics but applied to many biological problems, especially in the context of medical images (X-ray photographs). The mentioned tools have been analyzed in two different virtual environments, a virtual machine environment based on the OpenStack cloud software and in a Docker environment. The gained performance values were compared to a bare-metal setup and among each other. The study reveals, that the used virtual environments produce an overhead in the range of seven to twenty-five percent compared to the bare-metal environment. The scaling measurements showed, that some of the analyzed tools do not benefit from using larger amounts of computing resources, whereas others showed an almost linear scaling behavior. The findings of this study have been generalized as far as possible and should help users to find the best amount of resources for their analysis. Further, the results provide valuable information for resource providers to handle their resources as efficiently as possible and raise the user community’s awareness of the efficient usage of computing resources.

Download Full-text

Inspiring the Next Generation of HPC Engineers with Reconfigurable, Multi-Tenant Resources for Teaching and Research

Sustainability ◽

10.3390/su132111782 ◽

2021 ◽

Vol 13 (21) ◽

pp. 11782

Author(s):

Taha Al-Jody ◽

Hamza Aagela ◽

Violeta Holmes

Keyword(s):

Open Source ◽

High Performance Computing ◽

Systems Engineering ◽

High Performance ◽

Bare Metal ◽

Next Generation ◽

Teaching And Research ◽

Exascale Computing ◽

Secure Access ◽

Performance Computing

There is a tradition at our university for teaching and research in High Performance Computing (HPC) systems engineering. With exascale computing on the horizon and a shortage of HPC talent, there is a need for new specialists to secure the future of research computing. Whilst many institutions provide research computing training for users within their particular domain, few offer HPC engineering and infrastructure-related courses, making it difficult for students to acquire these skills. This paper outlines how and why we are training students in HPC systems engineering, including the technologies used in delivering this goal. We demonstrate the potential for a multi-tenant HPC system for education and research, using novel container and cloud-based architecture. This work is supported by our previously published work that uses the latest open-source technologies to create sustainable, fast and flexible turn-key HPC environments with secure access via an HPC portal. The proposed multi-tenant HPC resources can be deployed on a “bare metal” infrastructure or in the cloud. An evaluation of our activities over the last five years is given in terms of recruitment metrics, skills audit feedback from students, and research outputs enabled by the multi-tenant usage of the resource.

Download Full-text

Big Data Benchmarks of High-Performance Storage Systems on Commercial Bare Metal Clouds

2019 IEEE 12th International Conference on Cloud Computing (CLOUD) ◽

10.1109/cloud.2019.00014 ◽

2019 ◽

Cited By ~ 1

Author(s):

Hyungro Lee ◽

Geoffrey Fox

Keyword(s):

Big Data ◽

High Performance ◽

Storage Systems ◽

Bare Metal

Download Full-text

Performance characterization of containerization for HPC workloads on InfiniBand clusters: an empirical study

Cluster Computing ◽

10.1007/s10586-021-03460-8 ◽

2021 ◽

Author(s):

Peini Liu ◽

Jordi Guitart

Keyword(s):

High Performance ◽

Virtual Machines ◽

Memory Access ◽

Memory Allocation ◽

Bare Metal ◽

Fine Grain ◽

The Impact ◽

Performance Computing ◽

Memory Affinity

AbstractContainerization technology offers an appealing alternative for encapsulating and operating applications (and all their dependencies) without being constrained by the performance penalties of using Virtual Machines and, as a result, has got the interest of the High-Performance Computing (HPC) community to obtain fast, customized, portable, flexible, and reproducible deployments of their workloads. Previous work on this area has demonstrated that containerized HPC applications can exploit InfiniBand networks, but has ignored the potential of multi-container deployments which partition the processes that belong to each application into multiple containers in each host. Partitioning HPC applications has demonstrated to be useful when using virtual machines by constraining them to a single NUMA (Non-Uniform Memory Access) domain. This paper conducts a systematical study on the performance of multi-container deployments with different network fabrics and protocols, focusing especially on Infiniband networks. We analyze the impact of container granularity and its potential to exploit processor and memory affinity to improve applications’ performance. Our results show that default Singularity can achieve near bare-metal performance but does not support fine-grain multi-container deployments. Docker and Singularity-instance have similar behavior in terms of the performance of deployment schemes with different container granularity and affinity. This behavior differs for the several network fabrics and protocols, and depends as well on the application communication patterns and the message size. Moreover, deployments on Infiniband are also more impacted by the computation and memory allocation, and because of that, they can exploit the affinity better.

Download Full-text

Grendel: Bare Metal Provisioning System for High Performance Computing

Practice and Experience in Advanced Research Computing ◽

10.1145/3311790.3396637 ◽

2020 ◽

Author(s):

Andrew E. Bruno ◽

Salvatore J. Guercio ◽

Doris Sajdak ◽

Tony Kew ◽

Matthew D. Jones

Keyword(s):

High Performance Computing ◽

High Performance ◽

Bare Metal ◽

Performance Computing

Download Full-text

Optimizing OpenStack Nova for Scientific Workloads

EPJ Web of Conferences ◽

10.1051/epjconf/201921407031 ◽

2019 ◽

Vol 214 ◽

pp. 07031

Author(s):

Belmiro Moreira ◽

Spyridon Trigazis ◽

Theodoros Tsioutsias

Keyword(s):

Resource Utilization ◽

High Performance ◽

Large Scale ◽

Hadron Collider ◽

Bare Metal ◽

Service Levels ◽

Implementation Challenges ◽

High Resource ◽

Engineering Team ◽

Square Kilometer Array

The CERN OpenStack cloud provides over 300,000 CPU cores to run data processing analyses for the Large Hadron Collider (LHC) experiments. To deliver these services, with high performance and reliable service levels, while at the same time ensuring a continuous high resource utilization has been one of the major challenges for the CERN cloud engineering team. Several optimizations like NUMA-aware scheduling and huge pages, have been deployed to improve scientific workloads performance, but the CERN Cloud team continues to explore new possibilities like preemptible instances and containers on bare-metal. In this paper we will dive into the concept and implementation challenges of preemptible instances and containers on bare-metal for scientific workloads. We will also explore how they can improve scientific workloads throughput and infrastructure resource utilization. We will present the ongoing collaboration with the Square Kilometer Array (SKA) community to develop the necessary upstream enhancement to further improve OpenStack Nova to support large-scale scientific workloads.

Download Full-text

A High Performance Energy Analyzer for Use in Electron Scanning Microscopy

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100061975 ◽

1969 ◽

Vol 27 ◽

pp. 14-15

Author(s):

A. V. Crewe ◽

M. Isaacson ◽

D. Johnson

Keyword(s):

High Performance ◽

Vertical Plane ◽

Median Plane ◽

Electron Gun ◽

Uniform Field ◽

Loss Mechanism ◽

Electron Scanning Microscopy ◽

Sector Type ◽

Double Focusing ◽

Electron Energy Loss

A double focusing magnetic spectrometer has been constructed for use with a field emission electron gun scanning microscope in order to study the electron energy loss mechanism in thin specimens. It is of the uniform field sector type with curved pole pieces. The shape of the pole pieces is determined by requiring that all particles be focused to a point at the image slit (point 1). The resultant shape gives perfect focusing in the median plane (Fig. 1) and first order focusing in the vertical plane (Fig. 2).

Download Full-text

Vacuum System to Minimize the Specimen Contamination of High-Performance EM

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100077967 ◽

1977 ◽

Vol 35 ◽

pp. 68-69

Author(s):

N. Yoshimura ◽

K. Shirota ◽

T. Etoh

Keyword(s):

Electron Microscope ◽

High Speed ◽

High Performance ◽

High Vacuum ◽

Vacuum System ◽

Pump System ◽

Pumping System ◽

Diffusion Pump ◽

Almost All ◽

Cascade Type

One of the most important requirements for a high-performance EM, especially an analytical EM using a fine beam probe, is to prevent specimen contamination by providing a clean high vacuum in the vicinity of the specimen. However, in almost all commercial EMs, the pressure in the vicinity of the specimen under observation is usually more than ten times higher than the pressure measured at the punping line. The EM column inevitably requires the use of greased Viton O-rings for fine movement, and specimens and films need to be exchanged frequently and several attachments may also be exchanged. For these reasons, a high speed pumping system, as well as a clean vacuum system, is now required. A newly developed electron microscope, the JEM-100CX features clean high vacuum in the vicinity of the specimen, realized by the use of a CASCADE type diffusion pump system which has been essentially improved over its predeces- sorD employed on the JEM-100C.

Download Full-text

CoFilter: A High-Performance Switch-Accelerated Stateful Packet Filter for Bare-Metal Servers

CoFilter: High-Performance Switch-Accelerated Stateful Packet Filter for Bare-Metal Servers

Performance evaluation of container-based virtualization for high performance computing environments

Performance and scaling behavior of bioinformatic applications in virtualization environments to create awareness for the efficient use of compute resources

Inspiring the Next Generation of HPC Engineers with Reconfigurable, Multi-Tenant Resources for Teaching and Research

Big Data Benchmarks of High-Performance Storage Systems on Commercial Bare Metal Clouds

Performance characterization of containerization for HPC workloads on InfiniBand clusters: an empirical study

Grendel: Bare Metal Provisioning System for High Performance Computing

Optimizing OpenStack Nova for Scientific Workloads

A High Performance Energy Analyzer for Use in Electron Scanning Microscopy

Vacuum System to Minimize the Specimen Contamination of High-Performance EM

Export Citation Format