Composition and Realization of Source-to-Sink High-Performance Flows: File Systems, Storage, Hosts, LAN and WAN

Non-volatile memory and remote direct memory access (RDMA) provide extremely high performance in storage and network hardware. However, existing distributed file systems strictly isolate file system and network layers, and the heavy layered software designs leave high-speed hardware under-exploited. In this article, we propose an RDMA-enabled distributed persistent memory file system, Octopus + , to redesign file system internal mechanisms by closely coupling non-volatile memory and RDMA features. For data operations, Octopus + directly accesses a shared persistent memory pool to reduce memory copying overhead, and actively fetches and pushes data all in clients to rebalance the load between the server and network. For metadata operations, Octopus + introduces self-identified remote procedure calls for immediate notification between file systems and networking, and an efficient distributed transaction mechanism for consistency. Octopus + is enabled with replication feature to provide better availability. Evaluations on Intel Optane DC Persistent Memory Modules show that Octopus + achieves nearly the raw bandwidth for large I/Os and orders of magnitude better performance than existing distributed file systems.

Download Full-text

Scalable Distributed Metadata Server Based on Nonblocking Transactions

JUCS - Journal of Universal Computer Science ◽

10.3897/jucs.2020.006 ◽

2020 ◽

Vol 26 (1) ◽

pp. 89-106

Author(s):

Kohei Hiraga ◽

Osamu Tatebe ◽

Hideyuki Kawashima

Keyword(s):

High Performance ◽

File Systems ◽

Software Transactional Memory ◽

Server Side ◽

Multiple Servers ◽

Distributed Transaction ◽

Lock Mode ◽

Dynamic Software ◽

Metadata Server ◽

Performance Computing

Metadata performance scalability is critically important in high-performance computing when accessing many small files from millions of clients. This paper proposes a design of a scalable distributed metadata server, PPMDS, for parallel file systems using multiple key-value servers. In PPMDS, hierarchical namespace of a file system is efficiently managed by multiple servers. Multiple entries can be atomically updated using a nonblocking distributed transaction based on an algorithm of dynamic software transactional memory. This paper also proposes optimizations to further improve the metadata performance by introducing a server-side transaction processing, multiple readers, and a shared lock mode, which reduce the number of remote procedure calls and prevent unnecessary blocking. Performance evaluation shows the scalable performance up to 3 servers, and achieves 62,000 operations per second, which is 2.58x performance improvement compared to a single metadata performance.

Download Full-text

High Performance Storage for Big Data Analytics and Visualization

Advances in Data Mining and Database Management - Handbook of Research on Big Data Storage and Visualization Techniques ◽

10.4018/978-1-5225-3142-5.ch010 ◽

2018 ◽

pp. 254-275

Author(s):

Armando Fandango ◽

William Rivera

Keyword(s):

Big Data ◽

High Speed ◽

High Performance ◽

File System ◽

Predictive Analytics ◽

Big Data Analytics ◽

File Systems ◽

Distributed Applications ◽

System Level ◽

File Formats

Scientific Big Data being gathered at exascale needs to be stored, retrieved and manipulated. The storage stack for scientific Big Data includes a file system at the system level for physical organization of the data, and a file format and input/output (I/O) system at the application level for logical organization of the data; both of them of high-performance variety for exascale. The high-performance file system is designed with concurrent access, high-speed transmission and fault tolerance characteristics. High-performance file formats and I/O are designed to allow parallel and distributed applications with easy and fast access to Big Data. These specialized file formats make it easier to store and access Big Data for scientific visualization and predictive analytics. This chapter provides a brief review of the characteristics of high-performance file systems such as Lustre and GPFS, and high-performance file formats such as HDF5, NetCDF, MPI-IO, and HDFS.

Download Full-text

High Performance Cache Management for Parallel File Systems

Vector and Parallel Processing – VECPAR’98 - Lecture Notes in Computer Science ◽

10.1007/10703040_35 ◽

1999 ◽

pp. 466-479 ◽

Cited By ~ 4

Author(s):

F. García ◽

J. Carretero ◽

F. Pérez ◽

P. de Miguel

Keyword(s):

High Performance ◽

File Systems ◽

Cache Management ◽

Parallel File Systems ◽

Parallel File

Download Full-text

Integrating High Performance File Systems in a Cloud Computing Environment

2012 SC Companion: High Performance Computing, Networking Storage and Analysis ◽

10.1109/sc.companion.2012.103 ◽

2012 ◽

Cited By ~ 3

Author(s):

Abhisek Pan ◽

John Paul Walters ◽

Vijay S. Pai ◽

Dong-In D. Kang ◽

Stephen P. Crago

Keyword(s):

Cloud Computing ◽

High Performance ◽

File Systems ◽

Computing Environment ◽

Cloud Computing Environment

Download Full-text

Ad Hoc File Systems for High-Performance Computing

Journal of Computer Science and Technology ◽

10.1007/s11390-020-9801-1 ◽

2020 ◽

Vol 35 (1) ◽

pp. 4-26 ◽

Cited By ~ 2

Author(s):

André Brinkmann ◽

Kathryn Mohror ◽

Weikuan Yu ◽

Philip Carns ◽

Toni Cortes ◽

...

Keyword(s):

High Performance Computing ◽

High Performance ◽

Ad Hoc ◽

File Systems ◽

Performance Computing

Download Full-text

On Rectilinear Distance-Preserving Trees

VLSI Design ◽

10.1155/1998/26574 ◽

1998 ◽

Vol 7 (1) ◽

pp. 15-30

Author(s):

Gustavo E. Téllez ◽

Majid Sarrafzadeh

Keyword(s):

Approximation Algorithm ◽

Heuristic Algorithm ◽

Time Complexity ◽

High Performance ◽

Exact Algorithm ◽

Wire Length ◽

Worst Case ◽

Rectilinear Distance ◽

Source To Sink ◽

Linear Delay

Given a set of terminals on the plane N={s,ν1,…,νn}, with a source terminal s, a Rectilinear Distance-Preserving Tree (RDPT) T(V, E) is defined as a tree rooted at s, connecting all terminals in N. An RDPT has the property that the length of every source to sink path is equal to the rectilinear distance between that source and sink. A Min- Cost Rectilinear Distance-Preserving Tree (MRDPT) minimizes the total wire length while maintaining minimal source to sink linear delay, making it suitable for high performance interconnect applications.This paper studies problems in the construction of RDPTs, including the following contributions. A new exact algorithm for a restricted version of the problem in one quadrant with O(n2) time complexity is proposed. A novel heuristic algorithm, which uses optimally solvable sub-problems, is proposed for the problem in a single quadrant. The average and worst-case time complexity for the proposed heuristic algorithm are O(n3/2) and O(n3), respectively. A 2-approximation of the quadrant merging problem is proposed. The proposed algorithm has time complexity O(α2T(n)+α3) for any constant α > 1, where T(n) is the time complexity of the solution of the RDPT problem on one quadrant. This result improves over the best previous quadrant merging solution which has O(n2T(n)+n3) time complexity.We test our algorithms on randomly uniform point sets and compare our heuristic RDPT construction against a Minimum Cost Rectilinear Steiner (MRST) tree approximation algorithm. Our results show that RDPTs are competitive with Steiner trees in total wire-length when the number of terminals is less than 32. This result makes RDPTs suitable for VLSI routing applications. We also compare our algorithm to the Rao-Shor RDPT approximation algorithm obtaining improvements of up to 10% in total wirelength. These comparisons show that the algorithms proposed herein produce promising results.

Download Full-text

Mapping Cache for High-Performance Memory Mapped File I/O in Memory File Systems

Journal of KIISE ◽

10.5626/jok.2016.43.5.524 ◽

2016 ◽

Vol 43 (5) ◽

pp. 524-530

Author(s):

Jiwon Kim ◽

Jungsik Choi ◽

Hwansoo Han

Keyword(s):

High Performance ◽

File Systems

Download Full-text