Rethinking key–value store for parallel I/O optimization

Author(s):  
Anthony Kougkas ◽  
Hassan Eslami ◽  
Xian-He Sun ◽  
Rajeev Thakur ◽  
William Gropp

Key–value stores are being widely used as the storage system for large-scale internet services and cloud storage systems. However, they are rarely used in HPC systems, where parallel file systems are the dominant storage solution. In this study, we examine the architecture differences and performance characteristics of parallel file systems and key–value stores. We propose using key–value stores to optimize overall Input/Output (I/O) performance, especially for workloads that parallel file systems cannot handle well, such as the cases with intense data synchronization or heavy metadata operations. We conducted experiments with several synthetic benchmarks, an I/O benchmark, and a real application. We modeled the performance of these two systems using collected data from our experiments, and we provide a predictive method to identify which system offers better I/O performance given a specific workload. The results show that we can optimize the I/O performance in HPC systems by utilizing key–value stores.

Author(s):  
Eduardo Inacio ◽  
Mario Antonio Dantas

To meet ever increasing capacity and performance requirements of emerging data-intensive applications, highly distributed and multilayered back-end storage systems have been employed in large-scale high performance computing (HPC) environments. A main component of these storage infrastructures is the parallel file system (PFS), a especially designed file system for absorbing bulk data transfers from applications with thousands of concurrent processes. Load distribution on PFS data servers compose a major source of intra-application input/output (I/O) performance variability. Albeit mitigating variability is desirable, as it is known to harm application-perceived performance, understanding and dealing with I/O performance variability in such complex environments remains a challenging task. In this research, a differentiated approach for evaluating and mitigating intra-application I/O performance variability over PFSs is proposed. More specifically, from the evaluation perspective, a comprehensive approach combining complementary methods is proposed. An analytical model proposal, named DTSMaxLoad, provides estimates for the maximum load in a PFS data server. To complement DTSMaxLoad, modeling conditions and mechanisms hard to represent analytically, the Parallel I/O and Storage System (PIOSS) simulation model was proposed. Finally, for experimental evaluation over real environments, a flexible and distributed I/O performance evaluation tool, coined as IOR-Extended (IORE), was proposed. Furthermore, a high-level file distribution approach for PFSs, called N-N Round-Robin (N2R2), was proposed focusing on mitigating I/O performance variability for distributed applications where each process accesses an individual and independent file. An extensive experimental effort, including measurements on real environments, was conducted in this research work for evaluating each of the proposed approaches. In summary, this evaluation indicated both DTSMaxLoad and PIOSS modeling proposals can represent load distribution behavior on PFSs with significant fidelity. Moreover, results demonstrated N2R2 successfully reduced intra-application I/O performance variability for 270 distinct experimental scenarios, which, ultimately, translated into overall application I/O performance Improvements.


2012 ◽  
Vol 241-244 ◽  
pp. 1556-1561
Author(s):  
Qi Meng Wu ◽  
Ke Xie ◽  
Ming Fa Zhu ◽  
Li Min Xiao ◽  
Li Ruan

Parallel file systems deploy multiple metadata servers to distribute heavy metadata workload from clients. With the increasing number of metadata servers, metadata-intensive operations are facing some problems related with collaboration among them, compromising the performance gain. Consequently, a file system simulator is very helpful to try out some optimization ideas to solve these problems. In this paper, we propose DMFSsim to simulate the metadata-intensive operations on large-scale distributed metadata file systems. DMFSsim can flexibly replay traces of multiple metadata operations, support several commonly used metadata distribution algorithms, simulate file system tree hierarchy and underlying disk blocks management mechanism in real systems. Extensive simulations show that DMFSsim is capable of demonstrating the performance of metadata-intensive operations in distributed metadata file system.


Technologies ◽  
2021 ◽  
Vol 9 (2) ◽  
pp. 28
Author(s):  
Hossam A. Gabbar ◽  
Ahmed M. Othman ◽  
Muhammad R. Abdussami

The evolving global landscape for electrical distribution and use created a need area for energy storage systems (ESS), making them among the fastest growing electrical power system products. A key element in any energy storage system is the capability to monitor, control, and optimize performance of an individual or multiple battery modules in an energy storage system and the ability to control the disconnection of the module(s) from the system in the event of abnormal conditions. This management scheme is known as “battery management system (BMS)”, which is one of the essential units in electrical equipment. BMS reacts with external events, as well with as an internal event. It is used to improve the battery performance with proper safety measures within a system. Therefore, a safe BMS is the prerequisite for operating an electrical system. This report analyzes the details of BMS for electric transportation and large-scale (stationary) energy storage. The analysis includes different aspects of BMS covering testing, component, functionalities, topology, operation, architecture, and BMS safety aspects. Additionally, current related standards and codes related to BMS are also reviewed. The report investigates BMS safety aspects, battery technology, regulation needs, and offer recommendations. It further studies current gaps in respect to the safety requirements and performance requirements of BMS by focusing mainly on the electric transportation and stationary application. The report further provides a framework for developing a new standard on BMS, especially on BMS safety and operational risk. In conclusion, four main areas of (1) BMS construction, (2) Operation Parameters, (3) BMS Integration, and (4) Installation for improvement of BMS safety and performance are identified, and detailed recommendations were provided for each area. It is recommended that a technical review of the BMS be performed for transportation electrification and large-scale (stationary) applications. A comprehensive evaluation of the components, architectures, and safety risks applicable to BMS operation is also presented.


Sign in / Sign up

Export Citation Format

Share Document