UHNVM: A Universal Heterogeneous Cache Design with Non-Volatile Memory

During the recent decades, non-volatile memory (NVM) has been anticipated to scale up the main memory size, improve the performance of applications, and reduce the speed gap between main memory and storage devices, while supporting persistent storage to cope with power outages. However, to fit NVM, all existing DRAM-based applications have to be rewritten by developers. Therefore, the developer must have a good understanding of targeted application codes, so as to manually distinguish and store data fit for NVM. In order to intelligently facilitate NVM deployment for existing legacy applications, we propose a universal heterogeneous cache hierarchy which is able to automatically select and store the appropriate data of applications for non-volatile memory (UHNVM), without compulsory code understanding. In this article, a program context (PC) technique is proposed in the user space to help UHNVM to classify data. Comparing to the conventional hot or cold files categories, the PC technique can categorize application data in a fine-grained manner, enabling us to store them either in NVM or SSDs efficiently for better performance. Our experimental results using a real Optane dual-inline-memory-module (DIMM) card show that our new heterogeneous architecture reduces elapsed times by about 11% compared to the conventional kernel memory configuration without NVM.

Download Full-text

Implications of NVM Based Storage on Memory Subsystem Management

Applied Sciences ◽

10.3390/app10030999 ◽

2020 ◽

Vol 10 (3) ◽

pp. 999

Author(s):

Hyokyung Bahn ◽

Kyungwoon Cho

Keyword(s):

Random Access ◽

Disk Drive ◽

Main Memory ◽

Memory Storage ◽

Storage Device ◽

Storage Devices ◽

Large Memory ◽

Memory Subsystems ◽

Non Volatile Memory ◽

Management Techniques

Recently, non-volatile memory (NVM) has advanced as a fast storage medium, and legacy memory subsystems optimized for DRAM (dynamic random access memory) and HDD (hard disk drive) hierarchies need to be revisited. In this article, we explore the memory subsystems that use NVM as an underlying storage device and discuss the challenges and implications of such systems. As storage performance becomes close to DRAM performance, existing memory configurations and I/O (input/output) mechanisms should be reassessed. This article explores the performance of systems with NVM based storage emulated by the RAMDisk under various configurations. Through our measurement study, we make the following findings. (1) We can decrease the main memory size without performance penalties when NVM storage is adopted instead of HDD. (2) For buffer caching to be effective, judicious management techniques like admission control are necessary. (3) Prefetching is not effective in NVM storage. (4) The effect of synchronous I/O and direct I/O in NVM storage is less significant than that in HDD storage. (5) Performance degradation due to the contention of multi-threads is less severe in NVM based storage than in HDD. Based on these observations, we discuss a new PC configuration consisting of small memory and fast storage in comparison with a traditional PC consisting of large memory and slow storage. We show that this new memory-storage configuration can be an alternative solution for ever-growing memory demands and the limited density of DRAM memory. We anticipate that our results will provide directions in system software development in the presence of ever-faster storage devices.

Download Full-text

A Real-Time Non-Volatile Memory Analyzer and its Use on the Evaluation of Storage Devices based on NAND Flash Memories

2020 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) ◽

10.1109/i2mtc43012.2020.9128931 ◽

2020 ◽

Author(s):

Eleni Bougioukou ◽

Maria Varsamou ◽

Nikolaos Toulgaridis ◽

Theodore Antonakopoulos

Keyword(s):

Real Time ◽

Nand Flash ◽

Flash Memories ◽

Storage Devices ◽

Non Volatile Memory ◽

Volatile Memory

Download Full-text

BiCS Flash as a Future 3D Non-Volatile Memory Technology for Ultra High Density Storage Devices

2009 IEEE International Memory Workshop ◽

10.1109/imw.2009.5090581 ◽

2009 ◽

Cited By ~ 12

Author(s):

Hideaki Aochi

Keyword(s):

High Density ◽

Storage Devices ◽

Non Volatile Memory ◽

Volatile Memory

Download Full-text

NVM-Shelf: Secure Hybrid Encryption with Less Flip for Non-Volatile Memory

Electronics ◽

10.3390/electronics9081304 ◽

2020 ◽

Vol 9 (8) ◽

pp. 1304

Author(s):

Thomas Haywood Dadzie ◽

Jiwon Lee ◽

Jihye Kim ◽

Hyunok Oh

Keyword(s):

Embedded Systems ◽

Block Cipher ◽

High Energy ◽

Main Memory ◽

Security Issue ◽

Avalanche Effect ◽

Hybrid Encryption ◽

Short Period ◽

Non Volatile Memory ◽

Volatile Memory

The Non-Volatile Memory (NVM), such as PRAM or STT-MRAM, is often adopted as the main memory in portable embedded systems. The non-volatility triggers a security issue against physical attacks, which is a vulnerability caused by memory extraction and snapshots. However, simply encrypting the NVM degrades the performance of the memory (high energy consumption, short lifetime), since typical encryption causes an avalanche effect while most NVMs suffer from the memory-write operation. In this paper, we propose NVM-shelf: Secure Hybrid Encryption with Less Flip (shelf) for Non-Volatile Memory (NVM), which is hybrid encryption to reduce the flip penalty. The main idea is that a stream cipher, such as block cipher CTR mode, is flip-tolerant when the keystream is reused. By modifying the CTR mode in AES block cipher, we let the keystream updated in a short period and reuse the keystream to achieve flip reduction while maintaining security against physical attacks. Since the CTR mode requires additional storage for the nonce, we classify write-intensive cache blocks and apply our CTR mode to the write-intensive blocks and apply the ECB mode for the rest of the blocks. To extend the cache-based NVM-shelf implementation toward SPM-based systems, we also propose an efficient compiler for SA-SPM: Security-Aware Scratch Pad Memory, which ensures the security of main memories in SPM-based embedded systems. Our compiler is the first approach to support full encryption of memory regions (i.e., stack, heap, code, and static variables) in an SPM-based system. By integrating the NVM-shelf framework to the SA-SPM compiler, we obtain the NVM-shelf implementation for both cache-based and SPM-based systems. The cache-based experiment shows that the NVM-shelf achieves encryption flip penalty less than 3%, and the SPM-based experiment shows that the NVM-shelf reduces the flip penalty by 31.8% compared to the whole encryption.

Download Full-text

Mapping Datasets to Object Storage System

EPJ Web of Conferences ◽

10.1051/epjconf/202024504037 ◽

2020 ◽

Vol 245 ◽

pp. 04037

Author(s):

Xiaowei Aaron Chu ◽

Jeff LeFevre ◽

Aldrin Montana ◽

Dana Robinson ◽

Quincey Koziol ◽

...

Keyword(s):

Storage Systems ◽

Distributed Storage ◽

Storage System ◽

Random Access ◽

Data Partitioning ◽

Data Layout ◽

Storage Devices ◽

Storage Servers ◽

Non Volatile Memory ◽

Volatile Memory

Access libraries such as ROOT[1] and HDF5[2] allow users to interact with datasets using high level abstractions, like coordinate systems and associated slicing operations. Unfortunately, the implementations of access libraries are based on outdated assumptions about storage systems interfaces and are generally unable to fully benefit from modern fast storage devices. For example, access libraries often implement buffering and data layout that assume that large, single-threaded sequential access patterns are causing less overall latency than small parallel random access: while this is true for spinning media, it is not true for flash media. The situation is getting worse with rapidly evolving storage devices such as non-volatile memory and ever larger datasets. This project explores distributed dataset mapping infrastructures that can integrate and scale out existing access libraries using Ceph’s extensible object model, avoiding re-implementation or even modifications of these access libraries as much as possible. These programmable storage extensions coupled with our distributed dataset mapping techniques enable: 1) access library operations to be offloaded to storage system servers, 2) the independent evolution of access libraries and storage systems and 3) fully leveraging of the existing load balancing, elasticity, and failure management of distributed storage systems like Ceph. They also create more opportunities to conduct storage server-local optimizations specific to storage servers. For example, storage servers might include local key/value stores combined with chunk stores that require different optimizations than a local file system. As storage servers evolve to support new storage devices like non-volatile memory, these server-local optimizations can be implemented while minimizing disruptions to applications. We will report progress on the means by which distributed dataset mapping can be abstracted over particular access libraries, including access libraries for ROOT data, and how we address some of the challenges revolving around data partitioning and composability of access operations.

Download Full-text

Fine-grained checkpoint based on non-volatile memory

Frontiers of Information Technology & Electronic Engineering ◽

10.1631/fitee.1500352 ◽

2017 ◽

Vol 18 (2) ◽

pp. 220-234 ◽

Cited By ~ 1

Author(s):

Wen-zhe Zhang ◽

Kai Lu ◽

Mikel Luján ◽

Xiao-ping Wang ◽

Xu Zhou

Keyword(s):

Fine Grained ◽

Non Volatile Memory ◽

Volatile Memory

Download Full-text

A Wear-Leveling-Aware Fine-Grained Allocator for Non-Volatile Memory

Proceedings of the 56th Annual Design Automation Conference 2019 on - DAC '19 ◽

10.1145/3316781.3317752 ◽

2019 ◽

Cited By ~ 1

Author(s):

Xianzhang Chen ◽

Zhuge Qingfeng ◽

Qiang Sun ◽

Edwin H.-M. Sha ◽

Shouzhen Gu ◽

...

Keyword(s):

Wear Leveling ◽

Fine Grained ◽

Non Volatile Memory ◽

Volatile Memory

Download Full-text

Database Techniques for New Hardware

Advances in Computer and Electrical Engineering - Advanced Methodologies and Technologies in Network Architecture, Mobile Computing, and Data Analytics ◽

10.4018/978-1-5225-7598-6.ch040 ◽

2019 ◽

pp. 546-562

Author(s):

Xiongpai Qin ◽

Yueguo Chen

Keyword(s):

Database Systems ◽

Memory Capacity ◽

Research Community ◽

Main Memory ◽

Computer Hardware ◽

Data Intensive ◽

Database Research ◽

Non Volatile Memory ◽

Volatile Memory

In the last decade, computer hardware progressed by leaps and bounds. The advancements of hardware include the application of multi-core CPUs, use of GPUs in data intensive tasks, bigger and bigger main memory capacity, maturity and production use of non-volatile memory, etc. Database systems immediately benefit from faster CPU/GPU and bigger memory and run faster. However, there are some pitfalls. For example, database systems running on multi-core processors may suffer from cache conflicts when the number of concurrently executing DB processes increases. To fully exploit advantages of new hardware to improve the performance of database systems, database software should be more or less revised. This chapter introduces some efforts of database research community in this aspect.

Download Full-text

Database Techniques for New Hardware

Encyclopedia of Information Science and Technology, Fourth Edition ◽

10.4018/978-1-5225-2255-3.ch169 ◽

2018 ◽

pp. 1947-1961

Author(s):

Xiongpai Qin ◽

Yueguo Chen

Keyword(s):

Database Systems ◽

Memory Capacity ◽

Research Community ◽

Main Memory ◽

Computer Hardware ◽

Data Intensive ◽

Database Research ◽

Non Volatile Memory ◽

Volatile Memory

In the last decade, computer hardware progresses with leaps and bounds. The advancements of hardware include: widely application of multi-core CPUs, using of GPUs in data intensive tasks, bigger and bigger main memory capacity, maturity and production use of non-volatile memory etc. Database systems immediately benefit from faster CPU/GPU and bigger memory, and run faster. However, there are some pitfalls. For example, database systems running on multi-core processors may suffer from cache conflicts when the number of concurrently executing DB processes increases. To fully exploit advantages of new hardware to improve the performance of database systems, database software should be more or less revised. This chapter introduces some efforts of database research community in this aspect.

Download Full-text

InK: In-Kernel Key-Value Storage with Persistent Memory

Electronics ◽

10.3390/electronics9111913 ◽

2020 ◽

Vol 9 (11) ◽

pp. 1913

Author(s):

Minjong Ha ◽

Sang-Hoon Kim

Keyword(s):

High Performance ◽

File Systems ◽

Cloud Service ◽

Main Memory ◽

Storage Device ◽

Storage Devices ◽

Persistent Memory ◽

Non Volatile Memory ◽

Long Time ◽

Block Based

Block-based storage devices exhibit different characteristics from main memory, and applications and systems have been optimized for a long time considering the characteristics in mind. However, emerging non-volatile memory technologies are about to change the situation. Persistent Memory (PM) provides a huge, persistent, and byte-addressable address space to the system, thereby enabling new opportunities for systems software. However, existing applications are usually apt to indirectly utilize PM as a storage device on top of file systems. This makes applications and file systems perform unnecessary operations and amplify I/O traffic, thereby under-utilizing the high performance of PM. In this paper, we make the case for an in-Kernel key-value storage service optimized for PM, called InK. While providing the persistence of data at a high performance, InK considers the characteristics of PM to guarantee the crash consistency. To this end, InK indexes key-value pairs with B+ tree, which is more efficient on PM. We implemented InK based on the Linux kernel and evaluated its performance with Yahoo Cloud Service Benchmark (YCSB) and RocksDB. Evaluation results confirms that InK has advantages over LSM-tree-based key-value store systems in terms of throughput and tail latency.

Download Full-text