A Novel File-Level Continuous Data Protection System

Continuous Data Protection is a data recovery method which can protect file systems against malicious attacks or users’ mistakes. This paper proposes BCFBS (BUPT Continuous File Backup System): a continuous data protection architecture at file level. Compared with other approaches, it uses caching technique to protect the consistence between file versions, thereby speeding up both the backup of file version and space recycling. Furthermore, BCFBS combines techniques of filter the type of file, adjusting the frequency of the backup of file with incremental backup to make up the storage waste default of traditional CDP. Experimental results demonstrate that BCFBS can save storage space by 50%.

Download Full-text

Incremental Data Recovery Method in Continuous Data Protection System

The Journal of Korean Institute of Information Technology ◽

10.14801/jkiit.2020.18.5.21 ◽

2020 ◽

Vol 18 (5) ◽

pp. 21-29

Author(s):

Jungjoo Moon ◽

Seokil Song

Keyword(s):

Data Protection ◽

Data Recovery ◽

Protection System ◽

Continuous Data ◽

Recovery Method

Download Full-text

Boosting the Restoring Performance of Deduplication Data by Classifying Backup Metadata

ACM/IMS Transactions on Data Science ◽

10.1145/3437261 ◽

2021 ◽

Vol 2 (2) ◽

pp. 1-16

Author(s):

Ru Yang ◽

Yuhui Deng ◽

Yi Zhou ◽

Ping Huang

Keyword(s):

State Of The Art ◽

Negative Impact ◽

Storage Systems ◽

Experimental Results ◽

Continuous Data ◽

Storage Space ◽

Data Backup ◽

Salient Features ◽

Storage Characteristics

Restoring data is the main purpose of data backup in storage systems. The fragmentation issue, caused by physically scattering logically continuous data across a variety of disk locations, poses a negative impact on the restoring performance of a deduplication system. Rewriting algorithms are used to alleviate the fragmentation problem by improving the restoring speed of a deduplication system. However, rewriting methods give birth to a big sacrifice in terms of deduplication ratio, leading to a huge storage space waste. Furthermore, traditional backup approaches treat file metadata and chunk metadata as the same, which causes frequent on-disk metadata accesses. In this article, we start by analyzing storage characteristics of backup metadata. An intriguing finding shows that with 10 million files, the file metadata merely takes up approximately 340 MB. Motivated by this finding, we propose a Classified-Metadata based Restoring method (CMR) that classifies backup metadata into file metadata and chunk metadata . Because the file metadata merely takes up a meager amount of space, CMR maintains all file metadata in memory, whereas chunk metadata are aggressively prefetched to memory in a greedy manner. A deduplication system with CMR in place exhibits three salient features: (i) It avoids rewriting algorithms’ additional overhead by reducing the number of disk reads in a restoring process, (ii) it increases the restoring throughput without sacrificing the deduplication ratio, and (iii) it thoroughly leverages the hardware resources to boost the restoring performance. To quantitatively evaluate the performance of CMR, we compare our CMR against two state-of-the-art approaches, namely, a history-aware rewriting method (HAR) and a context-based rewriting scheme (CAP). The experimental results show that compared to HAR and CAP, CMR reduces the restoring time by 27.2% and 29.3%, respectively. Moreover, the deduplication ratio is improved by 1.91% and 4.36%, respectively.

Download Full-text

Continuous Data Protection as a Strategy for Reduced Data Recovery Time

Journal of Systems Integration ◽

10.20470/jsi.v2i4.102 ◽

2011 ◽

Vol 2 ◽

pp. 54-69 ◽

Cited By ~ 1

Author(s):

Leon Mugoh ◽

Ismail Lukandu Ateya ◽

Bernard Shibwabo Kasamani

Keyword(s):

Data Protection ◽

Recovery Time ◽

Data Recovery ◽

Continuous Data ◽

Reduced Data

Download Full-text

Snapshot Method for Continuous Data Protection Systems

Journal of Software ◽

10.3724/sp.j.1001.2011.04048 ◽

2011 ◽

Vol 22 (10) ◽

pp. 2523-2537

Author(s):

Xiao LI ◽

Yu-An TAN ◽

Yuan-Zhang LI

Keyword(s):

Data Protection ◽

Continuous Data ◽

Protection Systems

Download Full-text

Design and evaluation of an advanced continuous data level auditing system: A three-layer structure

International Journal of Accounting Information Systems ◽

10.1016/j.accinf.2021.100524 ◽

2021 ◽

Vol 42 ◽

pp. 100524

Author(s):

Kyunghee Yoon ◽

Yue Liu ◽

Tiffany Chiu ◽

Miklos A. Vasarhelyi

Keyword(s):

Layer Structure ◽

Continuous Data ◽

System A ◽

Data Level

Download Full-text

Power Data Recovery Method Based on Time Series Model for Understanding the Operation of HVDC Near-zone Assets

2020 IEEE Sustainable Power and Energy Conference (iSPEC) ◽

10.1109/ispec50848.2020.9350989 ◽

2020 ◽

Author(s):

Shipei Zhao ◽

Xiaoyun Wang ◽

Li Liao ◽

Xuan Dai ◽

Xuefei Lu ◽

...

Keyword(s):

Time Series ◽

Time Series Model ◽

Data Recovery ◽

Recovery Method ◽

Near Zone

Download Full-text

Thinking of data protection law's subject matter as a complex adaptive system: A heuristic display

Computer Law & Security Review ◽

10.1016/j.clsr.2015.01.007 ◽

2015 ◽

Vol 31 (2) ◽

pp. 201-220 ◽

Cited By ~ 2

Author(s):

Kunbei Zhang ◽

Aernout H.J. Schmidt

Keyword(s):

Data Protection ◽

Subject Matter ◽

Adaptive System ◽

Complex Adaptive System ◽

System A ◽

Complex Adaptive

Download Full-text

Continuous Data Protection

Encyclopedia of Database Systems ◽

10.1007/978-1-4614-8265-9_1465 ◽

2018 ◽

pp. 613-614

Author(s):

Kazuhisa Fujimoto

Keyword(s):

Data Protection ◽

Continuous Data

Download Full-text

Reliability and Continuous Data Protection, Synchronous Replication

Data Center Storage ◽

10.1201/b10798-20 ◽

2016 ◽

pp. 173-174

Author(s):

Hubbert Smith

Keyword(s):

Data Protection ◽

Continuous Data

Download Full-text

A Dockers Storage Performance Evaluation: Impact of Backing File Systems

10.54216/jisiot.030101 ◽

2021 ◽

pp. 8-17

Author(s):

Amer Ramadan ◽

Keyword(s):

Performance Evaluation ◽

Experimental Design ◽

File Systems ◽

Experimental Results ◽

File Server ◽

Storage Performance ◽

File Access ◽

Mail Server ◽

The Impact ◽

E Mail

This paper reports on an in-depth examination of the impact of the backing filesystems to Docker performance in the context of Linux container-based virtualization. The experimental design was a 3x3x4 arrangement, i.e., we considered three different numbers of Docker containers, three filesystems (Ext4, XFS and Btrfs), and four application workloads related to Web server I/O activity, e-mail server I/O activity, file server I/O activity and random file access I/O activity, respectively. The experimental results indicate that Ext4 is the most optimal filesystem, among the considered filesystems, for the considered experimental settings. In addition, the XFS filesystem is not suitable for workloads that are dominated by synchronous random write components (e.g., characteristical for mail workload), while the Btrfs filesystem is not suitable for workloads dominated by random write and sequential write components (e.g., file server workload).

Download Full-text