Tree-Structured Parallel Regeneration Based on Regenerating Codes for Multiple Data Losses in Distributed Storage Systems

In distributed storage systems, erasure codes represent an attractive data redundancy solution which can provide the same reliability as replication requiring much less storage space. Multiple data losses happens usually and the lost data should be regenerated to maintain data redundancy in distributed storage systems. Regeneration for multiple data losses is expected to be finished as soon as possible, because the regeneration time can influence the data reliability and availability of distributed storage systems. However, multiple data losses is usually regenerated by regenerating single data loss one by one, which brings high entire regeneration time and severely reduces the data reliability and availability of distributed storage systems. In this paper, we propose a tree-structured parallel regeneration scheme based on regenerating codes (TPRORC) for multiple data losses in distributed storage systems. In our scheme, multiple regeneration trees based on regenerating code are constructed. Firstly, these trees are created independently, each of which dose not share any edges from the others and is responsible for one data loss; secondly, every regeneration tree based on regenerating codes owns the least network traffic and bandwidth optimized-paths for regenerating its data loss. Thus it can perform parallel regeneration for multiple data losses by using multiple optimized topology trees, in which network bandwidth is utilized efficiently and entire regeneration is overlapped. Our simulation results show that the tree-structured parallel regeneration scheme reduces the regeneration time significantly, compared to other regular regeneration schemes.

Download Full-text

Repairing Multiple Data Losses by Parallel Max-min Trees Based on Regenerating Codes in Distributed Storage Systems

Algorithms and Architectures for Parallel Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-11194-0_25 ◽

2014 ◽

pp. 325-338

Author(s):

Pengfei You ◽

Yuxing Peng ◽

Zhen Huang ◽

Changjian Wang

Keyword(s):

Storage Systems ◽

Distributed Storage ◽

Multiple Data ◽

Distributed Storage Systems ◽

Regenerating Codes

Download Full-text

Pipelined Regeneration with Regenerating Codes for Distributed Storage Systems

2011 International Symposium on Networking Coding ◽

10.1109/isnetcod.2011.5978915 ◽

2011 ◽

Cited By ~ 9

Author(s):

Jun Li ◽

Xin Wang ◽

Baochun Li

Keyword(s):

Storage Systems ◽

Distributed Storage ◽

Distributed Storage Systems ◽

Regenerating Codes

Download Full-text

Cyclic Structure in Regenerating Codes

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.539.416 ◽

2014 ◽

Vol 539 ◽

pp. 416-419

Author(s):

Wen Juan Liang ◽

Ying Du

Keyword(s):

Storage Systems ◽

Distributed Storage ◽

Erasure Codes ◽

Cyclic Structure ◽

Distributed Storage Systems ◽

Regenerating Codes ◽

Original Message

Regenerating codes are a class of erasure codes for distributed storage. The use of regenerating codes not only improves reliability of distributed storage systems, but also minimizes repairing bandwidth when storage nodes failed and need to be repaired. In this paper, we investigate the cyclic structure of hybrid regenerating codes which each node has two fragments with the first fragment stores original message and the second fragment stores parity message. A fast repairing algorithm is also proposed.

Download Full-text

Cooperative repair based on tree structure for multiple failures in distributed storage systems with regenerating codes

Proceedings of the 12th ACM International Conference on Computing Frontiers - CF '15 ◽

10.1145/2742854.2742869 ◽

2015 ◽

Cited By ~ 1

Author(s):

Xiaoqiang Pei ◽

Yijie Wang ◽

Xingkong Ma ◽

Yongquan Fu ◽

Fangliang Xu

Keyword(s):

Storage Systems ◽

Distributed Storage ◽

Tree Structure ◽

Multiple Failures ◽

Distributed Storage Systems ◽

Regenerating Codes

Download Full-text

Tree-structured parallel regeneration for multiple data losses in distributed storage systems based on erasure codes

China Communications ◽

10.1109/cc.2013.6506936 ◽

2013 ◽

Vol 10 (4) ◽

pp. 113-125 ◽

Cited By ~ 17

Author(s):

Sun Weidong ◽

Wang Yijie ◽

Pei Xiaoqiang

Keyword(s):

Storage Systems ◽

Distributed Storage ◽

Erasure Codes ◽

Multiple Data ◽

Distributed Storage Systems

Download Full-text

Asymmetric regenerating codes for heterogeneous distributed storage systems

2018 16th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt) ◽

10.23919/wiopt.2018.8362844 ◽

2018 ◽

Author(s):

Shan Qu ◽

Jinbei Zhang ◽

Xinbing Wang

Keyword(s):

Storage Systems ◽

Distributed Storage ◽

Distributed Storage Systems ◽

Regenerating Codes

Download Full-text

Generalised regenerating codes for securing distributed storage systems against eavesdropping

Journal of Information Security and Applications ◽

10.1016/j.jisa.2017.02.002 ◽

2017 ◽

Vol 34 ◽

pp. 225-232

Author(s):

Jian Xu ◽

Yewen Cao ◽

Deqiang Wang

Keyword(s):

Storage Systems ◽

Distributed Storage ◽

Distributed Storage Systems ◽

Regenerating Codes

Download Full-text

Exact minimum-repair-bandwidth cooperative regenerating codes for distributed storage systems

2011 IEEE International Symposium on Information Theory Proceedings ◽

10.1109/isit.2011.6033778 ◽

2011 ◽

Cited By ~ 25

Author(s):

Kenneth W. Shum ◽

Yuchong Hu

Keyword(s):

Storage Systems ◽

Distributed Storage ◽

Distributed Storage Systems ◽

Regenerating Codes

Download Full-text

Multi-Rack Regenerating Codes for Hierarchical Distributed Storage Systems

2018 IEEE International Conference on Communications (ICC) ◽

10.1109/icc.2018.8422112 ◽

2018 ◽

Cited By ~ 1

Author(s):

Shan Qu ◽

Yu Liu ◽

Jinbei Zhang ◽

Haiwen Cao ◽

Xinbing Wang

Keyword(s):

Storage Systems ◽

Distributed Storage ◽

Distributed Storage Systems ◽

Regenerating Codes

Download Full-text

Reliability analysis of distributed storage systems considering data loss and theft

Proceedings of the Institution of Mechanical Engineers Part O Journal of Risk and Reliability ◽

10.1177/1748006x19885508 ◽

2019 ◽

Vol 234 (2) ◽

pp. 303-321

Author(s):

Heping Jia ◽

Rui Peng ◽

Yi Ding ◽

Changzheng Shao

Keyword(s):

System Reliability ◽

Storage Systems ◽

Distributed Storage ◽

System Structure ◽

Time To Failure ◽

Data Loss ◽

Exponential Distributions ◽

Basic Model ◽

Distributed Storage Systems ◽

Dependence Property

With the advancement of cloud computing and internet of things, data are usually stored on distributed computers and these data may risk being lost or stolen. In this article, we consider a common case where the entirety of the data is partitioned into several parts and each data part can be allocated to one or more computers. In the case where a computer fails, all the data parts on it are lost. Before the failure of any computer, the data parts may also be stolen by hackers. The basic model of computer failure and computer intrusion resulting in the theft of all the data parts on the computer is considered first. Then, the case is extended to a general model where computer failure, as well as data part corruption and theft caused by hacking are embedded. It is essential to study the reliability of distributed storage systems considering both data loss and data theft, which can be a basis for decision making on system structure optimization. In this article, a multi-valued decision diagram–based approach is developed to quantitatively evaluate system reliability for both models considering the time-dependence property of sequential events. The proposed method is applicable to systems where the random time to failure, theft, or corruption follows arbitrary distributions including the commonly used exponential distributions. Illustrative examples are provided to validate the proposed method.

Download Full-text