Merkle Hash Tree Based Deduplication in Cloud Storage

To deduplicate the sensitive data in a cloud storage center, a scheme called as MHT-Dedup that is based on MHT (Merkle Hash Tree) is proposed. It achieves the cross-user file-level client-side deduplication and local block-level client-side deduplication concurrently. It firstly encrypts the file on block granularity, and then authenticates the file ciphertext to find duplicated files (Proofs of oWnership, PoW) and check the hash of block plaintext to find duplicated blocks. In the PoW protocol of MHT-Dedup, an authenticating binary tree is generated from the tags of encrypted blocks to assuredly find the duplicated files. MHT-Dedup gets rid of the conflict between data deduplication and encryption, achieves the file-level and block-level deduplication concurrently, avoids the misuse of storage system by users, resists to the inside and outside attacks to data confidentiality, and prevents the target collision attack to files and brute force attack to blocks.

Download Full-text

A case for the cloud storage system supporting sensitive data application

IEEE Conference Anthology ◽

10.1109/anthology.2013.6784871 ◽

2013 ◽

Author(s):

Jiangjiang Wu ◽

Cong Liu ◽

Jun Ma ◽

Yong Cheng ◽

Jiangchun Ren ◽

...

Keyword(s):

Cloud Storage ◽

Storage System ◽

Sensitive Data ◽

Data Application

Download Full-text

Deduplication in Cloud Storage

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.b1027.1292s19 ◽

2019 ◽

Vol 9 (2S) ◽

pp. 364-368

Keyword(s):

Data Storage ◽

Cloud Storage ◽

Cloud Service ◽

Information Storage ◽

Cloud Environment ◽

Data Deduplication ◽

Cloud Service Provider ◽

Level Data ◽

Block Level ◽

Individual Client

Cloud Computing is well known today on account of enormous measure of data storage and quick access of information over the system. It gives an individual client boundless extra space, accessibility and openness of information whenever at anyplace. Cloud service provider can boost information storage by incorporating data deduplication into cloud storage, despite the fact that information deduplication removes excess information and reproduced information happens in cloud environment. This paper presents a literature survey alongside different deduplication procedures that have been based on cloud information storage. To all the more likely guarantee secure deduplication in cloud, this paper examines file level data deduplication and block level data deduplication.

Download Full-text

An extensive research survey on data integrity and deduplication towards privacy in cloud storage

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v10i2.pp2011-2022 ◽

2020 ◽

Vol 10 (2) ◽

pp. 2011

Author(s):

Anil Kumar G. ◽

Shantala C. P.

Keyword(s):

Cloud Storage ◽

Data Privacy ◽

Storage System ◽

Data Integrity ◽

Data Deduplication ◽

Research Issues ◽

Existing Problems ◽

Research Survey ◽

Challenging Tasks ◽

Future Direction

Owing to the highly distributed nature of the cloud storage system, it is one of the challenging tasks to incorporate a higher degree of security towards the vulnerable data. Apart from various security concerns, data privacy is still one of the unsolved problems in this regards. The prime reason is that existing approaches of data privacy doesn't offer data integrity and secure data deduplication process at the same time, which is highly essential to ensure a higher degree of resistance against all form of dynamic threats over cloud and internet systems. Therefore, data integrity, as well as data deduplication is such associated phenomena which influence data privacy. Therefore, this manuscript discusses the explicit research contribution toward data integrity, data privacy, and data deduplication. The manuscript also contributes towards highlighting the potential open research issues followed by a discussion of the possible future direction of work towards addressing the existing problems.

Download Full-text

Improving Data Availability for Deduplication in Cloud Storage

International Journal of Grid and High Performance Computing ◽

10.4018/ijghpc.2018040106 ◽

2018 ◽

Vol 10 (2) ◽

pp. 70-89 ◽

Cited By ~ 3

Author(s):

Jun Li ◽

Mengshu Hou

Keyword(s):

Conventional Method ◽

Cloud Storage ◽

Storage System ◽

Data Availability ◽

Redundant Information ◽

Storage Requirement ◽

Improved Method ◽

Data Deduplication ◽

Access Frequency ◽

Storage Overhead

This article describes how in order to reduce the amount of data, deduplication technology is introduced in the cloud storage. Adopting this technology, the duplicated data can be eliminated, users can conserve the storage requirement. However, deduplication technology also increases the data unavailability. To solve this problem, the authors propose a method to improve data availability in the deduplication storage system. It is based on the data chunk reference count and access frequency, and increases redundant information for the data chunks, to ensure data availability and minimize storage overhead. Extensive experiments are conducted to evaluate effectiveness of the improved method. WFD, CDC, and sliding block deduplication technology are used for comparison. The experimental results show that the proposed method can achieve higher data availability than the conventional method and increase little storage overhead.

Download Full-text

Secure Data Deduplication with Reliable Data Deletion in Cloud

International Journal of Foundations of Computer Science ◽

10.1142/s0129054119400124 ◽

2019 ◽

Vol 30 (04) ◽

pp. 551-570 ◽

Cited By ~ 1

Author(s):

Wenjuan Meng ◽

Jianhua Ge ◽

Tao Jiang

Keyword(s):

Cloud Storage ◽

Storage System ◽

Security Analysis ◽

Practical Implementation ◽

Data Deduplication ◽

Storage Server ◽

Security Models ◽

Secure Data ◽

Data Updating ◽

Data Update

A cloud storage system which incorporates the deletion and deduplication functionalities will have both security and efficiency advantages over exiting solutions which provide only one of them. However, the security models of secure data deletion and data deduplication functionalities are not compatible with each other, which will cause security and efficiency vulnerability under coercive adversaries. To solve these security and efficiency challenges, we define and construct a scheme, whose security relies on the proper erasure of keys in the wrapped key tree and periodical update of the deduplication encryption keys. Moreover, we enhance the efficiency of the proposed scheme by introducing incremental data update, where only the changed part is encrypted/decrypted and uploaded/downloaded in data updating. Further security analysis shows that the proposed scheme is secure against coercive attack. Finally, the practical implementation shows that our scheme is performance efficient in computation, storage and communication for both the cloud storage server and users.

Download Full-text

A Secure Data Deduplication Scheme for Cloud Storage System

IJARCCE ◽

10.17148/ijarcce.2017.6460 ◽

2017 ◽

Vol 6 (4) ◽

pp. 316-323

Author(s):

Bhos Komal ◽

Ingale Karuna ◽

Hattikatti Susmita ◽

Jadhav Sachin ◽

Mirajkar SS ◽

...

Keyword(s):

Cloud Storage ◽

Storage System ◽

Data Deduplication ◽

Secure Data

Download Full-text

Secure Content De-Duplication Utilizing Efficient Content Discovery and Preserving De-Duplication (Ecdpd)

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.k1273.10812s19 ◽

2019 ◽

Vol 8 (12S) ◽

pp. 986-990

Keyword(s):

Access Control ◽

Data Storage ◽

Cloud Storage ◽

Storage System ◽

Data Access ◽

Data File ◽

Data Deduplication ◽

Efficient Technology ◽

Data Access Control ◽

Content Discovery

Cloud computing, an efficient technology that utilizes huge amount of data file storage with security. However, the content owner does not controlling data access for unauthorized clients and does not control data storage and usage of data. Some previous approaches data access control to help data de-duplication concurrently for cloud storage system. Encrypted data for cloud storage is not effectively handled by current industrial de-duplication solutions. The deduplication is unguarded from brute-force attacks and fails in supporting control of data access .An efficient data confining technique that eliminates redundant data’s multiple copies which is commonly used is Data-Deduplication. It reduces the space needed to store these data and thus bandwidth is saved. An efficient content discovery and preserving De-duplication (ECDPD) algorithm that detects client file range and block range of de-duplication in storing data files in the cloud storage system was proposed to overpower the above problems.Data access control is supported by ECDPD actively. Based on Experimental evaluations, proposed ECDPD method reduces 3.802 milliseconds of DUT (Data Uploading Time) and 3.318 milliseconds of DDT (Data Downloading Time) compared than existing approaches

Download Full-text

Survey on Sensitive Data Handling—Challenges and Solutions in Cloud Storage System

Advances in Intelligent Systems and Computing - Advances in Big Data and Cloud Computing ◽

10.1007/978-981-13-1882-5_17 ◽

2018 ◽

pp. 189-196 ◽

Cited By ~ 2

Author(s):

M. Sumathi ◽

S. Sangeetha

Keyword(s):

Cloud Storage ◽

Storage System ◽

Data Handling ◽

Sensitive Data

Download Full-text

Improving the Efficiency of Deduplication Process by Dedicated Hash Table for each Digital Data Type in Cloud Storage System

Webology ◽

10.14704/web/v18si01/web18060 ◽

2021 ◽

Vol 18 (Special Issue 01) ◽

pp. 288-301

Author(s):

G. Sujatha ◽

Dr. Jeberson Retna Raj

Keyword(s):

Data Storage ◽

Cloud Storage ◽

Storage System ◽

Hash Table ◽

Cloud Services ◽

Data Type ◽

Digital Data ◽

Storage Space ◽

Data Deduplication ◽

Worst Case

Data storage is one of the significant cloud services available to the cloud users. Since the magnitude of information outsourced grows extremely high, there is a need of implementing data deduplication technique in the cloud storage space for efficient utilization. The cloud storage space supports all kind of digital data like text, audio, video and image. In the hash-based deduplication system, cryptographic hash value should be calculated for all data irrespective of its type and stored in the memory for future reference. Using these hash value only, duplicate copies can be identified. The problem in this existing scenario is size of the hash table. To find a duplicate copy, all the hash values should be checked in the worst case irrespective of its data type. At the same time, all kind of digital data does not suit with same structure of hash table. In this study we proposed an approach to have multiple hash tables for different digital data. By having dedicated hash table for each digital data type will improve the searching time of duplicate data.

Download Full-text

A secure data deduplication system for integrated cloud-edge networks

Journal of Cloud Computing Advances Systems and Applications ◽

10.1186/s13677-020-00214-6 ◽

2020 ◽

Vol 9 (1) ◽

Author(s):

Shynu P. G. ◽

Nadesh R. K. ◽

Varun G. Menon ◽

Venu P. ◽

Mahdi Abbasi ◽

...

Keyword(s):

Elliptic Curve ◽

Elliptic Curve Cryptography ◽

Cloud Storage ◽

Data Deduplication ◽

Data Redundancy ◽

Systems Implementation ◽

Block Level ◽

Cloud Environments ◽

Security Levels ◽

Edge Networks

AbstractData redundancy is a significant issue that wastes plenty of storage space in the cloud-fog storage integrated environments. Most of the current techniques, which mainly center around the static scenes, for example, the backup and archive systems, are not appropriate because of the dynamic nature of data in the cloud or integrated cloud environments. This problem can be effectively reduced and successfully managed by data deduplication techniques, eliminating duplicate data in cloud storage systems. Implementation of data deduplication (DD) over encrypted data is always a significant challenge in an integrated cloud-fog storage and computing environment to optimize the storage efficiently in a highly secured manner. This paper develops a new method using Convergent and Modified Elliptic Curve Cryptography (MECC) algorithms over the cloud and fog environment to construct secure deduplication systems. The proposed method focuses on the two most important goals of such systems. On one side, the redundancy of data needs to be reduced to its minimum, and on the other hand, a robust encryption approach must be developed to ensure the security of the data. The proposed technique is well suited for operations such as uploading new files by a user to the fog or cloud storage. The file is first encrypted using the Convergent Encryption (CE) technique and then re-encrypted using the Modified Elliptic Curve Cryptography (MECC) algorithm. The proposed method can recognize data redundancy at the block level, reducing the redundancy of data more effectively. Testing results show that the proposed approach can outperform a few state-of-the-art methods of computational efficiency and security levels.

Download Full-text