data placement Latest Research Papers

Tree-structured data placement scheme with cluster-aided top-down transmission in erasure-coded distributed storage systems

Computer Networks ◽

10.1016/j.comnet.2021.108714 ◽

2022 ◽

pp. 108714

Author(s):

Anan Zhou ◽

Benshun Yi ◽

Laigan Luo

Keyword(s):

Storage Systems ◽

Distributed Storage ◽

Data Placement ◽

Structured Data ◽

Top Down ◽

Distributed Storage Systems

Optimal Data Placement and Replication Approach for SIoT with Edge

Computer Systems Science and Engineering ◽

10.32604/csse.2022.019507 ◽

2022 ◽

Vol 41 (2) ◽

pp. 661-676

Author(s):

B. Prabhu Shankar ◽

S. Chitra

Keyword(s):

Data Placement

Efficient Key-Value Data Placement for ZNS SSD

Applied Sciences ◽

10.3390/app112411842 ◽

2021 ◽

Vol 11 (24) ◽

pp. 11842

Author(s):

Gijun Oh ◽

Junseok Yang ◽

Sungyong Ahn

Keyword(s):

Performance Evaluation ◽

Solid State ◽

Performance Improvement ◽

Garbage Collection ◽

Data Placement ◽

Input Output ◽

Solid State Drive ◽

Space Efficiency ◽

Output Performance

Log-structured merge-tree (LSM-Tree)-based key–value stores are attracting attention for their high I/O (Input/Output) performance due to their sequential write characteristics. However, excessive writes caused by compaction shorten the lifespan of the Solid-state Drive (SSD). Therefore, there are several studies aimed at reducing garbage collection overhead by using Zoned Namespace ZNS; SSD in which the host can determine data placement. However, the existing studies have limitations in terms of performance improvement because the lifetime and hotness of key–value data are not considered. Therefore, in this paper, we propose a technique to minimize the space efficiency and garbage collection overhead of SSDs by arranging them according to the characteristics of key–value data. The proposed method was implemented by modifying ZenFS of RocksDB and, according to the result of the performance evaluation, the space efficiency could be improved by up to 75%.

A Case for Splitting a File for Data Placement in a Distributed Scientific Workflow

10.1109/iemcon53756.2021.9623232 ◽

2021 ◽

Author(s):

Hindol Bhattacharya ◽

Matangini Chattopadhyay ◽

Samiran Chattopadhay

Keyword(s):

Data Placement ◽

Scientific Workflow

An Intelligent Approach to Resource Allocation on Heterogeneous Cloud Infrastructures

Applied Sciences ◽

10.3390/app11219940 ◽

2021 ◽

Vol 11 (21) ◽

pp. 9940

Author(s):

Jack Marquez ◽

Oscar H. Mondragon ◽

Juan D. Gonzalez

Keyword(s):

Resource Allocation ◽

Virtual Machine ◽

Virtual Machines ◽

Service Providers ◽

Cloud Service ◽

Data Placement ◽

Memory Access ◽

Heterogeneous Hardware ◽

Machine Allocation ◽

Virtual Machine Allocation

Cloud computing systems are rapidly evolving toward multicloud architectures supported on heterogeneous hardware. Cloud service providers are widely offering different types of storage infrastructures and multi-NUMA architecture servers. Existing cloud resource allocation solutions do not comprehensively consider this heterogeneous infrastructure. In this study, we present a novel approach comprised of a hierarchical framework based on genetic programming to solve problems related to data placement and virtual machine allocation for analytics applications running on heterogeneous hardware with a variety of storage types and nonuniform memory access. Our approach optimizes data placement using the Hadoop File System on heterogeneous storage devices on multicloud systems. It guarantees the efficient allocation of virtual machines on physical machines with multiple NUMA (nonuniform memory access) domains by minimizing contention between workloads. We prove that our solutions for data placement and virtual machine allocation outperform other state-of-the-art approaches.

A machine learning assisted data placement mechanism for hybrid storage systems

Journal of Systems Architecture ◽

10.1016/j.sysarc.2021.102295 ◽

2021 ◽

pp. 102295

Author(s):

Jinting Ren ◽

Xianzhang Chen ◽

Duo Liu ◽

Yujuan Tan ◽

Moming Duan ◽

...

Keyword(s):

Machine Learning ◽

Storage Systems ◽

Data Placement ◽

Hybrid Storage

PoBery: Possibly-complete Big Data Queries with Probabilistic Data Placement and Scanning

ACM/IMS Transactions on Data Science ◽

10.1145/3465375 ◽

2021 ◽

Vol 2 (3) ◽

pp. 1-28

Author(s):

Jie Song ◽

Qiang He ◽

Feifei Chen ◽

Ye Yuan ◽

Ge Yu

Keyword(s):

Big Data ◽

Query Processing ◽

State Of The Art ◽

Data Placement ◽

Probabilistic Data ◽

Trade Off ◽

Query Performance ◽

Data Query ◽

Query Efficiency ◽

The Given

In big data query processing, there is a trade-off between query accuracy and query efficiency, for example, sampling query approaches trade-off query completeness for efficiency. In this article, we argue that query performance can be significantly improved by slightly losing the possibility of query completeness, that is, the chance that a query is complete. To quantify the possibility, we define a new concept, Probability of query Completeness (hereinafter referred to as PC). For example, If a query is executed 100 times, PC = 0.95 guarantees that there are no more than 5 incomplete results among 100 results. Leveraging the probabilistic data placement and scanning, we trade off PC for query performance. In the article, we propose PoBery (POssibly-complete Big data quERY), a method that supports neither complete queries nor incomplete queries, but possibly-complete queries. The experimental results conducted on HiBench prove that PoBery can significantly accelerate queries while ensuring the PC. Specifically, it is guaranteed that the percentage of complete queries is larger than the given PC confidence. Through comparison with state-of-the-art key-value stores, we show that while Drill-based PoBery performs as fast as Drill on complete queries, it is 1.7 ×, 1.1 ×, and 1.5 × faster on average than Drill, Impala, and Hive, respectively, on possibly-complete queries.

Multi-objective Optimization of Data Placement in a Storage-as-a-Service Federated Cloud

ACM Transactions on Storage ◽

10.1145/3452741 ◽

2021 ◽

Vol 17 (3) ◽

pp. 1-32

Author(s):

Amina Chikhaoui ◽

Laurent Lemarchand ◽

Kamel Boukhalfa ◽

Jalil Boukhobza

Keyword(s):

Execution Time ◽

Service Providers ◽

Data Placement ◽

Exact Method ◽

Initial Population ◽

Multi Objective Optimization ◽

Multi Objective ◽

Large Sets ◽

Proposed Model ◽

Injection Function

Cloud federation enables service providers to collaborate to provide better services to customers. For cloud storage services, optimizing customer object placement for a member of a federation is a real challenge. Storage, migration, and latency costs need to be considered. These costs are contradictory in some cases. In this article, we modeled object placement as a multi-objective optimization problem. The proposed model takes into account parameters related to the local infrastructure, the federated environment, customer workloads, and their SLAs. For resolving this problem, we propose CDP-NSGAII IR , a Constraint Data Placement matheuristic based on NSGAII with Injection and Repair functions. The injection function aims to enhance the solutions’ quality. It consists to calculate some solutions using an exact method then inject them into the initial population of NSGAII. The repair function ensures that the solutions obey the problem constraints and so prevents from exploring large sets of unfeasible solutions. It reduces drastically the execution time of NSGAII. Experimental results show that the injection function improves the HV of NSGAII and the exact method by up to 94% and 60%, respectively, while the repair function reduces the execution time by an average of 68%.

Optimal data placement strategy considering capacity limitation and load balancing in geographically distributed cloud

Future Generation Computer Systems ◽

10.1016/j.future.2021.08.014 ◽

2021 ◽

Author(s):

Chunlin Li ◽

Qianqian Cai ◽

Luo Youlong

Keyword(s):

Load Balancing ◽

Data Placement ◽

Capacity Limitation ◽

Geographically Distributed ◽

Distributed Cloud

Space-efficient Graph Data Placement to Save Energy of ReRAM Crossbar

2021 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED) ◽

10.1109/islped52811.2021.9502482 ◽

2021 ◽

Author(s):

Ting-Shan Lo ◽

Chun-Feng Wu ◽

Yuan-Hao Chang ◽

Tei-Wei Kuo ◽

Wei-Chen Wang

Keyword(s):

Data Placement ◽

Graph Data ◽

Save Energy

data placement
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Tree-structured data placement scheme with cluster-aided top-down transmission in erasure-coded distributed storage systems

Optimal Data Placement and Replication Approach for SIoT with Edge

Efficient Key-Value Data Placement for ZNS SSD

A Case for Splitting a File for Data Placement in a Distributed Scientific Workflow

An Intelligent Approach to Resource Allocation on Heterogeneous Cloud Infrastructures

A machine learning assisted data placement mechanism for hybrid storage systems

PoBery: Possibly-complete Big Data Queries with Probabilistic Data Placement and Scanning

Multi-objective Optimization of Data Placement in a Storage-as-a-Service Federated Cloud

Optimal data placement strategy considering capacity limitation and load balancing in geographically distributed cloud

Space-efficient Graph Data Placement to Save Energy of ReRAM Crossbar

Export Citation Format

data placementRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Tree-structured data placement scheme with cluster-aided top-down transmission in erasure-coded distributed storage systems

Optimal Data Placement and Replication Approach for SIoT with Edge

Efficient Key-Value Data Placement for ZNS SSD

A Case for Splitting a File for Data Placement in a Distributed Scientific Workflow

An Intelligent Approach to Resource Allocation on Heterogeneous Cloud Infrastructures

A machine learning assisted data placement mechanism for hybrid storage systems

PoBery: Possibly-complete Big Data Queries with Probabilistic Data Placement and Scanning

Multi-objective Optimization of Data Placement in a Storage-as-a-Service Federated Cloud

Optimal data placement strategy considering capacity limitation and load balancing in geographically distributed cloud

Space-efficient Graph Data Placement to Save Energy of ReRAM Crossbar

data placement
Recently Published Documents