Predictive File Replication on the Data Grids

Most replication methods either monitor the popularity of files or use complicated functions to calculate the overall cost of whether or not a replication decision or a deletion decision should be issued. However, once the replication decision is issued, the popularity of the files is changed and may have already impacted access latency and resource usage. This article proposes a decision-tree-based predictive file replication strategy that forecasts files’ future popularity based on their characteristics on the Grids. The proposed strategy has shown superb performance in terms of mean job time and effective network usage compared with the other two replication strategies, LRU and Economic under OptorSim simulation environment.

Download Full-text

A Two-Level Fuzzy Value-Based Replica Replacement Algorithm in Data Grids

International Journal of Grid and High Performance Computing ◽

10.4018/ijghpc.2016100105 ◽

2016 ◽

Vol 8 (4) ◽

pp. 78-99 ◽

Cited By ~ 3

Author(s):

Nazanin Saadat ◽

Amir Masoud Rahmani

Keyword(s):

Data Grid ◽

Data Availability ◽

Distributed Data ◽

Similar Data ◽

Data Grids ◽

Replacement Algorithm ◽

Minimum Latency ◽

Network Usage ◽

Effective Network ◽

And Storage

One of the challenges of data grid is to access widely distributed data fast and efficiently and providing maximum data availability with minimum latency. Data replication is an efficient way used to address this challenge by replicating and storing replicas, making it possible to access similar data in different locations of the data grid and can shorten the time of getting the files. However, as the number and storage size of grid sites is limited and restricted, an optimized and effective replacement algorithm is needed to improve the efficiency of replication. In this paper, the authors propose a novel two-level replacement algorithm which uses Fuzzy Replica Preserving Value Evaluator System (FRPVES) for evaluating the value of each replica. The algorithm was tested using a grid simulator, OptorSim developed by European Data Grid projects. Results from simulation procedure show that the authors' proposed algorithm has better performance in comparison with other algorithms in terms of job execution time, total number of replications and effective network usage.

Download Full-text

A dynamic file replication strategy in data grids

TENCON 2007 - 2007 IEEE Region 10 Conference ◽

10.1109/tencon.2007.4428908 ◽

2007 ◽

Cited By ~ 1

Author(s):

Chao-Tung Yang ◽

Chun-Pin Fu ◽

Chien-Jung Huang

Keyword(s):

Data Grids ◽

Replication Strategy ◽

File Replication

Download Full-text

An Intuitionistic Fuzzy Based Novel Approach to CPU Scheduler

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405614666180903120708 ◽

2020 ◽

Vol 16 (4) ◽

pp. 316-328

Author(s):

Supriya Raheja

Keyword(s):

Fuzzy Inference ◽

The Other ◽

Simulation Environment ◽

Inference System ◽

Intuitionistic Fuzzy ◽

Novel Approach ◽

Continuous Feedback ◽

Simulation Results ◽

Effectiveness And Efficiency ◽

Unique Capability

Background: The extension of CPU schedulers with fuzzy has been ascertained better because of its unique capability of handling imprecise information. Though, other generalized forms of fuzzy can be used which can further extend the performance of the scheduler. Objectives: This paper introduces a novel approach to design an intuitionistic fuzzy inference system for CPU scheduler. Methods: The proposed inference system is implemented with a priority scheduler. The proposed scheduler has the ability to dynamically handle the impreciseness of both priority and estimated execution time. It also makes the system adaptive based on the continuous feedback. The proposed scheduler is also capable enough to schedule the tasks according to dynamically generated priority. To demonstrate the performance of proposed scheduler, a simulation environment has been implemented and the performance of proposed scheduler is compared with the other three baseline schedulers (conventional priority scheduler, fuzzy based priority scheduler and vague based priority scheduler). Results: Proposed scheduler is also compared with the shortest job first CPU scheduler as it is known to be an optimized solution for the schedulers. Conclusion: Simulation results prove the effectiveness and efficiency of intuitionistic fuzzy based priority scheduler. Moreover, it provides optimised results as its results are comparable to the results of shortest job first.

Download Full-text

Design and Performance Analysis of File Replication Strategy on Distributed File System Using GridSim

ICT and Critical Infrastructure: Proceedings of the 48th Annual Convention of Computer Society of India- Vol I - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-319-03107-1_69 ◽

2014 ◽

pp. 629-637

Author(s):

Nirmal Singh ◽

Sarbjeet Singh

Keyword(s):

Performance Analysis ◽

File System ◽

Distributed File System ◽

Replication Strategy ◽

File Replication ◽

And Performance

Download Full-text

Policy Driven Negotiation to Improve the QoS in Data Grid

Encyclopedia of E-Business Development and Management in the Global Economy ◽

10.4018/978-1-61520-611-7.ch105 ◽

2010 ◽

pp. 1041-1056

Author(s):

Ghalem Belalem

Keyword(s):

Large Scale ◽

Data Access ◽

Data Grid ◽

Management Service ◽

Replica Placement ◽

Data Grids ◽

Large Scale Systems ◽

Consistency Management ◽

Access Latency ◽

Replicated Data

Data grids have become an interesting and popular domain in grid community (Foster and Kesselmann, 2004). Generally, the grids are proposed as solutions for large scale systems, where data replication is a well-known technique used to reduce access latency and bandwidth, and increase availability. In splitting of the advantages of replication, there are many problems that should be solved such as, • The replica placement that determines the optimal locations of replicated data in order to reduce the storage cost and data access (Xu et al, 2002); • The problem of determining which replica will be accessed to in terms of consistency when we need to execute a read or write operation (Ranganathan and Foster, 2001); • The problem of degree of replication which consists in finding a minimal number of replicas without reducing the performance of user applications; • The problem of replica consistency that concerns the consistency of a set of replicated data. This consistency provides a completely coherent view of all the replicas for a user (Gray et al 1996). Our principal aim, in this article, is to integrate into consistency management service, an approach based on an economic model for resolving conflicts detected in the data grid.

Download Full-text

A Two-Level Fuzzy Value-Based Replica Replacement Algorithm in Data Grids

Fuzzy Systems ◽

10.4018/978-1-5225-1908-9.ch023 ◽

2017 ◽

pp. 516-539

Author(s):

Nazanin Saadat ◽

Amir Masoud Rahmani

Keyword(s):

Data Grid ◽

Data Availability ◽

Distributed Data ◽

Similar Data ◽

Replacement Algorithm ◽

Minimum Latency ◽

Network Usage ◽

Effective Network ◽

And Storage ◽

Replica Replacement

One of the challenges of data grid is to access widely distributed data fast and efficiently and providing maximum data availability with minimum latency. Data replication is an efficient way used to address this challenge by replicating and storing replicas, making it possible to access similar data in different locations of the data grid and can shorten the time of getting the files. However, as the number and storage size of grid sites is limited and restricted, an optimized and effective replacement algorithm is needed to improve the efficiency of replication. In this paper, the authors propose a novel two-level replacement algorithm which uses Fuzzy Replica Preserving Value Evaluator System (FRPVES) for evaluating the value of each replica. The algorithm was tested using a grid simulator, OptorSim developed by European Data Grid projects. Results from simulation procedure show that the authors' proposed algorithm has better performance in comparison with other algorithms in terms of job execution time, total number of replications and effective network usage.

Download Full-text

Performance of File Replication Policies for Real-time File Access in Data Grids

Proceedings of the 1st International ICST Conference on Networks for Grid Applications ◽

10.4108/gridnets.2007.2235 ◽

2007 ◽

Cited By ~ 2

Author(s):

Atakan Dogan

Keyword(s):

Real Time ◽

Data Grids ◽

File Access ◽

File Replication

Download Full-text

A novel astrophysics-based framework for prediction of binding affinity of glucose binder

Modern Physics Letters B ◽

10.1142/s0217984920503467 ◽

2020 ◽

Vol 34 (31) ◽

pp. 2050346

Author(s):

Rajesh Kondabala ◽

Vijay Kumar ◽

Amjad Ali ◽

Manjit Kaur

Keyword(s):

Decision Tree ◽

Binding Affinity ◽

Cross Validation ◽

Learning Strategy ◽

Experimental Results ◽

The Other ◽

Computational Time ◽

Glucose Binding ◽

Regression Algorithms

In this paper, a novel astrophysics-based prediction framework is developed for estimating the binding affinity of a glucose binder. The proposed framework utilizes the molecule properties for predicting the binding affinity. It also uses the astrophysics-learning strategy that incorporates the concepts of Kepler’s law during the prediction process. The proposed framework is compared with 10 regression algorithms over ZINC dataset. Experimental results reveal that the proposed framework provides 99.30% accuracy of predicting binding affinity. However, decision tree provides the prediction with 97.14% accuracy. Cross-validation results show that the proposed framework provides better accuracy than the other existing models. The developed framework enables researchers to screen glucose binder rapidly. It also reduces computational time for designing small glucose binding molecule.

Download Full-text

A replication and migration strategy on the hierarchical architecture in the fog computing environment

Multiagent and Grid Systems ◽

10.3233/mgs-200333 ◽

2020 ◽

Vol 16 (3) ◽

pp. 291-307

Author(s):

Ahmed Berkennou ◽

Ghalem Belalem ◽

Said Limam

Keyword(s):

Fog Computing ◽

Hierarchical Architecture ◽

Computing Environment ◽

Huge Amount ◽

Migration Strategy ◽

Replication Strategy ◽

Promising Solution ◽

Network Usage ◽

And Migration ◽

Do So

Connecting objects have increasingly become popular in recent years, leading to the connection of more than 50 billion objects by the end of 2020. This large number of objects will generate a huge amount of data that is currently being processed and stored in the cloud. Fog Computing presents a promising solution to the problems of high latency and huge network traffic encountered in the cloud. As Fog’s infrastructures are dense, heterogeneous and geo-distributed, managing the data in order to satisfy users demand in such context is very complicated. In this work, we propose a data management strategy called ‘RMS-HaFC’ in which we consider the characteristics of Fog Computing environment. To do so, we proposed a hierarchical multi-layer model, on which we designed a migration and replication strategy based on data popularity. These strategies duplicate files dynamically and store them in different locations to improve the response time of users requests and minimize the system energy consumption without loading network usage. The strategy was evaluated using the iFogSim simulator and the experimental results obtained are very promising.

Download Full-text