rollback recovery Latest Research Papers

For the purpose of high performance computation, several machines are developed at an exascale level. These machines can perform at least one exaflop calculations per second, which corresponds to a billion billon or 108. The universe and nature can be understood in a better manner while addressing certain challenging computational issues by using these machines. However, certain obstacles are faced by these machines. As huge quantity of components is encompassed in the exascale machines, frequent failure may be experienced and also the resilience may be challenging. High progress rate must be maintained for the applications by incorporating certain form of fault tolerance in the system. Power management has to be performed by incorporating the system in a parallel manner. All layers inclusive of fault tolerance layer must adhere to the power limitation in the system. Huge energy bills may be expected on installation of exascale machines due to the high power consumption. For various fault tolerance models, the energy profile must be analyzed. Parallel recovery, message-logging, and restart or checkpoint fault tolerance models for rollback recovery are evaluated in this paper. For execution with failure, the most energy efficient solution is provided by parallel recovery when programs with various programming models are used. The execution is performed faster with parallel recovery when compared to the other techniques. An analytical model is used for exploring these models and their behavior at extreme scales.

Makespan Minimization in Job Shop Scheduling

International Journal of Engineering and Management Research ◽

10.31033/ijemr.11.1.31 ◽

2021 ◽

Vol 11 (1) ◽

pp. 228-230

Author(s):

K. Sathya Sundari

Keyword(s):

Completion Time ◽

Job Shop ◽

Short Term Memory ◽

Job Shop Scheduling ◽

Rollback Recovery ◽

Makespan Minimization ◽

Recovery Strategy ◽

Short Term ◽

Term Memory ◽

Shop Scheduling

In industries, the completion time of job problems in the manufacturing unit has risen significantly. In several types of current study, the job's completion time, or makespan, is reduced by taking straight paths, which is time-consuming. In this paper, we used an Improved Ant Colony Optimization and Tabu Search (ACOTS) algorithm to solve this problem by precisely defining the fault occurrence location in order to rollback. We have used a short-term memory-based rollback recovery strategy to minimise the job's completion time by rolling back to its own short-term memory. The recent movements in Tabu quest are visited using short term memory. As compared to the ACO algorithm, our proposed ACOTS-Cmax solution is more efficient and takes less time to complete.

An Efficient Data Replication Technique with Fault Tolerance Approach using BVAG with Checkpoint and Rollback-Recovery

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2021.0120155 ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Sharifah Hafizah Sy Ahmad Ubaidillah ◽

Basem Alkazemi ◽

A. Noraziah

Keyword(s):

Fault Tolerance ◽

Data Replication ◽

Rollback Recovery ◽

Tolerance Approach ◽

Efficient Data ◽

Replication Technique ◽

Checkpoint And Rollback

Markov Chain-based Modeling and Analysis of Checkpointing with Rollback Recovery for Efficient DSE in Soft Real-time Systems

2020 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT) ◽

10.1109/dft50435.2020.9250892 ◽

2020 ◽

Author(s):

Siva Satyendra Sahoo ◽

Bharadwaj Veeravalli ◽

Akash Kumar

Keyword(s):

Markov Chain ◽

Real Time ◽

Rollback Recovery ◽

Real Time Systems ◽

Modeling And Analysis ◽

Time Systems

A Review of Checkpointing and Rollback Recovery Protocols for Mobile Distributed Computing Systems

Internet of Things and Secure Smart Environments ◽

10.1201/9780367276706-3 ◽

2020 ◽

pp. 111-146

Author(s):

Houssem Mansouri ◽

Al-Sakib Khan Pathan

Keyword(s):

Distributed Computing ◽

Rollback Recovery ◽

Distributed Computing Systems ◽

Computing Systems ◽

Recovery Protocols

A cooperative partial snapshot algorithm for checkpoint‐rollback recovery of large‐scale and dynamic distributed systems and experimental evaluations

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.5647 ◽

2020 ◽

Author(s):

Junya Nakamura ◽

Yonghwan Kim ◽

Yoshiaki Katayama ◽

Toshimitsu Masuzawa

Keyword(s):

Distributed Systems ◽

Large Scale ◽

Rollback Recovery

An Improved Ant Colony Optimized Tabu Search Algorithm for Makespan Improvement in Job Shop

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.b6695.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 670-674

Keyword(s):

Tabu Search ◽

Completion Time ◽

Job Shop ◽

Short Term Memory ◽

Search Algorithm ◽

Ant Colony ◽

Rollback Recovery ◽

Short Term ◽

Term Memory ◽

Recovery Technique

In industries, the completion time of job problems is increased drastically in the production unit. In many existing kinds of research, the completion time i.e. makespan of the job is minimized using straight paths which is time-consuming. In this paper, we addressed this problem using an Improved Ant Colony Optimization and Tabu Search (ACOTS) algorithm by identifying the fault occurrence position exactly to rollback. Also, we used a short term memory-based rollback recovery technique to roll back to its own short term memory to reduce the completion time of the job. Short term memory is used to visit the recent movements in Tabu search. Our proposed ACOTS-Cmax approach is efficient and consumed less completion time compared to the ACO algorithm

S3R5: A Snapshot Storage System Based on ROW with Rapid Rollback, Recovery and Read-Write

2019 IEEE 21st International Conference on High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS) ◽

10.1109/hpcc/smartcity/dss.2019.00292 ◽

2019 ◽

Author(s):

Hongzhang Yang ◽

Yahui Yang ◽

Yaofeng Tu

Keyword(s):

Storage System ◽

Rollback Recovery

SIFAT-SIFAT ROLLBACK RECOVERY MENGGUNAKAN UNCOORDINATED CHECKPOINTING BERBASIS CAUSALITY STRENGTH

Jurnal Matematika Statistika dan Komputasi ◽

10.20956/jmsk.v15i2.5716 ◽

2018 ◽

Vol 15 (2) ◽

pp. 74

Author(s):

Junianto Sesa

Keyword(s):

Fault Tolerance ◽

Rollback Recovery ◽

Domino Effect ◽

Alternative Approach ◽

Tolerance Approach

AbstractFault tolerance approach is the most popular computing application on computer devices in which depends on checkpoint uncoordinated. This alternative approach is based on checkpoint uncoordinated and logging message requiring all records, imposing works, memories and overhead becomes significant to communication. Recent studies have found that many applications on computer are send-determinism which can possibly design a new fault tolerance protocol. Thus, this research uses checkpoint uncoordinated protocol based causality strength, a send-determinism feature to record one part of the messages without restarting the process systematically when the error occurs. By drawing the protocol and proving its validity are required as the effective methods of this research. With this alternative approach, the protocol can functionally work where the only small portion of the message is recorded and domino effect does not occur.Keywords : Causality Strength, Domino Effect, Rollback Recovery, Uncoordinated Checkpointing AbstrakPendekatan toleransi kesalahan yang paling populer untuk aplikasi komputasi pada perangkat komputer bergantung pada checkpoint uncoordinated. Alternatif pendekatan tersebut berdasarkan pada checkpoint uncoordinated dan logging pesan mengharuskan pencatatan semua pesan, memaksakan pekerjaan memori/penyimpanan tinggi dan overhead yang signifikan pada komunikasi. Baru-baru ini telah diamati bahwa banyak aplikasi pada komputer bersifat send-determinism yang memungkinkan untuk mendesain protokol toleransi kesalahan baru. Sehingga penelitian ini menggunakan protokol checkpoint uncoordinated berbasis causality strength yang bersifat send-determinism yang hanya mencatat satu bagian dari pesan dan tidak perlu me-restart secara sistematis semua proses ketika kegagalan terjadi. Untuk menunjukkan bahwa penelitian ini berjalan sesuai dengan metode yang digunakan yaitu dengan menggambarkan protokol dan membuktikan kebenarannya. Dengan menggunakan pendekatan tersebut, dapat ditunjukkan bahwa protokol ini benar-benar berhasil dimana hanya mencatat sebagian kecil dari pesan dan tidak terjadi efek domino.Kata kunci : Causality Strength, Efek Domino, Rollback Recovery, Uncoordinated Checkpointing

SIFAT-SIFAT ROLLBACK RECOVERY MENGGUNAKAN UNCOORDINATED CHECKPOINTING BERBASIS CAUSALITY STRENGTH

Jurnal Matematika Statistika dan Komputasi ◽

10.20956/jmsk.v15i2.5572 ◽

2018 ◽

Vol 15 (2) ◽

pp. 71

Author(s):

Junianto Sesa

Keyword(s):

Rollback Recovery

Pendekatan toleransi kesalahan yang paling populer untuk aplikasi komputasi pada perangkat komputer bergantung pada checkpoint uncoordinated. Alternatif pendekatan tersebut berdasarkan pada checkpoint uncoordinated dan logging pesan mengharuskan pencatatan semua pesan, memaksakan pekerjaan memori/penyimpanan tinggi dan overhead yang signifikan pada komunikasi. Baru-baru ini telah diamati bahwa banyak aplikasi pada komputer bersifat send-determinism yang memungkinkan untuk mendesain protokol toleransi kesalahan baru. Sehingga penelitian ini menggunakan protokol checkpoint uncoordinated berbasis causality strength yang bersifat send-determinism yang hanya mencatat satu bagian dari pesan dan tidak perlu me-restart secara sistematis semua proses ketika kegagalan terjadi. Untuk menunjukkan bahwa penelitian ini berjalan sesuai dengan metode yang digunakan yaitu dengan menggambarkan protokol dan membuktikan kebenarannya. Dengan menggunakan pendekatan tersebut, dapat ditunjukkan bahwa protokol ini benar-benar berhasil dimana hanya mencatat sebagian kecil dari pesan dan tidak terjadi efek domino.

rollback recovery
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

A Comprehensive Review on Power Efficient Fault Tolerance Models in High Performance Computation Systems

Makespan Minimization in Job Shop Scheduling

An Efficient Data Replication Technique with Fault Tolerance Approach using BVAG with Checkpoint and Rollback-Recovery

Markov Chain-based Modeling and Analysis of Checkpointing with Rollback Recovery for Efficient DSE in Soft Real-time Systems

A Review of Checkpointing and Rollback Recovery Protocols for Mobile Distributed Computing Systems

A cooperative partial snapshot algorithm for checkpoint‐rollback recovery of large‐scale and dynamic distributed systems and experimental evaluations

An Improved Ant Colony Optimized Tabu Search Algorithm for Makespan Improvement in Job Shop

S3R5: A Snapshot Storage System Based on ROW with Rapid Rollback, Recovery and Read-Write

SIFAT-SIFAT ROLLBACK RECOVERY MENGGUNAKAN UNCOORDINATED CHECKPOINTING BERBASIS CAUSALITY STRENGTH

SIFAT-SIFAT ROLLBACK RECOVERY MENGGUNAKAN UNCOORDINATED CHECKPOINTING BERBASIS CAUSALITY STRENGTH

Export Citation Format

rollback recoveryRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

A Comprehensive Review on Power Efficient Fault Tolerance Models in High Performance Computation Systems

Makespan Minimization in Job Shop Scheduling

An Efficient Data Replication Technique with Fault Tolerance Approach using BVAG with Checkpoint and Rollback-Recovery

Markov Chain-based Modeling and Analysis of Checkpointing with Rollback Recovery for Efficient DSE in Soft Real-time Systems

A Review of Checkpointing and Rollback Recovery Protocols for Mobile Distributed Computing Systems

A cooperative partial snapshot algorithm for checkpoint‐rollback recovery of large‐scale and dynamic distributed systems and experimental evaluations

An Improved Ant Colony Optimized Tabu Search Algorithm for Makespan Improvement in Job Shop

S3R5: A Snapshot Storage System Based on ROW with Rapid Rollback, Recovery and Read-Write

SIFAT-SIFAT ROLLBACK RECOVERY MENGGUNAKAN UNCOORDINATED CHECKPOINTING BERBASIS CAUSALITY STRENGTH

SIFAT-SIFAT ROLLBACK RECOVERY MENGGUNAKAN UNCOORDINATED CHECKPOINTING BERBASIS CAUSALITY STRENGTH

rollback recovery
Recently Published Documents