scientific workflows
Recently Published Documents


TOTAL DOCUMENTS

793
(FIVE YEARS 164)

H-INDEX

42
(FIVE YEARS 5)

Author(s):  
Jonathan Bader ◽  
Lauritz Thamsen ◽  
Svetlana Kulagina ◽  
Jonathan Will ◽  
Henning Meyerhenke ◽  
...  

2021 ◽  
Vol 17 (4) ◽  
pp. 1-21
Author(s):  
Devarshi Ghoshal ◽  
Lavanya Ramakrishnan

Scientific workflows in High Performance Computing ( HPC ) environments are processing large amounts of data. The storage hierarchy on HPC systems is getting deeper, driven by new technologies (NVRAMs, SSDs, etc.) There is a need for new programming abstractions that allow users to seamlessly manage data at the workflow level on multi-tiered storage systems, and provide optimal workflow performance and use of storage resources. In previous work, we introduced a software architecture Managing Data on Tiered Storage for Scientific Workflows (MaDaTS ) that used a Virtual Data Space ( VDS ) abstraction to hide the complexities of the underlying storage system while allowing users to control data management strategies. In this article, we detail the data-centric programming abstractions that allow users to manage a workflow around its data on the storage layer. The programming abstractions simplify data management for scientific workflows on multi-tiered storage systems, without affecting workflow performance or storage capacity. We measure the overheads and effectiveness introduced by the programming abstractions of MaDaTS. Our results show that these abstractions can optimally use the storage capacity in lesser capacity storage tiers, and simplify data management without adding any performance overheads.


Author(s):  
Mirsaeid Hosseini Shirvani ◽  
Reza Noorian Talouki

AbstractScheduling of scientific workflows on hybrid cloud architecture, which contains private and public clouds, is a challenging task because schedulers should be aware of task inter-dependencies, underlying heterogeneity, cost diversity, and virtual machine (VM) variable configurations during the scheduling process. On the one side, reaching a minimum total execution time or makespan is a favorable issue for users whereas the cost of utilizing quicker VMs may lead to conflict with their budget on the other side. Existing works in the literature scarcely consider VM’s monetary cost in the scheduling process but mainly focus on makespan. Therefore, in this paper, the problem of scientific workflow scheduling running on hybrid cloud architecture is formulated to a bi-objective optimization problem with makespan and monetary cost minimization viewpoint. To address this combinatorial discrete problem, this paper presents a hybrid bi-objective optimization based on simulated annealing and task duplication algorithms (BOSA-TDA) that exploits two important heuristics heterogeneous earliest finish time (HEFT) and duplication techniques to improve canonical SA. The extensive simulation results reported of running different well-known scientific workflows such as LIGO, SIPHT, Cybershake, Montage, and Epigenomics demonstrate that proposed BOSA-TDA has the amount of 12.5%, 14.5%, 17%, 13.5%, and 18.5% average improvement against other existing approaches in terms of makespan, monetary cost, speed up, SLR, and efficiency metrics, respectively.


Author(s):  
Tchimou N'Takpé ◽  
Jean Edgard Gnimassoun ◽  
Souleymane Oumtanaga ◽  
Frédéric Suter
Keyword(s):  

2021 ◽  
Vol 7 ◽  
pp. e747
Author(s):  
Mazen Farid ◽  
Rohaya Latip ◽  
Masnida Hussin ◽  
Nor Asilah Wati Abdul Hamid

Background Recent technological developments have enabled the execution of more scientific solutions on cloud platforms. Cloud-based scientific workflows are subject to various risks, such as security breaches and unauthorized access to resources. By attacking side channels or virtual machines, attackers may destroy servers, causing interruption and delay or incorrect output. Although cloud-based scientific workflows are often used for vital computational-intensive tasks, their failure can come at a great cost. Methodology To increase workflow reliability, we propose the Fault and Intrusion-tolerant Workflow Scheduling algorithm (FITSW). The proposed workflow system uses task executors consisting of many virtual machines to carry out workflow tasks. FITSW duplicates each sub-task three times, uses an intermediate data decision-making mechanism, and then employs a deadline partitioning method to determine sub-deadlines for each sub-task. This way, dynamism is achieved in task scheduling using the resource flow. The proposed technique generates or recycles task executors, keeps the workflow clean, and improves efficiency. Experiments were conducted on WorkflowSim to evaluate the effectiveness of FITSW using metrics such as task completion rate, success rate and completion time. Results The results show that FITSW not only raises the success rate by about 12%, it also improves the task completion rate by 6.2% and minimizes the completion time by about 15.6% in comparison with intrusion tolerant scientific workflow ITSW system.


2021 ◽  
Author(s):  
Devarshi Ghoshal ◽  
Ludovico Bianchi ◽  
Abdelilah Essiari ◽  
Drew Paine ◽  
Sarah S. Poon ◽  
...  
Keyword(s):  

2021 ◽  
Author(s):  
Rafael Ferreira da Silva ◽  
Henri Casanova ◽  
Kyle Chard ◽  
Ilkay Altintas ◽  
Rosa M Badia ◽  
...  

2021 ◽  
Author(s):  
Henri Casanova ◽  
Ewa Deelman ◽  
Sandra Gesing ◽  
Michael Hildreth ◽  
Stephen Hudson ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document