Nonintrusive collection and management of data provenance in scientific workflows

2012 ◽  
Vol 24 (18) ◽  
pp. 2268-2281
Author(s):  
Giorgos Tylissanakis ◽  
Yiannis Cotronis
Author(s):  
Anton Michlmayr ◽  
Florian Rosenberg ◽  
Philipp Leitner ◽  
Schahram Dustdar

In general, provenance describes the origin and well-documented history of a given object. This notion has been applied in information systems, mainly to provide data provenance of scientific workflows. Similar to this, provenance in Service-oriented Computing has also focused on data provenance. However, the authors argue that in service-centric systems the origin and history of services is equally important. This paper presents an approach that addresses service provenance. The authors show how service provenance information can be collected and retrieved, and how security mechanisms guarantee integrity and access to this information, while also providing user-specific views on provenance. Finally, the paper gives a performance evaluation of the authors’ approach, which has been integrated into the VRESCo Web service runtime environment.


Author(s):  
Khalid Belhajjame ◽  
Paolo Missier ◽  
Carole Goble

Data provenance is key to understanding and interpreting the results of scientific experiments. This chapter introduces and characterises data provenance in scientific workflows using illustrative examples taken from real-world workflows. The characterisation takes the form of a taxonomy that is used for comparing and analysing provenance capabilities supplied by existing scientific workflow systems.


2010 ◽  
Vol 7 (2) ◽  
pp. 65-86 ◽  
Author(s):  
Anton Michlmayr ◽  
Florian Rosenberg ◽  
Philipp Leitner ◽  
Schahram Dustdar

In general, provenance describes the origin and well-documented history of a given object. This notion has been applied in information systems, mainly to provide data provenance of scientific workflows. Similar to this, provenance in Service-oriented Computing has also focused on data provenance. However, the authors argue that in service-centric systems the origin and history of services is equally important. This paper presents an approach that addresses service provenance. The authors show how service provenance information can be collected and retrieved, and how security mechanisms guarantee integrity and access to this information, while also providing user-specific views on provenance. Finally, the paper gives a performance evaluation of the authors’ approach, which has been integrated into the VRESCo Web service runtime environment.


Author(s):  
Jingyu Zhang ◽  
Jinhui Yao ◽  
Shiping Chen ◽  
David Levy

This paper proposes the use of a mobile-cloud by combining mobile devices and the cloud together in a biodefense and emerging infectious diseases (BEI) research application scenario. A mobile-cloud framework is developed to facilitate the use of mobile devices to collect data for, and manipulate and interact with the scientific workflows running in the Cloud. In this framework, an independent trusted accountability service is used to provide data provenance and enforce compliance among the participants of a biodefense research workflow. The authors have implemented a prototype which allows the researchers to use mobile devices to design and participate in biodefense workflows. The authors evaluated the effectiveness of the mobile-cloud with a prototype and conducted performance testing with example biodefense workflows.


Author(s):  
Shawn Bowers ◽  
Timothy McPhillips ◽  
Bertram Ludäscher ◽  
Shirley Cohen ◽  
Susan B. Davidson

Author(s):  
Jingyu Zhang ◽  
Jinhui Yao ◽  
Shiping Chen ◽  
David Levy

This paper proposes the use of a mobile-cloud by combining mobile devices and the cloud together in a biodefense and emerging infectious diseases (BEI) research application scenario. A mobile-cloud framework is developed to facilitate the use of mobile devices to collect data for, and manipulate and interact with the scientific workflows running in the Cloud. In this framework, an independent trusted accountability service is used to provide data provenance and enforce compliance among the participants of a biodefense research workflow. The authors have implemented a prototype which allows the researchers to use mobile devices to design and participate in biodefense workflows. The authors evaluated the effectiveness of the mobile-cloud with a prototype and conducted performance testing with example biodefense workflows.


2012 ◽  
pp. 941-955
Author(s):  
Jingyu Zhang ◽  
Jinhui Yao ◽  
Shiping Chen ◽  
David Levy

This paper proposes the use of a mobile-cloud by combining mobile devices and the cloud together in a biodefense and emerging infectious diseases (BEI) research application scenario. A mobile-cloud framework is developed to facilitate the use of mobile devices to collect data for, and manipulate and interact with the scientific workflows running in the Cloud. In this framework, an independent trusted accountability service is used to provide data provenance and enforce compliance among the participants of a biodefense research workflow. The authors have implemented a prototype which allows the researchers to use mobile devices to design and participate in biodefense workflows. The authors evaluated the effectiveness of the mobile-cloud with a prototype and conducted performance testing with example biodefense workflows.


2017 ◽  
Vol 9 (2) ◽  
pp. 20-46 ◽  
Author(s):  
Tassio Ferenzini Martins Sirqueira ◽  
Regina Braga ◽  
Marco Antônio P. Araújo ◽  
José Maria N. David ◽  
Fernanda Campos ◽  
...  

A scientific software ecosystem aims to integrate all stages of an experiment and its related workflows, in order to solve complex problems. In this vein, in order to assure the experiment proper execution, any modification that occurs must be propagated to the associated workflows, which must be maintained and evolved for the successful conduction of the research. One way to ensure this control is through configuration management using data provenance. In this work, the authors use data provenance concepts and models, together with ontologies to provide an architecture for the storage and query of scientific experiment information. Considering the architecture, a proof of concept was conducted using workflows extracted from the myExperiment repository. The results are presented along the paper.


Sign in / Sign up

Export Citation Format

Share Document