Publication | Closed Access
Provenance and scientific workflows
460
Citations
52
References
2008
Year
Unknown Venue
Workflow ExecutionEngineeringScientific Workflow SystemData ScienceScientific CommunityProvenance ManagementKnowledge DiscoveryScientific WorkflowsManagementData IntegrationComputer ScienceSemantic WebProvenance AnalysisData ManagementData ProvenanceData Modeling
Provenance of data and workflow specifications is crucial for reproducibility, sharing, and knowledge reuse, and has been the subject of numerous workshops and research projects. This tutorial surveys current research issues and technologies in provenance for scientific workflows, highlighting recent literature. Targeted at database researchers and scientific data practitioners, the tutorial presents an overview of scientific workflows, discusses provenance support in existing systems, explores emerging applications, and identifies open problems and future research directions.
Provenance in the context of workflows, both for the data they derive and for their specification, is an essential component to allow for result reproducibility, sharing, and knowledge re-use in the scientific community. Several workshops have been held on the topic, and it has been the focus of many research projects and prototype systems. This tutorial provides an overview of research issues in provenance for scientific workflows, with a focus on recent literature and technology in this area. It is aimed at a general database research audience and at people who work with scientific data and workflows. We will (1) provide a general overview of scientific workflows, (2) describe research on provenance for scientific workflows and show in detail how provenance is supported in existing systems; (3) discuss emerging applications that are enabled by provenance; and (4) outline open problems and new directions for database-related research.
| Year | Citations | |
|---|---|---|
Page 1
Page 1