Related resources
Full-text held externally
Search for item elsewhere
University researcher(s)
Enhancing and abstracting scientific workflow provenance for data publishing
Pinar Alper, Khalid Belhajjame, Carole A. Goble, Pinar Karagoz
In: EDBT '13 Proceedings of the Joint EDBT/ICDT 2013 Workshops : EDBT '13 Proceedings of the Joint EDBT/ICDT 2013 Workshops ; 18 Mar 2013-22 Mar 2013; Genoa, Italy. ACM; 2013.
Access to files
- SUPPLEMENTARY-1.PDF (pdf)
Abstract
Many scientists are using workflows to systematically design and run computational experiments. Once the workflow is executed, the scientist may want to publish the dataset generated as a result, to be, e.g., reused by other scientists as input to their experiments. In doing so, the scientist needs to curate such dataset by specifying metadata information that describes it, e.g. its derivation history, origins and ownership. To assist the scientist in this task, we explore in this paper the use of provenance traces collected by workflow management systems when enacting workflows. Specifically, we identify the shortcomings of such raw provenance traces in supporting the data publishing task, and propose an approach whereby distilled, yet more informative, provenance traces that are fit for the data publishing task can be derived.
Keyword(s)
Bibliographic metadata
- eScience-WF-Motifs-Taverna-DataSet http://www.myexperiment.org/files/789.html
- Motif ontology source (outdated) https://github.com/wf4ever/ro/blob/master/motifs.owl
- The Workflow Motif Ontology http://purl.org/net/wf-motifs