In April 2016 Manchester eScholar was replaced by the University of Manchester’s new Research Information Management System, Pure. In the autumn the University’s research outputs will be available to search and browse via a new Research Portal. Until then the University’s full publication record can be accessed via a temporary portal and the old eScholar content is available to search and browse via this archive.

Data Lineage Model for Taverna Workflows with Lightweight, Annotation Requirements

Missier, P; Belhajjame, K; Zhao, J; Roos, M; Goble, C

In: Freire, J; Koop, D; Moreau, L; Imaging Inst, Springer Univ; Utah. 2nd International Provenance and Annotation Workshop: 2nd International Provenance and Annotation Workshop; 17 Jun 2008-18 Jun 2008; Salt Lake City, UT. Springer-Verlag Berlin; 2008. p. 17-30.

Access to files

Full-text and supplementary files are not available from Manchester eScholar. Use our list of Related resources to find this item elsewhere. Alternatively, request a copy from the Library's Document supply service.

Abstract

The provenance, or lineage, of a workflow data product can be reconstructed by keeping a, complete trace of workflow execution. This lineage information, however, is likely to be both imprecise, because of the black-box nature of the services that compose the workflow, and noisy, because of the many trivial data transformations that obscure the intended purpose of the workflow. In this paper we argue that these shortcomings can be alleviated by introducing a small set of optional lightweight annotations to the workflow, in a principled way. We begin by presenting a baseline, annotation-free lineage model for the Taverna workflow system, and then show how the proposed annotations improve the results of fundamental lineage queries.

Bibliographic metadata

Content type:
Type of conference contribution:
Publication date:
Conference title:
2nd International Provenance and Annotation Workshop
Conference venue:
Salt Lake City, UT
Conference start date:
2008-06-17
Conference end date:
2008-06-18
Proceedings start page:
17
Proceedings end page:
30
Proceedings pagination:
17-30
Contribution total pages:
14
Abstract:
The provenance, or lineage, of a workflow data product can be reconstructed by keeping a, complete trace of workflow execution. This lineage information, however, is likely to be both imprecise, because of the black-box nature of the services that compose the workflow, and noisy, because of the many trivial data transformations that obscure the intended purpose of the workflow. In this paper we argue that these shortcomings can be alleviated by introducing a small set of optional lightweight annotations to the workflow, in a principled way. We begin by presenting a baseline, annotation-free lineage model for the Taverna workflow system, and then show how the proposed annotations improve the results of fundamental lineage queries.
Proceedings' ISBN:
0302-9743 978-3-540-89964-8
Proceedings' volume:
5272
Language:
english
General notes:
  • Microsoft Corporat, Univ Utah Sci Comp Missier, Paolo Belhajjame, Khalid Zhao, Jun Roos, Marco Goble, Carole 20 BERLIN BIU90

Institutional metadata

University researcher(s):

Record metadata

Manchester eScholar ID:
uk-ac-man-scw:2f37
Created:
7th September, 2009, 15:27:20
Last modified by:
Goble, Carole
Last modified:
4th November, 2014, 15:28:32

Can we help?

The library chat service will be available from 11am-3pm Monday to Friday (excluding Bank Holidays). You can also email your enquiry to us.