Right - many, many levels of provenance... provenance of source code is not
very much different from provenance of workflow definitions. We already did
quite a bit of work with that on a social level with our generic workflow
sharing platform myExperiment - http://www.myexperiment.org/home  -- e.g.
"Version 4 of this workflow - attributed to this other workflow by someone
else".  This might be of interest to other "Workflow-like" projects within
Apache - as myExperiment can deal with any workflow type.

For instance, my workflow at http://www.myexperiment.org/workflows/3860
attributes http://www.myexperiment.org/workflows/3369 because I have
embedded it as a nested workflow. I have therefore also given the original
authors credit on my workflow.  Lots of this information can be deduced by
inspecting the definitions, looking at hashes and identifiers, etc.
(Taverna workflow definitions includes a chain of identifiers throughout
its evolution - so you can even tell if an earlier, unpublished version of
a workflow has been reused).



Provenance of a workflow *execution* is also quite related to, but still
quite distinct from, the higher level provenance of research data and of
the scientific analysis it has been going through. Similarly the provenance
of a command line tool can be on system-level "Ran for 14 seconds on a
Linux host asdkjasd using 1127 MB of memory and these shared libraries" -
or on a semantic level like "Aligned these two biological sequences from
mouse and rat".

The big challenge is trying to bind these kinds of provenance together, and
to infer one level of provenance from another.


But I am digressing!  My apologies to the rest of the list.. but do let me
know if you are interested in workflows, provenance, versioning and
semantics, and we can put together some kind of interest group.



On 23 October 2014 07:41, Bertrand Delacretaz <bdelacre...@apache.org>
wrote:

> Hi,
>
> Thanks for the clarifications.
>
> On Thu, Oct 23, 2014 at 4:43 AM, Stian Soiland-Reyes
> <soiland-re...@cs.manchester.ac.uk> wrote:
> > ...Provenance exchange - I am thinking in particular if it would be
> > possible to combine our W3C PROV-O provenance support -
> > https://github.com/taverna/taverna-prov (which describes a workflow
> > run) - with exposing service-level provenance...
>
> Ok got it now. We sometimes talk about the provenance of our code,
> which must be traceable etc. so I was confused why you'd exchange
> provenance with other projects ;-)
>
> All clear now.
> -Bertrand
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


-- 
Stian Soiland-Reyes, myGrid team
School of Computer Science
The University of Manchester
http://soiland-reyes.com/stian/work/ http://orcid.org/0000-0001-9842-9718

Reply via email to