Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Provenance Collection Support in the Kepler Scientific Workflow System

Identifieur interne : 001047 ( Main/Exploration ); précédent : 001046; suivant : 001048

Provenance Collection Support in the Kepler Scientific Workflow System

Auteurs : Ilkay Altintas [États-Unis] ; Oscar Barney [États-Unis] ; Efrat Jaeger-Frank [États-Unis]

Source :

RBID : ISTEX:6035CD2171BD868852B7AAE419F394A2A6102527

Abstract

Abstract: In many data-driven applications, analysis needs to be performed on scientific information obtained from several sources and generated by computations on distributed resources. Systematic analysis of this scientific information unleashes a growing need for automated data-driven applications that also can keep track of the provenance of the data and processes with little user interaction and overhead. Such data analysis can be facilitated by the recent advancements in scientific workflow systems. A major profit when using scientific workflow systems is the ability to make provenance collection a part of the workflow. Specifically, provenance should include not only the standard data lineage information but also information about the context in which the workflow was used, execution that processed the data, and the evolution of the workflow design. In this paper we describe a complete framework for data and process provenance in the Kepler Scientific Workflow System. We outline the requirements and issues related to data and workflow provenance in a multi-disciplinary workflow system and introduce how generic provenance capture can be facilitated in Kepler’s actor-oriented workflow environment. We also describe the usage of the stored provenance information for efficient rerun of scientific workflows.

Url:
DOI: 10.1007/11890850_14


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Provenance Collection Support in the Kepler Scientific Workflow System</title>
<author>
<name sortKey="Altintas, Ilkay" sort="Altintas, Ilkay" uniqKey="Altintas I" first="Ilkay" last="Altintas">Ilkay Altintas</name>
</author>
<author>
<name sortKey="Barney, Oscar" sort="Barney, Oscar" uniqKey="Barney O" first="Oscar" last="Barney">Oscar Barney</name>
</author>
<author>
<name sortKey="Jaeger Frank, Efrat" sort="Jaeger Frank, Efrat" uniqKey="Jaeger Frank E" first="Efrat" last="Jaeger-Frank">Efrat Jaeger-Frank</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:6035CD2171BD868852B7AAE419F394A2A6102527</idno>
<date when="2006" year="2006">2006</date>
<idno type="doi">10.1007/11890850_14</idno>
<idno type="url">https://api.istex.fr/document/6035CD2171BD868852B7AAE419F394A2A6102527/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000281</idno>
<idno type="wicri:Area/Istex/Curation">000281</idno>
<idno type="wicri:Area/Istex/Checkpoint">000696</idno>
<idno type="wicri:doubleKey">0302-9743:2006:Altintas I:provenance:collection:support</idno>
<idno type="wicri:Area/Main/Merge">001060</idno>
<idno type="wicri:Area/Main/Curation">001047</idno>
<idno type="wicri:Area/Main/Exploration">001047</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Provenance Collection Support in the Kepler Scientific Workflow System</title>
<author>
<name sortKey="Altintas, Ilkay" sort="Altintas, Ilkay" uniqKey="Altintas I" first="Ilkay" last="Altintas">Ilkay Altintas</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>San Diego Supercomputer Center, University of California, 9500 Gilman Drive, San Diego, 92092-0505, San Diego, CA</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">États-Unis</country>
</affiliation>
</author>
<author>
<name sortKey="Barney, Oscar" sort="Barney, Oscar" uniqKey="Barney O" first="Oscar" last="Barney">Oscar Barney</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Scientific Computing and Imaging Institute, University of Utah, 50 S. Central Campus Drive, 84112, Salt Lake City, UT</wicri:regionArea>
<placeName>
<region type="state">Utah</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">États-Unis</country>
</affiliation>
</author>
<author>
<name sortKey="Jaeger Frank, Efrat" sort="Jaeger Frank, Efrat" uniqKey="Jaeger Frank E" first="Efrat" last="Jaeger-Frank">Efrat Jaeger-Frank</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>San Diego Supercomputer Center, University of California, 9500 Gilman Drive, San Diego, 92092-0505, San Diego, CA</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">États-Unis</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2006</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">6035CD2171BD868852B7AAE419F394A2A6102527</idno>
<idno type="DOI">10.1007/11890850_14</idno>
<idno type="ChapterID">14</idno>
<idno type="ChapterID">Chap14</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: In many data-driven applications, analysis needs to be performed on scientific information obtained from several sources and generated by computations on distributed resources. Systematic analysis of this scientific information unleashes a growing need for automated data-driven applications that also can keep track of the provenance of the data and processes with little user interaction and overhead. Such data analysis can be facilitated by the recent advancements in scientific workflow systems. A major profit when using scientific workflow systems is the ability to make provenance collection a part of the workflow. Specifically, provenance should include not only the standard data lineage information but also information about the context in which the workflow was used, execution that processed the data, and the evolution of the workflow design. In this paper we describe a complete framework for data and process provenance in the Kepler Scientific Workflow System. We outline the requirements and issues related to data and workflow provenance in a multi-disciplinary workflow system and introduce how generic provenance capture can be facilitated in Kepler’s actor-oriented workflow environment. We also describe the usage of the stored provenance information for efficient rerun of scientific workflows.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Californie</li>
<li>Utah</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Californie">
<name sortKey="Altintas, Ilkay" sort="Altintas, Ilkay" uniqKey="Altintas I" first="Ilkay" last="Altintas">Ilkay Altintas</name>
</region>
<name sortKey="Altintas, Ilkay" sort="Altintas, Ilkay" uniqKey="Altintas I" first="Ilkay" last="Altintas">Ilkay Altintas</name>
<name sortKey="Barney, Oscar" sort="Barney, Oscar" uniqKey="Barney O" first="Oscar" last="Barney">Oscar Barney</name>
<name sortKey="Barney, Oscar" sort="Barney, Oscar" uniqKey="Barney O" first="Oscar" last="Barney">Oscar Barney</name>
<name sortKey="Jaeger Frank, Efrat" sort="Jaeger Frank, Efrat" uniqKey="Jaeger Frank E" first="Efrat" last="Jaeger-Frank">Efrat Jaeger-Frank</name>
<name sortKey="Jaeger Frank, Efrat" sort="Jaeger Frank, Efrat" uniqKey="Jaeger Frank E" first="Efrat" last="Jaeger-Frank">Efrat Jaeger-Frank</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001047 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001047 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:6035CD2171BD868852B7AAE419F394A2A6102527
   |texte=   Provenance Collection Support in the Kepler Scientific Workflow System
}}

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024