Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Scientific workflow management and the Kepler system

Identifieur interne : 001042 ( Main/Exploration ); précédent : 001041; suivant : 001043

Scientific workflow management and the Kepler system

Auteurs : Bertram Lud Scher [États-Unis] ; Ilkay Altintas [États-Unis] ; Chad Berkley [États-Unis] ; Dan Higgins [États-Unis] ; Efrat Jaeger [États-Unis] ; Matthew Jones [États-Unis] ; Edward A. Lee [États-Unis] ; Jing Tao [États-Unis] ; Yang Zhao [États-Unis]

Source :

RBID : ISTEX:30E3B595D35F4B9A8B4605A53FA65B4A123D59F9

English descriptors

Abstract

Many scientific disciplines are now data and information driven, and new scientific knowledge is often gained by scientists putting together data analysis and knowledge discovery ‘pipelines’. A related trend is that more and more scientific communities realize the benefits of sharing their data and computational services, and are thus contributing to a distributed data and computational community infrastructure (a.k.a. ‘the Grid’). However, this infrastructure is only a means to an end and ideally scientists should not be too concerned with its existence. The goal is for scientists to focus on development and use of what we call scientific workflows. These are networks of analytical steps that may involve, e.g., database access and querying steps, data analysis and mining steps, and many other steps including computationally intensive jobs on high‐performance cluster computers. In this paper we describe characteristics of and requirements for scientific workflows as identified in a number of our application projects. We then elaborate on Kepler, a particular scientific workflow system, currently under development across a number of scientific data management projects. We describe some key features of Kepler and its underlying Ptolemy II system, planned extensions, and areas of future research. Kepler is a community‐driven, open source project, and we always welcome related projects and new contributors to join. Copyright © 2005 John Wiley & Sons, Ltd.

Url:
DOI: 10.1002/cpe.994


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Scientific workflow management and the Kepler system</title>
<author>
<name sortKey="Lud Scher, Bertram" sort="Lud Scher, Bertram" uniqKey="Lud Scher B" first="Bertram" last="Lud Scher">Bertram Lud Scher</name>
</author>
<author>
<name sortKey="Altintas, Ilkay" sort="Altintas, Ilkay" uniqKey="Altintas I" first="Ilkay" last="Altintas">Ilkay Altintas</name>
</author>
<author>
<name sortKey="Berkley, Chad" sort="Berkley, Chad" uniqKey="Berkley C" first="Chad" last="Berkley">Chad Berkley</name>
</author>
<author>
<name sortKey="Higgins, Dan" sort="Higgins, Dan" uniqKey="Higgins D" first="Dan" last="Higgins">Dan Higgins</name>
</author>
<author>
<name sortKey="Jaeger, Efrat" sort="Jaeger, Efrat" uniqKey="Jaeger E" first="Efrat" last="Jaeger">Efrat Jaeger</name>
</author>
<author>
<name sortKey="Jones, Matthew" sort="Jones, Matthew" uniqKey="Jones M" first="Matthew" last="Jones">Matthew Jones</name>
</author>
<author>
<name sortKey="Lee, Edward A" sort="Lee, Edward A" uniqKey="Lee E" first="Edward A." last="Lee">Edward A. Lee</name>
</author>
<author>
<name sortKey="Tao, Jing" sort="Tao, Jing" uniqKey="Tao J" first="Jing" last="Tao">Jing Tao</name>
</author>
<author>
<name sortKey="Zhao, Yang" sort="Zhao, Yang" uniqKey="Zhao Y" first="Yang" last="Zhao">Yang Zhao</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:30E3B595D35F4B9A8B4605A53FA65B4A123D59F9</idno>
<date when="2006" year="2006">2006</date>
<idno type="doi">10.1002/cpe.994</idno>
<idno type="url">https://api.istex.fr/document/30E3B595D35F4B9A8B4605A53FA65B4A123D59F9/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000526</idno>
<idno type="wicri:Area/Istex/Curation">000526</idno>
<idno type="wicri:Area/Istex/Checkpoint">000691</idno>
<idno type="wicri:doubleKey">1532-0626:2006:Lud Scher B:scientific:workflow:management</idno>
<idno type="wicri:Area/Main/Merge">001055</idno>
<idno type="wicri:Area/Main/Curation">001042</idno>
<idno type="wicri:Area/Main/Exploration">001042</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Scientific workflow management and the Kepler system</title>
<author>
<name sortKey="Lud Scher, Bertram" sort="Lud Scher, Bertram" uniqKey="Lud Scher B" first="Bertram" last="Lud Scher">Bertram Lud Scher</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>San Diego Supercomputer Center, UC San Diego, San Diego, CA 92093</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computer Science and Genome Center, UC Davis, Davis, CA 95616</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Altintas, Ilkay" sort="Altintas, Ilkay" uniqKey="Altintas I" first="Ilkay" last="Altintas">Ilkay Altintas</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>San Diego Supercomputer Center, UC San Diego, San Diego, CA 92093</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Berkley, Chad" sort="Berkley, Chad" uniqKey="Berkley C" first="Chad" last="Berkley">Chad Berkley</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>National Center for Ecological Analysis and Synthesis, UC Santa Barbara, Santa Barbara, CA 93101</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Higgins, Dan" sort="Higgins, Dan" uniqKey="Higgins D" first="Dan" last="Higgins">Dan Higgins</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>National Center for Ecological Analysis and Synthesis, UC Santa Barbara, Santa Barbara, CA 93101</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Jaeger, Efrat" sort="Jaeger, Efrat" uniqKey="Jaeger E" first="Efrat" last="Jaeger">Efrat Jaeger</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>San Diego Supercomputer Center, UC San Diego, San Diego, CA 92093</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Jones, Matthew" sort="Jones, Matthew" uniqKey="Jones M" first="Matthew" last="Jones">Matthew Jones</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>National Center for Ecological Analysis and Synthesis, UC Santa Barbara, Santa Barbara, CA 93101</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Lee, Edward A" sort="Lee, Edward A" uniqKey="Lee E" first="Edward A." last="Lee">Edward A. Lee</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Electrical Engineering and Computer Sciences, UC Berkeley, Berkeley, CA 94720</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Tao, Jing" sort="Tao, Jing" uniqKey="Tao J" first="Jing" last="Tao">Jing Tao</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>San Diego Supercomputer Center, UC San Diego, San Diego, CA 92093</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Zhao, Yang" sort="Zhao, Yang" uniqKey="Zhao Y" first="Yang" last="Zhao">Yang Zhao</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Electrical Engineering and Computer Sciences, UC Berkeley, Berkeley, CA 94720</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Concurrency and Computation: Practice and Experience</title>
<title level="j" type="abbrev">Concurrency Computat.: Pract. Exper.</title>
<idno type="ISSN">1532-0626</idno>
<idno type="eISSN">1532-0634</idno>
<imprint>
<publisher>John Wiley & Sons, Ltd.</publisher>
<pubPlace>Chichester, UK</pubPlace>
<date type="published" when="2006-08-25">2006-08-25</date>
<biblScope unit="volume">18</biblScope>
<biblScope unit="issue">10</biblScope>
<biblScope unit="page" from="1039">1039</biblScope>
<biblScope unit="page" to="1065">1065</biblScope>
</imprint>
<idno type="ISSN">1532-0626</idno>
</series>
<idno type="istex">30E3B595D35F4B9A8B4605A53FA65B4A123D59F9</idno>
<idno type="DOI">10.1002/cpe.994</idno>
<idno type="ArticleID">CPE994</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">1532-0626</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Grid workflows</term>
<term>dataflow networks</term>
<term>problem‐solving environments</term>
<term>scientific data management</term>
<term>scientific workflows</term>
</keywords>
</textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Many scientific disciplines are now data and information driven, and new scientific knowledge is often gained by scientists putting together data analysis and knowledge discovery ‘pipelines’. A related trend is that more and more scientific communities realize the benefits of sharing their data and computational services, and are thus contributing to a distributed data and computational community infrastructure (a.k.a. ‘the Grid’). However, this infrastructure is only a means to an end and ideally scientists should not be too concerned with its existence. The goal is for scientists to focus on development and use of what we call scientific workflows. These are networks of analytical steps that may involve, e.g., database access and querying steps, data analysis and mining steps, and many other steps including computationally intensive jobs on high‐performance cluster computers. In this paper we describe characteristics of and requirements for scientific workflows as identified in a number of our application projects. We then elaborate on Kepler, a particular scientific workflow system, currently under development across a number of scientific data management projects. We describe some key features of Kepler and its underlying Ptolemy II system, planned extensions, and areas of future research. Kepler is a community‐driven, open source project, and we always welcome related projects and new contributors to join. Copyright © 2005 John Wiley & Sons, Ltd.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Californie</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Californie">
<name sortKey="Lud Scher, Bertram" sort="Lud Scher, Bertram" uniqKey="Lud Scher B" first="Bertram" last="Lud Scher">Bertram Lud Scher</name>
</region>
<name sortKey="Altintas, Ilkay" sort="Altintas, Ilkay" uniqKey="Altintas I" first="Ilkay" last="Altintas">Ilkay Altintas</name>
<name sortKey="Berkley, Chad" sort="Berkley, Chad" uniqKey="Berkley C" first="Chad" last="Berkley">Chad Berkley</name>
<name sortKey="Higgins, Dan" sort="Higgins, Dan" uniqKey="Higgins D" first="Dan" last="Higgins">Dan Higgins</name>
<name sortKey="Jaeger, Efrat" sort="Jaeger, Efrat" uniqKey="Jaeger E" first="Efrat" last="Jaeger">Efrat Jaeger</name>
<name sortKey="Jones, Matthew" sort="Jones, Matthew" uniqKey="Jones M" first="Matthew" last="Jones">Matthew Jones</name>
<name sortKey="Lee, Edward A" sort="Lee, Edward A" uniqKey="Lee E" first="Edward A." last="Lee">Edward A. Lee</name>
<name sortKey="Lud Scher, Bertram" sort="Lud Scher, Bertram" uniqKey="Lud Scher B" first="Bertram" last="Lud Scher">Bertram Lud Scher</name>
<name sortKey="Tao, Jing" sort="Tao, Jing" uniqKey="Tao J" first="Jing" last="Tao">Jing Tao</name>
<name sortKey="Zhao, Yang" sort="Zhao, Yang" uniqKey="Zhao Y" first="Yang" last="Zhao">Yang Zhao</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001042 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001042 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:30E3B595D35F4B9A8B4605A53FA65B4A123D59F9
   |texte=   Scientific workflow management and the Kepler system
}}

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024