Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice

Identifieur interne : 000466 ( Istex/Corpus ); précédent : 000465; suivant : 000467

An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice

Auteurs : Xiaoyu Yang ; Martin T. Dove ; Richard P. Bruin ; Andrew Walkingshaw ; Richard Sinclair ; Dan J. Wilson ; Peter Murray-Rust

Source :

RBID : ISTEX:66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA

Abstract

Grid‐based simulation usually involves large quantities of data at each stage of the simulation process. These data include simulation input and output files, intermediate results files, log and error files, associated metadata, and information capturing the processes that generate the data. The question of how to effectively store and manage data files within a Grid computing environment is increasingly becoming an important issue. This paper illustrates how we built a lightweight e‐Science infrastructure for data management within a Grid computing environment, including the integration of data curation activities into the entire Grid‐based simulation process. Rather than focusing on specific implementation details, we aim to identify the key issues and research challenges, describing how various existing technologies and tools can be best integrated to address these requirements and challenges. Although the case of quantum mechanical simulation of materials properties is used in the paper, much of the discussion is as generic as possible so that approaches, methods and practice (e.g. integrated approach, workflow taxonomy and development approach, simple but useful semantic annotation approach) can be applied to wider domains and disciplines to facilitat the digital research. A comparison between our approach and Cloud computing, and lessons learned in data management within the Grid computing environment, are also presented. Copyright © 2012 John Wiley & Sons, Ltd.

Url:
DOI: 10.1002/cpe.2849

Links to Exploration step

ISTEX:66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice</title>
<author>
<name sortKey="Yang, Xiaoyu" sort="Yang, Xiaoyu" uniqKey="Yang X" first="Xiaoyu" last="Yang">Xiaoyu Yang</name>
<affiliation>
<mods:affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>Correspondence to: Xiaoyu Yang, Senior Member, Wolfson College, University of Cambridge, Barton Road, Cambridge, CB3 9BB, UK.E‐mail:</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: kevin.x.yang@wolfsonemail.com</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Dove, Martin T" sort="Dove, Martin T" uniqKey="Dove M" first="Martin T." last="Dove">Martin T. Dove</name>
<affiliation>
<mods:affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Bruin, Richard P" sort="Bruin, Richard P" uniqKey="Bruin R" first="Richard P." last="Bruin">Richard P. Bruin</name>
<affiliation>
<mods:affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Walkingshaw, Andrew" sort="Walkingshaw, Andrew" uniqKey="Walkingshaw A" first="Andrew" last="Walkingshaw">Andrew Walkingshaw</name>
<affiliation>
<mods:affiliation>Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Sinclair, Richard" sort="Sinclair, Richard" uniqKey="Sinclair R" first="Richard" last="Sinclair">Richard Sinclair</name>
<affiliation>
<mods:affiliation>Daresbury Laboratory, Science and Technologies Facilities Council, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Wilson, Dan J" sort="Wilson, Dan J" uniqKey="Wilson D" first="Dan J." last="Wilson">Dan J. Wilson</name>
<affiliation>
<mods:affiliation>J. W. Goethe University, Frankfurt, Germany</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Murray Ust, Peter" sort="Murray Ust, Peter" uniqKey="Murray Ust P" first="Peter" last="Murray-Rust">Peter Murray-Rust</name>
<affiliation>
<mods:affiliation>Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA</idno>
<date when="2013" year="2013">2013</date>
<idno type="doi">10.1002/cpe.2849</idno>
<idno type="url">https://api.istex.fr/document/66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000466</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice</title>
<author>
<name sortKey="Yang, Xiaoyu" sort="Yang, Xiaoyu" uniqKey="Yang X" first="Xiaoyu" last="Yang">Xiaoyu Yang</name>
<affiliation>
<mods:affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>Correspondence to: Xiaoyu Yang, Senior Member, Wolfson College, University of Cambridge, Barton Road, Cambridge, CB3 9BB, UK.E‐mail:</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: kevin.x.yang@wolfsonemail.com</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Dove, Martin T" sort="Dove, Martin T" uniqKey="Dove M" first="Martin T." last="Dove">Martin T. Dove</name>
<affiliation>
<mods:affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Bruin, Richard P" sort="Bruin, Richard P" uniqKey="Bruin R" first="Richard P." last="Bruin">Richard P. Bruin</name>
<affiliation>
<mods:affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Walkingshaw, Andrew" sort="Walkingshaw, Andrew" uniqKey="Walkingshaw A" first="Andrew" last="Walkingshaw">Andrew Walkingshaw</name>
<affiliation>
<mods:affiliation>Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Sinclair, Richard" sort="Sinclair, Richard" uniqKey="Sinclair R" first="Richard" last="Sinclair">Richard Sinclair</name>
<affiliation>
<mods:affiliation>Daresbury Laboratory, Science and Technologies Facilities Council, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Wilson, Dan J" sort="Wilson, Dan J" uniqKey="Wilson D" first="Dan J." last="Wilson">Dan J. Wilson</name>
<affiliation>
<mods:affiliation>J. W. Goethe University, Frankfurt, Germany</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Murray Ust, Peter" sort="Murray Ust, Peter" uniqKey="Murray Ust P" first="Peter" last="Murray-Rust">Peter Murray-Rust</name>
<affiliation>
<mods:affiliation>Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Concurrency and Computation: Practice and Experience</title>
<title level="j" type="abbrev">Concurrency Computat.: Pract. Exper.</title>
<idno type="ISSN">1532-0626</idno>
<idno type="eISSN">1532-0634</idno>
<imprint>
<publisher>Blackwell Publishing Ltd</publisher>
<date type="published" when="2013-03-10">2013-03-10</date>
<biblScope unit="volume">25</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="385">385</biblScope>
<biblScope unit="page" to="409">409</biblScope>
</imprint>
<idno type="ISSN">1532-0626</idno>
</series>
<idno type="istex">66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA</idno>
<idno type="DOI">10.1002/cpe.2849</idno>
<idno type="ArticleID">CPE2849</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">1532-0626</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract">Grid‐based simulation usually involves large quantities of data at each stage of the simulation process. These data include simulation input and output files, intermediate results files, log and error files, associated metadata, and information capturing the processes that generate the data. The question of how to effectively store and manage data files within a Grid computing environment is increasingly becoming an important issue. This paper illustrates how we built a lightweight e‐Science infrastructure for data management within a Grid computing environment, including the integration of data curation activities into the entire Grid‐based simulation process. Rather than focusing on specific implementation details, we aim to identify the key issues and research challenges, describing how various existing technologies and tools can be best integrated to address these requirements and challenges. Although the case of quantum mechanical simulation of materials properties is used in the paper, much of the discussion is as generic as possible so that approaches, methods and practice (e.g. integrated approach, workflow taxonomy and development approach, simple but useful semantic annotation approach) can be applied to wider domains and disciplines to facilitat the digital research. A comparison between our approach and Cloud computing, and lessons learned in data management within the Grid computing environment, are also presented. Copyright © 2012 John Wiley & Sons, Ltd.</div>
</front>
</TEI>
<istex>
<corpusName>wiley</corpusName>
<author>
<json:item>
<name>Xiaoyu Yang</name>
<affiliations>
<json:string>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</json:string>
<json:string>Correspondence to: Xiaoyu Yang, Senior Member, Wolfson College, University of Cambridge, Barton Road, Cambridge, CB3 9BB, UK.E‐mail:</json:string>
<json:string>E-mail: kevin.x.yang@wolfsonemail.com</json:string>
</affiliations>
</json:item>
<json:item>
<name>Martin T. Dove</name>
<affiliations>
<json:string>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Richard P. Bruin</name>
<affiliations>
<json:string>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Andrew Walkingshaw</name>
<affiliations>
<json:string>Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Richard Sinclair</name>
<affiliations>
<json:string>Daresbury Laboratory, Science and Technologies Facilities Council, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Dan J. Wilson</name>
<affiliations>
<json:string>J. W. Goethe University, Frankfurt, Germany</json:string>
</affiliations>
</json:item>
<json:item>
<name>Peter Murray‐Rust</name>
<affiliations>
<json:string>Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK</json:string>
</affiliations>
</json:item>
</author>
<subject>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>data infrastructure</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>e‐infrastructure</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>cyberinfrastructure</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>e‐Science</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>data curation</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>ontology</value>
</json:item>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>workflow</value>
</json:item>
</subject>
<articleId>
<json:string>CPE2849</json:string>
</articleId>
<language>
<json:string>eng</json:string>
</language>
<originalGenre>
<json:string>article</json:string>
</originalGenre>
<abstract>Grid‐based simulation usually involves large quantities of data at each stage of the simulation process. These data include simulation input and output files, intermediate results files, log and error files, associated metadata, and information capturing the processes that generate the data. The question of how to effectively store and manage data files within a Grid computing environment is increasingly becoming an important issue. This paper illustrates how we built a lightweight e‐Science infrastructure for data management within a Grid computing environment, including the integration of data curation activities into the entire Grid‐based simulation process. Rather than focusing on specific implementation details, we aim to identify the key issues and research challenges, describing how various existing technologies and tools can be best integrated to address these requirements and challenges. Although the case of quantum mechanical simulation of materials properties is used in the paper, much of the discussion is as generic as possible so that approaches, methods and practice (e.g. integrated approach, workflow taxonomy and development approach, simple but useful semantic annotation approach) can be applied to wider domains and disciplines to facilitat the digital research. A comparison between our approach and Cloud computing, and lessons learned in data management within the Grid computing environment, are also presented. Copyright © 2012 John Wiley & Sons, Ltd.</abstract>
<qualityIndicators>
<score>8.08</score>
<pdfVersion>1.4</pdfVersion>
<pdfPageSize>595.276 x 782.362 pts</pdfPageSize>
<refBibsNative>true</refBibsNative>
<keywordCount>7</keywordCount>
<abstractCharCount>1492</abstractCharCount>
<pdfWordCount>11609</pdfWordCount>
<pdfCharCount>71726</pdfCharCount>
<pdfPageCount>25</pdfPageCount>
<abstractWordCount>215</abstractWordCount>
</qualityIndicators>
<title>An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice</title>
<genre>
<json:string>article</json:string>
</genre>
<host>
<volume>25</volume>
<publisherId>
<json:string>CPE</json:string>
</publisherId>
<pages>
<total>25</total>
<last>409</last>
<first>385</first>
</pages>
<issn>
<json:string>1532-0626</json:string>
</issn>
<issue>3</issue>
<subject>
<json:item>
<value>Research Article</value>
</json:item>
</subject>
<genre>
<json:string>journal</json:string>
</genre>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1532-0634</json:string>
</eissn>
<title>Concurrency and Computation: Practice and Experience</title>
<doi>
<json:string>10.1002/(ISSN)1532-0634</json:string>
</doi>
</host>
<publicationDate>2013</publicationDate>
<copyrightDate>2013</copyrightDate>
<doi>
<json:string>10.1002/cpe.2849</json:string>
</doi>
<id>66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA</id>
<score>0.15769838</score>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice</title>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Blackwell Publishing Ltd</publisher>
<availability>
<p>Copyright © 2013 John Wiley & Sons, Ltd.Copyright © 2012 John Wiley & Sons, Ltd.</p>
</availability>
<date>2012-06-06</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice</title>
<author xml:id="author-1">
<persName>
<forename type="first">Xiaoyu</forename>
<surname>Yang</surname>
</persName>
<email>kevin.x.yang@wolfsonemail.com</email>
<affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</affiliation>
<affiliation>Correspondence to: Xiaoyu Yang, Senior Member, Wolfson College, University of Cambridge, Barton Road, Cambridge, CB3 9BB, UK.E‐mail:</affiliation>
</author>
<author xml:id="author-2">
<persName>
<forename type="first">Martin T.</forename>
<surname>Dove</surname>
</persName>
<affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</affiliation>
</author>
<author xml:id="author-3">
<persName>
<forename type="first">Richard P.</forename>
<surname>Bruin</surname>
</persName>
<affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</affiliation>
</author>
<author xml:id="author-4">
<persName>
<forename type="first">Andrew</forename>
<surname>Walkingshaw</surname>
</persName>
<affiliation>Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK</affiliation>
</author>
<author xml:id="author-5">
<persName>
<forename type="first">Richard</forename>
<surname>Sinclair</surname>
</persName>
<affiliation>Daresbury Laboratory, Science and Technologies Facilities Council, UK</affiliation>
</author>
<author xml:id="author-6">
<persName>
<forename type="first">Dan J.</forename>
<surname>Wilson</surname>
</persName>
<affiliation>J. W. Goethe University, Frankfurt, Germany</affiliation>
</author>
<author xml:id="author-7">
<persName>
<forename type="first">Peter</forename>
<surname>Murray‐Rust</surname>
</persName>
<affiliation>Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK</affiliation>
</author>
</analytic>
<monogr>
<title level="j">Concurrency and Computation: Practice and Experience</title>
<title level="j" type="abbrev">Concurrency Computat.: Pract. Exper.</title>
<idno type="pISSN">1532-0626</idno>
<idno type="eISSN">1532-0634</idno>
<idno type="DOI">10.1002/(ISSN)1532-0634</idno>
<imprint>
<publisher>Blackwell Publishing Ltd</publisher>
<date type="published" when="2013-03-10"></date>
<biblScope unit="volume">25</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="385">385</biblScope>
<biblScope unit="page" to="409">409</biblScope>
</imprint>
</monogr>
<idno type="istex">66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA</idno>
<idno type="DOI">10.1002/cpe.2849</idno>
<idno type="ArticleID">CPE2849</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2012-06-06</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract>
<p>Grid‐based simulation usually involves large quantities of data at each stage of the simulation process. These data include simulation input and output files, intermediate results files, log and error files, associated metadata, and information capturing the processes that generate the data. The question of how to effectively store and manage data files within a Grid computing environment is increasingly becoming an important issue. This paper illustrates how we built a lightweight e‐Science infrastructure for data management within a Grid computing environment, including the integration of data curation activities into the entire Grid‐based simulation process. Rather than focusing on specific implementation details, we aim to identify the key issues and research challenges, describing how various existing technologies and tools can be best integrated to address these requirements and challenges. Although the case of quantum mechanical simulation of materials properties is used in the paper, much of the discussion is as generic as possible so that approaches, methods and practice (e.g. integrated approach, workflow taxonomy and development approach, simple but useful semantic annotation approach) can be applied to wider domains and disciplines to facilitat the digital research. A comparison between our approach and Cloud computing, and lessons learned in data management within the Grid computing environment, are also presented. Copyright © 2012 John Wiley & Sons, Ltd.</p>
</abstract>
<textClass>
<keywords scheme="keyword">
<list>
<head>keywords</head>
<item>
<term>data infrastructure</term>
</item>
<item>
<term>e‐infrastructure</term>
</item>
<item>
<term>cyberinfrastructure</term>
</item>
<item>
<term>e‐Science</term>
</item>
<item>
<term>data curation</term>
</item>
<item>
<term>ontology</term>
</item>
<item>
<term>workflow</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Journal Subject">
<list>
<head>article-category</head>
<item>
<term>Research Article</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2011-10-31">Received</change>
<change when="2012-04-01">Registration</change>
<change when="2012-06-06">Created</change>
<change when="2013-03-10">Published</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Wiley, elements deleted: body">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8" standalone="yes"</istex:xmlDeclaration>
<istex:document>
<component type="serialArticle" version="2.0" xml:lang="en" xml:id="cpe2849">
<header xml:id="cpe2849-hdr-0001">
<publicationMeta level="product">
<doi>10.1002/(ISSN)1532-0634</doi>
<issn type="print">1532-0626</issn>
<issn type="electronic">1532-0634</issn>
<idGroup>
<id type="product" value="CPE"></id>
</idGroup>
<titleGroup>
<title type="main" sort="CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE">Concurrency and Computation: Practice and Experience</title>
<title type="short">Concurrency Computat.: Pract. Exper.</title>
</titleGroup>
</publicationMeta>
<publicationMeta level="part" position="30">
<doi>10.1002/cpe.v25.3</doi>
<copyright ownership="publisher">Copyright © 2013 John Wiley & Sons, Ltd.</copyright>
<numberingGroup>
<numbering type="journalVolume" number="25">25</numbering>
<numbering type="journalIssue">3</numbering>
</numberingGroup>
<coverDate startDate="2013-03-10">10 March 2013</coverDate>
</publicationMeta>
<publicationMeta level="unit" position="50" type="article" status="forIssue">
<doi>10.1002/cpe.2849</doi>
<idGroup>
<id type="unit" value="CPE2849"></id>
</idGroup>
<countGroup>
<count type="pageTotal" number="25"></count>
</countGroup>
<titleGroup>
<title type="articleCategory">Research Article</title>
<title type="tocHeading1">Research Articles</title>
</titleGroup>
<copyright ownership="publisher">Copyright © 2012 John Wiley & Sons, Ltd.</copyright>
<eventGroup>
<event type="manuscriptReceived" date="2011-10-31"></event>
<event type="manuscriptRevised" date="2012-03-18"></event>
<event type="manuscriptAccepted" date="2012-04-01"></event>
<event type="xmlCreated" agent="SPi Global" date="2012-06-06"></event>
<event type="publishedOnlineEarlyUnpaginated" date="2012-06-12"></event>
<event type="firstOnline" date="2012-06-12"></event>
<event type="publishedOnlineFinalForm" date="2013-02-08"></event>
<event type="xmlConverted" agent="Converter:WILEY_ML3G_TO_WILEY_ML3GV2 version:3.8.8" date="2014-01-16"></event>
<event type="xmlConverted" agent="Converter:WML3G_To_WML3G version:4.6.4 mode:FullText" date="2015-10-02"></event>
</eventGroup>
<numberingGroup>
<numbering type="pageFirst">385</numbering>
<numbering type="pageLast">409</numbering>
</numberingGroup>
<correspondenceTo>
<lineatedText xml:id="cpe2849-lntext-0001">
<line>Correspondence to: Xiaoyu Yang, Senior Member, Wolfson College, University of Cambridge, Barton Road, Cambridge, CB3 9BB, UK.</line>
<line>E‐mail:
<email>kevin.x.yang@wolfsonemail.com</email>
</line>
</lineatedText>
</correspondenceTo>
<linkGroup>
<link type="toTypesetVersion" href="file:CPE.CPE2849.pdf"></link>
</linkGroup>
</publicationMeta>
<contentMeta>
<titleGroup>
<title type="main">An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice</title>
<title type="shortAuthors">X. YANG
<i>ET AL.</i>
</title>
<title type="short">E‐SCIENCE DATA INFRASTRUCTURE</title>
</titleGroup>
<creators>
<creator xml:id="cpe2849-cr-0001" creatorRole="author" affiliationRef="#cpe2849-aff-0001" corresponding="yes">
<personName>
<givenNames>Xiaoyu</givenNames>
<familyName>Yang</familyName>
</personName>
</creator>
<creator xml:id="cpe2849-cr-0002" creatorRole="author" affiliationRef="#cpe2849-aff-0001">
<personName>
<givenNames>Martin T.</givenNames>
<familyName>Dove</familyName>
</personName>
</creator>
<creator xml:id="cpe2849-cr-0003" creatorRole="author" affiliationRef="#cpe2849-aff-0001">
<personName>
<givenNames>Richard P.</givenNames>
<familyName>Bruin</familyName>
</personName>
</creator>
<creator xml:id="cpe2849-cr-0004" creatorRole="author" affiliationRef="#cpe2849-aff-0002">
<personName>
<givenNames>Andrew</givenNames>
<familyName>Walkingshaw</familyName>
</personName>
</creator>
<creator xml:id="cpe2849-cr-0005" creatorRole="author" affiliationRef="#cpe2849-aff-0003">
<personName>
<givenNames>Richard</givenNames>
<familyName>Sinclair</familyName>
</personName>
</creator>
<creator xml:id="cpe2849-cr-0006" creatorRole="author" affiliationRef="#cpe2849-aff-0004">
<personName>
<givenNames>Dan J.</givenNames>
<familyName>Wilson</familyName>
</personName>
</creator>
<creator xml:id="cpe2849-cr-0007" creatorRole="author" affiliationRef="#cpe2849-aff-0002">
<personName>
<givenNames>Peter</givenNames>
<familyName>Murray‐Rust</familyName>
</personName>
</creator>
</creators>
<affiliationGroup>
<affiliation xml:id="cpe2849-aff-0001" countryCode="GB" type="organization">
<orgDiv>Department of Earth Sciences</orgDiv>
<orgName>University of Cambridge</orgName>
<address>
<street>Downing Street</street>
<city>Cambridge CB2 3EQ</city>
<country>UK</country>
</address>
</affiliation>
<affiliation xml:id="cpe2849-aff-0002" countryCode="GB" type="organization">
<orgDiv>Department of Chemistry</orgDiv>
<orgName>University of Cambridge</orgName>
<address>
<street>Lensfield Road</street>
<city>Cambridge CB2 1EW</city>
<country>UK</country>
</address>
</affiliation>
<affiliation xml:id="cpe2849-aff-0003" countryCode="GB" type="organization">
<orgDiv>Daresbury Laboratory</orgDiv>
<orgName>Science and Technologies Facilities Council</orgName>
<address>
<country>UK</country>
</address>
</affiliation>
<affiliation xml:id="cpe2849-aff-0004" countryCode="DE" type="organization">
<orgName>J. W. Goethe University</orgName>
<address>
<city>Frankfurt</city>
<country>Germany</country>
</address>
</affiliation>
</affiliationGroup>
<keywordGroup type="author">
<keyword xml:id="cpe2849-kwd-0001">data infrastructure</keyword>
<keyword xml:id="cpe2849-kwd-0002">e‐infrastructure</keyword>
<keyword xml:id="cpe2849-kwd-0003">cyberinfrastructure</keyword>
<keyword xml:id="cpe2849-kwd-0004">e‐Science</keyword>
<keyword xml:id="cpe2849-kwd-0005">data curation</keyword>
<keyword xml:id="cpe2849-kwd-0006">ontology</keyword>
<keyword xml:id="cpe2849-kwd-0007">workflow</keyword>
</keywordGroup>
<abstractGroup>
<abstract type="main" xml:id="cpe2849-abs-0001">
<title type="main">SUMMARY</title>
<p xml:id="cpe2849-para-0001">Grid‐based simulation usually involves large quantities of data at each stage of the simulation process. These data include simulation input and output files, intermediate results files, log and error files, associated metadata, and information capturing the processes that generate the data. The question of how to effectively store and manage data files within a Grid computing environment is increasingly becoming an important issue. This paper illustrates how we built a lightweight e‐Science infrastructure for data management within a Grid computing environment, including the integration of data curation activities into the entire Grid‐based simulation process. Rather than focusing on specific implementation details, we aim to identify the key issues and research challenges, describing how various existing technologies and tools can be best integrated to address these requirements and challenges. Although the case of quantum mechanical simulation of materials properties is used in the paper, much of the discussion is as generic as possible so that approaches, methods and practice (e.g. integrated approach, workflow taxonomy and development approach, simple but useful semantic annotation approach) can be applied to wider domains and disciplines to facilitat the digital research. A comparison between our approach and Cloud computing, and lessons learned in data management within the Grid computing environment, are also presented. Copyright © 2012 John Wiley & Sons, Ltd.</p>
</abstract>
</abstractGroup>
</contentMeta>
</header>
</component>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice</title>
</titleInfo>
<titleInfo type="abbreviated" lang="en">
<title>E‐SCIENCE DATA INFRASTRUCTURE</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice</title>
</titleInfo>
<name type="personal">
<namePart type="given">Xiaoyu</namePart>
<namePart type="family">Yang</namePart>
<affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</affiliation>
<affiliation>Correspondence to: Xiaoyu Yang, Senior Member, Wolfson College, University of Cambridge, Barton Road, Cambridge, CB3 9BB, UK.E‐mail:</affiliation>
<affiliation>E-mail: kevin.x.yang@wolfsonemail.com</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Martin T.</namePart>
<namePart type="family">Dove</namePart>
<affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Richard P.</namePart>
<namePart type="family">Bruin</namePart>
<affiliation>Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Andrew</namePart>
<namePart type="family">Walkingshaw</namePart>
<affiliation>Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Richard</namePart>
<namePart type="family">Sinclair</namePart>
<affiliation>Daresbury Laboratory, Science and Technologies Facilities Council, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Dan J.</namePart>
<namePart type="family">Wilson</namePart>
<affiliation>J. W. Goethe University, Frankfurt, Germany</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Peter</namePart>
<namePart type="family">Murray‐Rust</namePart>
<affiliation>Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="article" displayLabel="article"></genre>
<originInfo>
<publisher>Blackwell Publishing Ltd</publisher>
<dateIssued encoding="w3cdtf">2013-03-10</dateIssued>
<dateCreated encoding="w3cdtf">2012-06-06</dateCreated>
<dateCaptured encoding="w3cdtf">2011-10-31</dateCaptured>
<dateValid encoding="w3cdtf">2012-04-01</dateValid>
<copyrightDate encoding="w3cdtf">2013</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract>Grid‐based simulation usually involves large quantities of data at each stage of the simulation process. These data include simulation input and output files, intermediate results files, log and error files, associated metadata, and information capturing the processes that generate the data. The question of how to effectively store and manage data files within a Grid computing environment is increasingly becoming an important issue. This paper illustrates how we built a lightweight e‐Science infrastructure for data management within a Grid computing environment, including the integration of data curation activities into the entire Grid‐based simulation process. Rather than focusing on specific implementation details, we aim to identify the key issues and research challenges, describing how various existing technologies and tools can be best integrated to address these requirements and challenges. Although the case of quantum mechanical simulation of materials properties is used in the paper, much of the discussion is as generic as possible so that approaches, methods and practice (e.g. integrated approach, workflow taxonomy and development approach, simple but useful semantic annotation approach) can be applied to wider domains and disciplines to facilitat the digital research. A comparison between our approach and Cloud computing, and lessons learned in data management within the Grid computing environment, are also presented. Copyright © 2012 John Wiley & Sons, Ltd.</abstract>
<subject>
<genre>keywords</genre>
<topic>data infrastructure</topic>
<topic>e‐infrastructure</topic>
<topic>cyberinfrastructure</topic>
<topic>e‐Science</topic>
<topic>data curation</topic>
<topic>ontology</topic>
<topic>workflow</topic>
</subject>
<relatedItem type="host">
<titleInfo>
<title>Concurrency and Computation: Practice and Experience</title>
</titleInfo>
<titleInfo type="abbreviated">
<title>Concurrency Computat.: Pract. Exper.</title>
</titleInfo>
<genre type="journal">journal</genre>
<subject>
<genre>article-category</genre>
<topic>Research Article</topic>
</subject>
<identifier type="ISSN">1532-0626</identifier>
<identifier type="eISSN">1532-0634</identifier>
<identifier type="DOI">10.1002/(ISSN)1532-0634</identifier>
<identifier type="PublisherID">CPE</identifier>
<part>
<date>2013</date>
<detail type="volume">
<caption>vol.</caption>
<number>25</number>
</detail>
<detail type="issue">
<caption>no.</caption>
<number>3</number>
</detail>
<extent unit="pages">
<start>385</start>
<end>409</end>
<total>25</total>
</extent>
</part>
</relatedItem>
<identifier type="istex">66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA</identifier>
<identifier type="DOI">10.1002/cpe.2849</identifier>
<identifier type="ArticleID">CPE2849</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Copyright © 2013 John Wiley & Sons, Ltd.Copyright © 2012 John Wiley & Sons, Ltd.</accessCondition>
<recordInfo>
<recordContentSource>WILEY</recordContentSource>
</recordInfo>
</mods>
</metadata>
<enrichments>
<json:item>
<type>multicat</type>
<uri>https://api.istex.fr/document/66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA/enrichments/multicat</uri>
</json:item>
</enrichments>
<serie></serie>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000466 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000466 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:66D3FE122A3E7348EE0C5014A65BEFD271BC3ACA
   |texte=   An e‐Science data infrastructure for simulations within Grid computing environment: methods, approaches and practice
}}

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024