Semantic web data warehousing for caGrid
Identifieur interne : 000B89 ( Main/Merge ); précédent : 000B88; suivant : 000B90Semantic web data warehousing for caGrid
Auteurs : James P. Mccusker [États-Unis] ; Joshua A. Phillips [États-Unis] ; Alejandra González Beltrán [Royaume-Uni] ; Anthony Finkelstein [Royaume-Uni] ; Michael Krauthammer [États-Unis]Source :
- BMC Bioinformatics [ 1471-2105 ] ; 2009.
Abstract
The National Cancer Institute (NCI) is developing caGrid as a means for sharing cancer-related data and services. As more data sets become available on caGrid, we need effective ways of accessing and integrating this information. Although the data models exposed on caGrid are semantically well annotated, it is currently up to the caGrid client to infer relationships between the different models and their classes. In this paper, we present a Semantic Web-based data warehouse (Corvus) for creating relationships among caGrid models. This is accomplished through the transformation of semantically-annotated caBIG® Unified Modeling Language (UML) information models into Web Ontology Language (OWL) ontologies that preserve those semantics. We demonstrate the validity of the approach by Semantic Extraction, Transformation and Loading (SETL) of data from two caGrid data sources, caTissue and caArray, as well as alignment and query of those sources in Corvus. We argue that semantic integration is necessary for integration of data from distributed web services and that Corvus is a useful way of accomplishing this. Our approach is generalizable and of broad utility to researchers facing similar integration challenges.
Url:
DOI: 10.1186/1471-2105-10-S10-S2
PubMed: 19796399
PubMed Central: 2755823
Links toward previous steps (curation, corpus...)
- to stream Pmc, to step Corpus: 000209
- to stream Pmc, to step Curation: 000209
- to stream Pmc, to step Checkpoint: 000617
- to stream Ncbi, to step Merge: 000109
- to stream Ncbi, to step Curation: 000109
- to stream Ncbi, to step Checkpoint: 000109
Links to Exploration step
PMC:2755823Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Semantic web data warehousing for caGrid</title>
<author><name sortKey="Mccusker, James P" sort="Mccusker, James P" uniqKey="Mccusker J" first="James P" last="Mccusker">James P. Mccusker</name>
<affiliation wicri:level="2"><nlm:aff id="I1">Department of Pathology, Yale University School of Medicine, New Haven, CT, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Pathology, Yale University School of Medicine, New Haven, CT</wicri:regionArea>
<placeName><region type="state">Connecticut</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Phillips, Joshua A" sort="Phillips, Joshua A" uniqKey="Phillips J" first="Joshua A" last="Phillips">Joshua A. Phillips</name>
<affiliation wicri:level="2"><nlm:aff id="I2">Semantic Bits, LLC, Reston, VA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Semantic Bits, LLC, Reston, VA</wicri:regionArea>
<placeName><region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Beltran, Alejandra Gonzalez" sort="Beltran, Alejandra Gonzalez" uniqKey="Beltran A" first="Alejandra González" last="Beltrán">Alejandra González Beltrán</name>
<affiliation wicri:level="4"><nlm:aff id="I3">Department of Computer Science, University College London, London, UK</nlm:aff>
<country xml:lang="fr">Royaume-Uni</country>
<wicri:regionArea>Department of Computer Science, University College London, London</wicri:regionArea>
<placeName><settlement type="city">Londres</settlement>
<region type="country">Angleterre</region>
<region type="région" nuts="1">Grand Londres</region>
</placeName>
<orgName type="university">University College de Londres</orgName>
</affiliation>
</author>
<author><name sortKey="Finkelstein, Anthony" sort="Finkelstein, Anthony" uniqKey="Finkelstein A" first="Anthony" last="Finkelstein">Anthony Finkelstein</name>
<affiliation wicri:level="4"><nlm:aff id="I3">Department of Computer Science, University College London, London, UK</nlm:aff>
<country xml:lang="fr">Royaume-Uni</country>
<wicri:regionArea>Department of Computer Science, University College London, London</wicri:regionArea>
<placeName><settlement type="city">Londres</settlement>
<region type="country">Angleterre</region>
<region type="région" nuts="1">Grand Londres</region>
</placeName>
<orgName type="university">University College de Londres</orgName>
</affiliation>
</author>
<author><name sortKey="Krauthammer, Michael" sort="Krauthammer, Michael" uniqKey="Krauthammer M" first="Michael" last="Krauthammer">Michael Krauthammer</name>
<affiliation wicri:level="2"><nlm:aff id="I1">Department of Pathology, Yale University School of Medicine, New Haven, CT, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Pathology, Yale University School of Medicine, New Haven, CT</wicri:regionArea>
<placeName><region type="state">Connecticut</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">19796399</idno>
<idno type="pmc">2755823</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2755823</idno>
<idno type="RBID">PMC:2755823</idno>
<idno type="doi">10.1186/1471-2105-10-S10-S2</idno>
<date when="2009">2009</date>
<idno type="wicri:Area/Pmc/Corpus">000209</idno>
<idno type="wicri:Area/Pmc/Curation">000209</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000617</idno>
<idno type="wicri:Area/Ncbi/Merge">000109</idno>
<idno type="wicri:Area/Ncbi/Curation">000109</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000109</idno>
<idno type="wicri:Area/Main/Merge">000B89</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">Semantic web data warehousing for caGrid</title>
<author><name sortKey="Mccusker, James P" sort="Mccusker, James P" uniqKey="Mccusker J" first="James P" last="Mccusker">James P. Mccusker</name>
<affiliation wicri:level="2"><nlm:aff id="I1">Department of Pathology, Yale University School of Medicine, New Haven, CT, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Pathology, Yale University School of Medicine, New Haven, CT</wicri:regionArea>
<placeName><region type="state">Connecticut</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Phillips, Joshua A" sort="Phillips, Joshua A" uniqKey="Phillips J" first="Joshua A" last="Phillips">Joshua A. Phillips</name>
<affiliation wicri:level="2"><nlm:aff id="I2">Semantic Bits, LLC, Reston, VA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Semantic Bits, LLC, Reston, VA</wicri:regionArea>
<placeName><region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Beltran, Alejandra Gonzalez" sort="Beltran, Alejandra Gonzalez" uniqKey="Beltran A" first="Alejandra González" last="Beltrán">Alejandra González Beltrán</name>
<affiliation wicri:level="4"><nlm:aff id="I3">Department of Computer Science, University College London, London, UK</nlm:aff>
<country xml:lang="fr">Royaume-Uni</country>
<wicri:regionArea>Department of Computer Science, University College London, London</wicri:regionArea>
<placeName><settlement type="city">Londres</settlement>
<region type="country">Angleterre</region>
<region type="région" nuts="1">Grand Londres</region>
</placeName>
<orgName type="university">University College de Londres</orgName>
</affiliation>
</author>
<author><name sortKey="Finkelstein, Anthony" sort="Finkelstein, Anthony" uniqKey="Finkelstein A" first="Anthony" last="Finkelstein">Anthony Finkelstein</name>
<affiliation wicri:level="4"><nlm:aff id="I3">Department of Computer Science, University College London, London, UK</nlm:aff>
<country xml:lang="fr">Royaume-Uni</country>
<wicri:regionArea>Department of Computer Science, University College London, London</wicri:regionArea>
<placeName><settlement type="city">Londres</settlement>
<region type="country">Angleterre</region>
<region type="région" nuts="1">Grand Londres</region>
</placeName>
<orgName type="university">University College de Londres</orgName>
</affiliation>
</author>
<author><name sortKey="Krauthammer, Michael" sort="Krauthammer, Michael" uniqKey="Krauthammer M" first="Michael" last="Krauthammer">Michael Krauthammer</name>
<affiliation wicri:level="2"><nlm:aff id="I1">Department of Pathology, Yale University School of Medicine, New Haven, CT, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Pathology, Yale University School of Medicine, New Haven, CT</wicri:regionArea>
<placeName><region type="state">Connecticut</region>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j">BMC Bioinformatics</title>
<idno type="eISSN">1471-2105</idno>
<imprint><date when="2009">2009</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><p>The National Cancer Institute (NCI) is developing caGrid as a means for sharing cancer-related data and services. As more data sets become available on caGrid, we need effective ways of accessing and integrating this information. Although the data models exposed on caGrid are semantically well annotated, it is currently up to the caGrid client to infer relationships between the different models and their classes. In this paper, we present a Semantic Web-based data warehouse (Corvus) for creating relationships among caGrid models. This is accomplished through the transformation of semantically-annotated caBIG<sup>® </sup>
Unified Modeling Language (UML) information models into Web Ontology Language (OWL) ontologies that preserve those semantics. We demonstrate the validity of the approach by Semantic Extraction, Transformation and Loading (SETL) of data from two caGrid data sources, caTissue and caArray, as well as alignment and query of those sources in Corvus. We argue that semantic integration is necessary for integration of data from distributed web services and that Corvus is a useful way of accomplishing this. Our approach is generalizable and of broad utility to researchers facing similar integration challenges.</p>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000B89 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 000B89 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= CyberinfraV1 |flux= Main |étape= Merge |type= RBID |clé= PMC:2755823 |texte= Semantic web data warehousing for caGrid }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Merge/RBID.i -Sk "pubmed:19796399" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Merge/biblio.hfd \ | NlmPubMed2Wicri -a CyberinfraV1
This area was generated with Dilib version V0.6.25. |