Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A Semantic Web Management Model for Integrative Biomedical Informatics

Identifieur interne : 000312 ( Pmc/Corpus ); précédent : 000311; suivant : 000313

A Semantic Web Management Model for Integrative Biomedical Informatics

Auteurs : Helena F. Deus ; Romesh Stanislaus ; Diogo F. Veiga ; Carmen Behrens ; Ignacio I. Wistuba ; John D. Minna ; Harold R. Garner ; Stephen G. Swisher ; Jack A. Roth ; Arlene M. Correa ; Bradley Broom ; Kevin Coombes ; Allen Chang ; Lynn H. Vogel ; Jonas S. Almeida

Source :

RBID : PMC:2491554

Abstract

Background

Data, data everywhere. The diversity and magnitude of the data generated in the Life Sciences defies automated articulation among complementary efforts. The additional need in this field for managing property and access permissions compounds the difficulty very significantly. This is particularly the case when the integration involves multiple domains and disciplines, even more so when it includes clinical and high throughput molecular data.

Methodology/Principal Findings

The emergence of Semantic Web technologies brings the promise of meaningful interoperation between data and analysis resources. In this report we identify a core model for biomedical Knowledge Engineering applications and demonstrate how this new technology can be used to weave a management model where multiple intertwined data structures can be hosted and managed by multiple authorities in a distributed management infrastructure. Specifically, the demonstration is performed by linking data sources associated with the Lung Cancer SPORE awarded to The University of Texas MDAnderson Cancer Center at Houston and the Southwestern Medical Center at Dallas. A software prototype, available with open source at www.s3db.org, was developed and its proposed design has been made publicly available as an open source instrument for shared, distributed data management.

Conclusions/Significance

The Semantic Web technologies have the potential to addresses the need for distributed and evolvable representations that are critical for systems Biology and translational biomedical research. As this technology is incorporated into application development we can expect that both general purpose productivity software and domain specific software installed on our personal computers will become increasingly integrated with the relevant remote resources. In this scenario, the acquisition of a new dataset should automatically trigger the delegation of its analysis.


Url:
DOI: 10.1371/journal.pone.0002946
PubMed: 18698353
PubMed Central: 2491554

Links to Exploration step

PMC:2491554

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">A Semantic Web Management Model for Integrative Biomedical Informatics</title>
<author>
<name sortKey="Deus, Helena F" sort="Deus, Helena F" uniqKey="Deus H" first="Helena F." last="Deus">Helena F. Deus</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff2">
<addr-line>Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Lisboa, Portugal</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Stanislaus, Romesh" sort="Stanislaus, Romesh" uniqKey="Stanislaus R" first="Romesh" last="Stanislaus">Romesh Stanislaus</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Veiga, Diogo F" sort="Veiga, Diogo F" uniqKey="Veiga D" first="Diogo F." last="Veiga">Diogo F. Veiga</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Behrens, Carmen" sort="Behrens, Carmen" uniqKey="Behrens C" first="Carmen" last="Behrens">Carmen Behrens</name>
<affiliation>
<nlm:aff id="aff3">
<addr-line>Department of Thoracic/Head and Neck Medical Oncology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Wistuba, Ignacio I" sort="Wistuba, Ignacio I" uniqKey="Wistuba I" first="Ignacio I." last="Wistuba">Ignacio I. Wistuba</name>
<affiliation>
<nlm:aff id="aff3">
<addr-line>Department of Thoracic/Head and Neck Medical Oncology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff4">
<addr-line>Department of Pathology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Minna, John D" sort="Minna, John D" uniqKey="Minna J" first="John D." last="Minna">John D. Minna</name>
<affiliation>
<nlm:aff id="aff5">
<addr-line>Hamon Center for Therapeutic Oncology Research, Simmons Cancer Center, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Garner, Harold R" sort="Garner, Harold R" uniqKey="Garner H" first="Harold R." last="Garner">Harold R. Garner</name>
<affiliation>
<nlm:aff id="aff5">
<addr-line>Hamon Center for Therapeutic Oncology Research, Simmons Cancer Center, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff6">
<addr-line>Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff7">
<addr-line>Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff8">
<addr-line>Center for Biomedical Inventions, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff9">
<addr-line>Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Swisher, Stephen G" sort="Swisher, Stephen G" uniqKey="Swisher S" first="Stephen G." last="Swisher">Stephen G. Swisher</name>
<affiliation>
<nlm:aff id="aff10">
<addr-line>Department of Thoracic and Cardiovascular Surgery, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Roth, Jack A" sort="Roth, Jack A" uniqKey="Roth J" first="Jack A." last="Roth">Jack A. Roth</name>
<affiliation>
<nlm:aff id="aff10">
<addr-line>Department of Thoracic and Cardiovascular Surgery, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Correa, Arlene M" sort="Correa, Arlene M" uniqKey="Correa A" first="Arlene M." last="Correa">Arlene M. Correa</name>
<affiliation>
<nlm:aff id="aff10">
<addr-line>Department of Thoracic and Cardiovascular Surgery, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Broom, Bradley" sort="Broom, Bradley" uniqKey="Broom B" first="Bradley" last="Broom">Bradley Broom</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Coombes, Kevin" sort="Coombes, Kevin" uniqKey="Coombes K" first="Kevin" last="Coombes">Kevin Coombes</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Chang, Allen" sort="Chang, Allen" uniqKey="Chang A" first="Allen" last="Chang">Allen Chang</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Vogel, Lynn H" sort="Vogel, Lynn H" uniqKey="Vogel L" first="Lynn H." last="Vogel">Lynn H. Vogel</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff11">
<addr-line>Department of Biomedical Informatics, Columbia University, New York, New York, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Almeida, Jonas S" sort="Almeida, Jonas S" uniqKey="Almeida J" first="Jonas S." last="Almeida">Jonas S. Almeida</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">18698353</idno>
<idno type="pmc">2491554</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2491554</idno>
<idno type="RBID">PMC:2491554</idno>
<idno type="doi">10.1371/journal.pone.0002946</idno>
<date when="2008">2008</date>
<idno type="wicri:Area/Pmc/Corpus">000312</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">A Semantic Web Management Model for Integrative Biomedical Informatics</title>
<author>
<name sortKey="Deus, Helena F" sort="Deus, Helena F" uniqKey="Deus H" first="Helena F." last="Deus">Helena F. Deus</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff2">
<addr-line>Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Lisboa, Portugal</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Stanislaus, Romesh" sort="Stanislaus, Romesh" uniqKey="Stanislaus R" first="Romesh" last="Stanislaus">Romesh Stanislaus</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Veiga, Diogo F" sort="Veiga, Diogo F" uniqKey="Veiga D" first="Diogo F." last="Veiga">Diogo F. Veiga</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Behrens, Carmen" sort="Behrens, Carmen" uniqKey="Behrens C" first="Carmen" last="Behrens">Carmen Behrens</name>
<affiliation>
<nlm:aff id="aff3">
<addr-line>Department of Thoracic/Head and Neck Medical Oncology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Wistuba, Ignacio I" sort="Wistuba, Ignacio I" uniqKey="Wistuba I" first="Ignacio I." last="Wistuba">Ignacio I. Wistuba</name>
<affiliation>
<nlm:aff id="aff3">
<addr-line>Department of Thoracic/Head and Neck Medical Oncology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff4">
<addr-line>Department of Pathology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Minna, John D" sort="Minna, John D" uniqKey="Minna J" first="John D." last="Minna">John D. Minna</name>
<affiliation>
<nlm:aff id="aff5">
<addr-line>Hamon Center for Therapeutic Oncology Research, Simmons Cancer Center, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Garner, Harold R" sort="Garner, Harold R" uniqKey="Garner H" first="Harold R." last="Garner">Harold R. Garner</name>
<affiliation>
<nlm:aff id="aff5">
<addr-line>Hamon Center for Therapeutic Oncology Research, Simmons Cancer Center, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff6">
<addr-line>Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff7">
<addr-line>Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff8">
<addr-line>Center for Biomedical Inventions, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff9">
<addr-line>Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Swisher, Stephen G" sort="Swisher, Stephen G" uniqKey="Swisher S" first="Stephen G." last="Swisher">Stephen G. Swisher</name>
<affiliation>
<nlm:aff id="aff10">
<addr-line>Department of Thoracic and Cardiovascular Surgery, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Roth, Jack A" sort="Roth, Jack A" uniqKey="Roth J" first="Jack A." last="Roth">Jack A. Roth</name>
<affiliation>
<nlm:aff id="aff10">
<addr-line>Department of Thoracic and Cardiovascular Surgery, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Correa, Arlene M" sort="Correa, Arlene M" uniqKey="Correa A" first="Arlene M." last="Correa">Arlene M. Correa</name>
<affiliation>
<nlm:aff id="aff10">
<addr-line>Department of Thoracic and Cardiovascular Surgery, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Broom, Bradley" sort="Broom, Bradley" uniqKey="Broom B" first="Bradley" last="Broom">Bradley Broom</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Coombes, Kevin" sort="Coombes, Kevin" uniqKey="Coombes K" first="Kevin" last="Coombes">Kevin Coombes</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Chang, Allen" sort="Chang, Allen" uniqKey="Chang A" first="Allen" last="Chang">Allen Chang</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Vogel, Lynn H" sort="Vogel, Lynn H" uniqKey="Vogel L" first="Lynn H." last="Vogel">Lynn H. Vogel</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff11">
<addr-line>Department of Biomedical Informatics, Columbia University, New York, New York, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Almeida, Jonas S" sort="Almeida, Jonas S" uniqKey="Almeida J" first="Jonas S." last="Almeida">Jonas S. Almeida</name>
<affiliation>
<nlm:aff id="aff1">
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">PLoS ONE</title>
<idno type="eISSN">1932-6203</idno>
<imprint>
<date when="2008">2008</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec>
<title>Background</title>
<p>Data, data everywhere. The diversity and magnitude of the data generated in the Life Sciences defies automated articulation among complementary efforts. The additional need in this field for managing property and access permissions compounds the difficulty very significantly. This is particularly the case when the integration involves multiple domains and disciplines, even more so when it includes clinical and high throughput molecular data.</p>
</sec>
<sec>
<title>Methodology/Principal Findings</title>
<p>The emergence of Semantic Web technologies brings the promise of meaningful interoperation between data and analysis resources. In this report we identify a core model for biomedical Knowledge Engineering applications and demonstrate how this new technology can be used to weave a management model where multiple intertwined data structures can be hosted and managed by multiple authorities in a distributed management infrastructure. Specifically, the demonstration is performed by linking data sources associated with the Lung Cancer SPORE awarded to The University of Texas MDAnderson Cancer Center at Houston and the Southwestern Medical Center at Dallas. A software prototype, available with open source at
<ext-link ext-link-type="uri" xlink:href="http://www.s3db.org">www.s3db.org</ext-link>
, was developed and its proposed design has been made publicly available as an open source instrument for shared, distributed data management.</p>
</sec>
<sec>
<title>Conclusions/Significance</title>
<p>The Semantic Web technologies have the potential to addresses the need for distributed and evolvable representations that are critical for systems Biology and translational biomedical research. As this technology is incorporated into application development we can expect that both general purpose productivity software and domain specific software installed on our personal computers will become increasingly integrated with the relevant remote resources. In this scenario, the acquisition of a new dataset should automatically trigger the delegation of its analysis.</p>
</sec>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article" xml:lang="EN">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">PLoS ONE</journal-id>
<journal-id journal-id-type="publisher-id">plos</journal-id>
<journal-id journal-id-type="pmc">plosone</journal-id>
<journal-title>PLoS ONE</journal-title>
<issn pub-type="epub">1932-6203</issn>
<publisher>
<publisher-name>Public Library of Science</publisher-name>
<publisher-loc>San Francisco, USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">18698353</article-id>
<article-id pub-id-type="pmc">2491554</article-id>
<article-id pub-id-type="publisher-id">08-PONE-RA-04070</article-id>
<article-id pub-id-type="doi">10.1371/journal.pone.0002946</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Research Article</subject>
</subj-group>
<subj-group subj-group-type="Discipline">
<subject>Oncology</subject>
<subject>Computational Biology/Systems Biology</subject>
<subject>Computer Science/Information Technology</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>A Semantic Web Management Model for Integrative Biomedical Informatics</article-title>
<alt-title alt-title-type="running-head">Integrative Bioinformatics</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Deus</surname>
<given-names>Helena F.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Stanislaus</surname>
<given-names>Romesh</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Veiga</surname>
<given-names>Diogo F.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Behrens</surname>
<given-names>Carmen</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wistuba</surname>
<given-names>Ignacio I.</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<xref ref-type="aff" rid="aff4">
<sup>4</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Minna</surname>
<given-names>John D.</given-names>
</name>
<xref ref-type="aff" rid="aff5">
<sup>5</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Garner</surname>
<given-names>Harold R.</given-names>
</name>
<xref ref-type="aff" rid="aff5">
<sup>5</sup>
</xref>
<xref ref-type="aff" rid="aff6">
<sup>6</sup>
</xref>
<xref ref-type="aff" rid="aff7">
<sup>7</sup>
</xref>
<xref ref-type="aff" rid="aff8">
<sup>8</sup>
</xref>
<xref ref-type="aff" rid="aff9">
<sup>9</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Swisher</surname>
<given-names>Stephen G.</given-names>
</name>
<xref ref-type="aff" rid="aff10">
<sup>10</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Roth</surname>
<given-names>Jack A.</given-names>
</name>
<xref ref-type="aff" rid="aff10">
<sup>10</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Correa</surname>
<given-names>Arlene M.</given-names>
</name>
<xref ref-type="aff" rid="aff10">
<sup>10</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Broom</surname>
<given-names>Bradley</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Coombes</surname>
<given-names>Kevin</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Chang</surname>
<given-names>Allen</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Vogel</surname>
<given-names>Lynn H.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff11">
<sup>11</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Almeida</surname>
<given-names>Jonas S.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="cor1">
<sup>*</sup>
</xref>
</contrib>
</contrib-group>
<aff id="aff1">
<label>1</label>
<addr-line>Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Lisboa, Portugal</addr-line>
</aff>
<aff id="aff3">
<label>3</label>
<addr-line>Department of Thoracic/Head and Neck Medical Oncology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</aff>
<aff id="aff4">
<label>4</label>
<addr-line>Department of Pathology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</aff>
<aff id="aff5">
<label>5</label>
<addr-line>Hamon Center for Therapeutic Oncology Research, Simmons Cancer Center, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</aff>
<aff id="aff6">
<label>6</label>
<addr-line>Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</aff>
<aff id="aff7">
<label>7</label>
<addr-line>Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</aff>
<aff id="aff8">
<label>8</label>
<addr-line>Center for Biomedical Inventions, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</aff>
<aff id="aff9">
<label>9</label>
<addr-line>Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America</addr-line>
</aff>
<aff id="aff10">
<label>10</label>
<addr-line>Department of Thoracic and Cardiovascular Surgery, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America</addr-line>
</aff>
<aff id="aff11">
<label>11</label>
<addr-line>Department of Biomedical Informatics, Columbia University, New York, New York, United States of America</addr-line>
</aff>
<contrib-group>
<contrib contrib-type="editor">
<name>
<surname>Ben-Jacob</surname>
<given-names>Eshel</given-names>
</name>
<role>Editor</role>
<xref ref-type="aff" rid="edit1"></xref>
</contrib>
</contrib-group>
<aff id="edit1">Tel Aviv University, Israel</aff>
<author-notes>
<corresp id="cor1">* E-mail:
<email>jalmeida@mdanderson.org</email>
</corresp>
<fn fn-type="con">
<p>Conceived and designed the experiments: HFD CB IW JSA. Performed the experiments: HFD JSA. Analyzed the data: HFD JSA. Contributed reagents/materials/analysis tools: HFD RS DFV CB JDM HRG SGS JAR AMC BB KC AC LHV JSA. Wrote the paper: JSA.</p>
</fn>
</author-notes>
<pub-date pub-type="collection">
<year>2008</year>
</pub-date>
<pub-date pub-type="epub">
<day>13</day>
<month>8</month>
<year>2008</year>
</pub-date>
<volume>3</volume>
<issue>8</issue>
<elocation-id>e2946</elocation-id>
<history>
<date date-type="received">
<day>25</day>
<month>3</month>
<year>2008</year>
</date>
<date date-type="accepted">
<day>12</day>
<month>7</month>
<year>2008</year>
</date>
</history>
<copyright-statement>Deus et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.</copyright-statement>
<copyright-year>2008</copyright-year>
<abstract>
<sec>
<title>Background</title>
<p>Data, data everywhere. The diversity and magnitude of the data generated in the Life Sciences defies automated articulation among complementary efforts. The additional need in this field for managing property and access permissions compounds the difficulty very significantly. This is particularly the case when the integration involves multiple domains and disciplines, even more so when it includes clinical and high throughput molecular data.</p>
</sec>
<sec>
<title>Methodology/Principal Findings</title>
<p>The emergence of Semantic Web technologies brings the promise of meaningful interoperation between data and analysis resources. In this report we identify a core model for biomedical Knowledge Engineering applications and demonstrate how this new technology can be used to weave a management model where multiple intertwined data structures can be hosted and managed by multiple authorities in a distributed management infrastructure. Specifically, the demonstration is performed by linking data sources associated with the Lung Cancer SPORE awarded to The University of Texas MDAnderson Cancer Center at Houston and the Southwestern Medical Center at Dallas. A software prototype, available with open source at
<ext-link ext-link-type="uri" xlink:href="http://www.s3db.org">www.s3db.org</ext-link>
, was developed and its proposed design has been made publicly available as an open source instrument for shared, distributed data management.</p>
</sec>
<sec>
<title>Conclusions/Significance</title>
<p>The Semantic Web technologies have the potential to addresses the need for distributed and evolvable representations that are critical for systems Biology and translational biomedical research. As this technology is incorporated into application development we can expect that both general purpose productivity software and domain specific software installed on our personal computers will become increasingly integrated with the relevant remote resources. In this scenario, the acquisition of a new dataset should automatically trigger the delegation of its analysis.</p>
</sec>
</abstract>
<counts>
<page-count count="10"></page-count>
</counts>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<sec id="s1a">
<title>Data management and analysis for the life sciences</title>
<p>“The laws of Nature are written in the language of mathematics” famously said Galileo. However, in recent years efforts to analyze the increasing amount and diversity of data in the Life Sciences has been correspondingly constrained not so much by our ability to read it as by the challenge of organizing it. The urgency of this task and the reward of even partial success in its accomplishment have caused the interoperability between diverse digital representations to take center stage
<xref ref-type="bibr" rid="pone.0002946-Blake1">[1]</xref>
<xref ref-type="bibr" rid="pone.0002946-Hendler1">[5]</xref>
. Presently, for those in the Life Sciences enticed by Galileo's pronouncement, the effort of collecting data is no longer focused solely on field/bench work. Instead, it often consists of painfully squeezing the pieces of the systemic puzzle from the digital media where the raw data is held hostage
<xref ref-type="bibr" rid="pone.0002946-Wiley1">[6]</xref>
. It is only then that a comprehensive representation amenable to mathematical modeling really becomes available
<xref ref-type="bibr" rid="pone.0002946-Wass1">[7]</xref>
. This is not a preoccupation exclusive to the Life Sciences. Integration of software applications is also the driving force behind new information management systems architectures that seek to eliminate the boundaries to interoperability between data and services. This preoccupation indeed underlies the emergence of service oriented architectures
<xref ref-type="bibr" rid="pone.0002946-Foster1">[8]</xref>
<xref ref-type="bibr" rid="pone.0002946-Bridges1">[11]</xref>
, even more so in its event driven dynamic generalization
<xref ref-type="bibr" rid="pone.0002946-Gomadam1">[12]</xref>
. It also underlies the development of novel approaches to software deployment (
<xref ref-type="fig" rid="pone-0002946-g001">Figure 1</xref>
) that juggle data structures between server and client applications. Presently, a particularly popular design pattern is the usage-centric Web 2.0
<xref ref-type="bibr" rid="pone.0002946-Musser1">[13]</xref>
,
<xref ref-type="bibr" rid="pone.0002946-KamelBoulos1">[14]</xref>
which seeks a delicate balance in the distribution of tasks between client and server in order to diminish the perception of a distinction between local and remote computation.</p>
<fig id="pone-0002946-g001" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0002946.g001</object-id>
<label>Figure 1</label>
<caption>
<title>Three generations of design patterns for web-based applications.</title>
<p>The original design (“1.0”) consists of collections of hypertext documents that are syntactically (dashed lines) interoperable (traversing between them by clicking on the links), regardless of the domain content. The user centric web 2.0 applications use internal representations of the external data structures. This representation is asynchronously updated from the reference resources which are now free to have a specialized interoperation between domain contents. An example of this approach is that followed by AJAX-based interfaces. Finally, the ongoing emergence of the semantic web promises to produce service oriented systems that are semantically interoperable such that the interface application reacts to domains of knowledge specifically. At this level all applications tend to be web-interoperable with peer-to-peer architectures complementing the client-server design of w1.0 and w2.0.</p>
</caption>
<graphic xlink:href="pone.0002946.g001"></graphic>
</fig>
<p>Semantic web technologies
<xref ref-type="bibr" rid="pone.0002946-Ruttenberg1">[3]</xref>
,
<xref ref-type="bibr" rid="pone.0002946-BernersLee1">[15]</xref>
<xref ref-type="bibr" rid="pone.0002946-Feigenbaum1">[21]</xref>
represent the latest installment of web technology development. In what is being unimaginatively designated as Web 3.0
<xref ref-type="bibr" rid="pone.0002946-Borland1">[22]</xref>
,
<xref ref-type="bibr" rid="pone.0002946-Green1">[23]</xref>
, a software development design pattern is proposed where the interoperability boundaries between data structures, not just between the systems that produce them, is set to disappear. The defining characteristic of this environment is that one can retrieve data and information by specifying their desired properties instead of explicitly (syntactically) specifying their physical location. The desirability of this design can clearly be seen in systems in which clinical records are matched with high throughput molecular profiles, each of which stem from very distinct environments and are often the object of very different access management regulations.</p>
</sec>
<sec id="s1b">
<title>Inadequacy of conventional systems for Translational Research</title>
<p>On the one hand, high throughput molecular Biology core facilities and improved medical record systems are able to document individual data elements with increasing detail. On the other hand, researchers producing the data and models that critically advance the understanding of biological phenomena are increasingly separated from their use by the specialization inherent in each of these activities. Consequently, bridging between the information systems of basic research and their clinical application becomes a necessary foundation for any translational exploits of new biomedical knowledge
<xref ref-type="bibr" rid="pone.0002946-Ruttenberg1">[3]</xref>
,
<xref ref-type="bibr" rid="pone.0002946-Almeida1">[24]</xref>
. The alternative, using conventional data representations where the data models cannot evolve, typically requires the biomedical community to complement the data representation with a clandestine and inefficient flurry of datasets exchanged as spreadsheets through email.</p>
</sec>
<sec id="s1c">
<title>Foundations for a novel solution</title>
<p>As others before us
<xref ref-type="bibr" rid="pone.0002946-Hendler1">[5]</xref>
, we have argued previously for the use of semantic web formats as the foundation for developing more flexible and articulated data management and analytical bioinformatics infrastructures
<xref ref-type="bibr" rid="pone.0002946-Wang1">[20]</xref>
. A software prototype was then produced following those technical specifications to provide a flexible web-based data sharing environment within which a management model can be identified
<xref ref-type="bibr" rid="pone.0002946-Almeida1">[24]</xref>
. In this third report we describe the resulting core model supporting distributed and portable data representation and management. In practice this translates into a small application deployed in multiple locations rather than a large infrastructure at a single central location. The open source prototype application described here has been made public
<xref ref-type="bibr" rid="pone.0002946-s3db1">[25]</xref>
. All deployments support a common data management and analysis infrastructure with no constraints on the actual data structures described.</p>
</sec>
<sec id="s1d">
<title>A very brief history of data</title>
<p>The formatting of data sets as portable text mirrors the same three stages described for web-based applications in
<xref ref-type="fig" rid="pone-0002946-g001">Figure 1</xref>
. As described in
<xref ref-type="fig" rid="pone-0002946-g002">Figure 2</xref>
, data representation has been evolving from tabular text formats (“flat files”), to self described hierarchical trees of tags (extended markup languages, XML), and finally to the subject-predicate-object triples of Resource Description Framework (RDF)
<xref ref-type="bibr" rid="pone.0002946-Robu1">[26]</xref>
. We have been active participants in these transformations
<xref ref-type="bibr" rid="pone.0002946-Almeida1">[24]</xref>
,
<xref ref-type="bibr" rid="pone.0002946-Silva1">[27]</xref>
,
<xref ref-type="bibr" rid="pone.0002946-Stanislaus1">[28]</xref>
, and like many others concluded that in order to bridge the fragmentation between distinct data structures, we needed to break down the data structures themselves
<xref ref-type="bibr" rid="pone.0002946-Wang1">[20]</xref>
, that is, to reduce the interoperable elements to RDF triples
<xref ref-type="bibr" rid="pone.0002946-IvanHerman1">[29]</xref>
. In addition to its directed labeled graph nature, RDF formats
<xref ref-type="bibr" rid="pone.0002946-IvanHerman1">[29]</xref>
have a second defining characteristic: each of the three elements has a Uniform Resource Identifier (URI), which, for the purposes of this very brief introduction, can be thought as a unique locator capable of directing an application to the desired content or service. It is also interesting to note that at each level of this three-stage progression (
<xref ref-type="fig" rid="pone-0002946-g002">Figure 2</xref>
) we find data elements that have “matured”, that is, that present a stable representation which remains useful to specialized tools. When this happens we find that those elements remain convenient representations preserved whole within more fragmented formats. For example, we find no advantages in breaking down mzXML
<xref ref-type="bibr" rid="pone.0002946-Pedrioli1">[30]</xref>
representations of mass spectrometry based proteomics data. Instead, these data structures are used as objects of regular RDF triples. The mzXML proteomics data structure offers an paradigmatic illustration of the evolution of ontologies as efforts to standardize data formats
<xref ref-type="bibr" rid="pone.0002946-Orchard1">[31]</xref>
. It would be interesting to understand if the lengthy effort headed by the Human Proteomics Organization, HUPO, to integrate it reflects the difficulty to justify reforming
<xref ref-type="bibr" rid="pone.0002946-Orchard2">[32]</xref>
a representation that remains useful
<xref ref-type="bibr" rid="pone.0002946-Klimek1">[33]</xref>
.</p>
<fig id="pone-0002946-g002" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0002946.g002</object-id>
<label>Figure 2</label>
<caption>
<title>Evolution of formats for individual datasets.</title>
<p>Hexagons, rectangles and small circles indicate data elements, respectively, attributes, their values, and relations. First, flat file formats such as fasta or the GeneBank data model were proposed to collect attribute-value pairs about an individual data entry. The use of tagging by extended markup languages (XML) allowed for the embedding of additional detail and further definition of the nature of the hierarchical structure between data elements. More recently, the resource description framework (RDF) further generalized the XML tree structure into that of a network where the relationship between resources (nodes) is a resource itself. Furthermore, the referencing of each resource by a unique identifier (URI) implies that the data elements can be distributed between distinct documents or even locations.</p>
</caption>
<graphic xlink:href="pone.0002946.g002"></graphic>
</fig>
<p>The advancement towards a more abstract, more global and more flexible representation of data is by no means unique to the Life Sciences. However, because of the exceptional diversity of that domain's fluidity, the Life Sciences are where the Semantic Web may find its most interesting challenge and as well, hopefully, where it will find its most compelling validation
<xref ref-type="bibr" rid="pone.0002946-BernersLee1">[15]</xref>
.</p>
</sec>
<sec id="s1e">
<title>Mathematics for data models</title>
<p>It has not been lost to the swelling ranks of Systems Biologists that the reduction of data interoperability to the ternary representation of
<italic>relations</italic>
<xref ref-type="bibr" rid="pone.0002946-AhoJDU1">[34]</xref>
brings the topic solidly back to the Galilean fold of Mathematics as a language. The reduction of data structures to globally referenced dyadic relations (functions of two variables), such as those of the Entity-Relationship (ER) model, brings in rich feeds from the vein of Logic. In the process, and beyond Galileo's horizon, assigning a description logic value
<xref ref-type="bibr" rid="pone.0002946-Aranguren1">[35]</xref>
<xref ref-type="bibr" rid="pone.0002946-Zhang1">[37]</xref>
to some RDF predicates (for example, specifying that something is part of or, on the contrary, is distinct from something else) allows the definition of procedures. This further elaboration of RDF has the potential to transform data management into an application of knowledge engineering, and more specifically of artificial intelligence (AI). This reclassification reflects the dilution of the distinction between data management and data analysis that is apparent even in an introduction as brief as this one. Another clear indication of this transformation is that it re-ignites the opposition between data-driven and rule-driven designs for semantic web representation
<xref ref-type="bibr" rid="pone.0002946-Miller1">[38]</xref>
<xref ref-type="bibr" rid="pone.0002946-Soldatova1">[42]</xref>
, a recurring topic in AI. It is important to note that the management model proposed here is orthogonal to that discussion. Its purpose is solely to enable the distribution
<xref ref-type="bibr" rid="pone.0002946-Merelli1">[43]</xref>
of a semantic data management system that can withstand changes in the domain of discourse, independently of the rationale for the changes themselves.</p>
</sec>
<sec id="s1f">
<title>Software engineering for Bioinformatics</title>
<p>This overview of modern trends in integrative data management is as significant for what is covered as for what is missed – what management models should be used to control the generation and transformation of the data model? It is interesting to note that the management models that associate access permissions with the population of a data model have traditionally been the province of software engineering. This may at first appear to be a reasonable solution. Since instances of a data structure in conventional databases are contained in a defined digital media, permission management is an issue of access to the system itself. However, this ceases to be the case with the semantic web RDF triples because they weave data structures that can expand indefinitely between multiple machines. Presently, the formalisms to manage data in the semantic web realm are still in the early stages of development, notably by the World Wide Web consortium (W3C) SKOS initiative (Simple Knowledge Organization Systems). This initiative recently issued a call
<xref ref-type="bibr" rid="pone.0002946-AntoineIsaac1">[44]</xref>
for user cases where good design criteria can be abstracted and recommendations be issued on standard formats. As expected
<xref ref-type="bibr" rid="pone.0002946-BernersLee1">[15]</xref>
, the Life Sciences present some of the most convoluted user cases in which a multitude of naïve domain experts effectively need to maintain data structures that are as diverse and fluid as the experimental evidence they describe
<xref ref-type="bibr" rid="pone.0002946-Almeida1">[24]</xref>
.</p>
</sec>
</sec>
<sec sec-type="materials|methods" id="s2">
<title>Materials and Methods</title>
<p>The most extreme combination of heterogeneous data structures and the need for very tight control of access is arguably found in applications to Personalized Medicine, such as those emerging for cancer treatment and prevention. At the Univ. Texas MDAnderson Cancer Center at Houston and the Southwestern Medical Center at Dallas we have deployed the S3DB semantic web prototype to engage the community of translational researchers of the University of Texas Lung Cancer SPORE
<xref ref-type="bibr" rid="pone.0002946-The1">[45]</xref>
in identifying a suitable management model. This exercise involved over one hundred researchers and close to half a million data entries, of clinical and molecular nature. Right at its onset integrating access permissions in the definition of the data models was identified as an absolute necessity by the participants, as anticipated by the SKOS group. As a consequence, a data driven “core model”, S3DBcore, that accommodates management specifications as part of data representation, was developed and is described here. The software used is provided with open source at
<ext-link ext-link-type="uri" xlink:href="http://www.s3db.org">www.s3db.org</ext-link>
. Only open source tools were used in development of this web-based web-service: PHP 5 was used for server side programming and both MySQL and PostgreSQL were tested as the relational backbone for PHP's database abstraction class. At the same location detailed documentation about S3DB's Application Programming Interface (API) is also provided.</p>
</sec>
<sec id="s3">
<title>Results</title>
<sec id="s3a">
<title>Units of representation</title>
<p>The most fundamental representation of data is that of attribute-value (AV) pairs, for example, . The generic data management infrastructure proposed here can be described as that of encapsulating AV pairs through the use of another fundamental unit of representation, the Entity-Relation-Entity model (ER), such as . Each entity can then be associated with one or more AV pairs using the entity-attribute-value EAV model, for example, . Fast forwarding three decades of computer science and knowledge engineering and we reach the present day development of a representation framework where each element of the triple is a resource with a unique identifier, with the third element of the triple having the option of being a literal, that is, of having an actual value rather than a placeholder. This single sentence very broadly describes the Resource Description Framework (RDF) which is at the foundation of the ongoing development of the Semantic Web
<xref ref-type="bibr" rid="pone.0002946-IvanHerman1">[29]</xref>
, just like hypertext (HTML) was the enabling format for the original Web. It is important to note that the evolution of representation formats typically takes place through generalization of the existing ones. For example, extended markup language-based files (XML) are still text files, and RDF documents are still XML structures (
<xref ref-type="fig" rid="pone-0002946-g002">Figure 2</xref>
). As noted earlier, this succession is closely paralleled by refinements of software design patterns (
<xref ref-type="fig" rid="pone-0002946-g001">Figure 1</xref>
). This reification process is often driven by the necessity to maintain increasingly complex data at a simpler level of representation where they remain intelligible for those who generate and use the data. Accordingly, in the next section triple relations will be weaved around the AV pair with that exact purpose: to produce a core model that is simple enough to be usable by naïve users that need to interact with heterogeneous data hosted in a variety of machines (
<xref ref-type="fig" rid="pone-0002946-g003">Figure 3</xref>
), yet sophisticated enough to support automated implementation.</p>
<fig id="pone-0002946-g003" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0002946.g003</object-id>
<label>Figure 3</label>
<caption>
<title>Illustration of the desirable functionality: distinct users, with identities (solid icon) managed in distinct S3DB deployments (circular compartments), which they control separately, share a distributed and overlapping data structure (arrows between symbols) that they also manage independently: some data elements are shared (mixed color symbols) others are not.</title>
<p>This will require the identity verification to propagate between deployments peer-to-peer (P2P, dotted lines), including to deployments where neither user maintains an identity (dotted circular compartment). This is in contrast with the conventional approach of having distinct users manage insular deployments with permissions managed at the access point level.</p>
</caption>
<graphic xlink:href="pone.0002946.g003"></graphic>
</fig>
</sec>
<sec id="s3b">
<title>Weaving a distributed information management system</title>
<p>The objective of this exercise is to produce a data management model that can be distributed through multiple deployments of the Database Management Systems (DBMS) which implies a mechanism for migration access permissions. Simultaneously, this model should allow different domain experts to evolve their own data models without compromising pre-existing data. Achieving these two goals simultaneously can only be realized if the proposed distributed system is composed of node applications that are not only syntactically interoperable, but also semantically transparent. For a discussion of the absolute need for evolvable data models in the Life Sciences see
<xref ref-type="bibr" rid="pone.0002946-Almeida1">[24]</xref>
. That report is also where the DBMS prototype, S3DB, was first introduced (version 1.0). Finally, the Application Programming Interface (API) needs to support the semantic interoperability in a way that spans multiple deployments (
<xref ref-type="fig" rid="pone-0002946-g003">Figure 3</xref>
). The data model developed to achieve these goals is described in
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
.</p>
<fig id="pone-0002946-g004" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0002946.g004</object-id>
<label>Figure 4</label>
<caption>
<title>Core model developed for S3DB (supported by version 3.0 onwards).</title>
<p>This diagram can be read starting from the most fundamental data unit, the Attribute-Value pair (filled hexagonal and square symbols). Each element of the pair is object of two distinct triples, one describing the domain of discourse, the
<italic>Rules</italic>
, and the other made of
<italic>Statements</italic>
where that domain is populated to instantiate relationships between entities. The latter includes the actual Values. Surrounding these two nuclear collection of triples, is the resolution of
<italic>Collection</italic>
and its instantiation as
<italic>Item</italic>
that define the relationship between the individual elements of
<italic>Rules</italic>
and
<italic>Statements</italic>
. The resulting structure is then organized in
<italic>Projects</italic>
in such a way that the domain of discourse can nevertheless be shared with other
<italic>Projects</italic>
, in the same or in a distinct deployment of S3DB. Finally, a propagation of user permissions (dashed line) is defined such that the distribution of the data structures can be traced. See text for a more detailed description.</p>
</caption>
<graphic xlink:href="pone.0002946.g004"></graphic>
</fig>
</sec>
<sec id="s3c">
<title>A Core data management model that is universal and distributed</title>
<p>The directed labeled graph nature of RDF triples, coupled with their reliance on unique identifiers (as URIs), enables data structures to be scattered between multiple machines while permitting different domains of discourse to use the same data elements differently. However, those two characteristics alone do not address the management issue: how to decide when, where and what can be viewed, inserted, deleted and by whom. It is clear that the conventional approach of dealing with permissions at the level of access to the data store is not appropriate to the Life Sciences
<xref ref-type="bibr" rid="pone.0002946-Hendler1">[5]</xref>
where multiple disciplines and facilities are contributing to a partially overlapping representation of the system. It cannot be overstated that this is particularly the case when the system is designed to host clinical data. To solve this problem we have developed a core data model where membership and permission can migrate with the data. We have also developed a prototype application to support such a distributed data management system (
<xref ref-type="fig" rid="pone-0002946-g003">Figure 3</xref>
), which we make freely available with open source
<xref ref-type="bibr" rid="pone.0002946-s3db1">[25]</xref>
.</p>
</sec>
</sec>
<sec id="s4">
<title>Discussion</title>
<p>The proposed core model is detailed in
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
and will be now discussed in more detail. This diagram is best understood chronologically, starting with the very basic and nuclear collection of attribute-value pairs and then proceeding to their encapsulation by three consecutive layers – the semantic schema, assignment of membership and, finally the permission propagation.</p>
<sec id="s4a">
<title>Schema</title>
<p>The first layer of encapsulation is the definition and use of a domain of discourse (elements in red in
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
). This was achieved in typical RDF fashion by defining two sets of triples, one defining a set of rules and the second, the statements, using them. As discussed elsewhere
<xref ref-type="bibr" rid="pone.0002946-Almeida1">[24]</xref>
, there are good reasons to equip those who generate the data with the tools to define and manage their own domains of knowledge. The ensuing incubation of experimental ontologies was facilitated by an indexing scheme that mimics the use of subject, verb, object in natural languages. This indexing is achieved by recognizing
<italic>Collections</italic>
and the
<italic>Items</italic>
they contain as elements of the two sets of nuclear triples (
<italic>Rules</italic>
and
<italic>Statements</italic>
).</p>
</sec>
<sec id="s4b">
<title>Organization</title>
<p>The second layer of formal encapsulation corresponds to the assignment of membership. This process extends the designation of
<italic>Items</italic>
in the previous level, by assigning the
<italic>Collections</italic>
that contain them and
<italic>Rules</italic>
that relate them to
<italic>Projects</italic>
that are hosted by individual
<italic>Deployments</italic>
of the prototype S3DB application. In the diagram, the membership dependencies are accordingly labeled as
<italic>rdfs</italic>
<italic>subClassOf</italic>
<xref ref-type="bibr" rid="pone.0002946-IvanHerman1">[29]</xref>
. Note that memberships can also be established with remote resources (dotted lines in
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
), that is, between resources of distinct deployments. Defining remote memberships presents little dificulty in the RDF format because each element of the triple is refered to by a universal identifier (a URI), unique accross deployments. On the other hand, managing permission to access the remote content is a much harder problem, which we will address by supporting migration of identity. The alternative solution to migration of identities is migrating the contents along membership lines. However, that was, unsurprisingly, found to be objectionable by users with a special attention to privacy and confidentiality issues. It would also present some logistic challenges for larger datasets. In contrast, the definition of a temporary, portable, identity key or token needed for migration of identity is typically incommensurably smaller than the content it permits access.</p>
</sec>
<sec id="s4c">
<title>Permissions</title>
<p>The final layer of encapsulation defines
<italic>Users</italic>
and
<italic>Groups</italic>
within
<italic>Deployments</italic>
and controls their permissions to the data (blue in
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
). As with rest of the core model, the identification of proposed management of permissions was directed by user cases. That exercise determined that user identities should be maintained by specific Deployments of S3DB but also that they may be temporarily propagated to other deployments. That solution, illustrated in
<xref ref-type="fig" rid="pone-0002946-g003">Figure 3</xref>
, allows one application to request the verification of an identity in a remote deployment, which then verifies it in the identity's source deployment and assigns it a temporary key or token, say, for one hour. All that is propagated is a unique alphanumeric string, the temporary token, paired with the user's URI. No other user information is exchanged. As a consequence, for the remainder of the hour, the identification will be asynchronously available in both deployments, which enables the solution described in
<xref ref-type="fig" rid="pone-0002946-g003">Figure 3</xref>
, where a single interface can manipulate multiple components of a large, distributed systems level representation of the target data. Interestingly, because the multiple deployments of S3DB are accessed independently by multiple deployments of various applications, the mode of syntactic interoperation is
<italic>de facto</italic>
peer-to-peer. The propagation of permissions flows in the sequence indicated by the dashed blue lines in
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
. When a permission level is not defined for a resource, say for a
<italic>Item</italic>
, then it is borrowed from the parent entity, in this example, from the corresponding
<italic>Collection</italic>
. When there is a conflict then the most restrictive option is selected. For example a conflict can arise for a
<italic>Statemen</italic>
t which inherits permissions from both
<italic>Rules</italic>
and
<italic>Collections</italic>
. Another frequent example happens when a user belongs to multiple groups with distinct permissions to a common target resource.</p>
<p>Permission management is a particularly thorny issue in life sciences applications because of the management of multiple data provenances. Relying on distributed hosting of the complementary data sources compounds the management of multiple permissions even further because it also involves multiple permission management systems. Finally, permission management is often treated
<italic>ad hoc</italic>
by the management systems themselves where it is resolved as access permission to the system as a whole rather than being specified in the data representation. Because each source often describes a specialized domain, it is guarded with understandable zeal. We argue here that propagation of permissions is the only practical solution to determine how much information is to be revealed in different contexts. Consequently, whereas the relationships between the 8 S3DB entities (oval symbols in
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
) are defined using RDF schema
<xref ref-type="bibr" rid="pone.0002946-Robu1">[26]</xref>
(RDFS), and their tagging uses the well established Dublin Core
<xref ref-type="bibr" rid="pone.0002946-Baker1">[46]</xref>
, the permission propagation layer is a novel component of the proposed management model. In order to respond to widest range of the user cases driving model identification, the propagation was defined by three parameters, view, edit, and use. Each of these parameters can have three values, 0, 1 or 2, corresponding to, respectively, no permission, permission only on entries submitted by the user, and permission on all entries of that resource.
<italic>Users</italic>
and
<italic>Groups</italic>
(blue entities in
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
) can have these three types of permissions on
<italic>Projects</italic>
,
<italic>Collections</italic>
,
<italic>Rules</italic>
,
<italic>Items</italic>
and
<italic>Statements</italic>
. Among those five entities, additional permissions can be issued, for example, a
<italic>Project</italic>
may have specific permissions on
<italic>Collections</italic>
and
<italic>Rules</italic>
.
<italic>Collections</italic>
may have further permissions on their
<italic>Items</italic>
. The same reasoning, in reverse, establishes what should happen when permission is not specifically defined for a given entity. For example, for a
<italic>Statement</italic>
the permission would be inherited from the parent entities,
<italic>Item</italic>
and
<italic>Rule</italic>
. If those two entities did not specify specific permissions for the target statement, then those are searched upstream (
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
) until reaching the
<italic>Project</italic>
or even
<italic>Deployment</italic>
level. According to this mechanism, the conventional role of a system administrator corresponds to a user with permissions 222 at Deployment level. It is worth recalling that propagation of permissions between data elements in distinct S3DB deployments happens through the sharing the membership in external
<italic>Collections</italic>
and
<italic>Rules</italic>
(dotted lines), not through extending the permission inheritance beyond the local deployment. This is not a behavior explicitly imposed on the distributed deployment; it emerges naturally from the fact that
<italic>Rule</italic>
sharing specifies a permission which, remote or local, interrupts the permission inheritance. In practice both the user of the interface and the programmer using the API can ignore the intricacies of this process, which was identified to be the intuitive, sensible, propagation of permissions that we found naïve users to expect in user-case exercises.</p>
</sec>
<sec id="s4d">
<title>Portability</title>
<p>This discussion would not be complete without unveiling some defining technical details about how portability is addressed by this design. So far we have been loosely equating “unique identifiers” with the use of Uniform Resource Identifiers (URI). More specifically, the right hand side of
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
includes a list of eight types of locally unique identifiers that can be assigned to the same number of entities that define the core model. It is easy to see how this indexing can be made globally unique by concatenating them with the
<italic>Deployment</italic>
's ID, itself unique, for example using its URL. Indeed this is what is supported by the accompanying prototype software, with a generalizing twist with very significant consequences:
<italic>Did</italic>
can either be the deployment address or anything that indicates what that address is. For example, it can indicate an HTML document or even an entry in a database where this address is specified. More interestingly, it can also be a simple alphanumeric code that is maintained at
<ext-link ext-link-type="uri" xlink:href="http://www.s3db.org">www.s3db.org</ext-link>
in association with the actual URL of the target deployment. The flexible global indexing achieved by either scenario allows the manipulation of entire databases management systems as portable data structures. It also allows for novel management solutions through manipulation of the DBMS logical structure. For example, defining a
<italic>Did</italic>
as ‘localhost’ would have the effect of severing all logical connections to any usage outside that of the server machine. None of these more fanciful configurations were validated with the Lung Cancer SPORE user community even if they are fully supported by the accompanying prototype. Nevertheless, its possibility enables some interesting scenarios for data management and indeed for Knowledge Engineering.</p>
</sec>
<sec id="s4e">
<title>User Interfaces</title>
<p>The ultimate test for a data management model is the intuitiveness of what it communicates through the user interface
<xref ref-type="bibr" rid="pone.0002946-Good1">[47]</xref>
,
<xref ref-type="bibr" rid="pone.0002946-Neumann2">[48]</xref>
. The structure of S3DBcore offers some useful guidelines in this regard. The experimental values are represented in a combination of
<italic>Items</italic>
and
<italic>Statements</italic>
(
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
). There are two routes to that endpoint. One possibility is to take the document management approach of navigating from
<italic>Projects</italic>
to
<italic>Collections</italic>
, then to their
<italic>Items</italic>
and finally to the
<italic>Statements</italic>
. This is the scenario that will suit data centric activities such as querying and updating existing data or inserting new data. A real, working example of how that interface may look is depicted in
<xref ref-type="fig" rid="pone-0002946-g005">Figure 5-B</xref>
, which details an intermediate step between selecting a
<italic>Project</italic>
(
<xref ref-type="fig" rid="pone-0002946-g005">Figure 5-B</xref>
), and identifying and manipulating an individual entry made of multiple statements about an
<italic>Item</italic>
(
<xref ref-type="fig" rid="pone-0002946-g005">Fig. 5-D</xref>
). The mechanism used to distribute rich graphics applications and their interoperation with S3DB is detailed in
<xref ref-type="fig" rid="pone-0002946-g006">Figure 6</xref>
. Another possibility is to navigate from the
<italic>Project</italic>
to the collection of
<italic>Rules</italic>
, most likely represented as a directed labeled graph network, and then browse the
<italic>Statements</italic>
as an instantiation of the
<italic>Rules</italic>
, exemplified by another snapshot of a working application,
<xref ref-type="fig" rid="pone-0002946-g005">Figure 5-A</xref>
. This application is the standard web-based user interface distributed with S3DB package
<xref ref-type="bibr" rid="pone.0002946-s3db1">[25]</xref>
. Unlike the bookkeeping approach of the document centric model (
<xref ref-type="fig" rid="pone-0002946-g005">Figure 5-B</xref>
), the rule centric view (
<xref ref-type="fig" rid="pone-0002946-g005">Figure 5-A</xref>
) is most suitable to investigate the relationship between different parts of the domain of knowledge and to incubate
<xref ref-type="bibr" rid="pone.0002946-Almeida1">[24]</xref>
a more comprehensive and exact version of the ontology. However, and this may be the most relevant point, since S3DB's API returns query results as RDF, any RDF browser can be used to explore it. This point is illustrated in
<xref ref-type="fig" rid="pone-0002946-g005">figures 5E and F</xref>
where, respectively, a commercial semantic web knowledge explorer (Sentient, IO-Informatics Inc) and Welkin, a popular RDF browser developed at the Massachusetts Institute of Technology, are use to visualize the same S3DB Lung Cancer project depicted in
<xref ref-type="fig" rid="pone-0002946-g005">Figs. 5A and B</xref>
. Whereas the former is designed as a tool for knowledge discovery, the latter offers a global view of distributed data structures. The value of the core model described in
<xref ref-type="fig" rid="pone-0002946-g004">Figure 4</xref>
as a management template for individual data elements will be apparent upon close inspection of
<xref ref-type="fig" rid="pone-0002946-g005">Fig. 5E</xref>
. The different colors, automatically set by Sentient KE, distinguish the core model (pink), where permission management takes place, from the instantiation of their entities, in yellow. These two layers describe the context for individual entries specifying the age at surgery of 5 patients. The same display includes access to molecular work on tumor samples, in this case using tissue arrays and DNA extracts. The distinct domains are therefore integrated in an interoperable framework in spite of the fact that they are maintained, and regularly edited, by different communities of researchers. As a consequence, the database can evolve with the diversification of data gathering methodologies and with the advancement in understanding the underlying processes. In
<xref ref-type="fig" rid="pone-0002946-g005">figure 5F</xref>
it can be seen that MIT's Welkin RDF visualizer easily distinguished the query results as the interplay of 4 collections of 380
<italic>Statements</italic>
about 41
<italic>Items</italic>
from 5
<italic>Collections</italic>
related by 40
<italic>Rules</italic>
. For comparison, see
<xref ref-type="fig" rid="pone-0002946-g005">Figure 5E</xref>
where one of its
<italic>Statements</italic>
is labeled (describing that Age of patient providing pathology sample #90 with Clinical Information #I3646 is 90 years old), along with the parent entities. For examples of other
<italic>Statements</italic>
about the same
<italic>Item</italic>
see
<xref ref-type="fig" rid="pone-0002946-g005">Fig. 5D</xref>
. For examples of other statements of the same nature (about the same domain), see 4 statements listed at the bottom-right of
<xref ref-type="fig" rid="pone-0002946-g005">Figure 5E</xref>
.</p>
<fig id="pone-0002946-g005" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0002946.g005</object-id>
<label>Figure 5</label>
<caption>
<title>Snapshots of interfaces using S3DB's API (Application Programming Interface).</title>
<p>These applications exemplify why the semantic web designs can be particularly effective at enabling generic tools to assist users in exploring data documenting very specific and very complex relationships. Snapshot A was taken from S3DB's web interface, which is included in the downloadable package
<xref ref-type="bibr" rid="pone.0002946-s3db1">[25]</xref>
. This interface was developed to assist in managing the database model and, therefore, is centered on the visualization and manipulation of the domain of discourse, its
<italic>Collections of Items</italic>
and
<italic>Rules</italic>
defining the documentation of their relations. The application depicted on snapshots B–D describe a document management tool S3DBdoc, freely available as a Bioinformatics Station module (see
<xref ref-type="fig" rid="pone-0002946-g006">Figure 6</xref>
). The navigation is performed starting from the Project (C), then to the
<italic>Collection</italic>
(B) and finally to the editing of the
<italic>Statements</italic>
about an
<italic>Item</italic>
(D). The snapshot B illustrates an intermediate step in the navigation where the list of
<italic>Items</italic>
(in this case samples assayed by tissue arrays, for which there is clinical information about the donor) is being trimmed according to the properties of a distant entity, Age at Diagnosis, which is a property of the Clinical Information
<italic>Collection</italic>
associated with the sample that originated the array results. This interaction would have been difficult and computationally intensive to manage using a relational architecture. The RDF formatted query result produced by the API was also visualized using a commercial tool, Sentient Knowledge Explorer (IO-Informatics Inc), shown in snapshot E, and by Welkin, developed by the digital inter-operability SIMILE project at the Massachusetts Institute of Technology. See text for discussion of graphic representations by these tools. To protect patient confidentiality some values in snapshots B and D are scrambled and numeric sample and patient identifiers elsewhere are altered.</p>
</caption>
<graphic xlink:href="pone.0002946.g005"></graphic>
</fig>
<fig id="pone-0002946-g006" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0002946.g006</object-id>
<label>Figure 6</label>
<caption>
<title>Prototype infrastructure for integrated data management and analysis being tested by the Univ. Texas Lung cancer SPORE.</title>
<p>The system is based on two components, a network of universal semantic database servers and a code distribution server that delivers applications in response to the use of ontology. Four distinct user cases are represented, a–d, which rely on a combination of download of interpreted code (green arrows) or direct access to web-based graphic user interfaces or web-based API (blue arrows, in the latter case using Representational State Transfer, REST). The dotted lines represent regular updating of the application, propagating improvements in the application code.</p>
</caption>
<graphic xlink:href="pone.0002946.g006"></graphic>
</fig>
</sec>
<sec id="s4f">
<title>Conclusion</title>
<p>The Semantic Web
<xref ref-type="bibr" rid="pone.0002946-BernersLee1">[15]</xref>
technologies have the potential to addresses the need for distributed and evolvable representations that are critical for systems Biology and translational biomedical research. As this technology is incorporated into application development we can expect that both general purpose productivity software and domain specific software installed on our personal computers will become increasingly integrated with the relevant remote resources. In this scenario, the acquisition of a new dataset should automatically trigger the delegation of its analysis. The relevance of this achievement becomes very clear when we note that what prevents a new microarray result from being of immediate use to the experimental Biologist acquiring it is not the computational capability of the experimentalist's machine. Biostatisticians do not necessarily have more powerful machines than molecular Biologists. Moreover, in neither case is high end computation expected to be performed in the client machine
<xref ref-type="bibr" rid="pone.0002946-Foster1">[8]</xref>
. Rather, once data gathering and data analysis applications become semantically interoperable, at the very least, those who acquire the illustrative microarray data should expect their own machines to automatically trigger its sensible analysis by background subtraction, normalization and basic multivariate exploratory analysis such as dimensionality reduction and clustering. As a consequence, the quantitative scientist's role can be focused on defining the sensibility of alternative contexts of data generation.</p>
<p>The consequences of semantic integration are just as advantageous for those dedicated to data analysis. Statistical analysts typically spend the majority of their time parsing raw datasets rather than assessing the reasonableness of alternative analytical routes. This contrasts with the critical need to validate any given analysis by comparing results produced by alternative configurations applied to independent experimental evidence. It is this final step that ultimately determines the sensibility of the data analysis procedures triggered by the acquisition of data. In summary, any data management and analysis system that will scale for systems level analysis in the Life Sciences has to be semantically interoperable if automated validation is to be attainable.</p>
<p>In this report, we have demonstrated the design of a semantic web data model, S3DBcore, capable of delivering the desired features of distribution and evolvability. This solution relies on RDF triples, the language developed to enable the semantic web in the same fashion that HTML was developed to enable the original web. However, collections of
<italic>subject-predicte-object</italic>
triples do not establish a management model by themselves. That exercise requires the encapsulation of the data within two additional layers, one confining membership and another permitting access. The effort of identifying management models for information systems has conventionally been the property of technology deployment. This is not feasible when the challenge is scaled to the level of complexity and distribution of Systems Biology. This report describes such a working management model and the authors also make its prototype deployment freely available with open source. In conclusion, a distributed integrated data management and analysis system might look like the prototype infrastructure described in
<xref ref-type="fig" rid="pone-0002946-g006">Figure 6</xref>
which is based on a semantic database backbone coupled to a code distribution server reacting to the domain of discourse being used.</p>
</sec>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="pone.0002946-Blake1">
<label>1</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Blake</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Bult</surname>
<given-names>CJ</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>Beyond the data deluge: data integration and bio-ontologies.</article-title>
<source>J Biomed Inform</source>
<volume>39</volume>
<fpage>314</fpage>
<lpage>320</lpage>
<pub-id pub-id-type="pmid">16564748</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Komatsoulis1">
<label>2</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Komatsoulis</surname>
<given-names>GA</given-names>
</name>
<name>
<surname>Warzel</surname>
<given-names>DB</given-names>
</name>
<name>
<surname>Hartel</surname>
<given-names>FW</given-names>
</name>
<name>
<surname>Shanbhag</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Chilukuri</surname>
<given-names>R</given-names>
</name>
<etal></etal>
</person-group>
<year>2007</year>
<article-title>caCORE version 3: Implementation of a model driven, service-oriented architecture for semantic interoperability.</article-title>
<source>J Biomed Inform</source>
</citation>
</ref>
<ref id="pone.0002946-Ruttenberg1">
<label>3</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ruttenberg</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Clark</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Bug</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Samwald</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Bodenreider</surname>
<given-names>O</given-names>
</name>
<etal></etal>
</person-group>
<year>2007</year>
<article-title>Advancing translational research with the Semantic Web.</article-title>
<source>BMC Bioinformatics</source>
<volume>8</volume>
<issue>Suppl 3</issue>
<fpage>S2</fpage>
<pub-id pub-id-type="pmid">17493285</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Brazhnik1">
<label>4</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brazhnik</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>JF</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>Anatomy of data integration.</article-title>
<source>J Biomed Inform</source>
<volume>40</volume>
<fpage>252</fpage>
<lpage>269</lpage>
<pub-id pub-id-type="pmid">17071142</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Hendler1">
<label>5</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hendler</surname>
<given-names>J</given-names>
</name>
</person-group>
<year>2003</year>
<article-title>Communication. Science and the semantic web.</article-title>
<source>Science</source>
<volume>299</volume>
<fpage>520</fpage>
<lpage>521</lpage>
<pub-id pub-id-type="pmid">12543958</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Wiley1">
<label>6</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wiley</surname>
<given-names>HS</given-names>
</name>
<name>
<surname>Michaels</surname>
<given-names>GS</given-names>
</name>
</person-group>
<year>2004</year>
<article-title>Should software hold data hostage?</article-title>
<source>Nat Biotechnol</source>
<volume>22</volume>
<fpage>1037</fpage>
<lpage>1038</lpage>
<pub-id pub-id-type="pmid">15286656</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Wass1">
<label>7</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wass</surname>
<given-names>J</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>Integrating Knowledge.</article-title>
<source>Bio-IT World</source>
<volume>5</volume>
<fpage>22</fpage>
</citation>
</ref>
<ref id="pone.0002946-Foster1">
<label>8</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Foster</surname>
<given-names>I</given-names>
</name>
</person-group>
<year>2005</year>
<article-title>Service-oriented science.</article-title>
<source>Science</source>
<volume>308</volume>
<fpage>814</fpage>
<lpage>817</lpage>
<pub-id pub-id-type="pmid">15879208</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Hey1">
<label>9</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hey</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Trefethen</surname>
<given-names>AE</given-names>
</name>
</person-group>
<year>2005</year>
<article-title>Cyberinfrastructure for e-Science.</article-title>
<source>Science</source>
<volume>308</volume>
<fpage>817</fpage>
<lpage>821</lpage>
<pub-id pub-id-type="pmid">15879209</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Nadkarni1">
<label>10</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nadkarni</surname>
<given-names>PM</given-names>
</name>
<name>
<surname>Miller</surname>
<given-names>RA</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>Service-oriented architecture in medical software: promises and perils.</article-title>
<source>J Am Med Inform Assoc</source>
<volume>14</volume>
<fpage>244</fpage>
<lpage>246</lpage>
<pub-id pub-id-type="pmid">17213485</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Bridges1">
<label>11</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bridges</surname>
<given-names>MW</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>SOA in healthcare, Sharing system resources while enhancing interoperability within and between healthcare organizations with service-oriented architecture.</article-title>
<source>Health Manag Technol</source>
<volume>28</volume>
<fpage>6, 8, 10</fpage>
<pub-id pub-id-type="pmid">17642342</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Gomadam1">
<label>12</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gomadam</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Ramaswamy</surname>
</name>
<name>
<surname>Sheth</surname>
</name>
<name>
<surname>Verma</surname>
</name>
</person-group>
<year>2007</year>
<article-title>A Semantic Framework for Identifying Events in a Service Oriented Architecture.</article-title>
<source>IEEE International Conference on Web Services ICWS</source>
<volume>2007</volume>
<fpage>545</fpage>
<lpage>552</lpage>
</citation>
</ref>
<ref id="pone.0002946-Musser1">
<label>13</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Musser</surname>
<given-names>J</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>Web 2.0 Principles and Best Practices;</article-title>
<person-group person-group-type="editor">
<name>
<surname>O'Reilly</surname>
<given-names>T</given-names>
</name>
</person-group>
<publisher-name>O'Reilly Media, Inc</publisher-name>
</citation>
</ref>
<ref id="pone.0002946-KamelBoulos1">
<label>14</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kamel Boulos</surname>
<given-names>MN</given-names>
</name>
<name>
<surname>Wheeler</surname>
<given-names>S</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>The emerging Web 2.0 social software: an enabling suite of sociable technologies in health and health care education.</article-title>
<source>Health Info Libr J</source>
<volume>24</volume>
<fpage>2</fpage>
<lpage>23</lpage>
<pub-id pub-id-type="pmid">17331140</pub-id>
</citation>
</ref>
<ref id="pone.0002946-BernersLee1">
<label>15</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Berners-Lee</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Hall</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Hendler</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Shadbolt</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Weitzner</surname>
<given-names>DJ</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>Computer science. Creating a science of the Web.</article-title>
<source>Science</source>
<volume>313</volume>
<fpage>769</fpage>
<lpage>771</lpage>
<pub-id pub-id-type="pmid">16902115</pub-id>
</citation>
</ref>
<ref id="pone.0002946-BernersLee2">
<label>16</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Berners-Lee</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Hendler</surname>
<given-names>J</given-names>
</name>
</person-group>
<year>2001</year>
<article-title>Publishing on the semantic web.</article-title>
<source>Nature</source>
<volume>410</volume>
<fpage>1023</fpage>
<lpage>1024</lpage>
<pub-id pub-id-type="pmid">11323639</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Gordon1">
<label>17</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gordon</surname>
<given-names>PM</given-names>
</name>
<name>
<surname>Trinh</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Sensen</surname>
<given-names>CW</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>Semantic Web Service provision: a realistic framework for Bioinformatics programmers.</article-title>
<source>Bioinformatics</source>
<volume>23</volume>
<fpage>1178</fpage>
<lpage>1180</lpage>
<pub-id pub-id-type="pmid">17384428</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Neumann1">
<label>18</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Neumann</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Prusak</surname>
<given-names>L</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>Knowledge networks in the age of the Semantic Web.</article-title>
<source>Brief Bioinform</source>
<volume>8</volume>
<fpage>141</fpage>
<lpage>149</lpage>
<pub-id pub-id-type="pmid">17502336</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Post1">
<label>19</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Post</surname>
<given-names>LJ</given-names>
</name>
<name>
<surname>Roos</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Marshall</surname>
<given-names>MS</given-names>
</name>
<name>
<surname>Driel</surname>
<given-names>RV</given-names>
</name>
<name>
<surname>Breit</surname>
<given-names>TM</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>A semantic web approach applied to integrative bioinformatics experimentation: a biological use case with genomics data.</article-title>
<source>Bioinformatics</source>
</citation>
</ref>
<ref id="pone.0002946-Wang1">
<label>20</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Gorlitsky</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Almeida</surname>
<given-names>JS</given-names>
</name>
</person-group>
<year>2005</year>
<article-title>From XML to RDF: how semantic web technologies will change the design of ‘omic’ standards.</article-title>
<source>Nat Biotechnol</source>
<volume>23</volume>
<fpage>1099</fpage>
<lpage>1103</lpage>
<pub-id pub-id-type="pmid">16151403</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Feigenbaum1">
<label>21</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Feigenbaum</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Martin</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Roy</surname>
<given-names>MN</given-names>
</name>
<name>
<surname>Szekely</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Yung</surname>
<given-names>WC</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>Boca: an open-source RDF store for building Semantic Web applications.</article-title>
<source>Brief Bioinform</source>
<volume>8</volume>
<fpage>195</fpage>
<lpage>200</lpage>
<pub-id pub-id-type="pmid">17491005</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Borland1">
<label>22</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Borland</surname>
<given-names>J</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>A Smarter Web.</article-title>
<source>Technology Review March/April</source>
</citation>
</ref>
<ref id="pone.0002946-Green1">
<label>23</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Green</surname>
<given-names>H</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>A Web That Thinks Like You.</article-title>
<source>Businessweek</source>
<volume>28</volume>
</citation>
</ref>
<ref id="pone.0002946-Almeida1">
<label>24</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Almeida</surname>
<given-names>JS</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Gorlitsky</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Stanislaus</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Aires-de-Sousa</surname>
<given-names>M</given-names>
</name>
<etal></etal>
</person-group>
<year>2006</year>
<article-title>Data integration gets ‘Sloppy’.</article-title>
<source>Nat Biotechnol</source>
<volume>24</volume>
<fpage>1070</fpage>
<lpage>1071</lpage>
<pub-id pub-id-type="pmid">16964209</pub-id>
</citation>
</ref>
<ref id="pone.0002946-s3db1">
<label>25</label>
<citation citation-type="other">s3db 2.0</citation>
</ref>
<ref id="pone.0002946-Robu1">
<label>26</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Robu</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Robu</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Thirion</surname>
<given-names>B</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>An introduction to the Semantic Web for health sciences librarians.</article-title>
<source>J Med Libr Assoc</source>
<volume>94</volume>
<fpage>198</fpage>
<lpage>205</lpage>
<pub-id pub-id-type="pmid">16636713</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Silva1">
<label>27</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Silva</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Gouveia-Oliveira</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Maretzek</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Carrico</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Gudnason</surname>
<given-names>T</given-names>
</name>
<etal></etal>
</person-group>
<year>2003</year>
<article-title>EURISWEB–Web-based epidemiological surveillance of antibiotic-resistant pneumococci in day care centers.</article-title>
<source>BMC Med Inform Decis Mak</source>
<volume>3</volume>
<fpage>9</fpage>
<pub-id pub-id-type="pmid">12846930</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Stanislaus1">
<label>28</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stanislaus</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Franklin</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Arthur</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Almeida</surname>
<given-names>JS</given-names>
</name>
</person-group>
<year>2005</year>
<article-title>AGML Central: web based gel proteomic infrastructure.</article-title>
<source>Bioinformatics</source>
<volume>21</volume>
<fpage>1754</fpage>
<lpage>1757</lpage>
<pub-id pub-id-type="pmid">15647304</pub-id>
</citation>
</ref>
<ref id="pone.0002946-IvanHerman1">
<label>29</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ivan Herman</surname>
<given-names>RS</given-names>
</name>
<name>
<surname>Dan</surname>
<given-names>Brickley</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>Resource Description Framework (RDF).</article-title>
<source>The World Wide Web Consortium</source>
</citation>
</ref>
<ref id="pone.0002946-Pedrioli1">
<label>30</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pedrioli</surname>
<given-names>PG</given-names>
</name>
<name>
<surname>Eng</surname>
<given-names>JK</given-names>
</name>
<name>
<surname>Hubley</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Vogelzang</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Deutsch</surname>
<given-names>EW</given-names>
</name>
<etal></etal>
</person-group>
<year>2004</year>
<article-title>A common open representation of mass spectrometry data and its application to proteomics research.</article-title>
<source>Nat Biotechnol</source>
<volume>22</volume>
<fpage>1459</fpage>
<lpage>1466</lpage>
<pub-id pub-id-type="pmid">15529173</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Orchard1">
<label>31</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Orchard</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>AR</given-names>
</name>
<name>
<surname>Stephan</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Binz</surname>
<given-names>PA</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>The HUPO pre-congress Proteomics Standards Initiative workshop. HUPO 5th annual World Congress. Long Beach, CA, USA 28 October-1 November 2006.</article-title>
<source>Proteomics</source>
<volume>7</volume>
<fpage>1006</fpage>
<lpage>1008</lpage>
<pub-id pub-id-type="pmid">17340643</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Orchard2">
<label>32</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Orchard</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Montechi-Palazzi</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Deutsch</surname>
<given-names>EW</given-names>
</name>
<name>
<surname>Binz</surname>
<given-names>PA</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>AR</given-names>
</name>
<etal></etal>
</person-group>
<year>2007</year>
<article-title>Five years of progress in the Standardization of Proteomics Data 4(th) Annual Spring Workshop of the HUPO-Proteomics Standards Initiative April 23–25, 2007 Ecole Nationale Superieure (ENS), Lyon, France.</article-title>
<source>Proteomics</source>
<volume>7</volume>
<fpage>3436</fpage>
<lpage>3440</lpage>
<pub-id pub-id-type="pmid">17907277</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Klimek1">
<label>33</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Klimek</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Eddes</surname>
<given-names>JS</given-names>
</name>
<name>
<surname>Hohmann</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Jackson</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Peterson</surname>
<given-names>A</given-names>
</name>
<etal></etal>
</person-group>
<year>2007</year>
<article-title>The Standard Protein Mix Database: A Diverse Data Set To Assist in the Production of Improved Peptide and Protein Identification Software Tools.</article-title>
<source>J Proteome Res</source>
</citation>
</ref>
<ref id="pone.0002946-AhoJDU1">
<label>34</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aho JDU</surname>
<given-names>AV</given-names>
</name>
</person-group>
<year>1979</year>
<article-title>Universality of data retrieval languages.</article-title>
<source>Proceedings of the 6th ACM SIGACT-SIGPLAN symposium on Principles of programming languages</source>
<fpage>110</fpage>
<lpage>119</lpage>
</citation>
</ref>
<ref id="pone.0002946-Aranguren1">
<label>35</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aranguren</surname>
<given-names>ME</given-names>
</name>
<name>
<surname>Bechhofer</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Lord</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Sattler</surname>
<given-names>U</given-names>
</name>
<name>
<surname>Stevens</surname>
<given-names>R</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>Understanding and using the meaning of statements in a bio-ontology: recasting the Gene Ontology in OWL.</article-title>
<source>BMC Bioinformatics</source>
<volume>8</volume>
<fpage>57</fpage>
<pub-id pub-id-type="pmid">17311682</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Lam1">
<label>36</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lam</surname>
<given-names>HY</given-names>
</name>
<name>
<surname>Marenco</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Shepherd</surname>
<given-names>GM</given-names>
</name>
<name>
<surname>Miller</surname>
<given-names>PL</given-names>
</name>
<name>
<surname>Cheung</surname>
<given-names>KH</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>Using web ontology language to integrate heterogeneous databases in the neurosciences.</article-title>
<source>AMIA Annu Symp Proc</source>
<fpage>464</fpage>
<lpage>468</lpage>
<pub-id pub-id-type="pmid">17238384</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Zhang1">
<label>37</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Bodenreider</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Golbreich</surname>
<given-names>C</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>Experience in reasoning with the foundational model of anatomy in OWL DL.</article-title>
<source>Pac Symp Biocomput</source>
<fpage>200</fpage>
<lpage>211</lpage>
<pub-id pub-id-type="pmid">17094240</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Miller1">
<label>38</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Miller</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Rifaieh</surname>
<given-names>R</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>Wrestling with SUMO and bio-ontologies.</article-title>
<source>Nat Biotechnol</source>
<volume>24</volume>
<fpage>22</fpage>
<lpage>23; author reply 23</lpage>
<pub-id pub-id-type="pmid">16404383</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Musen1">
<label>39</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Musen</surname>
<given-names>MA</given-names>
</name>
<name>
<surname>Lewis</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>B</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>Wrestling with SUMO and bio-ontologies.</article-title>
<source>Nat Biotechnol</source>
<volume>24</volume>
<fpage>21; author reply 23</fpage>
<pub-id pub-id-type="pmid">16404381</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Stoeckert1">
<label>40</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stoeckert</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Ball</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Brazma</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Brinkman</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Causton</surname>
<given-names>H</given-names>
</name>
<etal></etal>
</person-group>
<year>2006</year>
<article-title>Wrestling with SUMO and bio-ontologies.</article-title>
<source>Nat Biotechnol</source>
<volume>24</volume>
<fpage>21</fpage>
<lpage>22; author reply 23</lpage>
<pub-id pub-id-type="pmid">16404382</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Blake2">
<label>41</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Blake</surname>
<given-names>J</given-names>
</name>
</person-group>
<year>2004</year>
<article-title>Bio-ontologies-fast and furious.</article-title>
<source>Nat Biotechnol</source>
<volume>22</volume>
<fpage>773</fpage>
<lpage>774</lpage>
<pub-id pub-id-type="pmid">15175701</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Soldatova1">
<label>42</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Soldatova</surname>
<given-names>LN</given-names>
</name>
<name>
<surname>King</surname>
<given-names>RD</given-names>
</name>
</person-group>
<year>2005</year>
<article-title>Are the current ontologies in biology good ontologies?</article-title>
<source>Nat Biotechnol</source>
<volume>23</volume>
<fpage>1095</fpage>
<lpage>1098</lpage>
<pub-id pub-id-type="pmid">16151402</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Merelli1">
<label>43</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Merelli</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Armano</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Cannata</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Corradini</surname>
<given-names>F</given-names>
</name>
<name>
<surname>d'Inverno</surname>
<given-names>M</given-names>
</name>
<etal></etal>
</person-group>
<year>2007</year>
<article-title>Agents in bioinformatics, computational and systems biology.</article-title>
<source>Brief Bioinform</source>
<volume>8</volume>
<fpage>45</fpage>
<lpage>59</lpage>
<pub-id pub-id-type="pmid">16772270</pub-id>
</citation>
</ref>
<ref id="pone.0002946-AntoineIsaac1">
<label>44</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Antoine Isaac</surname>
<given-names>JP</given-names>
</name>
<name>
<surname>Daniel</surname>
<given-names>Rubin</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>SKOS Use Cases and Requirements.</article-title>
</citation>
</ref>
<ref id="pone.0002946-The1">
<label>45</label>
<citation citation-type="other">The University of Texas Lung Cancer SPORE. P50 CA70907</citation>
</ref>
<ref id="pone.0002946-Baker1">
<label>46</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baker</surname>
<given-names>T</given-names>
</name>
</person-group>
<year>2005</year>
<article-title>A Common Grammar for Diverse Vocabularies: The Abstract Model for Dublin Core.</article-title>
<source>Lecture Notes in Computer Science</source>
<volume>3815</volume>
<fpage>495</fpage>
</citation>
</ref>
<ref id="pone.0002946-Good1">
<label>47</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Good</surname>
<given-names>BM</given-names>
</name>
<name>
<surname>Wilkinson</surname>
<given-names>MD</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>The Life Sciences Semantic Web is full of creeps!</article-title>
<source>Brief Bioinform</source>
<volume>7</volume>
<fpage>275</fpage>
<lpage>286</lpage>
<pub-id pub-id-type="pmid">16899496</pub-id>
</citation>
</ref>
<ref id="pone.0002946-Neumann2">
<label>48</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Neumann</surname>
<given-names>E</given-names>
</name>
</person-group>
<year>2005</year>
<article-title>A life science Semantic Web: are we there yet?</article-title>
<source>Sci STKE</source>
<volume>2005</volume>
<fpage>pe22</fpage>
<pub-id pub-id-type="pmid">15886389</pub-id>
</citation>
</ref>
</ref-list>
<fn-group>
<fn fn-type="conflict">
<p>
<bold>Competing Interests: </bold>
The authors have declared that no competing interests exist.</p>
</fn>
<fn fn-type="financial-disclosure">
<p>
<bold>Funding: </bold>
This work was supported in part by the National Heart, Lung and Blood Institute (NHLBI), by the National Cancer Institute (NCI) of the US National Institutes of Health (NIH), and by the Center for Clinical and Translational Sciences under contracts no. N01-HV-28181, P50 CA70907, and 1UL1RR024148, respectively. The authors also acknowledge support by the PREVIS project, contract number LSHM-CT-2003-503413 from the European Union Commission.</p>
</fn>
</fn-group>
</back>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000312 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000312 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:2491554
   |texte=   A Semantic Web Management Model for Integrative Biomedical Informatics
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:18698353" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024