Integration and publication of heterogeneous text-mined relationships on the Semantic Web
Identifieur interne : 000079 ( Pmc/Curation ); précédent : 000078; suivant : 000080Integration and publication of heterogeneous text-mined relationships on the Semantic Web
Auteurs : Adrien Coulet [France, États-Unis] ; Yael Garten [États-Unis] ; Michel Dumontier ; Russ B. Altman [États-Unis] ; Mark A. Musen [États-Unis] ; Nigam H. Shah [États-Unis]Source :
- Journal of Biomedical Semantics [ 2041-1480 ] ; 2011.
Abstract
Advances in Natural Language Processing (NLP) techniques enable the extraction of fine-grained relationships mentioned in biomedical text. The variability and the complexity of natural language in expressing similar relationships causes the extracted relationships to be highly heterogeneous, which makes the construction of knowledge bases difficult and poses a challenge in using these for data mining or question answering.
We report on the semi-automatic construction of the PHARE relationship ontology (the PHArmacogenomic RElationships Ontology) consisting of 200 curated relations from over 40,000 heterogeneous relationships extracted via text-mining. These heterogeneous relations are then mapped to the PHARE ontology using synonyms, entity descriptions and hierarchies of entities and roles. Once mapped, relationships can be normalized and compared using the structure of the ontology to identify relationships that have similar semantics but different syntax. We compare and contrast the manual procedure with a fully automated approach using WordNet to quantify the degree of integration enabled by iterative curation and refinement of the PHARE ontology. The result of such integration is a repository of normalized biomedical relationships, named PHARE-KB, which can be queried using Semantic Web technologies such as SPARQL and can be visualized in the form of a biological network.
The PHARE ontology serves as a common semantic framework to integrate more than 40,000 relationships pertinent to pharmacogenomics. The PHARE ontology forms the foundation of a knowledge base named PHARE-KB. Once populated with relationships, PHARE-KB (
Url:
DOI: 10.1186/2041-1480-2-S2-S10
PubMed: 21624156
PubMed Central: 3102890
Links toward previous steps (curation, corpus...)
- to stream Pmc, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000079
Links to Exploration step
PMC:3102890Curation
No country items
Michel Dumontier<affiliation><nlm:aff id="I4">Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, ON, Canada, K1S5B6</nlm:aff>
<wicri:noCountry code="subfield">K1S5B6</wicri:noCountry>
</affiliation>
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Integration and publication of heterogeneous text-mined relationships on the Semantic Web</title>
<author><name sortKey="Coulet, Adrien" sort="Coulet, Adrien" uniqKey="Coulet A" first="Adrien" last="Coulet">Adrien Coulet</name>
<affiliation wicri:level="1"><nlm:aff id="I1">LORIA – INRIA Nancy – Grand-Est, Campus Scientifique - BP 239 - 54506 Vandoeuvre-lès-Nancy Cedex, France</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>LORIA – INRIA Nancy – Grand-Est, Campus Scientifique - BP 239 - 54506 Vandoeuvre-lès-Nancy Cedex</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Garten, Yael" sort="Garten, Yael" uniqKey="Garten Y" first="Yael" last="Garten">Yael Garten</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Dumontier, Michel" sort="Dumontier, Michel" uniqKey="Dumontier M" first="Michel" last="Dumontier">Michel Dumontier</name>
<affiliation><nlm:aff id="I4">Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, ON, Canada, K1S5B6</nlm:aff>
<wicri:noCountry code="subfield">K1S5B6</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Altman, Russ B" sort="Altman, Russ B" uniqKey="Altman R" first="Russ B" last="Altman">Russ B. Altman</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I5">Department of Bioengineering, 318 Campus Drive, Mail Code 5444, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Bioengineering, 318 Campus Drive, Mail Code 5444, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Musen, Mark A" sort="Musen, Mark A" uniqKey="Musen M" first="Mark A" last="Musen">Mark A. Musen</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Shah, Nigam H" sort="Shah, Nigam H" uniqKey="Shah N" first="Nigam H" last="Shah">Nigam H. Shah</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">21624156</idno>
<idno type="pmc">3102890</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3102890</idno>
<idno type="RBID">PMC:3102890</idno>
<idno type="doi">10.1186/2041-1480-2-S2-S10</idno>
<date when="2011">2011</date>
<idno type="wicri:Area/Pmc/Corpus">000079</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000079</idno>
<idno type="wicri:Area/Pmc/Curation">000079</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Curation">000079</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">Integration and publication of heterogeneous text-mined relationships on the Semantic Web</title>
<author><name sortKey="Coulet, Adrien" sort="Coulet, Adrien" uniqKey="Coulet A" first="Adrien" last="Coulet">Adrien Coulet</name>
<affiliation wicri:level="1"><nlm:aff id="I1">LORIA – INRIA Nancy – Grand-Est, Campus Scientifique - BP 239 - 54506 Vandoeuvre-lès-Nancy Cedex, France</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>LORIA – INRIA Nancy – Grand-Est, Campus Scientifique - BP 239 - 54506 Vandoeuvre-lès-Nancy Cedex</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Garten, Yael" sort="Garten, Yael" uniqKey="Garten Y" first="Yael" last="Garten">Yael Garten</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Dumontier, Michel" sort="Dumontier, Michel" uniqKey="Dumontier M" first="Michel" last="Dumontier">Michel Dumontier</name>
<affiliation><nlm:aff id="I4">Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, ON, Canada, K1S5B6</nlm:aff>
<wicri:noCountry code="subfield">K1S5B6</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Altman, Russ B" sort="Altman, Russ B" uniqKey="Altman R" first="Russ B" last="Altman">Russ B. Altman</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I5">Department of Bioengineering, 318 Campus Drive, Mail Code 5444, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Bioengineering, 318 Campus Drive, Mail Code 5444, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Musen, Mark A" sort="Musen, Mark A" uniqKey="Musen M" first="Mark A" last="Musen">Mark A. Musen</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Shah, Nigam H" sort="Shah, Nigam H" uniqKey="Shah N" first="Nigam H" last="Shah">Nigam H. Shah</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
</analytic>
<series><title level="j">Journal of Biomedical Semantics</title>
<idno type="eISSN">2041-1480</idno>
<imprint><date when="2011">2011</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><sec><title>Background</title>
<p>Advances in Natural Language Processing (NLP) techniques enable the extraction of fine-grained relationships mentioned in biomedical text. The variability and the complexity of natural language in expressing similar relationships causes the extracted relationships to be highly heterogeneous, which makes the construction of knowledge bases difficult and poses a challenge in using these for data mining or question answering.</p>
</sec>
<sec><title>Results</title>
<p>We report on the semi-automatic construction of the PHARE relationship ontology (the PHArmacogenomic RElationships Ontology) consisting of 200 curated relations from over 40,000 heterogeneous relationships extracted via text-mining. These heterogeneous relations are then mapped to the PHARE ontology using synonyms, entity descriptions and hierarchies of entities and roles. Once mapped, relationships can be normalized and compared using the structure of the ontology to identify relationships that have similar semantics but different syntax. We compare and contrast the manual procedure with a fully automated approach using WordNet to quantify the degree of integration enabled by iterative curation and refinement of the PHARE ontology. The result of such integration is a repository of normalized biomedical relationships, named PHARE-KB, which can be queried using Semantic Web technologies such as SPARQL and can be visualized in the form of a biological network.</p>
</sec>
<sec><title>Conclusions</title>
<p>The PHARE ontology serves as a common semantic framework to integrate more than 40,000 relationships pertinent to pharmacogenomics. The PHARE ontology forms the foundation of a knowledge base named PHARE-KB. Once populated with relationships, PHARE-KB (<italic>i</italic>
) can be visualized in the form of a biological network to guide human tasks such as database curation and (<italic>ii</italic>
) can be queried programmatically to guide bioinformatics applications such as the prediction of molecular interactions. PHARE is available at <ext-link ext-link-type="uri" xlink:href="http://purl.bioontology.org/ontology/PHARE">http://purl.bioontology.org/ontology/PHARE</ext-link>
.</p>
</sec>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct><analytic><author><name sortKey="Groth, P" uniqKey="Groth P">P Groth</name>
</author>
<author><name sortKey="Gibson, A" uniqKey="Gibson A">A Gibson</name>
</author>
<author><name sortKey="Velterop, J" uniqKey="Velterop J">J Velterop</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Klein, T" uniqKey="Klein T">T Klein</name>
</author>
<author><name sortKey="Chang, J" uniqKey="Chang J">J Chang</name>
</author>
<author><name sortKey="Cho, M" uniqKey="Cho M">M Cho</name>
</author>
<author><name sortKey="Easton, K" uniqKey="Easton K">K Easton</name>
</author>
<author><name sortKey="Fergerson, K" uniqKey="Fergerson K">K Fergerson</name>
</author>
<author><name sortKey="Hewett, M" uniqKey="Hewett M">M Hewett</name>
</author>
<author><name sortKey="Lin, Z" uniqKey="Lin Z">Z Lin</name>
</author>
<author><name sortKey="Liu, Y" uniqKey="Liu Y">Y Liu</name>
</author>
<author><name sortKey="Liu, S" uniqKey="Liu S">S Liu</name>
</author>
<author><name sortKey="Oliver, D" uniqKey="Oliver D">D Oliver</name>
</author>
<author><name sortKey="Rubin, D" uniqKey="Rubin D">D Rubin</name>
</author>
<author><name sortKey="Shafa, F" uniqKey="Shafa F">F Shafa</name>
</author>
<author><name sortKey="Stuart, J" uniqKey="Stuart J">J Stuart</name>
</author>
<author><name sortKey="Altman, Rb" uniqKey="Altman R">RB Altman</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Garten, Y" uniqKey="Garten Y">Y Garten</name>
</author>
<author><name sortKey="Coulet, A" uniqKey="Coulet A">A Coulet</name>
</author>
<author><name sortKey="Altman, R" uniqKey="Altman R">R Altman</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Hunter, L" uniqKey="Hunter L">L Hunter</name>
</author>
<author><name sortKey="Lu, Z" uniqKey="Lu Z">Z Lu</name>
</author>
<author><name sortKey="Firby, J" uniqKey="Firby J">J Firby</name>
</author>
<author><name sortKey="Baumgartner, Wa" uniqKey="Baumgartner W">WA Baumgartner</name>
</author>
<author><name sortKey="Johnson, Hl" uniqKey="Johnson H">HL Johnson</name>
</author>
<author><name sortKey="Ogren, P" uniqKey="Ogren P">P Ogren</name>
</author>
<author><name sortKey="Cohen, K" uniqKey="Cohen K">K Cohen</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Friedman, C" uniqKey="Friedman C">C Friedman</name>
</author>
<author><name sortKey="Kra, P" uniqKey="Kra P">P Kra</name>
</author>
<author><name sortKey="Yu, H" uniqKey="Yu H">H Yu</name>
</author>
<author><name sortKey="Krauthammer, M" uniqKey="Krauthammer M">M Krauthammer</name>
</author>
<author><name sortKey="Rzhetsky, A" uniqKey="Rzhetsky A">A Rzhetsky</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Saric, J" uniqKey="Saric J">J Saric</name>
</author>
<author><name sortKey="Jensen, Lj" uniqKey="Jensen L">LJ Jensen</name>
</author>
<author><name sortKey="Ouzounova, R" uniqKey="Ouzounova R">R Ouzounova</name>
</author>
<author><name sortKey="Rojas, I" uniqKey="Rojas I">I Rojas</name>
</author>
<author><name sortKey="Bork, P" uniqKey="Bork P">P Bork</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ciaramita, M" uniqKey="Ciaramita M">M Ciaramita</name>
</author>
<author><name sortKey="Gangemi, A" uniqKey="Gangemi A">A Gangemi</name>
</author>
<author><name sortKey="Ratsch, E" uniqKey="Ratsch E">E Ratsch</name>
</author>
<author><name sortKey="Saric, J" uniqKey="Saric J">J Saric</name>
</author>
<author><name sortKey="Rojas, I" uniqKey="Rojas I">I Rojas</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ramakrishnan, C" uniqKey="Ramakrishnan C">C Ramakrishnan</name>
</author>
<author><name sortKey="Mendes, P" uniqKey="Mendes P">P Mendes</name>
</author>
<author><name sortKey="Wang, S" uniqKey="Wang S">S Wang</name>
</author>
<author><name sortKey="Sheth, A" uniqKey="Sheth A">A Sheth</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Tari, L" uniqKey="Tari L">L Tari</name>
</author>
<author><name sortKey="Answar, S" uniqKey="Answar S">S Answar</name>
</author>
<author><name sortKey="Liang, S" uniqKey="Liang S">S Liang</name>
</author>
<author><name sortKey="Cai, J" uniqKey="Cai J">J Cai</name>
</author>
<author><name sortKey="Baral, C" uniqKey="Baral C">C Baral</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Manning, Cd" uniqKey="Manning C">CD Manning</name>
</author>
<author><name sortKey="Schutze, H" uniqKey="Schutze H">H Schütze</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Coulet, A" uniqKey="Coulet A">A Coulet</name>
</author>
<author><name sortKey="Shah, Nh" uniqKey="Shah N">NH Shah</name>
</author>
<author><name sortKey="Garten, Y" uniqKey="Garten Y">Y Garten</name>
</author>
<author><name sortKey="Musen, Ma" uniqKey="Musen M">MA Musen</name>
</author>
<author><name sortKey="Altman, Rb" uniqKey="Altman R">RB Altman</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Agichtein, E" uniqKey="Agichtein E">E Agichtein</name>
</author>
<author><name sortKey="Gravano, L" uniqKey="Gravano L">L Gravano</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Xu, R" uniqKey="Xu R">R Xu</name>
</author>
<author><name sortKey="Supekar, K" uniqKey="Supekar K">K Supekar</name>
</author>
<author><name sortKey="Morgan, A" uniqKey="Morgan A">A Morgan</name>
</author>
<author><name sortKey="Das, A" uniqKey="Das A">A Das</name>
</author>
<author><name sortKey="Garber, A" uniqKey="Garber A">A Garber</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="De Marneffe, Mc" uniqKey="De Marneffe M">MC de Marneffe</name>
</author>
<author><name sortKey="Manning, Cd" uniqKey="Manning C">CD Manning</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Feebaum, C" uniqKey="Feebaum C">C Feebaum</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Smith, B" uniqKey="Smith B">B Smith</name>
</author>
<author><name sortKey="Ceusters, W" uniqKey="Ceusters W">W Ceusters</name>
</author>
<author><name sortKey="Klagges, B" uniqKey="Klagges B">B Klagges</name>
</author>
<author><name sortKey="Kohler, J" uniqKey="Kohler J">J Köhler</name>
</author>
<author><name sortKey="Kumar, A" uniqKey="Kumar A">A Kumar</name>
</author>
<author><name sortKey="Lomax, J" uniqKey="Lomax J">J Lomax</name>
</author>
<author><name sortKey="Mungall, C" uniqKey="Mungall C">C Mungall</name>
</author>
<author><name sortKey="Neuhaus, F" uniqKey="Neuhaus F">F Neuhaus</name>
</author>
<author><name sortKey="Rector, Al" uniqKey="Rector A">AL Rector</name>
</author>
<author><name sortKey="Rosse, C" uniqKey="Rosse C">C Rosse</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Ciccarese, P" uniqKey="Ciccarese P">P Ciccarese</name>
</author>
<author><name sortKey="Ocana, M" uniqKey="Ocana M">M Ocana</name>
</author>
<author><name sortKey="Castro, Ljg" uniqKey="Castro L">LJG Castro</name>
</author>
<author><name sortKey="Das, S" uniqKey="Das S">S Das</name>
</author>
<author><name sortKey="Clark, T" uniqKey="Clark T">T Clark</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Garten, Y" uniqKey="Garten Y">Y Garten</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article"><pmc-dir>properties open_access</pmc-dir>
<front><journal-meta><journal-id journal-id-type="nlm-ta">J Biomed Semantics</journal-id>
<journal-title-group><journal-title>Journal of Biomedical Semantics</journal-title>
</journal-title-group>
<issn pub-type="epub">2041-1480</issn>
<publisher><publisher-name>BioMed Central</publisher-name>
</publisher>
</journal-meta>
<article-meta><article-id pub-id-type="pmid">21624156</article-id>
<article-id pub-id-type="pmc">3102890</article-id>
<article-id pub-id-type="publisher-id">2041-1480-2-S2-S10</article-id>
<article-id pub-id-type="doi">10.1186/2041-1480-2-S2-S10</article-id>
<article-categories><subj-group subj-group-type="heading"><subject>Proceedings</subject>
</subj-group>
</article-categories>
<title-group><article-title>Integration and publication of heterogeneous text-mined relationships on the Semantic Web</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" corresp="yes" id="A1"><name><surname>Coulet</surname>
<given-names>Adrien</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<xref ref-type="aff" rid="I2">2</xref>
<xref ref-type="aff" rid="I3">3</xref>
<email>adrien.coulet@loria.fr</email>
</contrib>
<contrib contrib-type="author" id="A2"><name><surname>Garten</surname>
<given-names>Yael</given-names>
</name>
<xref ref-type="aff" rid="I2">2</xref>
<xref ref-type="aff" rid="I3">3</xref>
<email>ygarten@stanford.edu</email>
</contrib>
<contrib contrib-type="author" id="A3"><name><surname>Dumontier</surname>
<given-names>Michel</given-names>
</name>
<xref ref-type="aff" rid="I4">4</xref>
<email>michel_dumontier@carlton.ca</email>
</contrib>
<contrib contrib-type="author" id="A4"><name><surname>Altman</surname>
<given-names>Russ B</given-names>
</name>
<xref ref-type="aff" rid="I2">2</xref>
<xref ref-type="aff" rid="I3">3</xref>
<xref ref-type="aff" rid="I5">5</xref>
<email>russ.altman@stanford.edu</email>
</contrib>
<contrib contrib-type="author" id="A5"><name><surname>Musen</surname>
<given-names>Mark A</given-names>
</name>
<xref ref-type="aff" rid="I2">2</xref>
<email>musen@stanford.edu</email>
</contrib>
<contrib contrib-type="author" id="A6"><name><surname>Shah</surname>
<given-names>Nigam H</given-names>
</name>
<xref ref-type="aff" rid="I2">2</xref>
<email>nigam@stanford.edu</email>
</contrib>
</contrib-group>
<aff id="I1"><label>1</label>
LORIA – INRIA Nancy – Grand-Est, Campus Scientifique - BP 239 - 54506 Vandoeuvre-lès-Nancy Cedex, France</aff>
<aff id="I2"><label>2</label>
Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</aff>
<aff id="I3"><label>3</label>
Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</aff>
<aff id="I4"><label>4</label>
Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, ON, Canada, K1S5B6</aff>
<aff id="I5"><label>5</label>
Department of Bioengineering, 318 Campus Drive, Mail Code 5444, Stanford University, Stanford, CA, 94305, USA</aff>
<pub-date pub-type="collection"><year>2011</year>
</pub-date>
<pub-date pub-type="epub"><day>17</day>
<month>5</month>
<year>2011</year>
</pub-date>
<volume>2</volume>
<issue>Suppl 2</issue>
<supplement><named-content content-type="supplement-title">Proceedings of the Bio-Ontologies Special Interest Group Meeting 2010</named-content>
<named-content content-type="supplement-editor">Larisa Soldatova, Susanna-Assunta Sansone, Susie Stephens and Nigam H Shah</named-content>
</supplement>
<fpage>S10</fpage>
<lpage>S10</lpage>
<permissions><copyright-statement>Copyright ©2011 Coulet et al; licensee BioMed Central Ltd.</copyright-statement>
<copyright-year>2011</copyright-year>
<copyright-holder>Coulet et al; licensee BioMed Central Ltd.</copyright-holder>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/2.0"><license-p>This is an open access article distributed under the terms of the Creative Commons Attribution License (<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/2.0">http://creativecommons.org/licenses/by/2.0</ext-link>
), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri xlink:href="http://www.jbiomedsem.com/content/2/S2/S10"></self-uri>
<abstract><sec><title>Background</title>
<p>Advances in Natural Language Processing (NLP) techniques enable the extraction of fine-grained relationships mentioned in biomedical text. The variability and the complexity of natural language in expressing similar relationships causes the extracted relationships to be highly heterogeneous, which makes the construction of knowledge bases difficult and poses a challenge in using these for data mining or question answering.</p>
</sec>
<sec><title>Results</title>
<p>We report on the semi-automatic construction of the PHARE relationship ontology (the PHArmacogenomic RElationships Ontology) consisting of 200 curated relations from over 40,000 heterogeneous relationships extracted via text-mining. These heterogeneous relations are then mapped to the PHARE ontology using synonyms, entity descriptions and hierarchies of entities and roles. Once mapped, relationships can be normalized and compared using the structure of the ontology to identify relationships that have similar semantics but different syntax. We compare and contrast the manual procedure with a fully automated approach using WordNet to quantify the degree of integration enabled by iterative curation and refinement of the PHARE ontology. The result of such integration is a repository of normalized biomedical relationships, named PHARE-KB, which can be queried using Semantic Web technologies such as SPARQL and can be visualized in the form of a biological network.</p>
</sec>
<sec><title>Conclusions</title>
<p>The PHARE ontology serves as a common semantic framework to integrate more than 40,000 relationships pertinent to pharmacogenomics. The PHARE ontology forms the foundation of a knowledge base named PHARE-KB. Once populated with relationships, PHARE-KB (<italic>i</italic>
) can be visualized in the form of a biological network to guide human tasks such as database curation and (<italic>ii</italic>
) can be queried programmatically to guide bioinformatics applications such as the prediction of molecular interactions. PHARE is available at <ext-link ext-link-type="uri" xlink:href="http://purl.bioontology.org/ontology/PHARE">http://purl.bioontology.org/ontology/PHARE</ext-link>
.</p>
</sec>
</abstract>
<conference><conf-date>9-10 July 2010</conf-date>
<conf-name>Bio-Ontologies 2010: Semantic Applications in Life Sciences</conf-name>
<conf-loc>Boston, MA, USA</conf-loc>
</conference>
</article-meta>
</front>
</pmc>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Pmc/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000079 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd -nk 000079 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Pmc |étape= Curation |type= RBID |clé= PMC:3102890 |texte= Integration and publication of heterogeneous text-mined relationships on the Semantic Web }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Curation/RBID.i -Sk "pubmed:21624156" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd \ | NlmPubMed2Wicri -a InforLorV4
![]() | This area was generated with Dilib version V0.6.33. | ![]() |