InforLorV4, Pmc, Curation, bibRecord, 000079

Integration and publication of heterogeneous text-mined relationships on the Semantic Web

Identifieur interne : 000079 ( Pmc/Curation ); précédent : 000078; suivant : 000080

Integration and publication of heterogeneous text-mined relationships on the Semantic Web

Auteurs : Adrien Coulet [France, États-Unis] ; Yael Garten [États-Unis] ; Michel Dumontier ; Russ B. Altman [États-Unis] ; Mark A. Musen [États-Unis] ; Nigam H. Shah [États-Unis]

Source :

Journal of Biomedical Semantics [ 2041-1480 ] ; 2011.

RBID : PMC:3102890

Abstract

Background

Advances in Natural Language Processing (NLP) techniques enable the extraction of fine-grained relationships mentioned in biomedical text. The variability and the complexity of natural language in expressing similar relationships causes the extracted relationships to be highly heterogeneous, which makes the construction of knowledge bases difficult and poses a challenge in using these for data mining or question answering.

Results

We report on the semi-automatic construction of the PHARE relationship ontology (the PHArmacogenomic RElationships Ontology) consisting of 200 curated relations from over 40,000 heterogeneous relationships extracted via text-mining. These heterogeneous relations are then mapped to the PHARE ontology using synonyms, entity descriptions and hierarchies of entities and roles. Once mapped, relationships can be normalized and compared using the structure of the ontology to identify relationships that have similar semantics but different syntax. We compare and contrast the manual procedure with a fully automated approach using WordNet to quantify the degree of integration enabled by iterative curation and refinement of the PHARE ontology. The result of such integration is a repository of normalized biomedical relationships, named PHARE-KB, which can be queried using Semantic Web technologies such as SPARQL and can be visualized in the form of a biological network.

Conclusions

The PHARE ontology serves as a common semantic framework to integrate more than 40,000 relationships pertinent to pharmacogenomics. The PHARE ontology forms the foundation of a knowledge base named PHARE-KB. Once populated with relationships, PHARE-KB (i) can be visualized in the form of a biological network to guide human tasks such as database curation and (ii) can be queried programmatically to guide bioinformatics applications such as the prediction of molecular interactions. PHARE is available at http://purl.bioontology.org/ontology/PHARE.

Url:

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3102890

DOI: 10.1186/2041-1480-2-S2-S10
PubMed: 21624156
PubMed Central: 3102890

Links toward previous steps (curation, corpus...)

to stream Pmc, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000079

Links to Exploration step

PMC:3102890

Curation

No country items

Michel Dumontier

<affiliation><nlm:aff id="I4">Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, ON, Canada, K1S5B6</nlm:aff>
<wicri:noCountry code="subfield">K1S5B6</wicri:noCountry>
</affiliation>

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Integration and publication of heterogeneous text-mined relationships on the Semantic Web</title>
<author><name sortKey="Coulet, Adrien" sort="Coulet, Adrien" uniqKey="Coulet A" first="Adrien" last="Coulet">Adrien Coulet</name>
<affiliation wicri:level="1"><nlm:aff id="I1">LORIA – INRIA Nancy – Grand-Est, Campus Scientifique - BP 239 - 54506 Vandoeuvre-lès-Nancy Cedex, France</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>LORIA – INRIA Nancy – Grand-Est, Campus Scientifique - BP 239 - 54506 Vandoeuvre-lès-Nancy Cedex</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Garten, Yael" sort="Garten, Yael" uniqKey="Garten Y" first="Yael" last="Garten">Yael Garten</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Dumontier, Michel" sort="Dumontier, Michel" uniqKey="Dumontier M" first="Michel" last="Dumontier">Michel Dumontier</name>
<affiliation><nlm:aff id="I4">Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, ON, Canada, K1S5B6</nlm:aff>
<wicri:noCountry code="subfield">K1S5B6</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Altman, Russ B" sort="Altman, Russ B" uniqKey="Altman R" first="Russ B" last="Altman">Russ B. Altman</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I5">Department of Bioengineering, 318 Campus Drive, Mail Code 5444, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Bioengineering, 318 Campus Drive, Mail Code 5444, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Musen, Mark A" sort="Musen, Mark A" uniqKey="Musen M" first="Mark A" last="Musen">Mark A. Musen</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Shah, Nigam H" sort="Shah, Nigam H" uniqKey="Shah N" first="Nigam H" last="Shah">Nigam H. Shah</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">21624156</idno>
<idno type="pmc">3102890</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3102890</idno>
<idno type="RBID">PMC:3102890</idno>
<idno type="doi">10.1186/2041-1480-2-S2-S10</idno>
<date when="2011">2011</date>
<idno type="wicri:Area/Pmc/Corpus">000079</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000079</idno>
<idno type="wicri:Area/Pmc/Curation">000079</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Curation">000079</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">Integration and publication of heterogeneous text-mined relationships on the Semantic Web</title>
<author><name sortKey="Coulet, Adrien" sort="Coulet, Adrien" uniqKey="Coulet A" first="Adrien" last="Coulet">Adrien Coulet</name>
<affiliation wicri:level="1"><nlm:aff id="I1">LORIA – INRIA Nancy – Grand-Est, Campus Scientifique - BP 239 - 54506 Vandoeuvre-lès-Nancy Cedex, France</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>LORIA – INRIA Nancy – Grand-Est, Campus Scientifique - BP 239 - 54506 Vandoeuvre-lès-Nancy Cedex</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Garten, Yael" sort="Garten, Yael" uniqKey="Garten Y" first="Yael" last="Garten">Yael Garten</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Dumontier, Michel" sort="Dumontier, Michel" uniqKey="Dumontier M" first="Michel" last="Dumontier">Michel Dumontier</name>
<affiliation><nlm:aff id="I4">Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, ON, Canada, K1S5B6</nlm:aff>
<wicri:noCountry code="subfield">K1S5B6</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Altman, Russ B" sort="Altman, Russ B" uniqKey="Altman R" first="Russ B" last="Altman">Russ B. Altman</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I3">Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="I5">Department of Bioengineering, 318 Campus Drive, Mail Code 5444, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Bioengineering, 318 Campus Drive, Mail Code 5444, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Musen, Mark A" sort="Musen, Mark A" uniqKey="Musen M" first="Mark A" last="Musen">Mark A. Musen</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Shah, Nigam H" sort="Shah, Nigam H" uniqKey="Shah N" first="Nigam H" last="Shah">Nigam H. Shah</name>
<affiliation wicri:level="1"><nlm:aff id="I2">Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305</wicri:regionArea>
</affiliation>
</author>
</analytic>
<series><title level="j">Journal of Biomedical Semantics</title>
<idno type="eISSN">2041-1480</idno>
<imprint><date when="2011">2011</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><sec><title>Background</title>
<p>Advances in Natural Language Processing (NLP) techniques enable the extraction of fine-grained relationships mentioned in biomedical text. The variability and the complexity of natural language in expressing similar relationships causes the extracted relationships to be highly heterogeneous, which makes the construction of knowledge bases difficult and poses a challenge in using these for data mining or question answering.</p>
</sec>
<sec><title>Results</title>
<p>We report on the semi-automatic construction of the PHARE relationship ontology (the PHArmacogenomic RElationships Ontology) consisting of 200 curated relations from over 40,000 heterogeneous relationships extracted via text-mining. These heterogeneous relations are then mapped to the PHARE ontology using synonyms, entity descriptions and hierarchies of entities and roles. Once mapped, relationships can be normalized and compared using the structure of the ontology to identify relationships that have similar semantics but different syntax. We compare and contrast the manual procedure with a fully automated approach using WordNet to quantify the degree of integration enabled by iterative curation and refinement of the PHARE ontology. The result of such integration is a repository of normalized biomedical relationships, named PHARE-KB, which can be queried using Semantic Web technologies such as SPARQL and can be visualized in the form of a biological network.</p>
</sec>
<sec><title>Conclusions</title>
<p>The PHARE ontology serves as a common semantic framework to integrate more than 40,000 relationships pertinent to pharmacogenomics. The PHARE ontology forms the foundation of a knowledge base named PHARE-KB. Once populated with relationships, PHARE-KB (<italic>i</italic>
) can be visualized in the form of a biological network to guide human tasks such as database curation and (<italic>ii</italic>
) can be queried programmatically to guide bioinformatics applications such as the prediction of molecular interactions. PHARE is available at <ext-link ext-link-type="uri" xlink:href="http://purl.bioontology.org/ontology/PHARE">http://purl.bioontology.org/ontology/PHARE</ext-link>
.</p>
</sec>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct><analytic><author><name sortKey="Groth, P" uniqKey="Groth P">P Groth</name>
</author>
<author><name sortKey="Gibson, A" uniqKey="Gibson A">A Gibson</name>
</author>
<author><name sortKey="Velterop, J" uniqKey="Velterop J">J Velterop</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Klein, T" uniqKey="Klein T">T Klein</name>
</author>
<author><name sortKey="Chang, J" uniqKey="Chang J">J Chang</name>
</author>
<author><name sortKey="Cho, M" uniqKey="Cho M">M Cho</name>
</author>
<author><name sortKey="Easton, K" uniqKey="Easton K">K Easton</name>
</author>
<author><name sortKey="Fergerson, K" uniqKey="Fergerson K">K Fergerson</name>
</author>
<author><name sortKey="Hewett, M" uniqKey="Hewett M">M Hewett</name>
</author>
<author><name sortKey="Lin, Z" uniqKey="Lin Z">Z Lin</name>
</author>
<author><name sortKey="Liu, Y" uniqKey="Liu Y">Y Liu</name>
</author>
<author><name sortKey="Liu, S" uniqKey="Liu S">S Liu</name>
</author>
<author><name sortKey="Oliver, D" uniqKey="Oliver D">D Oliver</name>
</author>
<author><name sortKey="Rubin, D" uniqKey="Rubin D">D Rubin</name>
</author>
<author><name sortKey="Shafa, F" uniqKey="Shafa F">F Shafa</name>
</author>
<author><name sortKey="Stuart, J" uniqKey="Stuart J">J Stuart</name>
</author>
<author><name sortKey="Altman, Rb" uniqKey="Altman R">RB Altman</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Garten, Y" uniqKey="Garten Y">Y Garten</name>
</author>
<author><name sortKey="Coulet, A" uniqKey="Coulet A">A Coulet</name>
</author>
<author><name sortKey="Altman, R" uniqKey="Altman R">R Altman</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Hunter, L" uniqKey="Hunter L">L Hunter</name>
</author>
<author><name sortKey="Lu, Z" uniqKey="Lu Z">Z Lu</name>
</author>
<author><name sortKey="Firby, J" uniqKey="Firby J">J Firby</name>
</author>
<author><name sortKey="Baumgartner, Wa" uniqKey="Baumgartner W">WA Baumgartner</name>
</author>
<author><name sortKey="Johnson, Hl" uniqKey="Johnson H">HL Johnson</name>
</author>
<author><name sortKey="Ogren, P" uniqKey="Ogren P">P Ogren</name>
</author>
<author><name sortKey="Cohen, K" uniqKey="Cohen K">K Cohen</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Friedman, C" uniqKey="Friedman C">C Friedman</name>
</author>
<author><name sortKey="Kra, P" uniqKey="Kra P">P Kra</name>
</author>
<author><name sortKey="Yu, H" uniqKey="Yu H">H Yu</name>
</author>
<author><name sortKey="Krauthammer, M" uniqKey="Krauthammer M">M Krauthammer</name>
</author>
<author><name sortKey="Rzhetsky, A" uniqKey="Rzhetsky A">A Rzhetsky</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Saric, J" uniqKey="Saric J">J Saric</name>
</author>
<author><name sortKey="Jensen, Lj" uniqKey="Jensen L">LJ Jensen</name>
</author>
<author><name sortKey="Ouzounova, R" uniqKey="Ouzounova R">R Ouzounova</name>
</author>
<author><name sortKey="Rojas, I" uniqKey="Rojas I">I Rojas</name>
</author>
<author><name sortKey="Bork, P" uniqKey="Bork P">P Bork</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ciaramita, M" uniqKey="Ciaramita M">M Ciaramita</name>
</author>
<author><name sortKey="Gangemi, A" uniqKey="Gangemi A">A Gangemi</name>
</author>
<author><name sortKey="Ratsch, E" uniqKey="Ratsch E">E Ratsch</name>
</author>
<author><name sortKey="Saric, J" uniqKey="Saric J">J Saric</name>
</author>
<author><name sortKey="Rojas, I" uniqKey="Rojas I">I Rojas</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ramakrishnan, C" uniqKey="Ramakrishnan C">C Ramakrishnan</name>
</author>
<author><name sortKey="Mendes, P" uniqKey="Mendes P">P Mendes</name>
</author>
<author><name sortKey="Wang, S" uniqKey="Wang S">S Wang</name>
</author>
<author><name sortKey="Sheth, A" uniqKey="Sheth A">A Sheth</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Tari, L" uniqKey="Tari L">L Tari</name>
</author>
<author><name sortKey="Answar, S" uniqKey="Answar S">S Answar</name>
</author>
<author><name sortKey="Liang, S" uniqKey="Liang S">S Liang</name>
</author>
<author><name sortKey="Cai, J" uniqKey="Cai J">J Cai</name>
</author>
<author><name sortKey="Baral, C" uniqKey="Baral C">C Baral</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Manning, Cd" uniqKey="Manning C">CD Manning</name>
</author>
<author><name sortKey="Schutze, H" uniqKey="Schutze H">H Schütze</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Coulet, A" uniqKey="Coulet A">A Coulet</name>
</author>
<author><name sortKey="Shah, Nh" uniqKey="Shah N">NH Shah</name>
</author>
<author><name sortKey="Garten, Y" uniqKey="Garten Y">Y Garten</name>
</author>
<author><name sortKey="Musen, Ma" uniqKey="Musen M">MA Musen</name>
</author>
<author><name sortKey="Altman, Rb" uniqKey="Altman R">RB Altman</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Agichtein, E" uniqKey="Agichtein E">E Agichtein</name>
</author>
<author><name sortKey="Gravano, L" uniqKey="Gravano L">L Gravano</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Xu, R" uniqKey="Xu R">R Xu</name>
</author>
<author><name sortKey="Supekar, K" uniqKey="Supekar K">K Supekar</name>
</author>
<author><name sortKey="Morgan, A" uniqKey="Morgan A">A Morgan</name>
</author>
<author><name sortKey="Das, A" uniqKey="Das A">A Das</name>
</author>
<author><name sortKey="Garber, A" uniqKey="Garber A">A Garber</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="De Marneffe, Mc" uniqKey="De Marneffe M">MC de Marneffe</name>
</author>
<author><name sortKey="Manning, Cd" uniqKey="Manning C">CD Manning</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Feebaum, C" uniqKey="Feebaum C">C Feebaum</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Smith, B" uniqKey="Smith B">B Smith</name>
</author>
<author><name sortKey="Ceusters, W" uniqKey="Ceusters W">W Ceusters</name>
</author>
<author><name sortKey="Klagges, B" uniqKey="Klagges B">B Klagges</name>
</author>
<author><name sortKey="Kohler, J" uniqKey="Kohler J">J Köhler</name>
</author>
<author><name sortKey="Kumar, A" uniqKey="Kumar A">A Kumar</name>
</author>
<author><name sortKey="Lomax, J" uniqKey="Lomax J">J Lomax</name>
</author>
<author><name sortKey="Mungall, C" uniqKey="Mungall C">C Mungall</name>
</author>
<author><name sortKey="Neuhaus, F" uniqKey="Neuhaus F">F Neuhaus</name>
</author>
<author><name sortKey="Rector, Al" uniqKey="Rector A">AL Rector</name>
</author>
<author><name sortKey="Rosse, C" uniqKey="Rosse C">C Rosse</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Ciccarese, P" uniqKey="Ciccarese P">P Ciccarese</name>
</author>
<author><name sortKey="Ocana, M" uniqKey="Ocana M">M Ocana</name>
</author>
<author><name sortKey="Castro, Ljg" uniqKey="Castro L">LJG Castro</name>
</author>
<author><name sortKey="Das, S" uniqKey="Das S">S Das</name>
</author>
<author><name sortKey="Clark, T" uniqKey="Clark T">T Clark</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Garten, Y" uniqKey="Garten Y">Y Garten</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article"><pmc-dir>properties open_access</pmc-dir>
  <front><journal-meta><journal-id journal-id-type="nlm-ta">J Biomed Semantics</journal-id>
<journal-title-group><journal-title>Journal of Biomedical Semantics</journal-title>
</journal-title-group>
<issn pub-type="epub">2041-1480</issn>
<publisher><publisher-name>BioMed Central</publisher-name>
</publisher>
</journal-meta>
<article-meta><article-id pub-id-type="pmid">21624156</article-id>
<article-id pub-id-type="pmc">3102890</article-id>
<article-id pub-id-type="publisher-id">2041-1480-2-S2-S10</article-id>
<article-id pub-id-type="doi">10.1186/2041-1480-2-S2-S10</article-id>
<article-categories><subj-group subj-group-type="heading"><subject>Proceedings</subject>
</subj-group>
</article-categories>
<title-group><article-title>Integration and publication of heterogeneous text-mined relationships on the Semantic Web</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" corresp="yes" id="A1"><name><surname>Coulet</surname>
<given-names>Adrien</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<xref ref-type="aff" rid="I2">2</xref>
<xref ref-type="aff" rid="I3">3</xref>
<email>adrien.coulet@loria.fr</email>
</contrib>
<contrib contrib-type="author" id="A2"><name><surname>Garten</surname>
<given-names>Yael</given-names>
</name>
<xref ref-type="aff" rid="I2">2</xref>
<xref ref-type="aff" rid="I3">3</xref>
<email>ygarten@stanford.edu</email>
</contrib>
<contrib contrib-type="author" id="A3"><name><surname>Dumontier</surname>
<given-names>Michel</given-names>
</name>
<xref ref-type="aff" rid="I4">4</xref>
<email>michel_dumontier@carlton.ca</email>
</contrib>
<contrib contrib-type="author" id="A4"><name><surname>Altman</surname>
<given-names>Russ B</given-names>
</name>
<xref ref-type="aff" rid="I2">2</xref>
<xref ref-type="aff" rid="I3">3</xref>
<xref ref-type="aff" rid="I5">5</xref>
<email>russ.altman@stanford.edu</email>
</contrib>
<contrib contrib-type="author" id="A5"><name><surname>Musen</surname>
<given-names>Mark A</given-names>
</name>
<xref ref-type="aff" rid="I2">2</xref>
<email>musen@stanford.edu</email>
</contrib>
<contrib contrib-type="author" id="A6"><name><surname>Shah</surname>
<given-names>Nigam H</given-names>
</name>
<xref ref-type="aff" rid="I2">2</xref>
<email>nigam@stanford.edu</email>
</contrib>
</contrib-group>
<aff id="I1"><label>1</label>
LORIA – INRIA Nancy – Grand-Est, Campus Scientifique - BP 239 - 54506 Vandoeuvre-lès-Nancy Cedex, France</aff>
<aff id="I2"><label>2</label>
Department of Medicine, 300 Pasteur Drive, Mail Code 5110, Stanford University, Stanford, CA, 94305, USA</aff>
<aff id="I3"><label>3</label>
Department of Genetics, Mail Code 5120, Stanford University, Stanford, CA, 94305, USA</aff>
<aff id="I4"><label>4</label>
Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, ON, Canada, K1S5B6</aff>
<aff id="I5"><label>5</label>
Department of Bioengineering, 318 Campus Drive, Mail Code 5444, Stanford University, Stanford, CA, 94305, USA</aff>
<pub-date pub-type="collection"><year>2011</year>
</pub-date>
<pub-date pub-type="epub"><day>17</day>
<month>5</month>
<year>2011</year>
</pub-date>
<volume>2</volume>
<issue>Suppl 2</issue>
<supplement><named-content content-type="supplement-title">Proceedings of the Bio-Ontologies Special Interest Group Meeting 2010</named-content>
<named-content content-type="supplement-editor">Larisa Soldatova, Susanna-Assunta Sansone, Susie Stephens and Nigam H Shah</named-content>
</supplement>
<fpage>S10</fpage>
<lpage>S10</lpage>
<permissions><copyright-statement>Copyright ©2011 Coulet et al; licensee BioMed Central Ltd.</copyright-statement>
<copyright-year>2011</copyright-year>
<copyright-holder>Coulet et al; licensee BioMed Central Ltd.</copyright-holder>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/2.0"><license-p>This is an open access article distributed under the terms of the Creative Commons Attribution License (<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/2.0">http://creativecommons.org/licenses/by/2.0</ext-link>
), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri xlink:href="http://www.jbiomedsem.com/content/2/S2/S10"></self-uri>
<abstract><sec><title>Background</title>
<p>Advances in Natural Language Processing (NLP) techniques enable the extraction of fine-grained relationships mentioned in biomedical text. The variability and the complexity of natural language in expressing similar relationships causes the extracted relationships to be highly heterogeneous, which makes the construction of knowledge bases difficult and poses a challenge in using these for data mining or question answering.</p>
</sec>
<sec><title>Results</title>
<p>We report on the semi-automatic construction of the PHARE relationship ontology (the PHArmacogenomic RElationships Ontology) consisting of 200 curated relations from over 40,000 heterogeneous relationships extracted via text-mining. These heterogeneous relations are then mapped to the PHARE ontology using synonyms, entity descriptions and hierarchies of entities and roles. Once mapped, relationships can be normalized and compared using the structure of the ontology to identify relationships that have similar semantics but different syntax. We compare and contrast the manual procedure with a fully automated approach using WordNet to quantify the degree of integration enabled by iterative curation and refinement of the PHARE ontology. The result of such integration is a repository of normalized biomedical relationships, named PHARE-KB, which can be queried using Semantic Web technologies such as SPARQL and can be visualized in the form of a biological network.</p>
</sec>
<sec><title>Conclusions</title>
<p>The PHARE ontology serves as a common semantic framework to integrate more than 40,000 relationships pertinent to pharmacogenomics. The PHARE ontology forms the foundation of a knowledge base named PHARE-KB. Once populated with relationships, PHARE-KB (<italic>i</italic>
) can be visualized in the form of a biological network to guide human tasks such as database curation and (<italic>ii</italic>
) can be queried programmatically to guide bioinformatics applications such as the prediction of molecular interactions. PHARE is available at <ext-link ext-link-type="uri" xlink:href="http://purl.bioontology.org/ontology/PHARE">http://purl.bioontology.org/ontology/PHARE</ext-link>
.</p>
</sec>
</abstract>
<conference><conf-date>9-10 July 2010</conf-date>
<conf-name>Bio-Ontologies 2010: Semantic Applications in Life Sciences</conf-name>
<conf-loc>Boston, MA, USA</conf-loc>
</conference>
</article-meta>
</front>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Pmc/Curation

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000079 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd -nk 000079 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Pmc
   |étape=   Curation
   |type=    RBID
   |clé=     PMC:3102890
   |texte=   Integration and publication of heterogeneous text-mined relationships on the Semantic Web
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Curation/RBID.i   -Sk "pubmed:21624156" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd   \
       | NlmPubMed2Wicri -a InforLorV4

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022

	Serveur d'exploration sur la recherche en informatique en Lorraine
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur la recherche en informatique en Lorraine

Integration and publication of heterogeneous text-mined relationships on the Semantic Web

Integration and publication of heterogeneous text-mined relationships on the Semantic Web

Source :

Abstract

Links toward previous steps (curation, corpus...)

Links to Exploration step

Curation

No country items

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri

Pour générer des pages wiki