Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

An ontology-driven semantic mash-up of gene and biological pathway information: Application to the domain of nicotine dependence

Identifieur interne : 000045 ( Ncbi/Curation ); précédent : 000044; suivant : 000046

An ontology-driven semantic mash-up of gene and biological pathway information: Application to the domain of nicotine dependence

Auteurs : Satya S. Sahoo [États-Unis] ; Olivier Bodenreider [États-Unis] ; Joni L. Rutter [États-Unis] ; Karen J. Skinner [États-Unis] ; Amit P. Sheth [États-Unis]

Source :

RBID : PMC:2766186

Abstract

Objectives

This paper illustrates how Semantic Web technologies (especially RDF, OWL, and SPARQL) can support information integration and make it easy to create semantic mashups (semantically integrated resources). In the context of understanding the genetic basis of nicotine dependence, we integrate gene and pathway information and show how three complex biological queries can be answered by the integrated knowledge base.

Methods

We use an ontology-driven approach to integrate two gene resources (Entrez Gene and HomoloGene) and three pathway resources (KEGG, Reactome and BioCyc), for five organisms, including humans. We created the Entrez Knowledge Model (EKoM), an information model in OWL for the gene resources, and integrated it with the extant BioPAX ontology designed for pathway resources. The integrated schema is populated with data from the pathway resources, publicly available in BioPAX-compatible format, and gene resources for which a population procedure was created. The SPARQL query language is used to formulate queries over the integrated knowledge base to answer the three biological queries.

Results

Simple SPARQL queries could easily identify hub genes, i.e., those genes whose gene products participate in many pathways or interact with many other gene products. The identification of the genes expressed in the brain turned out to be more difficult, due to the lack of a common identification scheme for proteins.

Conclusion

Semantic Web technologies provide a valid framework for information integration in the life sciences. Ontology-driven integration represents a flexible, sustainable and extensible solution to the integration of large volumes of information. Additional resources, which enable the creation of mappings between information sources, are required to compensate for heterogeneity across namespaces.

Resource page

http://knoesis.wright.edu/research/lifesci/integration/structured_data/JBI-2008/


Url:
DOI: 10.1016/j.jbi.2008.02.006
PubMed: 18395495
PubMed Central: 2766186

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:2766186

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">An ontology-driven semantic mash-up of gene and biological pathway information: Application to the domain of nicotine dependence</title>
<author>
<name sortKey="Sahoo, Satya S" sort="Sahoo, Satya S" uniqKey="Sahoo S" first="Satya S." last="Sahoo">Satya S. Sahoo</name>
<affiliation wicri:level="2">
<nlm:aff id="A1">Kno.e.sis Center, Wright State University, Dayton, OH</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Ohio</region>
</placeName>
<wicri:cityArea>Kno.e.sis Center, Wright State University, Dayton</wicri:cityArea>
</affiliation>
<affiliation wicri:level="2">
<nlm:aff id="A2">LHNCBC, National Library of Medicine, Bethesda, MD</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
<wicri:cityArea>LHNCBC, National Library of Medicine, Bethesda</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Bodenreider, Olivier" sort="Bodenreider, Olivier" uniqKey="Bodenreider O" first="Olivier" last="Bodenreider">Olivier Bodenreider</name>
<affiliation wicri:level="2">
<nlm:aff id="A2">LHNCBC, National Library of Medicine, Bethesda, MD</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
<wicri:cityArea>LHNCBC, National Library of Medicine, Bethesda</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Rutter, Joni L" sort="Rutter, Joni L" uniqKey="Rutter J" first="Joni L." last="Rutter">Joni L. Rutter</name>
<affiliation wicri:level="2">
<nlm:aff id="A3">DBNBR, National Institute on Drug Abuse, Bethesda, MD</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
<wicri:cityArea>DBNBR, National Institute on Drug Abuse, Bethesda</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Skinner, Karen J" sort="Skinner, Karen J" uniqKey="Skinner K" first="Karen J." last="Skinner">Karen J. Skinner</name>
<affiliation wicri:level="2">
<nlm:aff id="A3">DBNBR, National Institute on Drug Abuse, Bethesda, MD</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
<wicri:cityArea>DBNBR, National Institute on Drug Abuse, Bethesda</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Sheth, Amit P" sort="Sheth, Amit P" uniqKey="Sheth A" first="Amit P." last="Sheth">Amit P. Sheth</name>
<affiliation wicri:level="2">
<nlm:aff id="A1">Kno.e.sis Center, Wright State University, Dayton, OH</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Ohio</region>
</placeName>
<wicri:cityArea>Kno.e.sis Center, Wright State University, Dayton</wicri:cityArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">18395495</idno>
<idno type="pmc">2766186</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2766186</idno>
<idno type="RBID">PMC:2766186</idno>
<idno type="doi">10.1016/j.jbi.2008.02.006</idno>
<date when="2008">2008</date>
<idno type="wicri:Area/Pmc/Corpus">000284</idno>
<idno type="wicri:Area/Pmc/Curation">000284</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000679</idno>
<idno type="wicri:Area/Ncbi/Merge">000045</idno>
<idno type="wicri:Area/Ncbi/Curation">000045</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">An ontology-driven semantic mash-up of gene and biological pathway information: Application to the domain of nicotine dependence</title>
<author>
<name sortKey="Sahoo, Satya S" sort="Sahoo, Satya S" uniqKey="Sahoo S" first="Satya S." last="Sahoo">Satya S. Sahoo</name>
<affiliation wicri:level="2">
<nlm:aff id="A1">Kno.e.sis Center, Wright State University, Dayton, OH</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Ohio</region>
</placeName>
<wicri:cityArea>Kno.e.sis Center, Wright State University, Dayton</wicri:cityArea>
</affiliation>
<affiliation wicri:level="2">
<nlm:aff id="A2">LHNCBC, National Library of Medicine, Bethesda, MD</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
<wicri:cityArea>LHNCBC, National Library of Medicine, Bethesda</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Bodenreider, Olivier" sort="Bodenreider, Olivier" uniqKey="Bodenreider O" first="Olivier" last="Bodenreider">Olivier Bodenreider</name>
<affiliation wicri:level="2">
<nlm:aff id="A2">LHNCBC, National Library of Medicine, Bethesda, MD</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
<wicri:cityArea>LHNCBC, National Library of Medicine, Bethesda</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Rutter, Joni L" sort="Rutter, Joni L" uniqKey="Rutter J" first="Joni L." last="Rutter">Joni L. Rutter</name>
<affiliation wicri:level="2">
<nlm:aff id="A3">DBNBR, National Institute on Drug Abuse, Bethesda, MD</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
<wicri:cityArea>DBNBR, National Institute on Drug Abuse, Bethesda</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Skinner, Karen J" sort="Skinner, Karen J" uniqKey="Skinner K" first="Karen J." last="Skinner">Karen J. Skinner</name>
<affiliation wicri:level="2">
<nlm:aff id="A3">DBNBR, National Institute on Drug Abuse, Bethesda, MD</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
<wicri:cityArea>DBNBR, National Institute on Drug Abuse, Bethesda</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Sheth, Amit P" sort="Sheth, Amit P" uniqKey="Sheth A" first="Amit P." last="Sheth">Amit P. Sheth</name>
<affiliation wicri:level="2">
<nlm:aff id="A1">Kno.e.sis Center, Wright State University, Dayton, OH</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Ohio</region>
</placeName>
<wicri:cityArea>Kno.e.sis Center, Wright State University, Dayton</wicri:cityArea>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Journal of biomedical informatics</title>
<idno type="ISSN">1532-0464</idno>
<idno type="eISSN">1532-0480</idno>
<imprint>
<date when="2008">2008</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec id="S1">
<title>Objectives</title>
<p id="P1">This paper illustrates how Semantic Web technologies (especially RDF, OWL, and SPARQL) can support information integration and make it easy to create semantic mashups (semantically integrated resources). In the context of understanding the genetic basis of nicotine dependence, we integrate gene and pathway information and show how three complex biological queries can be answered by the integrated knowledge base.</p>
</sec>
<sec sec-type="methods" id="S2">
<title>Methods</title>
<p id="P2">We use an ontology-driven approach to integrate two gene resources (Entrez Gene and HomoloGene) and three pathway resources (KEGG, Reactome and BioCyc), for five organisms, including humans. We created the Entrez Knowledge Model (EKoM), an information model in OWL for the gene resources, and integrated it with the extant BioPAX ontology designed for pathway resources. The integrated schema is populated with data from the pathway resources, publicly available in BioPAX-compatible format, and gene resources for which a population procedure was created. The SPARQL query language is used to formulate queries over the integrated knowledge base to answer the three biological queries.</p>
</sec>
<sec id="S3">
<title>Results</title>
<p id="P3">Simple SPARQL queries could easily identify hub genes, i.e., those genes whose gene products participate in many pathways or interact with many other gene products. The identification of the genes expressed in the brain turned out to be more difficult, due to the lack of a common identification scheme for proteins.</p>
</sec>
<sec id="S4">
<title>Conclusion</title>
<p id="P4">Semantic Web technologies provide a valid framework for information integration in the life sciences. Ontology-driven integration represents a flexible, sustainable and extensible solution to the integration of large volumes of information. Additional resources, which enable the creation of mappings between information sources, are required to compensate for heterogeneity across namespaces.</p>
</sec>
<sec id="S5">
<title>Resource page</title>
<p id="P5">
<ext-link ext-link-type="uri" xlink:href="http://knoesis.wright.edu/research/lifesci/integration/structured_data/JBI-2008/">http://knoesis.wright.edu/research/lifesci/integration/structured_data/JBI-2008/</ext-link>
</p>
</sec>
</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Ncbi/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000045 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Ncbi/Curation/biblio.hfd -nk 000045 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Ncbi
   |étape=   Curation
   |type=    RBID
   |clé=     PMC:2766186
   |texte=   An ontology-driven semantic mash-up of gene and biological pathway information: Application to the domain of nicotine dependence
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Ncbi/Curation/RBID.i   -Sk "pubmed:18395495" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Ncbi/Curation/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024