Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Structuring and extracting knowledge for the support of hypothesis generation in molecular biology

Identifieur interne : 000B87 ( Main/Merge ); précédent : 000B86; suivant : 000B88

Structuring and extracting knowledge for the support of hypothesis generation in molecular biology

Auteurs : Marco Roos [Pays-Bas] ; M Scott Marshall [Pays-Bas] ; Andrew P. Gibson [Pays-Bas] ; Martijn Schuemie [Pays-Bas] ; Edgar Meij [Pays-Bas] ; Sophia Katrenko [Pays-Bas] ; Willem Robert Van Hage [Pays-Bas] ; Konstantinos Krommydas [Pays-Bas] ; Pieter W. Adriaans [Pays-Bas]

Source :

RBID : PMC:2755830

Abstract

Background

Hypothesis generation in molecular and cellular biology is an empirical process in which knowledge derived from prior experiments is distilled into a comprehensible model. The requirement of automated support is exemplified by the difficulty of considering all relevant facts that are contained in the millions of documents available from PubMed. Semantic Web provides tools for sharing prior knowledge, while information retrieval and information extraction techniques enable its extraction from literature. Their combination makes prior knowledge available for computational analysis and inference. While some tools provide complete solutions that limit the control over the modeling and extraction processes, we seek a methodology that supports control by the experimenter over these critical processes.

Results

We describe progress towards automated support for the generation of biomolecular hypotheses. Semantic Web technologies are used to structure and store knowledge, while a workflow extracts knowledge from text. We designed minimal proto-ontologies in OWL for capturing different aspects of a text mining experiment: the biological hypothesis, text and documents, text mining, and workflow provenance. The models fit a methodology that allows focus on the requirements of a single experiment while supporting reuse and posterior analysis of extracted knowledge from multiple experiments. Our workflow is composed of services from the 'Adaptive Information Disclosure Application' (AIDA) toolkit as well as a few others. The output is a semantic model with putative biological relations, with each relation linked to the corresponding evidence.

Conclusion

We demonstrated a 'do-it-yourself' approach for structuring and extracting knowledge in the context of experimental research on biomolecular mechanisms. The methodology can be used to bootstrap the construction of semantically rich biological models using the results of knowledge extraction processes. Models specific to particular experiments can be constructed that, in turn, link with other semantic models, creating a web of knowledge that spans experiments. Mapping mechanisms can link to other knowledge resources such as OBO ontologies or SKOS vocabularies. AIDA Web Services can be used to design personalized knowledge extraction procedures. In our example experiment, we found three proteins (NF-Kappa B, p21, and Bax) potentially playing a role in the interplay between nutrients and epigenetic gene regulation.


Url:
DOI: 10.1186/1471-2105-10-S10-S9
PubMed: 19796406
PubMed Central: 2755830

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:2755830

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Structuring and extracting knowledge for the support of hypothesis generation in molecular biology</title>
<author>
<name sortKey="Roos, Marco" sort="Roos, Marco" uniqKey="Roos M" first="Marco" last="Roos">Marco Roos</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Marshall, M Scott" sort="Marshall, M Scott" uniqKey="Marshall M" first="M Scott" last="Marshall">M Scott Marshall</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Gibson, Andrew P" sort="Gibson, Andrew P" uniqKey="Gibson A" first="Andrew P" last="Gibson">Andrew P. Gibson</name>
<affiliation wicri:level="1">
<nlm:aff id="I2">Swammerdam Institute for Life Science, University of Amsterdam, Amsterdam, 1018 WB, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Swammerdam Institute for Life Science, University of Amsterdam, Amsterdam, 1018 WB</wicri:regionArea>
<wicri:noRegion>1018 WB</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Schuemie, Martijn" sort="Schuemie, Martijn" uniqKey="Schuemie M" first="Martijn" last="Schuemie">Martijn Schuemie</name>
<affiliation wicri:level="1">
<nlm:aff id="I3">BioSemantics group, Erasmus University of Rotterdam, Rotterdam, 3000 DR, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>BioSemantics group, Erasmus University of Rotterdam, Rotterdam, 3000 DR</wicri:regionArea>
<wicri:noRegion>3000 DR</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Meij, Edgar" sort="Meij, Edgar" uniqKey="Meij E" first="Edgar" last="Meij">Edgar Meij</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Katrenko, Sophia" sort="Katrenko, Sophia" uniqKey="Katrenko S" first="Sophia" last="Katrenko">Sophia Katrenko</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Van Hage, Willem Robert" sort="Van Hage, Willem Robert" uniqKey="Van Hage W" first="Willem Robert" last="Van Hage">Willem Robert Van Hage</name>
<affiliation wicri:level="1">
<nlm:aff id="I4">Business Informatics, Faculty of Sciences, Vrije Universiteit, Amsterdam, 1081 HV, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Business Informatics, Faculty of Sciences, Vrije Universiteit, Amsterdam, 1081 HV</wicri:regionArea>
<wicri:noRegion>1081 HV</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Krommydas, Konstantinos" sort="Krommydas, Konstantinos" uniqKey="Krommydas K" first="Konstantinos" last="Krommydas">Konstantinos Krommydas</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Adriaans, Pieter W" sort="Adriaans, Pieter W" uniqKey="Adriaans P" first="Pieter W" last="Adriaans">Pieter W. Adriaans</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">19796406</idno>
<idno type="pmc">2755830</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2755830</idno>
<idno type="RBID">PMC:2755830</idno>
<idno type="doi">10.1186/1471-2105-10-S10-S9</idno>
<date when="2009">2009</date>
<idno type="wicri:Area/Pmc/Corpus">000210</idno>
<idno type="wicri:Area/Pmc/Curation">000210</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000615</idno>
<idno type="wicri:Area/Ncbi/Merge">000110</idno>
<idno type="wicri:Area/Ncbi/Curation">000110</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000110</idno>
<idno type="wicri:Area/Main/Merge">000B87</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Structuring and extracting knowledge for the support of hypothesis generation in molecular biology</title>
<author>
<name sortKey="Roos, Marco" sort="Roos, Marco" uniqKey="Roos M" first="Marco" last="Roos">Marco Roos</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Marshall, M Scott" sort="Marshall, M Scott" uniqKey="Marshall M" first="M Scott" last="Marshall">M Scott Marshall</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Gibson, Andrew P" sort="Gibson, Andrew P" uniqKey="Gibson A" first="Andrew P" last="Gibson">Andrew P. Gibson</name>
<affiliation wicri:level="1">
<nlm:aff id="I2">Swammerdam Institute for Life Science, University of Amsterdam, Amsterdam, 1018 WB, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Swammerdam Institute for Life Science, University of Amsterdam, Amsterdam, 1018 WB</wicri:regionArea>
<wicri:noRegion>1018 WB</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Schuemie, Martijn" sort="Schuemie, Martijn" uniqKey="Schuemie M" first="Martijn" last="Schuemie">Martijn Schuemie</name>
<affiliation wicri:level="1">
<nlm:aff id="I3">BioSemantics group, Erasmus University of Rotterdam, Rotterdam, 3000 DR, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>BioSemantics group, Erasmus University of Rotterdam, Rotterdam, 3000 DR</wicri:regionArea>
<wicri:noRegion>3000 DR</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Meij, Edgar" sort="Meij, Edgar" uniqKey="Meij E" first="Edgar" last="Meij">Edgar Meij</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Katrenko, Sophia" sort="Katrenko, Sophia" uniqKey="Katrenko S" first="Sophia" last="Katrenko">Sophia Katrenko</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Van Hage, Willem Robert" sort="Van Hage, Willem Robert" uniqKey="Van Hage W" first="Willem Robert" last="Van Hage">Willem Robert Van Hage</name>
<affiliation wicri:level="1">
<nlm:aff id="I4">Business Informatics, Faculty of Sciences, Vrije Universiteit, Amsterdam, 1081 HV, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Business Informatics, Faculty of Sciences, Vrije Universiteit, Amsterdam, 1081 HV</wicri:regionArea>
<wicri:noRegion>1081 HV</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Krommydas, Konstantinos" sort="Krommydas, Konstantinos" uniqKey="Krommydas K" first="Konstantinos" last="Krommydas">Konstantinos Krommydas</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Adriaans, Pieter W" sort="Adriaans, Pieter W" uniqKey="Adriaans P" first="Pieter W" last="Adriaans">Pieter W. Adriaans</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ, The Netherlands</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Informatics Institute, University of Amsterdam, Amsterdam, 1098 SJ</wicri:regionArea>
<wicri:noRegion>1098 SJ</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j">BMC Bioinformatics</title>
<idno type="eISSN">1471-2105</idno>
<imprint>
<date when="2009">2009</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec>
<title>Background</title>
<p>Hypothesis generation in molecular and cellular biology is an empirical process in which knowledge derived from prior experiments is distilled into a comprehensible model. The requirement of automated support is exemplified by the difficulty of considering all relevant facts that are contained in the millions of documents available from PubMed. Semantic Web provides tools for sharing prior knowledge, while information retrieval and information extraction techniques enable its extraction from literature. Their combination makes prior knowledge available for computational analysis and inference. While some tools provide complete solutions that limit the control over the modeling and extraction processes, we seek a methodology that supports control by the experimenter over these critical processes.</p>
</sec>
<sec>
<title>Results</title>
<p>We describe progress towards automated support for the generation of biomolecular hypotheses. Semantic Web technologies are used to structure and store knowledge, while a workflow extracts knowledge from text. We designed minimal proto-ontologies in OWL for capturing different aspects of a text mining experiment: the biological hypothesis, text and documents, text mining, and workflow provenance. The models fit a methodology that allows focus on the requirements of a single experiment while supporting reuse and posterior analysis of extracted knowledge from multiple experiments. Our workflow is composed of services from the 'Adaptive Information Disclosure Application' (AIDA) toolkit as well as a few others. The output is a semantic model with putative biological relations, with each relation linked to the corresponding evidence.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>We demonstrated a 'do-it-yourself' approach for structuring and extracting knowledge in the context of experimental research on biomolecular mechanisms. The methodology can be used to bootstrap the construction of semantically rich biological models using the results of knowledge extraction processes. Models specific to particular experiments can be constructed that, in turn, link with other semantic models, creating a web of knowledge that spans experiments. Mapping mechanisms can link to other knowledge resources such as OBO ontologies or SKOS vocabularies. AIDA Web Services can be used to design personalized knowledge extraction procedures. In our example experiment, we found three proteins (NF-Kappa B, p21, and Bax) potentially playing a role in the interplay between nutrients and epigenetic gene regulation.</p>
</sec>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000B87 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 000B87 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     PMC:2755830
   |texte=   Structuring and extracting knowledge for the support of hypothesis generation in molecular biology
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Merge/RBID.i   -Sk "pubmed:19796406" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Merge/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024