Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Data Integration for Dynamic and Sustainable Systems Biology Resources: Challenges and Lessons Learned

Identifieur interne : 000288 ( Pmc/Curation ); précédent : 000287; suivant : 000289

Data Integration for Dynamic and Sustainable Systems Biology Resources: Challenges and Lessons Learned

Auteurs : Daniel E. Sullivan ; Joseph L. Gabbard ; Maulik Shukla ; Bruno Sobral

Source :

RBID : PMC:2894471

Abstract

Systems biology and infectious disease (host-pathogen-environment) research and development is becoming increasingly dependent on integrating data from diverse and dynamic sources. Maintaining integrated resources over long periods of time presents distinct challenges. This paper describes experiences and lessons learned from integrating data in two five-year projects focused on pathosystems biology: the Pathosystems Resource Integration Center (PATRIC, http://patric.vbi.vt.edu/), with a goal of developing bioinformatics resources for the research and countermeasures development communities based on genomics data, and the Resource Center for Biodefense Proteomics Research (RCBPR, http://www.proteomicsresource.org/), with a goal of developing resources based on the experiment data such as microarray and proteomics data from diverse sources and technologies. Some challenges include integrating genomic sequence and experiment data, data synchronization, data quality control, and usability engineering. We present examples of a variety of data integration problems drawn from our experiences with PATRIC and RBPRC, as well as open research questions related to long term sustainability, and describe the next steps to meeting these challenges. Novel contributions of this work include (1) an approach for addressing discrepancies between experiment results and interpreted results and (2) expanding the range of data integration techniques to include usability engineering at the presentation level.


Url:
DOI: 10.1002/cbdv.200900317
PubMed: 20491070
PubMed Central: 2894471

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:2894471

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Data Integration for Dynamic and Sustainable Systems Biology Resources: Challenges and Lessons Learned</title>
<author>
<name sortKey="Sullivan, Daniel E" sort="Sullivan, Daniel E" uniqKey="Sullivan D" first="Daniel E." last="Sullivan">Daniel E. Sullivan</name>
<affiliation>
<nlm:aff id="A1">CyberInfrastructure Section, Virginia Bioinformatics Institute, Washington Street, MC 0477, Virginia Tech, Blacksburg, Virginia 24061, USA (phone: 540-231-2100; fax: 540-231-2606;
<email>dsulliva@vbi.vt.edu</email>
)</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Gabbard, Joseph L" sort="Gabbard, Joseph L" uniqKey="Gabbard J" first="Joseph L." last="Gabbard">Joseph L. Gabbard</name>
<affiliation>
<nlm:aff id="A2">Center for Human-Computer Interaction, 2202 Kraft Drive, Virginia Tech, Blacksburg, Virginia 24060, USA (phone: 540- 231-3188; fax: (540) 231-6075;
<email>jgabbard@vt.edu</email>
)</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Shukla, Maulik" sort="Shukla, Maulik" uniqKey="Shukla M" first="Maulik" last="Shukla">Maulik Shukla</name>
<affiliation>
<nlm:aff id="A1">CyberInfrastructure Section, Virginia Bioinformatics Institute, Washington Street, MC 0477, Virginia Tech, Blacksburg, Virginia 24061, USA (phone: 540-231-2100; fax: 540-231-2606;
<email>dsulliva@vbi.vt.edu</email>
)</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Sobral, Bruno" sort="Sobral, Bruno" uniqKey="Sobral B" first="Bruno" last="Sobral">Bruno Sobral</name>
<affiliation>
<nlm:aff id="A1">CyberInfrastructure Section, Virginia Bioinformatics Institute, Washington Street, MC 0477, Virginia Tech, Blacksburg, Virginia 24061, USA (phone: 540-231-2100; fax: 540-231-2606;
<email>dsulliva@vbi.vt.edu</email>
)</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">20491070</idno>
<idno type="pmc">2894471</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2894471</idno>
<idno type="RBID">PMC:2894471</idno>
<idno type="doi">10.1002/cbdv.200900317</idno>
<date when="2010">2010</date>
<idno type="wicri:Area/Pmc/Corpus">000288</idno>
<idno type="wicri:Area/Pmc/Curation">000288</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Data Integration for Dynamic and Sustainable Systems Biology Resources: Challenges and Lessons Learned</title>
<author>
<name sortKey="Sullivan, Daniel E" sort="Sullivan, Daniel E" uniqKey="Sullivan D" first="Daniel E." last="Sullivan">Daniel E. Sullivan</name>
<affiliation>
<nlm:aff id="A1">CyberInfrastructure Section, Virginia Bioinformatics Institute, Washington Street, MC 0477, Virginia Tech, Blacksburg, Virginia 24061, USA (phone: 540-231-2100; fax: 540-231-2606;
<email>dsulliva@vbi.vt.edu</email>
)</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Gabbard, Joseph L" sort="Gabbard, Joseph L" uniqKey="Gabbard J" first="Joseph L." last="Gabbard">Joseph L. Gabbard</name>
<affiliation>
<nlm:aff id="A2">Center for Human-Computer Interaction, 2202 Kraft Drive, Virginia Tech, Blacksburg, Virginia 24060, USA (phone: 540- 231-3188; fax: (540) 231-6075;
<email>jgabbard@vt.edu</email>
)</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Shukla, Maulik" sort="Shukla, Maulik" uniqKey="Shukla M" first="Maulik" last="Shukla">Maulik Shukla</name>
<affiliation>
<nlm:aff id="A1">CyberInfrastructure Section, Virginia Bioinformatics Institute, Washington Street, MC 0477, Virginia Tech, Blacksburg, Virginia 24061, USA (phone: 540-231-2100; fax: 540-231-2606;
<email>dsulliva@vbi.vt.edu</email>
)</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Sobral, Bruno" sort="Sobral, Bruno" uniqKey="Sobral B" first="Bruno" last="Sobral">Bruno Sobral</name>
<affiliation>
<nlm:aff id="A1">CyberInfrastructure Section, Virginia Bioinformatics Institute, Washington Street, MC 0477, Virginia Tech, Blacksburg, Virginia 24061, USA (phone: 540-231-2100; fax: 540-231-2606;
<email>dsulliva@vbi.vt.edu</email>
)</nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Chemistry & biodiversity</title>
<idno type="ISSN">1612-1872</idno>
<idno type="eISSN">1612-1880</idno>
<imprint>
<date when="2010">2010</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p id="P1">Systems biology and infectious disease (host-pathogen-environment) research and development is becoming increasingly dependent on integrating data from diverse and dynamic sources. Maintaining integrated resources over long periods of time presents distinct challenges. This paper describes experiences and lessons learned from integrating data in two five-year projects focused on pathosystems biology: the Pathosystems Resource Integration Center (PATRIC,
<ext-link ext-link-type="uri" xlink:href="http://patric.vbi.vt.edu/">http://patric.vbi.vt.edu/</ext-link>
), with a goal of developing bioinformatics resources for the research and countermeasures development communities based on genomics data, and the Resource Center for Biodefense Proteomics Research (RCBPR,
<ext-link ext-link-type="uri" xlink:href="http://www.proteomicsresource.org/">http://www.proteomicsresource.org/</ext-link>
), with a goal of developing resources based on the experiment data such as microarray and proteomics data from diverse sources and technologies. Some challenges include integrating genomic sequence and experiment data, data synchronization, data quality control, and usability engineering. We present examples of a variety of data integration problems drawn from our experiences with PATRIC and RBPRC, as well as open research questions related to long term sustainability, and describe the next steps to meeting these challenges. Novel contributions of this work include (1) an approach for addressing discrepancies between experiment results and interpreted results and (2) expanding the range of data integration techniques to include usability engineering at the presentation level.</p>
</div>
</front>
</TEI>
<pmc article-type="research-article" xml:lang="EN">
<pmc-comment>The publisher of this article does not allow downloading of the full text in XML form.</pmc-comment>
<pmc-dir>properties manuscript</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-journal-id">101197449</journal-id>
<journal-id journal-id-type="pubmed-jr-id">32031</journal-id>
<journal-id journal-id-type="nlm-ta">Chem Biodivers</journal-id>
<journal-title>Chemistry & biodiversity</journal-title>
<issn pub-type="ppub">1612-1872</issn>
<issn pub-type="epub">1612-1880</issn>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">20491070</article-id>
<article-id pub-id-type="pmc">2894471</article-id>
<article-id pub-id-type="doi">10.1002/cbdv.200900317</article-id>
<article-id pub-id-type="manuscript">NIHMS208765</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Data Integration for Dynamic and Sustainable Systems Biology Resources: Challenges and Lessons Learned</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Sullivan</surname>
<given-names>Daniel E.</given-names>
</name>
<xref ref-type="aff" rid="A1">a</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Gabbard</surname>
<given-names>Joseph L.</given-names>
<suffix>Jr</suffix>
</name>
<xref ref-type="aff" rid="A2">b</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Shukla</surname>
<given-names>Maulik</given-names>
</name>
<xref ref-type="aff" rid="A1">a</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Sobral</surname>
<given-names>Bruno</given-names>
</name>
<xref ref-type="aff" rid="A1">a</xref>
</contrib>
</contrib-group>
<aff id="A1">
<label>a</label>
CyberInfrastructure Section, Virginia Bioinformatics Institute, Washington Street, MC 0477, Virginia Tech, Blacksburg, Virginia 24061, USA (phone: 540-231-2100; fax: 540-231-2606;
<email>dsulliva@vbi.vt.edu</email>
)</aff>
<aff id="A2">
<label>b</label>
Center for Human-Computer Interaction, 2202 Kraft Drive, Virginia Tech, Blacksburg, Virginia 24060, USA (phone: 540- 231-3188; fax: (540) 231-6075;
<email>jgabbard@vt.edu</email>
)</aff>
<pub-date pub-type="nihms-submitted">
<day>15</day>
<month>6</month>
<year>2010</year>
</pub-date>
<pub-date pub-type="ppub">
<month>5</month>
<year>2010</year>
</pub-date>
<pub-date pub-type="pmc-release">
<day>30</day>
<month>6</month>
<year>2010</year>
</pub-date>
<volume>7</volume>
<issue>5</issue>
<fpage>1124</fpage>
<lpage>1141</lpage>
<abstract>
<p id="P1">Systems biology and infectious disease (host-pathogen-environment) research and development is becoming increasingly dependent on integrating data from diverse and dynamic sources. Maintaining integrated resources over long periods of time presents distinct challenges. This paper describes experiences and lessons learned from integrating data in two five-year projects focused on pathosystems biology: the Pathosystems Resource Integration Center (PATRIC,
<ext-link ext-link-type="uri" xlink:href="http://patric.vbi.vt.edu/">http://patric.vbi.vt.edu/</ext-link>
), with a goal of developing bioinformatics resources for the research and countermeasures development communities based on genomics data, and the Resource Center for Biodefense Proteomics Research (RCBPR,
<ext-link ext-link-type="uri" xlink:href="http://www.proteomicsresource.org/">http://www.proteomicsresource.org/</ext-link>
), with a goal of developing resources based on the experiment data such as microarray and proteomics data from diverse sources and technologies. Some challenges include integrating genomic sequence and experiment data, data synchronization, data quality control, and usability engineering. We present examples of a variety of data integration problems drawn from our experiences with PATRIC and RBPRC, as well as open research questions related to long term sustainability, and describe the next steps to meeting these challenges. Novel contributions of this work include (1) an approach for addressing discrepancies between experiment results and interpreted results and (2) expanding the range of data integration techniques to include usability engineering at the presentation level.</p>
</abstract>
<contract-num rid="AI1">U54 AI057168-05S20025 ||AI</contract-num>
<contract-num rid="AI1">U54 AI057168-019001 ||AI</contract-num>
<contract-sponsor id="AI1">National Institute of Allergy and Infectious Diseases Extramural Activities : NIAID</contract-sponsor>
</article-meta>
</front>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000288 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd -nk 000288 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Pmc
   |étape=   Curation
   |type=    RBID
   |clé=     PMC:2894471
   |texte=   Data Integration for Dynamic and Sustainable Systems Biology Resources: Challenges and Lessons Learned
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Curation/RBID.i   -Sk "pubmed:20491070" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024