Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Data integration in biological research: an overview

Identifieur interne : 000194 ( Pmc/Corpus ); précédent : 000193; suivant : 000195

Data integration in biological research: an overview

Auteurs : Vasileios Lapatas ; Michalis Stefanidakis ; Rafael C. Jimenez ; Allegra Via ; Maria Victoria Schneider

Source :

RBID : PMC:4557916

Abstract

Data sharing, integration and annotation are essential to ensure the reproducibility of the analysis and interpretation of the experimental findings. Often these activities are perceived as a role that bioinformaticians and computer scientists have to take with no or little input from the experimental biologist. On the contrary, biological researchers, being the producers and often the end users of such data, have a big role in enabling biological data integration. The quality and usefulness of data integration depend on the existence and adoption of standards, shared formats, and mechanisms that are suitable for biological researchers to submit and annotate the data, so it can be easily searchable, conveniently linked and consequently used for further biological analysis and discovery. Here, we provide background on what is data integration from a computational science point of view, how it has been applied to biological research, which key aspects contributed to its success and future directions.


Url:
DOI: 10.1186/s40709-015-0032-5
PubMed: 26336651
PubMed Central: 4557916

Links to Exploration step

PMC:4557916

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Data integration in biological research: an overview</title>
<author>
<name sortKey="Lapatas, Vasileios" sort="Lapatas, Vasileios" uniqKey="Lapatas V" first="Vasileios" last="Lapatas">Vasileios Lapatas</name>
<affiliation>
<nlm:aff id="Aff1">Department of Informatics, Ionian University, 7 Tsirigoti Square, Corfu, 49100 Greece</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Stefanidakis, Michalis" sort="Stefanidakis, Michalis" uniqKey="Stefanidakis M" first="Michalis" last="Stefanidakis">Michalis Stefanidakis</name>
<affiliation>
<nlm:aff id="Aff1">Department of Informatics, Ionian University, 7 Tsirigoti Square, Corfu, 49100 Greece</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Jimenez, Rafael C" sort="Jimenez, Rafael C" uniqKey="Jimenez R" first="Rafael C." last="Jimenez">Rafael C. Jimenez</name>
<affiliation>
<nlm:aff id="Aff2">ELIXIR, Wellcome Trust Genome Campus, Hinxton, CB10 1SD UK</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Via, Allegra" sort="Via, Allegra" uniqKey="Via A" first="Allegra" last="Via">Allegra Via</name>
<affiliation>
<nlm:aff id="Aff3">Biocomputing Group, Sapienza University, Piazzale Aldo Moro 5, Rome, 00185 Italy</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Schneider, Maria Victoria" sort="Schneider, Maria Victoria" uniqKey="Schneider M" first="Maria Victoria" last="Schneider">Maria Victoria Schneider</name>
<affiliation>
<nlm:aff id="Aff4">361° Division, The Genome Analysis Centre, Norwich Research Park, Norwich, NR4 7UH UK</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">26336651</idno>
<idno type="pmc">4557916</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4557916</idno>
<idno type="RBID">PMC:4557916</idno>
<idno type="doi">10.1186/s40709-015-0032-5</idno>
<date when="2015">2015</date>
<idno type="wicri:Area/Pmc/Corpus">000194</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Data integration in biological research: an overview</title>
<author>
<name sortKey="Lapatas, Vasileios" sort="Lapatas, Vasileios" uniqKey="Lapatas V" first="Vasileios" last="Lapatas">Vasileios Lapatas</name>
<affiliation>
<nlm:aff id="Aff1">Department of Informatics, Ionian University, 7 Tsirigoti Square, Corfu, 49100 Greece</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Stefanidakis, Michalis" sort="Stefanidakis, Michalis" uniqKey="Stefanidakis M" first="Michalis" last="Stefanidakis">Michalis Stefanidakis</name>
<affiliation>
<nlm:aff id="Aff1">Department of Informatics, Ionian University, 7 Tsirigoti Square, Corfu, 49100 Greece</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Jimenez, Rafael C" sort="Jimenez, Rafael C" uniqKey="Jimenez R" first="Rafael C." last="Jimenez">Rafael C. Jimenez</name>
<affiliation>
<nlm:aff id="Aff2">ELIXIR, Wellcome Trust Genome Campus, Hinxton, CB10 1SD UK</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Via, Allegra" sort="Via, Allegra" uniqKey="Via A" first="Allegra" last="Via">Allegra Via</name>
<affiliation>
<nlm:aff id="Aff3">Biocomputing Group, Sapienza University, Piazzale Aldo Moro 5, Rome, 00185 Italy</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Schneider, Maria Victoria" sort="Schneider, Maria Victoria" uniqKey="Schneider M" first="Maria Victoria" last="Schneider">Maria Victoria Schneider</name>
<affiliation>
<nlm:aff id="Aff4">361° Division, The Genome Analysis Centre, Norwich Research Park, Norwich, NR4 7UH UK</nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Journal of Biological Research</title>
<idno type="ISSN">1790-045X</idno>
<idno type="eISSN">2241-5793</idno>
<imprint>
<date when="2015">2015</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>Data sharing, integration and annotation are essential to ensure the reproducibility of the analysis and interpretation of the experimental findings. Often these activities are perceived as a role that bioinformaticians and computer scientists have to take with no or little input from the experimental biologist. On the contrary, biological researchers, being the producers and often the end users of such data, have a big role in enabling biological data integration. The quality and usefulness of data integration depend on the existence and adoption of standards, shared formats, and mechanisms that are suitable for biological researchers to submit and annotate the data, so it can be easily searchable, conveniently linked and consequently used for further biological analysis and discovery. Here, we provide background on what is data integration from a computational science point of view, how it has been applied to biological research, which key aspects contributed to its success and future directions.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Stamatoyannopoulos, Ja" uniqKey="Stamatoyannopoulos J">JA Stamatoyannopoulos</name>
</author>
<author>
<name sortKey="Snyder, M" uniqKey="Snyder M">M Snyder</name>
</author>
<author>
<name sortKey="Hardison, R" uniqKey="Hardison R">R Hardison</name>
</author>
<author>
<name sortKey="Ren, B" uniqKey="Ren B">B Ren</name>
</author>
<author>
<name sortKey="Gingeras, T" uniqKey="Gingeras T">T Gingeras</name>
</author>
<author>
<name sortKey="Gilbert, Dm" uniqKey="Gilbert D">DM Gilbert</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gomez Cabrero, D" uniqKey="Gomez Cabrero D">D Gomez-Cabrero</name>
</author>
<author>
<name sortKey="Abugessaisa, I" uniqKey="Abugessaisa I">I Abugessaisa</name>
</author>
<author>
<name sortKey="Maier, D" uniqKey="Maier D">D Maier</name>
</author>
<author>
<name sortKey="Teschendorff, A" uniqKey="Teschendorff A">A Teschendorff</name>
</author>
<author>
<name sortKey="Merkenschlager, M" uniqKey="Merkenschlager M">M Merkenschlager</name>
</author>
<author>
<name sortKey="Gisel, A" uniqKey="Gisel A">A Gisel</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ma Yan, A" uniqKey="Ma Yan A">A Ma’ayan</name>
</author>
<author>
<name sortKey="Rouillard, Ad" uniqKey="Rouillard A">AD Rouillard</name>
</author>
<author>
<name sortKey="Clark, Nr" uniqKey="Clark N">NR Clark</name>
</author>
<author>
<name sortKey="Wang, Z" uniqKey="Wang Z">Z Wang</name>
</author>
<author>
<name sortKey="Duan, Q" uniqKey="Duan Q">Q Duan</name>
</author>
<author>
<name sortKey="Kou, Y" uniqKey="Kou Y">Y Kou</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ritchie, Md" uniqKey="Ritchie M">MD Ritchie</name>
</author>
<author>
<name sortKey="Holzinger, Er" uniqKey="Holzinger E">ER Holzinger</name>
</author>
<author>
<name sortKey="Li, R" uniqKey="Li R">R Li</name>
</author>
<author>
<name sortKey="Pendergrass, Sa" uniqKey="Pendergrass S">SA Pendergrass</name>
</author>
<author>
<name sortKey="Kim, D" uniqKey="Kim D">D Kim</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Warde Farley, D" uniqKey="Warde Farley D">D Warde-Farley</name>
</author>
<author>
<name sortKey="Donaldson, Sl" uniqKey="Donaldson S">SL Donaldson</name>
</author>
<author>
<name sortKey="Comes, O" uniqKey="Comes O">O Comes</name>
</author>
<author>
<name sortKey="Zuberi, K" uniqKey="Zuberi K">K Zuberi</name>
</author>
<author>
<name sortKey="Badrawi, R" uniqKey="Badrawi R">R Badrawi</name>
</author>
<author>
<name sortKey="Chao, P" uniqKey="Chao P">P Chao</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rieping, W" uniqKey="Rieping W">W Rieping</name>
</author>
<author>
<name sortKey="Habeck, M" uniqKey="Habeck M">M Habeck</name>
</author>
<author>
<name sortKey="Bardiaux, B" uniqKey="Bardiaux B">B Bardiaux</name>
</author>
<author>
<name sortKey="Bernard, A" uniqKey="Bernard A">A Bernard</name>
</author>
<author>
<name sortKey="Malliavin, Te" uniqKey="Malliavin T">TE Malliavin</name>
</author>
<author>
<name sortKey="Nilges, M" uniqKey="Nilges M">M Nilges</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jansen, R" uniqKey="Jansen R">R Jansen</name>
</author>
<author>
<name sortKey="Yu, H" uniqKey="Yu H">H Yu</name>
</author>
<author>
<name sortKey="Greenbaum, D" uniqKey="Greenbaum D">D Greenbaum</name>
</author>
<author>
<name sortKey="Kluger, Y" uniqKey="Kluger Y">Y Kluger</name>
</author>
<author>
<name sortKey="Krogan, Nj" uniqKey="Krogan N">NJ Krogan</name>
</author>
<author>
<name sortKey="Chung, S" uniqKey="Chung S">S Chung</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hwang, D" uniqKey="Hwang D">D Hwang</name>
</author>
<author>
<name sortKey="Rust, Ag" uniqKey="Rust A">AG Rust</name>
</author>
<author>
<name sortKey="Ramsey, S" uniqKey="Ramsey S">S Ramsey</name>
</author>
<author>
<name sortKey="Smith, Jj" uniqKey="Smith J">JJ Smith</name>
</author>
<author>
<name sortKey="Leslie, Dm" uniqKey="Leslie D">DM Leslie</name>
</author>
<author>
<name sortKey="Weston, Ad" uniqKey="Weston A">AD Weston</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Myers, Cl" uniqKey="Myers C">CL Myers</name>
</author>
<author>
<name sortKey="Troyanskaya, Og" uniqKey="Troyanskaya O">OG Troyanskaya</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chung, Sy" uniqKey="Chung S">SY Chung</name>
</author>
<author>
<name sortKey="Wong, L" uniqKey="Wong L">L Wong</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Letunic, I" uniqKey="Letunic I">I Letunic</name>
</author>
<author>
<name sortKey="Copley, Rr" uniqKey="Copley R">RR Copley</name>
</author>
<author>
<name sortKey="Schmidt, S" uniqKey="Schmidt S">S Schmidt</name>
</author>
<author>
<name sortKey="Ciccarelli, Fd" uniqKey="Ciccarelli F">FD Ciccarelli</name>
</author>
<author>
<name sortKey="Doerks, T" uniqKey="Doerks T">T Doerks</name>
</author>
<author>
<name sortKey="Schultz, J" uniqKey="Schultz J">J Schultz</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Von Mering, C" uniqKey="Von Mering C">C Von Mering</name>
</author>
<author>
<name sortKey="Jensen, Lj" uniqKey="Jensen L">LJ Jensen</name>
</author>
<author>
<name sortKey="Kuhn, M" uniqKey="Kuhn M">M Kuhn</name>
</author>
<author>
<name sortKey="Chaffron, S" uniqKey="Chaffron S">S Chaffron</name>
</author>
<author>
<name sortKey="Doerks, T" uniqKey="Doerks T">T Doerks</name>
</author>
<author>
<name sortKey="Kruger, B" uniqKey="Kruger B">B Krüger</name>
</author>
<author>
<name sortKey="Snel, B" uniqKey="Snel B">B Snel</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cheung, K H" uniqKey="Cheung K">K-H Cheung</name>
</author>
<author>
<name sortKey="Yip, Ky" uniqKey="Yip K">KY Yip</name>
</author>
<author>
<name sortKey="Smith, A" uniqKey="Smith A">A Smith</name>
</author>
<author>
<name sortKey="Masiar, A" uniqKey="Masiar A">A Masiar</name>
</author>
<author>
<name sortKey="Gerstein, M" uniqKey="Gerstein M">M Gerstein</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Goldovsky, L" uniqKey="Goldovsky L">L Goldovsky</name>
</author>
<author>
<name sortKey="Janssen, P" uniqKey="Janssen P">P Janssen</name>
</author>
<author>
<name sortKey="Ahren, D" uniqKey="Ahren D">D Ahren</name>
</author>
<author>
<name sortKey="Audit, B" uniqKey="Audit B">B Audit</name>
</author>
<author>
<name sortKey="Cases, I" uniqKey="Cases I">I Cases</name>
</author>
<author>
<name sortKey="Darzentas, N" uniqKey="Darzentas N">N Darzentas</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kauppinen, T" uniqKey="Kauppinen T">T Kauppinen</name>
</author>
<author>
<name sortKey="De Espindola, Gm" uniqKey="De Espindola G">GM de Espindola</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chamberlain, Sa" uniqKey="Chamberlain S">SA Chamberlain</name>
</author>
<author>
<name sortKey="Szocs, E" uniqKey="Szocs E">E Szöcs</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Juty, N" uniqKey="Juty N">N Juty</name>
</author>
<author>
<name sortKey="Ali, R" uniqKey="Ali R">R Ali</name>
</author>
<author>
<name sortKey="Glont, M" uniqKey="Glont M">M Glont</name>
</author>
<author>
<name sortKey="Keating, S" uniqKey="Keating S">S Keating</name>
</author>
<author>
<name sortKey="Rodriguez, N" uniqKey="Rodriguez N">N Rodriguez</name>
</author>
<author>
<name sortKey="Swat, M" uniqKey="Swat M">M Swat</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kenall, A" uniqKey="Kenall A">A Kenall</name>
</author>
<author>
<name sortKey="Edmunds, S" uniqKey="Edmunds S">S Edmunds</name>
</author>
<author>
<name sortKey="Goodman, L" uniqKey="Goodman L">L Goodman</name>
</author>
<author>
<name sortKey="Bal, L" uniqKey="Bal L">L Bal</name>
</author>
<author>
<name sortKey="Flintoft, L" uniqKey="Flintoft L">L Flintoft</name>
</author>
<author>
<name sortKey="Shanahan, Dr" uniqKey="Shanahan D">DR Shanahan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Garijo, D" uniqKey="Garijo D">D Garijo</name>
</author>
<author>
<name sortKey="Kinnings, S" uniqKey="Kinnings S">S Kinnings</name>
</author>
<author>
<name sortKey="Xie, L" uniqKey="Xie L">L Xie</name>
</author>
<author>
<name sortKey="Xie, L" uniqKey="Xie L">L Xie</name>
</author>
<author>
<name sortKey="Zhang, Y" uniqKey="Zhang Y">Y Zhang</name>
</author>
<author>
<name sortKey="Bourne, Pe" uniqKey="Bourne P">PE Bourne</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Saleem, M" uniqKey="Saleem M">M Saleem</name>
</author>
<author>
<name sortKey="Kamdar, Mr" uniqKey="Kamdar M">MR Kamdar</name>
</author>
<author>
<name sortKey="Iqbal, A" uniqKey="Iqbal A">A Iqbal</name>
</author>
<author>
<name sortKey="Sampath, S" uniqKey="Sampath S">S Sampath</name>
</author>
<author>
<name sortKey="Deus, Hf" uniqKey="Deus H">HF Deus</name>
</author>
<author>
<name sortKey="Ngomo, A Cn" uniqKey="Ngomo A">A-CN Ngomo</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wandelt, S" uniqKey="Wandelt S">S Wandelt</name>
</author>
<author>
<name sortKey="Rheinl Nder, A" uniqKey="Rheinl Nder A">A Rheinländer</name>
</author>
<author>
<name sortKey="Bux, M" uniqKey="Bux M">M Bux</name>
</author>
<author>
<name sortKey="Thalheim, L" uniqKey="Thalheim L">L Thalheim</name>
</author>
<author>
<name sortKey="Haldemann, B" uniqKey="Haldemann B">B Haldemann</name>
</author>
<author>
<name sortKey="Leser, U" uniqKey="Leser U">U Leser</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Nekrutenko, A" uniqKey="Nekrutenko A">A Nekrutenko</name>
</author>
<author>
<name sortKey="Taylor, J" uniqKey="Taylor J">J Taylor</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bravo, E" uniqKey="Bravo E">E Bravo</name>
</author>
<author>
<name sortKey="Calzolari, A" uniqKey="Calzolari A">A Calzolari</name>
</author>
<author>
<name sortKey="De Castro, P" uniqKey="De Castro P">P De Castro</name>
</author>
<author>
<name sortKey="Mabile, L" uniqKey="Mabile L">L Mabile</name>
</author>
<author>
<name sortKey="Napolitani, F" uniqKey="Napolitani F">F Napolitani</name>
</author>
<author>
<name sortKey="Rossi, Am" uniqKey="Rossi A">AM Rossi</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mabile, L" uniqKey="Mabile L">L Mabile</name>
</author>
<author>
<name sortKey="Dalgleish, R" uniqKey="Dalgleish R">R Dalgleish</name>
</author>
<author>
<name sortKey="Thorisson, Ga" uniqKey="Thorisson G">GA Thorisson</name>
</author>
<author>
<name sortKey="Deschenes, M" uniqKey="Deschenes M">M Deschênes</name>
</author>
<author>
<name sortKey="Hewitt, R" uniqKey="Hewitt R">R Hewitt</name>
</author>
<author>
<name sortKey="Carpenter, J" uniqKey="Carpenter J">J Carpenter</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Goble, C" uniqKey="Goble C">C Goble</name>
</author>
<author>
<name sortKey="Stevens, R" uniqKey="Stevens R">R Stevens</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Widom, J" uniqKey="Widom J">J Widom</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zhuge, Y" uniqKey="Zhuge Y">Y Zhuge</name>
</author>
<author>
<name sortKey="Garcia Molina, H" uniqKey="Garcia Molina H">H García-Molina</name>
</author>
<author>
<name sortKey="Hammer, J" uniqKey="Hammer J">J Hammer</name>
</author>
<author>
<name sortKey="Widom, J" uniqKey="Widom J">J Widom</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ives, Zg" uniqKey="Ives Z">ZG Ives</name>
</author>
<author>
<name sortKey="Florescu, D" uniqKey="Florescu D">D Florescu</name>
</author>
<author>
<name sortKey="Friedman, M" uniqKey="Friedman M">M Friedman</name>
</author>
<author>
<name sortKey="Levy, A" uniqKey="Levy A">A Levy</name>
</author>
<author>
<name sortKey="Weld, Ds" uniqKey="Weld D">DS Weld</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Halevy, Ay" uniqKey="Halevy A">AY Halevy</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Abiteboul, S" uniqKey="Abiteboul S">S Abiteboul</name>
</author>
<author>
<name sortKey="Duschka, Om" uniqKey="Duschka O">OM Duschka</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Levy, Ay" uniqKey="Levy A">AY Levy</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Beeri, C" uniqKey="Beeri C">C Beeri</name>
</author>
<author>
<name sortKey="Buneman, P" uniqKey="Buneman P">P Buneman</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Etzioni, O" uniqKey="Etzioni O">O Etzioni</name>
</author>
<author>
<name sortKey="Golden, K" uniqKey="Golden K">K Golden</name>
</author>
<author>
<name sortKey="Weld, Ds" uniqKey="Weld D">DS Weld</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Smedley, D" uniqKey="Smedley D">D Smedley</name>
</author>
<author>
<name sortKey="Haider, S" uniqKey="Haider S">S Haider</name>
</author>
<author>
<name sortKey="Ballester, B" uniqKey="Ballester B">B Ballester</name>
</author>
<author>
<name sortKey="Holland, R" uniqKey="Holland R">R Holland</name>
</author>
<author>
<name sortKey="London, D" uniqKey="London D">D London</name>
</author>
<author>
<name sortKey="Thorisson, G" uniqKey="Thorisson G">G Thorisson</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Etzold, T" uniqKey="Etzold T">T Etzold</name>
</author>
<author>
<name sortKey="Argos, P" uniqKey="Argos P">P Argos</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Belleau, F" uniqKey="Belleau F">F Belleau</name>
</author>
<author>
<name sortKey="Nolin, Ma" uniqKey="Nolin M">MA Nolin</name>
</author>
<author>
<name sortKey="Tourigny, N" uniqKey="Tourigny N">N Tourigny</name>
</author>
<author>
<name sortKey="Rigault, P" uniqKey="Rigault P">P Rigault</name>
</author>
<author>
<name sortKey="Morissette, J" uniqKey="Morissette J">J Morissette</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bateman, A" uniqKey="Bateman A">A Bateman</name>
</author>
<author>
<name sortKey="Martin, Mj" uniqKey="Martin M">MJ Martin</name>
</author>
<author>
<name sortKey="O Onovan, C" uniqKey="O Onovan C">C O’Donovan</name>
</author>
<author>
<name sortKey="Magrane, M" uniqKey="Magrane M">M Magrane</name>
</author>
<author>
<name sortKey="Apweiler, R" uniqKey="Apweiler R">R Apweiler</name>
</author>
<author>
<name sortKey="Alpi, E" uniqKey="Alpi E">E Alpi</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Benson, Da" uniqKey="Benson D">DA Benson</name>
</author>
<author>
<name sortKey="Clark, K" uniqKey="Clark K">K Clark</name>
</author>
<author>
<name sortKey="Karsch Mizrachi, I" uniqKey="Karsch Mizrachi I">I Karsch-Mizrachi</name>
</author>
<author>
<name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
<author>
<name sortKey="Ostell, J" uniqKey="Ostell J">J Ostell</name>
</author>
<author>
<name sortKey="Sayers, Ew" uniqKey="Sayers E">EW Sayers</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cerami, Eg" uniqKey="Cerami E">EG Cerami</name>
</author>
<author>
<name sortKey="Gross, Be" uniqKey="Gross B">BE Gross</name>
</author>
<author>
<name sortKey="Demir, E" uniqKey="Demir E">E Demir</name>
</author>
<author>
<name sortKey="Rodchenkov, I" uniqKey="Rodchenkov I">I Rodchenkov</name>
</author>
<author>
<name sortKey="Babur, O" uniqKey="Babur O">O Babur</name>
</author>
<author>
<name sortKey="Anwar, N" uniqKey="Anwar N">N Anwar</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Artimo, P" uniqKey="Artimo P">P Artimo</name>
</author>
<author>
<name sortKey="Jonnalagedda, M" uniqKey="Jonnalagedda M">M Jonnalagedda</name>
</author>
<author>
<name sortKey="Arnold, K" uniqKey="Arnold K">K Arnold</name>
</author>
<author>
<name sortKey="Baratin, D" uniqKey="Baratin D">D Baratin</name>
</author>
<author>
<name sortKey="Csardi, G" uniqKey="Csardi G">G Csardi</name>
</author>
<author>
<name sortKey="De Castro, E" uniqKey="De Castro E">E de Castro</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Karp, Pd" uniqKey="Karp P">PD Karp</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Dowell, Rd" uniqKey="Dowell R">RD Dowell</name>
</author>
<author>
<name sortKey="Jokerst, Rm" uniqKey="Jokerst R">RM Jokerst</name>
</author>
<author>
<name sortKey="Day, A" uniqKey="Day A">A Day</name>
</author>
<author>
<name sortKey="Eddy, Sr" uniqKey="Eddy S">SR Eddy</name>
</author>
<author>
<name sortKey="Stein, L" uniqKey="Stein L">L Stein</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gehlenborg, N" uniqKey="Gehlenborg N">N Gehlenborg</name>
</author>
<author>
<name sortKey="O Onoghue, Si" uniqKey="O Onoghue S">SI O’Donoghue</name>
</author>
<author>
<name sortKey="Baliga, Ns" uniqKey="Baliga N">NS Baliga</name>
</author>
<author>
<name sortKey="Goesmann, A" uniqKey="Goesmann A">A Goesmann</name>
</author>
<author>
<name sortKey="Hibbs, Ma" uniqKey="Hibbs M">MA Hibbs</name>
</author>
<author>
<name sortKey="Kitano, H" uniqKey="Kitano H">H Kitano</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Smith, B" uniqKey="Smith B">B Smith</name>
</author>
<author>
<name sortKey="Ashburner, M" uniqKey="Ashburner M">M Ashburner</name>
</author>
<author>
<name sortKey="Rosse, C" uniqKey="Rosse C">C Rosse</name>
</author>
<author>
<name sortKey="Bard, J" uniqKey="Bard J">J Bard</name>
</author>
<author>
<name sortKey="Bug, W" uniqKey="Bug W">W Bug</name>
</author>
<author>
<name sortKey="Ceusters, W" uniqKey="Ceusters W">W Ceusters</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Musen, Ma" uniqKey="Musen M">MA Musen</name>
</author>
<author>
<name sortKey="Noy, Nf" uniqKey="Noy N">NF Noy</name>
</author>
<author>
<name sortKey="Shah, Nh" uniqKey="Shah N">NH Shah</name>
</author>
<author>
<name sortKey="Whetzel, Pl" uniqKey="Whetzel P">PL Whetzel</name>
</author>
<author>
<name sortKey="Chute, Cg" uniqKey="Chute C">CG Chute</name>
</author>
<author>
<name sortKey="Story, Ma" uniqKey="Story M">MA Story</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gray, Ka" uniqKey="Gray K">KA Gray</name>
</author>
<author>
<name sortKey="Yates, B" uniqKey="Yates B">B Yates</name>
</author>
<author>
<name sortKey="Seal, Rl" uniqKey="Seal R">RL Seal</name>
</author>
<author>
<name sortKey="Wright, Mw" uniqKey="Wright M">MW Wright</name>
</author>
<author>
<name sortKey="Bruford, Ea" uniqKey="Bruford E">EA Bruford</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mathew, Jp" uniqKey="Mathew J">JP Mathew</name>
</author>
<author>
<name sortKey="Taylor, Bs" uniqKey="Taylor B">BS Taylor</name>
</author>
<author>
<name sortKey="Bader, Gd" uniqKey="Bader G">GD Bader</name>
</author>
<author>
<name sortKey="Pyarajan, S" uniqKey="Pyarajan S">S Pyarajan</name>
</author>
<author>
<name sortKey="Antoniotti, M" uniqKey="Antoniotti M">M Antoniotti</name>
</author>
<author>
<name sortKey="Chinnaiyan, Am" uniqKey="Chinnaiyan A">AM Chinnaiyan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Higgins, S" uniqKey="Higgins S">S Higgins</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Field, D" uniqKey="Field D">D Field</name>
</author>
<author>
<name sortKey="Sansone, S" uniqKey="Sansone S">S Sansone</name>
</author>
<author>
<name sortKey="Delong, Ef" uniqKey="Delong E">EF Delong</name>
</author>
<author>
<name sortKey="Sterk, P" uniqKey="Sterk P">P Sterk</name>
</author>
<author>
<name sortKey="Friedberg, I" uniqKey="Friedberg I">I Friedberg</name>
</author>
<author>
<name sortKey="Gaudet, P" uniqKey="Gaudet P">P Gaudet</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Brazma, A" uniqKey="Brazma A">A Brazma</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Brooksbank, C" uniqKey="Brooksbank C">C Brooksbank</name>
</author>
<author>
<name sortKey="Quackenbush, J" uniqKey="Quackenbush J">J Quackenbush</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Piwowar, Ha" uniqKey="Piwowar H">HA Piwowar</name>
</author>
<author>
<name sortKey="Becich, Mj" uniqKey="Becich M">MJ Becich</name>
</author>
<author>
<name sortKey="Bilofsky, H" uniqKey="Bilofsky H">H Bilofsky</name>
</author>
<author>
<name sortKey="Crowley, Rs" uniqKey="Crowley R">RS Crowley</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chervitz, Sa" uniqKey="Chervitz S">SA Chervitz</name>
</author>
<author>
<name sortKey="Deutsch, Ew" uniqKey="Deutsch E">EW Deutsch</name>
</author>
<author>
<name sortKey="Field, D" uniqKey="Field D">D Field</name>
</author>
<author>
<name sortKey="Parkinson, H" uniqKey="Parkinson H">H Parkinson</name>
</author>
<author>
<name sortKey="Quackenbush, J" uniqKey="Quackenbush J">J Quackenbush</name>
</author>
<author>
<name sortKey="Rocca Serra, P" uniqKey="Rocca Serra P">P Rocca-Serra</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Popplewell, K" uniqKey="Popplewell K">K Popplewell</name>
</author>
<author>
<name sortKey="Harding, J" uniqKey="Harding J">J Harding</name>
</author>
<author>
<name sortKey="Poler, R" uniqKey="Poler R">R Poler</name>
</author>
<author>
<name sortKey="Chalmeta, R" uniqKey="Chalmeta R">R Chalmeta</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bard, Jb" uniqKey="Bard J">JB Bard</name>
</author>
<author>
<name sortKey="Rhee, Sy" uniqKey="Rhee S">SY Rhee</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Smith, B" uniqKey="Smith B">B Smith</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chandrasekaran, B" uniqKey="Chandrasekaran B">B Chandrasekaran</name>
</author>
<author>
<name sortKey="Josephson, Jr" uniqKey="Josephson J">JR Josephson</name>
</author>
<author>
<name sortKey="Benjamins, Vr" uniqKey="Benjamins V">VR Benjamins</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mayer, G" uniqKey="Mayer G">G Mayer</name>
</author>
<author>
<name sortKey="Jones, Ar" uniqKey="Jones A">AR Jones</name>
</author>
<author>
<name sortKey="Binz, P A" uniqKey="Binz P">P-A Binz</name>
</author>
<author>
<name sortKey="Deutsch, Ew" uniqKey="Deutsch E">EW Deutsch</name>
</author>
<author>
<name sortKey="Orchard, S" uniqKey="Orchard S">S Orchard</name>
</author>
<author>
<name sortKey="Montecchi Palazzi, L" uniqKey="Montecchi Palazzi L">L Montecchi-Palazzi</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Blake, Ja" uniqKey="Blake J">JA Blake</name>
</author>
<author>
<name sortKey="Bult, Cj" uniqKey="Bult C">CJ Bult</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Whetzel, Pl" uniqKey="Whetzel P">PL Whetzel</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jonquet, C" uniqKey="Jonquet C">C Jonquet</name>
</author>
<author>
<name sortKey="Lependu, P" uniqKey="Lependu P">P Lependu</name>
</author>
<author>
<name sortKey="Falconer, S" uniqKey="Falconer S">S Falconer</name>
</author>
<author>
<name sortKey="Coulet, A" uniqKey="Coulet A">A Coulet</name>
</author>
<author>
<name sortKey="Noy, Nf" uniqKey="Noy N">NF Noy</name>
</author>
<author>
<name sortKey="Musen, Ma" uniqKey="Musen M">MA Musen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cote, R" uniqKey="Cote R">R Cote</name>
</author>
<author>
<name sortKey="Reisinger, F" uniqKey="Reisinger F">F Reisinger</name>
</author>
<author>
<name sortKey="Martens, L" uniqKey="Martens L">L Martens</name>
</author>
<author>
<name sortKey="Barsnes, H" uniqKey="Barsnes H">H Barsnes</name>
</author>
<author>
<name sortKey="Vizcaino, Ja" uniqKey="Vizcaino J">JA Vizcaino</name>
</author>
<author>
<name sortKey="Hermjakob, H" uniqKey="Hermjakob H">H Hermjakob</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Corpas, M" uniqKey="Corpas M">M Corpas</name>
</author>
<author>
<name sortKey="Fatumo, S" uniqKey="Fatumo S">S Fatumo</name>
</author>
<author>
<name sortKey="Schneider, R" uniqKey="Schneider R">R Schneider</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Baker, M" uniqKey="Baker M">M Baker</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Field, D" uniqKey="Field D">D Field</name>
</author>
<author>
<name sortKey="Garrity, G" uniqKey="Garrity G">G Garrity</name>
</author>
<author>
<name sortKey="Gray, T" uniqKey="Gray T">T Gray</name>
</author>
<author>
<name sortKey="Morrison, N" uniqKey="Morrison N">N Morrison</name>
</author>
<author>
<name sortKey="Selengut, J" uniqKey="Selengut J">J Selengut</name>
</author>
<author>
<name sortKey="Sterk, P" uniqKey="Sterk P">P Sterk</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Parnell, Ld" uniqKey="Parnell L">LD Parnell</name>
</author>
<author>
<name sortKey="Lindenbaum, P" uniqKey="Lindenbaum P">P Lindenbaum</name>
</author>
<author>
<name sortKey="Shameer, K" uniqKey="Shameer K">K Shameer</name>
</author>
<author>
<name sortKey="Dall Lio, Gm" uniqKey="Dall Lio G">GM Dall’Olio</name>
</author>
<author>
<name sortKey="Swan, Dc" uniqKey="Swan D">DC Swan</name>
</author>
<author>
<name sortKey="Jensen, Lj" uniqKey="Jensen L">LJ Jensen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Achard, F" uniqKey="Achard F">F Achard</name>
</author>
<author>
<name sortKey="Vaysseix, G" uniqKey="Vaysseix G">G Vaysseix</name>
</author>
<author>
<name sortKey="Barillot, E" uniqKey="Barillot E">E Barillot</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Barsnes, H" uniqKey="Barsnes H">H Barsnes</name>
</author>
<author>
<name sortKey="Vizcaino, Ja" uniqKey="Vizcaino J">JA Vizcaino</name>
</author>
<author>
<name sortKey="Eidhammer, I" uniqKey="Eidhammer I">I Eidhammer</name>
</author>
<author>
<name sortKey="Martens, L" uniqKey="Martens L">L Martens</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Martens, L" uniqKey="Martens L">L Martens</name>
</author>
<author>
<name sortKey="Hermjakob, H" uniqKey="Hermjakob H">H Hermjakob</name>
</author>
<author>
<name sortKey="Jones, P" uniqKey="Jones P">P Jones</name>
</author>
<author>
<name sortKey="Adamski, M" uniqKey="Adamski M">M Adamski</name>
</author>
<author>
<name sortKey="Taylor, C" uniqKey="Taylor C">C Taylor</name>
</author>
<author>
<name sortKey="States, D" uniqKey="States D">D States</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Benson, Da" uniqKey="Benson D">DA Benson</name>
</author>
<author>
<name sortKey="Karsch Mizrachi, I" uniqKey="Karsch Mizrachi I">I Karsch-Mizrachi</name>
</author>
<author>
<name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
<author>
<name sortKey="Ostell, J" uniqKey="Ostell J">J Ostell</name>
</author>
<author>
<name sortKey="Rapp, Ba" uniqKey="Rapp B">BA Rapp</name>
</author>
<author>
<name sortKey="Wheeler, Dl" uniqKey="Wheeler D">DL Wheeler</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Apweiler, R" uniqKey="Apweiler R">R Apweiler</name>
</author>
<author>
<name sortKey="Bairoch, A" uniqKey="Bairoch A">A Bairoch</name>
</author>
<author>
<name sortKey="Wu, Ch" uniqKey="Wu C">CH Wu</name>
</author>
<author>
<name sortKey="Barker, Wc" uniqKey="Barker W">WC Barker</name>
</author>
<author>
<name sortKey="Boeckmann, B" uniqKey="Boeckmann B">B Boeckmann</name>
</author>
<author>
<name sortKey="Ferro, S" uniqKey="Ferro S">S Ferro</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cunningham, F" uniqKey="Cunningham F">F Cunningham</name>
</author>
<author>
<name sortKey="Amode, Mr" uniqKey="Amode M">MR Amode</name>
</author>
<author>
<name sortKey="Barrell, D" uniqKey="Barrell D">D Barrell</name>
</author>
<author>
<name sortKey="Beal, K" uniqKey="Beal K">K Beal</name>
</author>
<author>
<name sortKey="Billis, K" uniqKey="Billis K">K Billis</name>
</author>
<author>
<name sortKey="Brent, S" uniqKey="Brent S">S Brent</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pruitt, Kd" uniqKey="Pruitt K">KD Pruitt</name>
</author>
<author>
<name sortKey="Tatusova, T" uniqKey="Tatusova T">T Tatusova</name>
</author>
<author>
<name sortKey="Maglott, Dr" uniqKey="Maglott D">DR Maglott</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Juty, N" uniqKey="Juty N">N Juty</name>
</author>
<author>
<name sortKey="Le Novere, N" uniqKey="Le Novere N">N Le Novere</name>
</author>
<author>
<name sortKey="Laibe, C" uniqKey="Laibe C">C Laibe</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cote, Rg" uniqKey="Cote R">RG Cote</name>
</author>
<author>
<name sortKey="Jones, P" uniqKey="Jones P">P Jones</name>
</author>
<author>
<name sortKey="Martens, L" uniqKey="Martens L">L Martens</name>
</author>
<author>
<name sortKey="Kerrien, S" uniqKey="Kerrien S">S Kerrien</name>
</author>
<author>
<name sortKey="Reisinger, F" uniqKey="Reisinger F">F Reisinger</name>
</author>
<author>
<name sortKey="Lin, Q" uniqKey="Lin Q">Q Lin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Huang, Daw" uniqKey="Huang D">daW Huang</name>
</author>
<author>
<name sortKey="Sherman, Bt" uniqKey="Sherman B">BT Sherman</name>
</author>
<author>
<name sortKey="Stephens, R" uniqKey="Stephens R">R Stephens</name>
</author>
<author>
<name sortKey="Baseler, Mw" uniqKey="Baseler M">MW Baseler</name>
</author>
<author>
<name sortKey="Lane, Hc" uniqKey="Lane H">HC Lane</name>
</author>
<author>
<name sortKey="Lempicki, Ra" uniqKey="Lempicki R">RA Lempicki</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Orchard, S" uniqKey="Orchard S">S Orchard</name>
</author>
<author>
<name sortKey="Kerrien, S" uniqKey="Kerrien S">S Kerrien</name>
</author>
<author>
<name sortKey="Abbani, S" uniqKey="Abbani S">S Abbani</name>
</author>
<author>
<name sortKey="Aranda, B" uniqKey="Aranda B">B Aranda</name>
</author>
<author>
<name sortKey="Bhate, J" uniqKey="Bhate J">J Bhate</name>
</author>
<author>
<name sortKey="Bidwell, S" uniqKey="Bidwell S">S Bidwell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chatr Aryamontri, A" uniqKey="Chatr Aryamontri A">A Chatr-aryamontri</name>
</author>
<author>
<name sortKey="Ceol, A" uniqKey="Ceol A">A Ceol</name>
</author>
<author>
<name sortKey="Palazzi, Lm" uniqKey="Palazzi L">LM Palazzi</name>
</author>
<author>
<name sortKey="Nardelli, G" uniqKey="Nardelli G">G Nardelli</name>
</author>
<author>
<name sortKey="Schneider, Mv" uniqKey="Schneider M">MV Schneider</name>
</author>
<author>
<name sortKey="Castagnoli, L" uniqKey="Castagnoli L">L Castagnoli</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Xenarios, I" uniqKey="Xenarios I">I Xenarios</name>
</author>
<author>
<name sortKey="Salwinski, L" uniqKey="Salwinski L">L Salwinski</name>
</author>
<author>
<name sortKey="Duan, Xj" uniqKey="Duan X">XJ Duan</name>
</author>
<author>
<name sortKey="Higney, P" uniqKey="Higney P">P Higney</name>
</author>
<author>
<name sortKey="Kim, Sm" uniqKey="Kim S">SM Kim</name>
</author>
<author>
<name sortKey="Eisenberg, D" uniqKey="Eisenberg D">D Eisenberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Brazma, A" uniqKey="Brazma A">A Brazma</name>
</author>
<author>
<name sortKey="Hingamp, P" uniqKey="Hingamp P">P Hingamp</name>
</author>
<author>
<name sortKey="Quackenbush, J" uniqKey="Quackenbush J">J Quackenbush</name>
</author>
<author>
<name sortKey="Sherlock, G" uniqKey="Sherlock G">G Sherlock</name>
</author>
<author>
<name sortKey="Spellman, P" uniqKey="Spellman P">P Spellman</name>
</author>
<author>
<name sortKey="Stoeckert, C" uniqKey="Stoeckert C">C Stoeckert</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Taylor, Cf" uniqKey="Taylor C">CF Taylor</name>
</author>
<author>
<name sortKey="Field, D" uniqKey="Field D">D Field</name>
</author>
<author>
<name sortKey="Sansone, S A" uniqKey="Sansone S">S-A Sansone</name>
</author>
<author>
<name sortKey="Aerts, J" uniqKey="Aerts J">J Aerts</name>
</author>
<author>
<name sortKey="Apweiler, R" uniqKey="Apweiler R">R Apweiler</name>
</author>
<author>
<name sortKey="Ashburner, M" uniqKey="Ashburner M">M Ashburner</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sweet, Jj" uniqKey="Sweet J">JJ Sweet</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Orchard, S" uniqKey="Orchard S">S Orchard</name>
</author>
<author>
<name sortKey="Al Lazikani, B" uniqKey="Al Lazikani B">B Al-Lazikani</name>
</author>
<author>
<name sortKey="Bryant, S" uniqKey="Bryant S">S Bryant</name>
</author>
<author>
<name sortKey="Clark, D" uniqKey="Clark D">D Clark</name>
</author>
<author>
<name sortKey="Calder, E" uniqKey="Calder E">E Calder</name>
</author>
<author>
<name sortKey="Dix, I" uniqKey="Dix I">I Dix</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Orchard, S" uniqKey="Orchard S">S Orchard</name>
</author>
<author>
<name sortKey="Salwinski, L" uniqKey="Salwinski L">L Salwinski</name>
</author>
<author>
<name sortKey="Kerrien, S" uniqKey="Kerrien S">S Kerrien</name>
</author>
<author>
<name sortKey="Montecchi Palazzi, L" uniqKey="Montecchi Palazzi L">L Montecchi-Palazzi</name>
</author>
<author>
<name sortKey="Oesterheld, M" uniqKey="Oesterheld M">M Oesterheld</name>
</author>
<author>
<name sortKey="Stumpflen, V" uniqKey="Stumpflen V">V Stumpflen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sansone, S A" uniqKey="Sansone S">S-A Sansone</name>
</author>
<author>
<name sortKey="Rocca Serra, P" uniqKey="Rocca Serra P">P Rocca-Serra</name>
</author>
<author>
<name sortKey="Field, D" uniqKey="Field D">D Field</name>
</author>
<author>
<name sortKey="Maguire, E" uniqKey="Maguire E">E Maguire</name>
</author>
<author>
<name sortKey="Taylor, C" uniqKey="Taylor C">C Taylor</name>
</author>
<author>
<name sortKey="Hofmann, O" uniqKey="Hofmann O">O Hofmann</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hucka, M" uniqKey="Hucka M">M Hucka</name>
</author>
<author>
<name sortKey="Nickerson, Dp" uniqKey="Nickerson D">DP Nickerson</name>
</author>
<author>
<name sortKey="Bader, Gd" uniqKey="Bader G">GD Bader</name>
</author>
<author>
<name sortKey="Bergmann, Ft" uniqKey="Bergmann F">FT Bergmann</name>
</author>
<author>
<name sortKey="Cooper, J" uniqKey="Cooper J">J Cooper</name>
</author>
<author>
<name sortKey="Demir, E" uniqKey="Demir E">E Demir</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Orchard, S" uniqKey="Orchard S">S Orchard</name>
</author>
<author>
<name sortKey="Hermjakob, H" uniqKey="Hermjakob H">H Hermjakob</name>
</author>
<author>
<name sortKey="Apweiler, R" uniqKey="Apweiler R">R Apweiler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Knoppers, Bm" uniqKey="Knoppers B">BM Knoppers</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Nakamura, Y" uniqKey="Nakamura Y">Y Nakamura</name>
</author>
<author>
<name sortKey="Cochrane, G" uniqKey="Cochrane G">G Cochrane</name>
</author>
<author>
<name sortKey="Karsch Mizrachi, I" uniqKey="Karsch Mizrachi I">I Karsch-Mizrachi</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hermjakob, H" uniqKey="Hermjakob H">H Hermjakob</name>
</author>
<author>
<name sortKey="Apweiler, R" uniqKey="Apweiler R">R Apweiler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Demir, E" uniqKey="Demir E">E Demir</name>
</author>
<author>
<name sortKey="Cary, Mp" uniqKey="Cary M">MP Cary</name>
</author>
<author>
<name sortKey="Paley, S" uniqKey="Paley S">S Paley</name>
</author>
<author>
<name sortKey="Fukuda, K" uniqKey="Fukuda K">K Fukuda</name>
</author>
<author>
<name sortKey="Lemer, C" uniqKey="Lemer C">C Lemer</name>
</author>
<author>
<name sortKey="Vastrik, I" uniqKey="Vastrik I">I Vastrik</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Crosswell, Lc" uniqKey="Crosswell L">LC Crosswell</name>
</author>
<author>
<name sortKey="Thornton, Jm" uniqKey="Thornton J">JM Thornton</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yuille, M" uniqKey="Yuille M">M Yuille</name>
</author>
<author>
<name sortKey="Van Ommen, Gj" uniqKey="Van Ommen G">GJ van Ommen</name>
</author>
<author>
<name sortKey="Brechot, C" uniqKey="Brechot C">C Brechot</name>
</author>
<author>
<name sortKey="Cambon Thomsen, A" uniqKey="Cambon Thomsen A">A Cambon-Thomsen</name>
</author>
<author>
<name sortKey="Dagher, G" uniqKey="Dagher G">G Dagher</name>
</author>
<author>
<name sortKey="Landegren, U" uniqKey="Landegren U">U Landegren</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Margolis, R" uniqKey="Margolis R">R Margolis</name>
</author>
<author>
<name sortKey="Derr, L" uniqKey="Derr L">L Derr</name>
</author>
<author>
<name sortKey="Dunn, M" uniqKey="Dunn M">M Dunn</name>
</author>
<author>
<name sortKey="Huerta, M" uniqKey="Huerta M">M Huerta</name>
</author>
<author>
<name sortKey="Larkin, J" uniqKey="Larkin J">J Larkin</name>
</author>
<author>
<name sortKey="Sheehan, J" uniqKey="Sheehan J">J Sheehan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Klech, H" uniqKey="Klech H">H Klech</name>
</author>
<author>
<name sortKey="Brooksbank, C" uniqKey="Brooksbank C">C Brooksbank</name>
</author>
<author>
<name sortKey="Price, S" uniqKey="Price S">S Price</name>
</author>
<author>
<name sortKey="Verpillat, P" uniqKey="Verpillat P">P Verpillat</name>
</author>
<author>
<name sortKey="Buhler, Fr" uniqKey="Buhler F">FR Buhler</name>
</author>
<author>
<name sortKey="Dubois, D" uniqKey="Dubois D">D Dubois</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Doiron, D" uniqKey="Doiron D">D Doiron</name>
</author>
<author>
<name sortKey="Burton, P" uniqKey="Burton P">P Burton</name>
</author>
<author>
<name sortKey="Marcon, Y" uniqKey="Marcon Y">Y Marcon</name>
</author>
<author>
<name sortKey="Gaye, A" uniqKey="Gaye A">A Gaye</name>
</author>
<author>
<name sortKey="Wolffenbuttel, Bh" uniqKey="Wolffenbuttel B">BH Wolffenbuttel</name>
</author>
<author>
<name sortKey="Perola, M" uniqKey="Perola M">M Perola</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Basset, A" uniqKey="Basset A">A Basset</name>
</author>
<author>
<name sortKey="Los, W" uniqKey="Los W">W Los</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pettifer, S" uniqKey="Pettifer S">S Pettifer</name>
</author>
<author>
<name sortKey="Thorne, D" uniqKey="Thorne D">D Thorne</name>
</author>
<author>
<name sortKey="Mcdermott, P" uniqKey="Mcdermott P">P McDermott</name>
</author>
<author>
<name sortKey="Marsh, J" uniqKey="Marsh J">J Marsh</name>
</author>
<author>
<name sortKey="Villeger, A" uniqKey="Villeger A">A Villeger</name>
</author>
<author>
<name sortKey="Kell, Db" uniqKey="Kell D">DB Kell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gehlenborg, N" uniqKey="Gehlenborg N">N Gehlenborg</name>
</author>
<author>
<name sortKey="O Onoghue, Si" uniqKey="O Onoghue S">SI O’Donoghue</name>
</author>
<author>
<name sortKey="Baliga, Ns" uniqKey="Baliga N">NS Baliga</name>
</author>
<author>
<name sortKey="Goesmann, A" uniqKey="Goesmann A">A Goesmann</name>
</author>
<author>
<name sortKey="Hibbs, Ma" uniqKey="Hibbs M">MA Hibbs</name>
</author>
<author>
<name sortKey="Kitano, H" uniqKey="Kitano H">H Kitano</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Thorvaldsdottir, H" uniqKey="Thorvaldsdottir H">H Thorvaldsdottir</name>
</author>
<author>
<name sortKey="Robinson, Jt" uniqKey="Robinson J">JT Robinson</name>
</author>
<author>
<name sortKey="Mesirov, Jp" uniqKey="Mesirov J">JP Mesirov</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kent, Wj" uniqKey="Kent W">WJ Kent</name>
</author>
<author>
<name sortKey="Sugnet, Cw" uniqKey="Sugnet C">CW Sugnet</name>
</author>
<author>
<name sortKey="Furey, Ts" uniqKey="Furey T">TS Furey</name>
</author>
<author>
<name sortKey="Roskin, Km" uniqKey="Roskin K">KM Roskin</name>
</author>
<author>
<name sortKey="Pringle, Th" uniqKey="Pringle T">TH Pringle</name>
</author>
<author>
<name sortKey="Zahler, Am" uniqKey="Zahler A">AM Zahler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hubbard, T" uniqKey="Hubbard T">T Hubbard</name>
</author>
<author>
<name sortKey="Barker, D" uniqKey="Barker D">D Barker</name>
</author>
<author>
<name sortKey="Birney, E" uniqKey="Birney E">E Birney</name>
</author>
<author>
<name sortKey="Cameron, G" uniqKey="Cameron G">G Cameron</name>
</author>
<author>
<name sortKey="Chen, Y" uniqKey="Chen Y">Y Chen</name>
</author>
<author>
<name sortKey="Clark, L" uniqKey="Clark L">L Clark</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Engels, R" uniqKey="Engels R">R Engels</name>
</author>
<author>
<name sortKey="Yu, T" uniqKey="Yu T">T Yu</name>
</author>
<author>
<name sortKey="Burge, C" uniqKey="Burge C">C Burge</name>
</author>
<author>
<name sortKey="Mesirov, Jp" uniqKey="Mesirov J">JP Mesirov</name>
</author>
<author>
<name sortKey="Decaprio, D" uniqKey="Decaprio D">D DeCaprio</name>
</author>
<author>
<name sortKey="Galagan, Je" uniqKey="Galagan J">JE Galagan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Shannon, Pt" uniqKey="Shannon P">PT Shannon</name>
</author>
<author>
<name sortKey="Reiss, Dj" uniqKey="Reiss D">DJ Reiss</name>
</author>
<author>
<name sortKey="Bonneau, R" uniqKey="Bonneau R">R Bonneau</name>
</author>
<author>
<name sortKey="Baliga, Ns" uniqKey="Baliga N">NS Baliga</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Frazer, Ka" uniqKey="Frazer K">KA Frazer</name>
</author>
<author>
<name sortKey="Pachter, L" uniqKey="Pachter L">L Pachter</name>
</author>
<author>
<name sortKey="Poliakov, A" uniqKey="Poliakov A">A Poliakov</name>
</author>
<author>
<name sortKey="Rubin, Em" uniqKey="Rubin E">EM Rubin</name>
</author>
<author>
<name sortKey="Dubchak, I" uniqKey="Dubchak I">I Dubchak</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pavlopoulos, Ga" uniqKey="Pavlopoulos G">GA Pavlopoulos</name>
</author>
<author>
<name sortKey="Wegener, Al" uniqKey="Wegener A">AL Wegener</name>
</author>
<author>
<name sortKey="Schneider, R" uniqKey="Schneider R">R Schneider</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Andersson, L" uniqKey="Andersson L">L Andersson</name>
</author>
<author>
<name sortKey="Archibald, Al" uniqKey="Archibald A">AL Archibald</name>
</author>
<author>
<name sortKey="Bottema, Cd" uniqKey="Bottema C">CD Bottema</name>
</author>
<author>
<name sortKey="Brauning, R" uniqKey="Brauning R">R Brauning</name>
</author>
<author>
<name sortKey="Burgess, Sc" uniqKey="Burgess S">SC Burgess</name>
</author>
<author>
<name sortKey="Burt, Dw" uniqKey="Burt D">DW Burt</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Howe, D" uniqKey="Howe D">D Howe</name>
</author>
<author>
<name sortKey="Costanzo, M" uniqKey="Costanzo M">M Costanzo</name>
</author>
<author>
<name sortKey="Fey, P" uniqKey="Fey P">P Fey</name>
</author>
<author>
<name sortKey="Gojobori, T" uniqKey="Gojobori T">T Gojobori</name>
</author>
<author>
<name sortKey="Hannick, L" uniqKey="Hannick L">L Hannick</name>
</author>
<author>
<name sortKey="Hide, W" uniqKey="Hide W">W Hide</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Stein, L" uniqKey="Stein L">L Stein</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Haw, R" uniqKey="Haw R">R Haw</name>
</author>
<author>
<name sortKey="Hermjakob, H" uniqKey="Hermjakob H">H Hermjakob</name>
</author>
<author>
<name sortKey="D Ustachio, P" uniqKey="D Ustachio P">P D’Eustachio</name>
</author>
<author>
<name sortKey="Stein, L" uniqKey="Stein L">L Stein</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tanabe, M" uniqKey="Tanabe M">M Tanabe</name>
</author>
<author>
<name sortKey="Kanehisa, M" uniqKey="Kanehisa M">M Kanehisa</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wang, J" uniqKey="Wang J">J Wang</name>
</author>
<author>
<name sortKey="Zhang, Y" uniqKey="Zhang Y">Y Zhang</name>
</author>
<author>
<name sortKey="Marian, C" uniqKey="Marian C">C Marian</name>
</author>
<author>
<name sortKey="Ressom, Hw" uniqKey="Ressom H">HW Ressom</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mlecnik, B" uniqKey="Mlecnik B">B Mlecnik</name>
</author>
<author>
<name sortKey="Scheideler, M" uniqKey="Scheideler M">M Scheideler</name>
</author>
<author>
<name sortKey="Hackl, H" uniqKey="Hackl H">H Hackl</name>
</author>
<author>
<name sortKey="Hartler, J" uniqKey="Hartler J">J Hartler</name>
</author>
<author>
<name sortKey="Sanchez Cabo, F" uniqKey="Sanchez Cabo F">F Sanchez-Cabo</name>
</author>
<author>
<name sortKey="Trajanoski, Z" uniqKey="Trajanoski Z">Z Trajanoski</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gotz, S" uniqKey="Gotz S">S Gotz</name>
</author>
<author>
<name sortKey="Garcia Gomez, Jm" uniqKey="Garcia Gomez J">JM Garcia-Gomez</name>
</author>
<author>
<name sortKey="Terol, J" uniqKey="Terol J">J Terol</name>
</author>
<author>
<name sortKey="Williams, Td" uniqKey="Williams T">TD Williams</name>
</author>
<author>
<name sortKey="Nagaraj, Sh" uniqKey="Nagaraj S">SH Nagaraj</name>
</author>
<author>
<name sortKey="Nueda, Mj" uniqKey="Nueda M">MJ Nueda</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Huang, Daw" uniqKey="Huang D">daW Huang</name>
</author>
<author>
<name sortKey="Sherman, Bt" uniqKey="Sherman B">BT Sherman</name>
</author>
<author>
<name sortKey="Lempicki, Ra" uniqKey="Lempicki R">RA Lempicki</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Stobbe, Md" uniqKey="Stobbe M">MD Stobbe</name>
</author>
<author>
<name sortKey="Jansen, Ga" uniqKey="Jansen G">GA Jansen</name>
</author>
<author>
<name sortKey="Moerland, Pd" uniqKey="Moerland P">PD Moerland</name>
</author>
<author>
<name sortKey="Van Kampen, Ah" uniqKey="Van Kampen A">AH van Kampen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Walter, T" uniqKey="Walter T">T Walter</name>
</author>
<author>
<name sortKey="Shattuck, Dw" uniqKey="Shattuck D">DW Shattuck</name>
</author>
<author>
<name sortKey="Baldock, R" uniqKey="Baldock R">R Baldock</name>
</author>
<author>
<name sortKey="Bastin, Me" uniqKey="Bastin M">ME Bastin</name>
</author>
<author>
<name sortKey="Carpenter, Ae" uniqKey="Carpenter A">AE Carpenter</name>
</author>
<author>
<name sortKey="Duce, S" uniqKey="Duce S">S Duce</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Giardine, B" uniqKey="Giardine B">B Giardine</name>
</author>
<author>
<name sortKey="Riemer, C" uniqKey="Riemer C">C Riemer</name>
</author>
<author>
<name sortKey="Hardison, Rc" uniqKey="Hardison R">RC Hardison</name>
</author>
<author>
<name sortKey="Burhans, R" uniqKey="Burhans R">R Burhans</name>
</author>
<author>
<name sortKey="Elnitski, L" uniqKey="Elnitski L">L Elnitski</name>
</author>
<author>
<name sortKey="Shah, P" uniqKey="Shah P">P Shah</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Shannon, P" uniqKey="Shannon P">P Shannon</name>
</author>
<author>
<name sortKey="Markiel, A" uniqKey="Markiel A">A Markiel</name>
</author>
<author>
<name sortKey="Ozier, O" uniqKey="Ozier O">O Ozier</name>
</author>
<author>
<name sortKey="Baliga, Ns" uniqKey="Baliga N">NS Baliga</name>
</author>
<author>
<name sortKey="Wang, Jt" uniqKey="Wang J">JT Wang</name>
</author>
<author>
<name sortKey="Ramage, D" uniqKey="Ramage D">D Ramage</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Smoot, Me" uniqKey="Smoot M">ME Smoot</name>
</author>
<author>
<name sortKey="Ono, K" uniqKey="Ono K">K Ono</name>
</author>
<author>
<name sortKey="Ruscheinski, J" uniqKey="Ruscheinski J">J Ruscheinski</name>
</author>
<author>
<name sortKey="Wang, Pl" uniqKey="Wang P">PL Wang</name>
</author>
<author>
<name sortKey="Ideker, T" uniqKey="Ideker T">T Ideker</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kohler, J" uniqKey="Kohler J">J Kohler</name>
</author>
<author>
<name sortKey="Baumbach, J" uniqKey="Baumbach J">J Baumbach</name>
</author>
<author>
<name sortKey="Taubert, J" uniqKey="Taubert J">J Taubert</name>
</author>
<author>
<name sortKey="Specht, M" uniqKey="Specht M">M Specht</name>
</author>
<author>
<name sortKey="Skusa, A" uniqKey="Skusa A">A Skusa</name>
</author>
<author>
<name sortKey="Ruegg, A" uniqKey="Ruegg A">A Ruegg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Goff, Sa" uniqKey="Goff S">SA Goff</name>
</author>
<author>
<name sortKey="Vaughn, M" uniqKey="Vaughn M">M Vaughn</name>
</author>
<author>
<name sortKey="Mckay, S" uniqKey="Mckay S">S McKay</name>
</author>
<author>
<name sortKey="Lyons, E" uniqKey="Lyons E">E Lyons</name>
</author>
<author>
<name sortKey="Stapleton, Ae" uniqKey="Stapleton A">AE Stapleton</name>
</author>
<author>
<name sortKey="Gessler, D" uniqKey="Gessler D">D Gessler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Luo, W" uniqKey="Luo W">W Luo</name>
</author>
<author>
<name sortKey="Brouwer, C" uniqKey="Brouwer C">C Brouwer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Attwood, Tk" uniqKey="Attwood T">TK Attwood</name>
</author>
<author>
<name sortKey="Kell, Db" uniqKey="Kell D">DB Kell</name>
</author>
<author>
<name sortKey="Mcdermott, P" uniqKey="Mcdermott P">P McDermott</name>
</author>
<author>
<name sortKey="Marsh, J" uniqKey="Marsh J">J Marsh</name>
</author>
<author>
<name sortKey="Pettifer, Sr" uniqKey="Pettifer S">SR Pettifer</name>
</author>
<author>
<name sortKey="Thorne, D" uniqKey="Thorne D">D Thorne</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Attwood, Tk" uniqKey="Attwood T">TK Attwood</name>
</author>
<author>
<name sortKey="Kell, Db" uniqKey="Kell D">DB Kell</name>
</author>
<author>
<name sortKey="Mcdermott, P" uniqKey="Mcdermott P">P McDermott</name>
</author>
<author>
<name sortKey="Marsh, J" uniqKey="Marsh J">J Marsh</name>
</author>
<author>
<name sortKey="Pettifer, Sr" uniqKey="Pettifer S">SR Pettifer</name>
</author>
<author>
<name sortKey="Thorne, D" uniqKey="Thorne D">D Thorne</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gomez, J" uniqKey="Gomez J">J Gomez</name>
</author>
<author>
<name sortKey="Garcia, Lj" uniqKey="Garcia L">LJ Garcia</name>
</author>
<author>
<name sortKey="Salazar, Ga" uniqKey="Salazar G">GA Salazar</name>
</author>
<author>
<name sortKey="Villaveces, J" uniqKey="Villaveces J">J Villaveces</name>
</author>
<author>
<name sortKey="Gore, S" uniqKey="Gore S">S Gore</name>
</author>
<author>
<name sortKey="Garcia, A" uniqKey="Garcia A">A Garcia</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Treloar, A" uniqKey="Treloar A">A Treloar</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="review-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">J Biol Res (Thessalon)</journal-id>
<journal-id journal-id-type="iso-abbrev">J Biol Res (Thessalon)</journal-id>
<journal-title-group>
<journal-title>Journal of Biological Research</journal-title>
</journal-title-group>
<issn pub-type="ppub">1790-045X</issn>
<issn pub-type="epub">2241-5793</issn>
<publisher>
<publisher-name>BioMed Central</publisher-name>
<publisher-loc>London</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">26336651</article-id>
<article-id pub-id-type="pmc">4557916</article-id>
<article-id pub-id-type="publisher-id">32</article-id>
<article-id pub-id-type="doi">10.1186/s40709-015-0032-5</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Review</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Data integration in biological research: an overview</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Lapatas</surname>
<given-names>Vasileios</given-names>
</name>
<address>
<email>piar301@gmail.com</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Stefanidakis</surname>
<given-names>Michalis</given-names>
</name>
<address>
<email>mistral@ionio.gr</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Jimenez</surname>
<given-names>Rafael C.</given-names>
</name>
<address>
<email>Rafael.Jimenez@elixir-europe.org</email>
</address>
<xref ref-type="aff" rid="Aff2"></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Via</surname>
<given-names>Allegra</given-names>
</name>
<address>
<email>allegra.via@uniroma1.it</email>
</address>
<xref ref-type="aff" rid="Aff3"></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Schneider</surname>
<given-names>Maria Victoria</given-names>
</name>
<address>
<email>Vicky.Schneider@tgac.ac.uk</email>
</address>
<xref ref-type="aff" rid="Aff4"></xref>
</contrib>
<aff id="Aff1">
<label></label>
Department of Informatics, Ionian University, 7 Tsirigoti Square, Corfu, 49100 Greece</aff>
<aff id="Aff2">
<label></label>
ELIXIR, Wellcome Trust Genome Campus, Hinxton, CB10 1SD UK</aff>
<aff id="Aff3">
<label></label>
Biocomputing Group, Sapienza University, Piazzale Aldo Moro 5, Rome, 00185 Italy</aff>
<aff id="Aff4">
<label></label>
361° Division, The Genome Analysis Centre, Norwich Research Park, Norwich, NR4 7UH UK</aff>
</contrib-group>
<pub-date pub-type="epub">
<day>2</day>
<month>9</month>
<year>2015</year>
</pub-date>
<pub-date pub-type="pmc-release">
<day>2</day>
<month>9</month>
<year>2015</year>
</pub-date>
<pub-date pub-type="collection">
<month>12</month>
<year>2015</year>
</pub-date>
<volume>22</volume>
<issue>1</issue>
<elocation-id>9</elocation-id>
<history>
<date date-type="received">
<day>20</day>
<month>4</month>
<year>2015</year>
</date>
<date date-type="accepted">
<day>10</day>
<month>8</month>
<year>2015</year>
</date>
</history>
<permissions>
<copyright-statement>© Lapatas et al. 2015</copyright-statement>
<license license-type="OpenAccess">
<license-p>
<bold>Open Access</bold>
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</ext-link>
), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/publicdomain/zero/1.0/">http://creativecommons.org/publicdomain/zero/1.0/</ext-link>
) applies to the data made available in this article, unless otherwise stated.</license-p>
</license>
</permissions>
<abstract id="Abs1">
<p>Data sharing, integration and annotation are essential to ensure the reproducibility of the analysis and interpretation of the experimental findings. Often these activities are perceived as a role that bioinformaticians and computer scientists have to take with no or little input from the experimental biologist. On the contrary, biological researchers, being the producers and often the end users of such data, have a big role in enabling biological data integration. The quality and usefulness of data integration depend on the existence and adoption of standards, shared formats, and mechanisms that are suitable for biological researchers to submit and annotate the data, so it can be easily searchable, conveniently linked and consequently used for further biological analysis and discovery. Here, we provide background on what is data integration from a computational science point of view, how it has been applied to biological research, which key aspects contributed to its success and future directions.</p>
</abstract>
<kwd-group xml:lang="en">
<title>Keywords</title>
<kwd>Data integration</kwd>
<kwd>Standards</kwd>
<kwd>Bioinformatics</kwd>
<kwd>Data driven</kwd>
<kwd>Open sciences</kwd>
</kwd-group>
<custom-meta-group>
<custom-meta>
<meta-name>issue-copyright-statement</meta-name>
<meta-value>© The Author(s) 2015</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
</front>
<body>
<sec id="Sec1" sec-type="introduction">
<title>Introduction</title>
<p>Data driven biological research has made data integration strategies crucial for the advancements and discovery in a plethora of fields (e.g. genomics, proteomics, metabolomics, environmental sciences, clinical research to name a few) [
<xref ref-type="bibr" rid="CR1">1</xref>
<xref ref-type="bibr" rid="CR6">6</xref>
]. Technically, solutions for data integration have been developed and applied in both corporate and academic sectors. When it comes to biological research, there are different interpretations and levels of data integration people seem to consider [
<xref ref-type="bibr" rid="CR7">7</xref>
<xref ref-type="bibr" rid="CR14">14</xref>
], ranging from genomic data to protein-protein interactions.</p>
<p>Together with data production, there is no doubt that data management, storage and consequently retrieval, analysis and interpretation are at the core of any biological research project. Moreover, the ability to have access to the actual data sets used in a particular study is often crucial for reproducibility and expansion of such study, hence the emphasis in recent years on Open Science and the various initiatives associated [
<xref ref-type="bibr" rid="CR15">15</xref>
<xref ref-type="bibr" rid="CR21">21</xref>
]. Noticeably, in biological research, the difficulties associated with data integration have only expanded with the advent of high throughput technologies [
<xref ref-type="bibr" rid="CR3">3</xref>
,
<xref ref-type="bibr" rid="CR22">22</xref>
,
<xref ref-type="bibr" rid="CR23">23</xref>
]. Anyone working with Next Generation Sequencing (NGS) faces challenges associated with a variety of aspects this type of data brings, one of the major being: the volume of the data [
<xref ref-type="bibr" rid="CR24">24</xref>
,
<xref ref-type="bibr" rid="CR25">25</xref>
].</p>
<p>Here, we refer to data integration as the computational solution allowing users, from end user (GUI) to power users (API), to fetch data from different sources, combine, manipulate and re-analyse them as well as being able to create new datasets and share these again with the scientific community.</p>
<p>With this definition in mind, it is clear that data integration solutions are imperative for the advancement of research in biological sciences as well as the mechanisms to make such processes traceable, shareable hence “integrable” [
<xref ref-type="bibr" rid="CR26">26</xref>
<xref ref-type="bibr" rid="CR28">28</xref>
]. Here, we provide an overview of the strategies most commonly adopted by the biological research community, current challenges and future directions.</p>
<sec id="Sec2">
<title>Key concepts and terminology</title>
<p>Data integration should not just rely on software engineers and computational scientists, but needs to be driven by the actual users whose communities need to define, adopt and use standards, ontologies and annotation best practice. Therefore, it is particularly important for the biological research community to get acquainted with the conceptual basis of data integration, its limitations, challenges and actual terminology.</p>
<p>In order to familiarise the experimental biology community of readers, in Table
<xref rid="Tab1" ref-type="table">1</xref>
we present key concepts, definitions and terms used by bioinformaticians and computer scientists.
<table-wrap id="Tab1">
<label>Table 1</label>
<caption>
<p>Terminology</p>
</caption>
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td align="left">Schema</td>
<td align="left">A structured and “queryable” way of storing data</td>
</tr>
<tr>
<td align="left">Database</td>
<td align="left">A single or collection of schemata</td>
</tr>
<tr>
<td align="left">Sources</td>
<td align="left">A number of databases that contain data. Data that reside in each source can either duplicate and/or complement data from other sources</td>
</tr>
<tr>
<td align="left">Data Integration</td>
<td align="left">The process of combining data that reside in different sources, to provide users with a unified view of such data</td>
</tr>
<tr>
<td align="left">Data Standards</td>
<td align="left">Agreements on representation, format, and definition for common data</td>
</tr>
<tr>
<td align="left">Data Formats</td>
<td align="left">A structured way to represent data and metadata in a file</td>
</tr>
<tr>
<td align="left">Data Warehousing</td>
<td align="left">Model for integrating data where the data from different sources reside on a central repository (aka data warehouse)</td>
</tr>
<tr>
<td align="left">Federated Databases</td>
<td align="left">Model for integrating data where the data reside on the original sources and users are provided with a unified view of the data based on mapping mechanisms of the information</td>
</tr>
<tr>
<td align="left">Linked Data</td>
<td align="left">The network of interlinked data that is available on the web. It is used to automatically share semantically rich information and represents the biggest attempt to convert significant amounts of human knowledge across all fields in a computer readable format</td>
</tr>
<tr>
<td align="left">Ontology</td>
<td align="left">A structured way of describing data, often presented in a computer-readable format. In bioinformatics, ontologies are sets of unambiguous, universally agreed terms used to describe biological phenomena and “entities”, their properties and their relationships</td>
</tr>
<tr>
<td align="left">lled Vocabulary</td>
<td align="left">A collection of terms for describing a certain domain of interest</td>
</tr>
<tr>
<td align="left">Unique Identifier</td>
<td align="left">A unique representation for a biological entity (molecule, organism, ontology term, etc.). Usually an alphanumeric string that is used to refer to this entity and distinguishes it from others (much like ID or passport number in humans).</td>
</tr>
<tr>
<td align="left">Metadata</td>
<td align="left">Data describing data, i.e., additional information (e.g., a comment, explanation, attributes, etc.) for a specific biological entity or process. As an example, in the context of an ontology, this is used to specify significant properties of the ontology</td>
</tr>
<tr>
<td align="left">Annotation</td>
<td align="left">The process of attaching relevant information (metadata) to a raw biological entity</td>
</tr>
<tr>
<td align="left">Automatic Annotation</td>
<td align="left">Automatic means that the annotation is being done by computer software (often by transferring information from a source to another). This is a way of producing a large amount of metadata</td>
</tr>
<tr>
<td align="left">Manual Annotation</td>
<td align="left">As opposed to automatic annotation, manual means that an actual individual does it</td>
</tr>
<tr>
<td align="left">GUI</td>
<td align="left">Graphical User Interface. Is the way that a user interacts with a computer by using graphical icons and visual indicators such as buttons, forms etc. In the scope of this paper we are using the term GUI to refer to interfaces that allow biologists to search/read/edit integrated biological data</td>
</tr>
<tr>
<td align="left">API</td>
<td align="left">Application Programming Interface. Set of tool and protocols that a power user can use in order to automatically gain access to functionality and/or data that have been developed/gathered by another individual/organisation</td>
</tr>
<tr>
<td align="left">UX</td>
<td align="left">User eXperience. The process of improving user satisfaction by focusing on the usability of a given product.</td>
</tr>
<tr>
<td align="left">Visualisation Tools</td>
<td align="left">Applications that help biologists view the data in a more human-friendly way (e.g., Cytoscape for visualising complex networks) like 3D or graph representations of the data</td>
</tr>
</tbody>
</table>
</table-wrap>
</p>
</sec>
</sec>
<sec id="Sec3">
<title>Review</title>
<p>In computational sciences the theoretical frameworks for data integration have been classified into two major categories namely “eager” and “lazy” [
<xref ref-type="bibr" rid="CR29">29</xref>
,
<xref ref-type="bibr" rid="CR30">30</xref>
]. The difference between the two approaches is the way the data get integrated. In the eager approach (warehousing), the data are being copied over to a global schema and stored in a central data warehouse; whereas in the lazy approach the data reside in distributed sources and are integrated on demand based on a global schema used to map the data between sources.</p>
<p>Each of the two main categories of data integration has to deal with its own challenges in order to provide the user with a unified view of the data. In the eager approach, researchers face challenges to keep data updated and consistent, and protect the global schema from having corrupted data [
<xref ref-type="bibr" rid="CR31">31</xref>
,
<xref ref-type="bibr" rid="CR32">32</xref>
]. In the lazy approach, data are queried at sources and the scientific community is trying to find ways of improving the answering query process [
<xref ref-type="bibr" rid="CR33">33</xref>
<xref ref-type="bibr" rid="CR38">38</xref>
] and source completeness [
<xref ref-type="bibr" rid="CR36">36</xref>
,
<xref ref-type="bibr" rid="CR37">37</xref>
,
<xref ref-type="bibr" rid="CR39">39</xref>
,
<xref ref-type="bibr" rid="CR40">40</xref>
]. Which approach should be used and when depends on amount of data, who owns them and the existing infrastructure.</p>
<p>In biology we see a diversity of implementations across these two approaches being used at a variety of levels and forms like data centralisation, federated databases [
<xref ref-type="bibr" rid="CR41">41</xref>
,
<xref ref-type="bibr" rid="CR42">42</xref>
] and linked data [
<xref ref-type="bibr" rid="CR43">43</xref>
]. Figure
<xref rid="Fig1" ref-type="fig">1</xref>
shows the most common schemata used to integrate data in biology.
<fig id="Fig1">
<label>Fig. 1</label>
<caption>
<p>Data integration methodologies. This figure illustrates six major types of data integration methodologies in biology</p>
</caption>
<graphic xlink:href="40709_2015_32_Fig1_HTML" id="MO1"></graphic>
</fig>
</p>
<p>UniProt [
<xref ref-type="bibr" rid="CR44">44</xref>
] and GenBank [
<xref ref-type="bibr" rid="CR45">45</xref>
] are examples of centralised resources (Fig.
<xref rid="Fig1" ref-type="fig">1</xref>
-Data Centralisation), whereas Pathway commons [
<xref ref-type="bibr" rid="CR46">46</xref>
] collects pathways from different databases and stores them to a shared repository that can be used to query and analyse pathway information (Fig.
<xref rid="Fig1" ref-type="fig">1</xref>
-Data Warehousing). Datasets integration can also be made by in-house workflows accessing distributed databases and downloading data to a local repository (Fig.
<xref rid="Fig1" ref-type="fig">1</xref>
-Dataset Integration). ExPASy [
<xref ref-type="bibr" rid="CR47">47</xref>
] is the SIB Bioinformatics Resource Portal through which the user can access databases and tools in different areas of life science (Fig.
<xref rid="Fig1" ref-type="fig">1</xref>
-Hyperlinks). Database links are crucial for interoperability and several efforts have been done in this context [
<xref ref-type="bibr" rid="CR48">48</xref>
]. Regarding the federated database model (Fig.
<xref rid="Fig1" ref-type="fig">1</xref>
-Federated Databases), the Distributed Annotation System (DAS) [
<xref ref-type="bibr" rid="CR49">49</xref>
] represents a valuable example. DAS is a client-server system used to integrate and display in a single view annotation data on biological sequences residing over multiple distant servers. In this case, a translation layer is needed to achieve data integration among heterogeneous databases. There are various ways to do this but in general it refers to ways to transform the data from the database to a common format so they can be interpreted in the same way from a mapping service. As for the linked data integration (Fig.
<xref rid="Fig1" ref-type="fig">1</xref>
-Linked Data), the services offered are graphical interfaces (GUI) that provide the user with hyperlinks connecting related data from multiple data providers in a large network of Linked Data. BIO2RDF [
<xref ref-type="bibr" rid="CR43">43</xref>
] is an example of such integration system.</p>
<p>Data integration in biological research has its challenges associated to a variety of factors such as standards adoption or easy conversion between data/file formats [
<xref ref-type="bibr" rid="CR2">2</xref>
].</p>
<p>Figure
<xref rid="Fig2" ref-type="fig">2</xref>
illustrates a simplified schematic view of the current state of biological research data integration components. Various attempts to integrate the data rely on translation layers that, by applying agreed standards, transform the data in a unified format in order to integrate them. In other words, different formats for the same type of data (e.g. NGS) need to be “translated” into a unified format by applying shared rules. On top of the integration layer, there are various GUIs that make it possible to utilise (download, analyse, represent, etc) the integrated data. Furthermore, there is a myriad of resources and visualisation tools generated that fail to comply with standards and/or are not compatible with each other [
<xref ref-type="bibr" rid="CR50">50</xref>
] On the other hand, controlled vocabularies and ontologies to ease data integration are available for an increasing number of biological domain areas. Some of them can be found at the websites of the OBO (Open Biological and Biomedical Ontologies) foundry [
<xref ref-type="bibr" rid="CR51">51</xref>
], the NCBO (National Center for Biomedical Ontology) BioPortal [
<xref ref-type="bibr" rid="CR52">52</xref>
], and the OLS (Ontology Lookup Service). One successful example is the XML-based proteomic standards defined by the HUPO-PSI (Human Proteome Organisation-Proteomics Standards Initiative) consortium (see Table
<xref rid="Tab2" ref-type="table">2</xref>
). The rest of the paper will discuss key aspects of standards: ontologies, data formats, identifiers, reporting guidelines, consortiums and standard initiatives which will be followed by a section on visualisation.
<fig id="Fig2">
<label>Fig. 2</label>
<caption>
<p>Current state. This figure illustrates a simplified view of the current state of biological data and tools</p>
</caption>
<graphic xlink:href="40709_2015_32_Fig2_HTML" id="MO2"></graphic>
</fig>
<table-wrap id="Tab2">
<label>Table 2</label>
<caption>
<p>List of data standards initiatives</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Acronym</th>
<th align="left">Name</th>
<th align="left">Goal</th>
<th align="left">URL</th>
<th align="left">PMID</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">OBO</td>
<td align="left">The Open Biological and</td>
<td align="left">Establish a set of principles for ontology</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.obofoundry.org">http://www.obofoundry.org</ext-link>
</td>
<td align="left">17989687</td>
</tr>
<tr>
<td align="left"></td>
<td align="left">Biomedical Ontologies</td>
<td align="left">development to create a suite of orthogonal</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left">interoperable reference ontologies in</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left">the biomedical domain</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left">CDISC</td>
<td align="left">Clinical data interchange</td>
<td align="left">Establish standards to support the acquisition,</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.cdisc.org">http://www.cdisc.org</ext-link>
</td>
<td align="left">23833735</td>
</tr>
<tr>
<td align="left"></td>
<td align="left">standards consortium</td>
<td align="left">exchange, submission and archive of</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left">clinical research data and metadata</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left">HUPO-PSI</td>
<td align="left">Human Proteome Organisation-</td>
<td align="left">Defines community standards for data</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.psidev.info">http://www.psidev.info</ext-link>
</td>
<td align="left">16901219</td>
</tr>
<tr>
<td align="left"></td>
<td align="left">Proteomics Standards Initiative</td>
<td align="left">representation in proteomics to facilitate</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left">data comparison, exchange and verification</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left">GAGH</td>
<td align="left">Global Alliance for Genomics</td>
<td align="left">Create interoperable approaches to catalyze</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://genomicsandhealth.org/">http://genomicsandhealth.org/</ext-link>
</td>
<td align="left">24896853</td>
</tr>
<tr>
<td align="left"></td>
<td align="left">and Health</td>
<td align="left">projects that will help unlock the great</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left">potential of genomic data</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left">COMBINE</td>
<td align="left">Computational Modeling</td>
<td align="left">Coordinate the development of the various</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://co.mbine.org/">http://co.mbine.org/</ext-link>
</td>
<td align="left">25759811</td>
</tr>
<tr>
<td align="left"></td>
<td align="left">in Biology</td>
<td align="left">community standards and formats for</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left">computational models</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left">MSI</td>
<td align="left">Metabolomics Standards</td>
<td align="left">Define community-agreed reporting</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://msi-workgroups.sourceforge.net">http://msi-workgroups.sourceforge.net</ext-link>
</td>
<td align="left">17687353</td>
</tr>
<tr>
<td align="left"></td>
<td align="left">Initiative</td>
<td align="left">standards, which provided a clear description</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left">of the biological system studied and</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left">all components of metabolomics studies</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left">RDA</td>
<td align="left">Research Data Alliance</td>
<td align="left">Builds the social and technical bridges that</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="https://rd-alliance.org">https://rd-alliance.org</ext-link>
</td>
<td align="left"></td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left">enable open sharing of data across multiple</td>
<td align="left"></td>
<td align="left"></td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left">scientific disciplines</td>
<td align="left"></td>
<td align="left"></td>
</tr>
</tbody>
</table>
</table-wrap>
</p>
<sec id="Sec4">
<title>Standards</title>
<p>As mentioned above, one of the most important factors for the biological field to thrive is to standardise the data. In computational science a similar problem was encountered for the web and specifically with the way that browsers parse web pages. This was solved by agreeing on W3C standards [
<xref ref-type="bibr" rid="CR53">53</xref>
] so that all the browsers are forced to comply otherwise they may result in poor user experience and they risk losing market share.</p>
<p>In biology there are many different ways of representing similar data and this makes the data harder to be integrated and processed to obtain unified views of such data. Gene naming is an example of poor uniformity in data representation. Despite full guidelines were issued in 1979 to adopt gene nomenclature standards (see [
<xref ref-type="bibr" rid="CR54">54</xref>
]), an assortment of alternate names is still in use across the scientific literature and databases, posing a challenge to data sharing. When it comes to biological research, it is crucial to create (when non existing), adopt and implement standards. Without these it is (nearly) impossible to achieve data integration [
<xref ref-type="bibr" rid="CR55">55</xref>
,
<xref ref-type="bibr" rid="CR56">56</xref>
].</p>
<p>So what do we mean by standards? Standards can be defined as an agreed compliant term or structure to represent a biological entity. Entities are all types of units of biological information. For example we use T, G, A, C as a standard way to refer to the nucleotides that make the DNA, and aa (for amino acids) represented usually by one letter, and consequently, a string of letters to represent a DNA or protein sequence. However, a protein might be known in the scientific literature and referred by researchers by a variety of names, synonyms and abbreviations.</p>
<p>So, which standards exist, who defines them and how are these working? Lots of standard initiatives and efforts seem to exist, sometimes redundant, often non driven by the end users communities. It is out of the scope of this paper (and probably a never ending exercise) to review all of them, which do proliferate but not necessarily in harmonising ways. A snapshot of the variety of standards for metadata can be found at the DCC website [
<xref ref-type="bibr" rid="CR57">57</xref>
] and BioSharing [
<xref ref-type="bibr" rid="CR58">58</xref>
] as an example of the point we are making. Table
<xref rid="Tab2" ref-type="table">2</xref>
reports a list of standard initiatives along with their primary goal, URL and key reference in the omics field.</p>
<p>Standards facilitate data re-use. They make data sharing easier, saving overheads and losses of time in data loading, conversion, getting systems to work properly with data. They help overcome interoperability difficulties across different data formats, architectures, and naming conventions, and at infrastructure level, enabling access systems to work together [
<xref ref-type="bibr" rid="CR59">59</xref>
<xref ref-type="bibr" rid="CR62">62</xref>
]. Absence of standards means substantial loss of productivity and less data available to researchers [
<xref ref-type="bibr" rid="CR63">63</xref>
].</p>
<p>Figure
<xref rid="Fig3" ref-type="fig">3</xref>
illustrates a schematic view of an ideal state of biological research data integration components. This figure emphasises on the importance of standards that is the base of all the top layers of the infrastructure. Without solid foundations, it is very difficult to build and maintain robust tools for the layers above. The arrows point out that the data can be used across all layers and this can go both ways. For example, in an ideal state, all biological data would be integrated from various databases across the world and biologists will be able to use a GUI to locate the entity of their interest. Then, they can use a visualisation tool to have a better representation of the entity by using the same data previously identified through the GUI (like a unique identifier). Furthermore, the biologist will be in a position to annotate or edit the data directly from the visualisation tool, which in turn will be able to commit the changes to the integrated service and from then on go all the way down the pyramid until the data in the proper database get edited and annotated.
<fig id="Fig3">
<label>Fig. 3</label>
<caption>
<p>Ideal state. This figure illustrates a simplified view of an ideal state of biological data and tools</p>
</caption>
<graphic xlink:href="40709_2015_32_Fig3_HTML" id="MO3"></graphic>
</fig>
</p>
<p>Standards are therefore key to the data sharing process since they describe the norms which should be adopted to facilitate interchange and inter-working of information, processes, objects and software. Thus data resources play a major role not just in data management, integration, access, and preservation, but also for providing adequate support to research communities.</p>
<sec id="Sec5">
<title>Ontologies</title>
<p>Ontologies have been proliferating in biological research, and their importance underlined several times [
<xref ref-type="bibr" rid="CR64">64</xref>
<xref ref-type="bibr" rid="CR67">67</xref>
] also in the specific context of data integration [
<xref ref-type="bibr" rid="CR68">68</xref>
]. In order to bring some coordination and consolidation to the proliferation of ontologies across the biological and biomedical research fields, The Open Biological and Biomedical Ontologies (OBO) got together. OBO is a collaborative experiment involving developers of science-based ontologies who are establishing a set of principles for ontology development with the goal of creating a suite of orthogonal interoperable reference ontologies in the biomedical domain. Biological researchers can get involved and provide feedback by getting into the discussion fora OBO provides. Currently there are ten OBO foundry ontologies and more than 120 candidate ontologies or other ontologies of interest [
<xref ref-type="bibr" rid="CR51">51</xref>
].</p>
<p>These efforts need the direct involvement of the actual biologists when it comes to the adoption and implementation of using such ontologies, ensuring these are known and disseminated across communities. Other important initiatives are, the NCBO (National Center for Biomedical Ontology) BioPortal [
<xref ref-type="bibr" rid="CR69">69</xref>
,
<xref ref-type="bibr" rid="CR70">70</xref>
], and the OLS (Ontology Lookup Service) [
<xref ref-type="bibr" rid="CR71">71</xref>
].</p>
<p>With a set of unique common compliant standards in place, it will be possible to create tools to integrate the data on the web using an existing infrastructure like linked data. This will enable querying multiple sources without having to re-invent integration techniques for the integration of each source. As an example, one of the efforts currently trying to attempt this is Bio2RDF [
<xref ref-type="bibr" rid="CR43">43</xref>
]. This is a major effort to integrate biological data using the linked data infrastructure. So far there are no tools that can utilise these data directly but they are mainly accessible via complex queries or low level GUIs.</p>
</sec>
</sec>
<sec id="Sec6">
<title>Formats</title>
<p>Data formats are the concrete way we structure and represent biological information in a file. They are particularly relevant to those who deal with large amount of information such that generated by high throughput experiments. Indeed, a scientist interested in a single or a few genes at a time may extract information about them by manually “parsing” the literature or free-text (i.e. non formatted) documents. The need for storing biological data in formatted files arose from the need for using computers to analyse them. The amounts of genomics and proteomics data, which cannot be manually analysed element by element, are exponentially increasing and the adoption of commonly agreed formats to represent them in computer readable files is nowadays of utter importance. Historically, the scarcity of well structured data standards and schemas, caused the flourishing of many different formats even to represent the same type of data despite the adoption of standards in file formats would be essential to data exchange and integration. Funnily, the Roslin Bioinformatics Law’s First Law declaims: “The first step in developing a new genetic analysis algorithm is to decide how to make the input data file format different from all pre-existing analysis data file formats” [
<xref ref-type="bibr" rid="CR72">72</xref>
].</p>
<p>For the benefit of data integration though, it would be ideal to have well-structured data across few basic formats that would be easily computer readable and therefore easily integrated. In the specific case of NGS data, the lag between the emerging high-throughput screening technologies and the adjusting of the scientific community to settle on a standard format, means time and effort spent on converting raw files across multiple sequencing platforms to make these compatible [
<xref ref-type="bibr" rid="CR73">73</xref>
]. Currently, in NGS there are no really “standards” that people adhere to, but a set of commonly used formats (FASTA/Q, SAM, VCF, GFF/GTF, etc.). There are descriptor standards like MIGS [
<xref ref-type="bibr" rid="CR74">74</xref>
], but these might not be generally adopted. More in general, today an exhaustive “atlas” of the formats used in bioinformatics cannot be found on the Internet. One partial list is available at
<ext-link ext-link-type="uri" xlink:href="http://genome.ucsc.edu/FAQ/FAQformat.html">http://genome.ucsc.edu/FAQ/FAQformat.html</ext-link>
and the description of many formats can be found in the online forum BioStar [
<xref ref-type="bibr" rid="CR75">75</xref>
].</p>
<p>A good format needs to take into account the data themselves (for example the DNA sequence of a gene) and the so called metadata, i.e. additional information describing the data (e.g. gene name, taxonomy information, cross reference to other resources, etc.) and has to adopt strategies (“tricks”) to make metadata unequivocally distinguishable from data by a computer program. This goal is achieved in different ways by different bioinformatics resources, resulting in the large number of formats we observe today. However, despite the large variety of computer readable formats, we realised that the most commonly used ones are ascribable to four main different classes: 1) tables 2) FASTA-like 3) GenBank-like 4) tag-structured. Table
<xref rid="Tab3" ref-type="table">3</xref>
reports examples for each of these classes.
<table-wrap id="Tab3">
<label>Table 3</label>
<caption>
<p>Mostly commonly used data formats in bioinformatics</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Data format class</th>
<th align="left">General data-</th>
<th align="left">Nucleotide sequence</th>
<th align="left">Protein sequence</th>
<th align="left">Structural</th>
<th align="left">Sequence</th>
<th align="left">Other data</th>
</tr>
<tr>
<th align="left"></th>
<th align="left">interchange formats</th>
<th align="left">data</th>
<th align="left">data</th>
<th align="left">data</th>
<th align="left">alignment</th>
<th align="left">types (PPI, etc)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Tabl</td>
<td align="left">CSV, TSV</td>
<td align="left">BED; GFF</td>
<td align="left">GFF, Uniprot-GFF</td>
<td align="left">PSF(D), MMCIF(D)</td>
<td align="left">SAM(D)</td>
<td align="left"></td>
</tr>
<tr>
<td align="left">FASTA-like</td>
<td align="left"></td>
<td align="left">FASTA; FASTQ</td>
<td align="left">FASTA, PIR</td>
<td align="left"></td>
<td align="left">SAM(M)</td>
<td align="left">Wig</td>
</tr>
<tr>
<td align="left">GenBank-like</td>
<td align="left"></td>
<td align="left">GenBank; EMBL</td>
<td align="left">Uniprot-TEXT</td>
<td align="left">PDB, PSF(M), MMCIF(D)</td>
<td align="left">CLUSTAL, MSF,</td>
<td align="left"></td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left"></td>
<td align="left"></td>
<td align="left"></td>
<td align="left">PHYLIP(D)</td>
<td align="left"></td>
</tr>
<tr>
<td align="left">Tag-structured</td>
<td align="left">HTML; XML; JSON</td>
<td align="left">SBOL-XML</td>
<td align="left">Uniprot-XML;</td>
<td align="left"></td>
<td align="left"></td>
<td align="left">PSI MI-XML;</td>
</tr>
<tr>
<td align="left"></td>
<td align="left"></td>
<td align="left"></td>
<td align="left">Uniprot-RDF/XML</td>
<td align="left"></td>
<td align="left"></td>
<td align="left">PSI-PAR</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>D = data; M = metadata. Formats appearing in more than one class are a mixture of classes</p>
</table-wrap-foot>
</table-wrap>
</p>
<p>In table formats, data are organised in a table in which the columns are separated by tabs, commas, pipes, etc., depending on the source generating the file. FASTA-like files utilise, for each data record, one or more “definition” or “declaration lines”, which contain metadata information or specify the content of the following lines. Definition/declaration lines usually start with a special character or keyword in the first position of the line - a “ >” in FASTA files or a “@” in fastq or SAM files - followed by lines containing the data themselves (Fig.
<xref rid="Fig4" ref-type="fig">4</xref>
). In some cases, declaration lines may be interspersed with data lines. This format is mostly used for sequence data. In the GenBank-like format, each line starts with an identifier that specifies the content of the line (Fig.
<xref rid="Fig5" ref-type="fig">5</xref>
). Tag-structured formatting uses “tags” (“ <”, “ >”, “{”, “}”, etc.) to make data and metadata recognisable (Fig.
<xref rid="Fig6" ref-type="fig">6</xref>
) with high specificity. Tag-structured text files, especially XML and JSON, are being increasingly employed as data interchange formats between different programming languages.
<fig id="Fig4">
<label>Fig. 4</label>
<caption>
<p>Selected parts of a FASTQ file. In this format declaration lines start with two different characters (“@” and “+”) corresponding to different data types (the raw sequence and the sequence quality values, respectively)</p>
</caption>
<graphic xlink:href="40709_2015_32_Fig4_HTML" id="MO4"></graphic>
</fig>
<fig id="Fig5">
<label>Fig. 5</label>
<caption>
<p>Selected parts of the GenBank entry DQ408531. The complete entry can be found at
<ext-link ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/nuccore/DQ408531">http://www.ncbi. nlm.nih.gov/nuccore/DQ408531</ext-link>
</p>
</caption>
<graphic xlink:href="40709_2015_32_Fig5_HTML" id="MO5"></graphic>
</fig>
<fig id="Fig6">
<label>Fig. 6</label>
<caption>
<p>Selected parts of the Uniprot entry P01308 in XML format - The complete entry can be found at
<ext-link ext-link-type="uri" xlink:href="http://www.uniprot.org/uniprot/P01308.xml">http://www.uniprot.org/uniprot/P01308.xml</ext-link>
</p>
</caption>
<graphic xlink:href="40709_2015_32_Fig6_HTML" id="MO6"></graphic>
</fig>
</p>
<p>There are also examples of data files using different representations for data and metadata. This means that two or more format classes may be used in the same data file. An example is represented by SAM files, which contain both GenBank-like lines (for the metadata) and table columns (for the data) as shown in Fig.
<xref rid="Fig7" ref-type="fig">7</xref>
.
<fig id="Fig7">
<label>Fig. 7</label>
<caption>
<p>Selected parts of a SAM file</p>
</caption>
<graphic xlink:href="40709_2015_32_Fig7_HTML" id="MO7"></graphic>
</fig>
</p>
<p>Should any of these four data representation classes be preferred over the others? Despite we observe an increasing use of XML and some authors propose to adopt XML for biological data interchange between databases and other sources of data [
<xref ref-type="bibr" rid="CR76">76</xref>
], we believe that there is not an ultimate answer. There are text formats that better suit some specific kind of data and specific computational requirements and purposes. For example, it is difficult to imagine how macromolecule X-ray or NMR coordinates and related annotation, currently stored in PDB files, could fit into the FASTA-like format. On the other hand, if one has to parse big sequence files, the FASTA format, with a single line annotation, will cause them to have a smaller size than differently formatted files and will allow parsing them with just a few lines of code. Notice that some formats (e.g. SAM) can be compressed into a binary version (BAM) for intensive data processing.</p>
<p>Therefore, we believe that the solution is not to urge scientists to conform to a unique “optimal” format but rather to identify a few operational formats and make database and tool developers aware of the importance of sticking to them.</p>
<p>For integration purposes, the scientific community of database and tool developers has begun to adopt some good practices in data file formatting. One example is represented by the FGED Society (
<ext-link ext-link-type="uri" xlink:href="http://fged.org/">http://fged.org/</ext-link>
) formed at a meeting on Microarray Gene Expression Databases (EBI, Hinxton, 1999) with the goal, amongst the others, of facilitating the adoption of standards for DNA microarrays and gene expression data representation. We believe, however, that further efforts should be made in order to achieve a more robust and systematic policy in all the areas where data sharing is essential to utilise these data to make new discoveries and the progress of science possible.</p>
<p>The community of scientists concerned by data sharing and integration, including us, should make the effort of 1) compiling a complete and structured (i.e. organised by data type and purpose) list of the currently available formats with their description and 2) developing guidelines and recommendations for the adoption of standards in file formatting, also discussing which data types fit into each different text format and the related performance implications. This list and the guidelines, which might be integrated in a resource such as BioSharing should encourage database and tool developers to present information in a way that a computer program can parse it, suggest that they avoid inventing new computer readable formats but rather comply with one of the existing ones, and only accept new data, for storage purposes, that meet certain formatting criteria. Such guidelines should be ambitious and forward-looking enough to also advice scientists in both academia and industry to keep in mind data representation in developing high throughput technologies and their information services.</p>
<p>The development of converters translating formats in a unified form should be promoted as well. This would actually make it possible to combine the data across all the formats. A rather isolated example of data format translation is represented by the PRIDE Converter [
<xref ref-type="bibr" rid="CR77">77</xref>
], which makes it easy to translate a large variety of input formats into the unique XML [
<xref ref-type="bibr" rid="CR76">76</xref>
,
<xref ref-type="bibr" rid="CR78">78</xref>
] format for proteomic data submission to the PRIDE repository [
<xref ref-type="bibr" rid="CR79">79</xref>
]. The PRIDE Converter was designed to be suitable for both small and large data submissions and has a very intuitive GUI also for wet-lab scientists without a strong bioinformatics background or informatics support. Format translation faces problems especially with not well-structured data that cannot be translated properly in a computer readable format and therefore rely on human manipulation of the data in order to verify the correctness of the transformation. In the case of NGS data, we rely on tools for conversion between next generation sequencing data formats, such as NGS-FC (
<ext-link ext-link-type="uri" xlink:href="http://sourceforge.net/projects/ngsformaterconv/">http://sourceforge.net/projects/ngsformaterconv/</ext-link>
), to ensure each tool in a workflow can work with the right format.</p>
<sec id="Sec7">
<title>Identifiers</title>
<p>An identifier is a unique representation of a given data entry [
<xref ref-type="bibr" rid="CR80">80</xref>
,
<xref ref-type="bibr" rid="CR81">81</xref>
]. For example the Universal Protein Database (UniProt) uses a “unique identifier” to refer to a protein entity which cannot be used in any other case, thus ensuring no redundancy and one agreed unique term that unequivocally identifies a given protein [
<xref ref-type="bibr" rid="CR82">82</xref>
].</p>
<p>In biological research a variety of data repositories exist and each of them is using its own implementation for generating unique identifiers. As an example, for the same protein, UniProt uses the identifier Q9Y6N8 whereas Ensembl [
<xref ref-type="bibr" rid="CR83">83</xref>
] is referring to it as ENSP00000264463 and RefSeq [
<xref ref-type="bibr" rid="CR84">84</xref>
] as NP_006718.2. If all the researchers could use a single unique identifier to refer to a given protein across their publications and work, data integration would be a step ahead of its current state.</p>
<p>An effort to help with the discoverability of the identifiers and assist the researcher with knowledge on how to query data across databases has be done from identifiers.org [
<xref ref-type="bibr" rid="CR85">85</xref>
]. This is a registry that facilitates the discovery of resources in life sciences and allows to decouple the identification of records by the physical locations on the web where they can be retrieved.</p>
<p>Many biological concepts are described in several databases using different identifiers. To facilitate discoverability and integration, databases have their data entries cross-referenced with external entries using identifiers. This enables users to find a data entry like a protein in UniProt and then find the same biological concept described in other databases (ie. RefSeq) and gather more relevant data about the same entry. Several initiatives like PICR [
<xref ref-type="bibr" rid="CR86">86</xref>
] or the “DAVID ID conversion tool” [
<xref ref-type="bibr" rid="CR87">87</xref>
] provide mapping of such identifiers. It will be beneficial if such service gets integrated in the major bioinformatics databases.</p>
<p>Some organised efforts including distributed resources like IMEx [
<xref ref-type="bibr" rid="CR88">88</xref>
] are very well organised and, though the independent databases that are part of the consortium like IntAct [
<xref ref-type="bibr" rid="CR81">81</xref>
], MINT [
<xref ref-type="bibr" rid="CR89">89</xref>
] and DIP [
<xref ref-type="bibr" rid="CR90">90</xref>
] use their own identifiers, all their entries get assigned a unique IMEx identifier issued by a central authority. The IMEx identifier is assigned to a single biological entity with the purpose of being reused across databases/systems and always link to the same entity regardless the system. The IMEx Central repository coordinates curation effort, assigns identifiers and facilitates the exchange of completed records on molecular interaction data between the IMEx Consortium partners.</p>
<p>Approaches like these can increase discoverability and shareability of data and even enable publications and scientific studies to use a single identifier to refer to a given entity. This entity could be easily traced and further studied by their audience. With an infrastructure like this in place, it will be possible to enforce researchers to submit the unique identifier of the biological entity that they are studying on their research papers. This is happening already for nucleotide sequence data where researchers have to submit newly obtained/sequenced entities to one of the three major sequencing databases [
<xref ref-type="bibr" rid="CR91">91</xref>
] and refer to it in the paper. Most of other data types can be used in publications without such requirement. This also extends to entire datasets.</p>
</sec>
<sec id="Sec8">
<title>Reporting guidelines</title>
<p>Huge steps have been achieved by the creation and adoption of clear recommended guidelines when it comes to depositing and disseminating data and datasets [
<xref ref-type="bibr" rid="CR92">92</xref>
<xref ref-type="bibr" rid="CR95">95</xref>
]. Such guidelines are often the result of several discussions (years of discussions in some occasions) in a field where data efforts for sharing have been maturing. The specification of several standards in life science include documentation and examples of how to use them, but many initiatives additionally include guidelines to agree on what minimum or recommended information should be provided when describing data. Minimum information guidelines have been very popular to ensure that data can be easily interpreted and that results derived from their analysis can be independently verified. These guidelines tend to concentrate on defining the content and structure of the necessary information rather than the technical format for capturing it. A key landmark in the development of guidelines of minimun information in this area comes from the “Minimum Information about a Biomedical or Biological Investigation” (MIBBI) [
<xref ref-type="bibr" rid="CR93">93</xref>
].</p>
<p>It is crucial to have a place where such efforts are listed and shared in order to ensure redundancy is avoided. As an example of reporting guidelines we mention here the efforts done in the topic of protein-protein interactions. Currently we see two reporting guidelines: MIMIx [
<xref ref-type="bibr" rid="CR96">96</xref>
] and IMEx [
<xref ref-type="bibr" rid="CR88">88</xref>
]. A key project that is contributing in this area and where one can look for as well as add “reporting guidelines” is the Registry of guidelines in biosharing.org [
<xref ref-type="bibr" rid="CR58">58</xref>
,
<xref ref-type="bibr" rid="CR97">97</xref>
].</p>
<p>As we have seen, there are different formats when it comes to data files, and these will always evolve according to the needs of the communities as well as the nature of the data and associated technologies. For example, a format that contains 20 fields for which one researcher might have a subset of information versus another that might opt for prioritising a different set. It is clear that having a minimum agreed set of fields that all comply to report using standards is crucial for data integration and reusability across such data. Similarly, other fields might be crucial and informative to a specific set of users. These can be adopted at the level of recommended. For example a protein-protein interaction database wants to capture domain specific information about interactions versus another one that is not interested in such aspect. One also might have optional fields, for those that want to annotate and enrich further the data record with metadata. Doing this in a standard manner means again allowing future reusability and expansion for others to adopt and exchange, integrate data based on this level of information.</p>
</sec>
<sec id="Sec9">
<title>Consortiums and standards initiatives</title>
<p>There are several initiatives coordinating the development of community standards to facilitate data comparison, exchange and verification in bioinformatics. Some of this initiatives are community initiatives or consortia like COMBINE [
<xref ref-type="bibr" rid="CR98">98</xref>
], PSI [
<xref ref-type="bibr" rid="CR99">99</xref>
], GAGH [
<xref ref-type="bibr" rid="CR100">100</xref>
], INSDC [
<xref ref-type="bibr" rid="CR101">101</xref>
], proteomeXchange [
<xref ref-type="bibr" rid="CR102">102</xref>
], IMEx [
<xref ref-type="bibr" rid="CR88">88</xref>
], BioPax [
<xref ref-type="bibr" rid="CR103">103</xref>
] involved in the development of standards in one specific biological domain. Some other community initiatives like RDA are more generic with a potential application in different scientific domains.</p>
<p>Some strategic efforts supported by major service providers and national governments like ELIXIR [
<xref ref-type="bibr" rid="CR104">104</xref>
], BBMRI [
<xref ref-type="bibr" rid="CR105">105</xref>
], BD2K [
<xref ref-type="bibr" rid="CR106">106</xref>
] are also involved in the development of standards in life sciences. Projects supported by specific grants like BioMedBridges [
<xref ref-type="bibr" rid="CR107">107</xref>
], BioSHaRE [
<xref ref-type="bibr" rid="CR108">108</xref>
] do also contribute to this cause but their duration is normally bound to the duration of the grant. All these initiatives play a major role in achieving consensus and agreements which facilitates the development and adoptions of standards.</p>
<p>In biological research, molecular biology has been the field ahead in terms of such efforts and the associated bioinformatics applications. One can only imagine the work yet to be done, learning from existing efforts and initiatives as described here in the field of ecology, biodiversity, marine biology and so on. Examples of large scale efforts that need to talk to each other and ideally apply best practice when it comes to creating an infrastructure that fosters data integration are LifeWatch [
<xref ref-type="bibr" rid="CR109">109</xref>
] and ISBE [
<xref ref-type="bibr" rid="CR110">110</xref>
].</p>
</sec>
<sec id="Sec10">
<title>Visualisation</title>
<p>There is a variety of visualisation tools, but often each tool requires a different file format and the task of feeding back the discovered data is not trivial [
<xref ref-type="bibr" rid="CR111">111</xref>
,
<xref ref-type="bibr" rid="CR112">112</xref>
]. The field of visualisation has its own challenges given the increasing quantity of data, the integration of heterogeneous data and the need for tools that allow representing multiple aspects of the data (e.g. multiple connections between nodes with diverse biological meanings [
<xref ref-type="bibr" rid="CR113">113</xref>
,
<xref ref-type="bibr" rid="CR114">114</xref>
]). There is a myriad of visualisation and analysis tools, ever proliferating, with each tool providing specific features that address different aspects (e.g. genome browsers [
<xref ref-type="bibr" rid="CR115">115</xref>
<xref ref-type="bibr" rid="CR119">119</xref>
]). In 2008 Pavlopoulus et al published a wish list for visualisation of biological data which still remains valid [
<xref ref-type="bibr" rid="CR120">120</xref>
].</p>
<p>Data integration principles are fundamental in providing tools that are user friendly and allow the end users (biologists) to focus their efforts on the actual study of the data instead of being lost in the process of looking for the data they need by querying multiple databases that appear to provide inconsistent results between them. The field of systems biology
<italic>per se</italic>
brought substantial advances in visualisations since the ability to analyse and interpret interactions, networks and pathways relies often in the ability of visualising these accurately [
<xref ref-type="bibr" rid="CR120">120</xref>
].</p>
<p>Overcoming some of the challenges associated with visualisation relies on better standards adoption and improvement in annotation and metadata. This is clearly a two directional effort: bottom up, where data and datasets are annotated and stored following a common set of standards, this extends to the data formats as well as a top down level of standards and adoption of compatible formats and output files that allow comparisons and integrations of results [
<xref ref-type="bibr" rid="CR121">121</xref>
<xref ref-type="bibr" rid="CR123">123</xref>
].</p>
<p>Historically, many domains within biology have relied on visualisation as a way to represent the biological information thus creating what are now considered standards in their domains. Plenty of examples can be found in the areas of phylogenetics [
<xref ref-type="bibr" rid="CR124">124</xref>
] and pathways [
<xref ref-type="bibr" rid="CR125">125</xref>
,
<xref ref-type="bibr" rid="CR126">126</xref>
]. The advent of next generation sequencing brought genomics as a domain were significant effort has been put to develop new visualisation techniques to represent sequences, alignments, expression patterns and ultimately entire genomes [
<xref ref-type="bibr" rid="CR127">127</xref>
<xref ref-type="bibr" rid="CR130">130</xref>
]. However, biological researchers might lack an understanding and awareness about the range of visualisation techniques available and which is the most appropriate visual representation [
<xref ref-type="bibr" rid="CR131">131</xref>
,
<xref ref-type="bibr" rid="CR132">132</xref>
].</p>
<p>An increased dialogue between the computational scientists involved in the creation and development of such tools with the end users (aka the biologists), would be beneficial for the entire community and we hope this paper is one step towards such outcome. Efforts in this direction are also on the way and we cite here the BiVi initiative (
<ext-link ext-link-type="uri" xlink:href="http://bivi.co/">http://bivi.co/</ext-link>
), which is addressing several challenges in the realm of visualisation as well as trying to reduce the gap between the biology, computational sciences and developers of bioinformatics tools. BiVi has grouped many of the most notable visualisation tools produced by biologists and developers across seven domains (though some of the tools cover more than one of these) and provides information as to their provenance, current status and links to websites (
<ext-link ext-link-type="uri" xlink:href="http://bivi.co/visualisations">http://bivi.co/visualisations</ext-link>
). Other community efforts in this area are VizBI (
<ext-link ext-link-type="uri" xlink:href="http://vizbi.org/">http://vizbi.org/</ext-link>
), SciVis (
<ext-link ext-link-type="uri" xlink:href="http://scivis.itn.liu.se/">http://scivis.itn.liu.se/</ext-link>
) and CoVis (
<ext-link ext-link-type="uri" xlink:href="http://www.iwr.uni-heidelberg.de/groups/CoVis/">http://www.iwr.uni-heidelberg.de/groups/CoVis/</ext-link>
).</p>
<p>It would be impossible for us to list the plethora of visualisation tools developed and used in biological research, hence we provide an overview in Table
<xref rid="Tab4" ref-type="table">4</xref>
of some of the most common visualisations tools in the area of “Interaction Network Visualisation” to illustrate the variety and types of resources available for one area.
<table-wrap id="Tab4">
<label>Table 4</label>
<caption>
<p>Common visualisation tools in the area of “Interaction Network Visualisation”</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Name of resource</th>
<th align="left">What it does</th>
<th align="left">URL</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">BicOverlapper</td>
<td align="justify">Visualisation of biclusters combined with profile plots and heat maps</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://vis.usal.es/bicoverlapper/">http://vis.usal.es/bicoverlapper/</ext-link>
</td>
</tr>
<tr>
<td align="left">BiGGEsTS</td>
<td align="justify">Heat map-based bicluster visualisation</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://tinyurl.com/BiGGEsTS">http://tinyurl.com/BiGGEsTS</ext-link>
</td>
</tr>
<tr>
<td align="left">Brain Explorer</td>
<td align="justify">Visualisation of 3D transcription data in the central nervous system</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://tinyurl.com/brainExplorer">http://tinyurl.com/brainExplorer</ext-link>
</td>
</tr>
<tr>
<td align="left">Data Matrix Viewer</td>
<td align="justify">Simple profile plot visualisation; supports Gaggle</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://gaggle.systemsbiology.net/">http://gaggle.systemsbiology.net/</ext-link>
</td>
</tr>
<tr>
<td align="left">EXPANDER</td>
<td align="justify">Heat maps, scatter plots and profile plots of cluster averages</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://acgt.cs.tau.ac.il/expander">http://acgt.cs.tau.ac.il/expander</ext-link>
</td>
</tr>
<tr>
<td align="left">GENESIS</td>
<td align="justify">Analysis suite; offers several interactive visualisations</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://genome.tugraz.at/">http://genome.tugraz.at/</ext-link>
</td>
</tr>
<tr>
<td align="left">geWorkbench</td>
<td align="justify">Modular suite; heat maps, dendrograms, profile and scatter plots</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://tinyurl.com/geWorkbench">http://tinyurl.com/geWorkbench</ext-link>
</td>
</tr>
<tr>
<td align="left">Hierarchical Clustering Explorer</td>
<td align="justify">Linked heat map, profile and scatter plots; systematic exploration</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://tinyurl.com/HCExplorer">http://tinyurl.com/HCExplorer</ext-link>
</td>
</tr>
<tr>
<td align="left">Java TreeView</td>
<td align="justify">Linked heat maps, karyoscopes, sequence alignments, scatter plots</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://jtreeview.sourceforge.net/">http://jtreeview.sourceforge.net/</ext-link>
</td>
</tr>
<tr>
<td align="left">Mayday</td>
<td align="justify">Modular suite; many linked visualisations; enhanced heat map113</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://tinyurl.com/maydaywp">http://tinyurl.com/maydaywp</ext-link>
</td>
</tr>
<tr>
<td align="left">MultiExperiment Viewer</td>
<td align="justify">Analysis suite; heat maps, dendrograms, profile and scatter plots</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.tm4.org/">http://www.tm4.org/</ext-link>
</td>
</tr>
<tr>
<td align="left">PointCloudXplore</td>
<td align="justify">Visualisation of 3D transcription data in Drosophila embryos</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://tinyurl.com/PointCloudXplore">http://tinyurl.com/PointCloudXplore</ext-link>
</td>
</tr>
<tr>
<td align="left">TimeSearcher</td>
<td align="justify">Exploration and analysis of time series; advanced profile plots</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://tinyurl.com/timesearcher">http://tinyurl.com/timesearcher</ext-link>
</td>
</tr>
<tr>
<td align="left">R/BioConductor Geneplotter</td>
<td align="justify">Karyoscope-style plots and other visualisations</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.bioconductor.org/">http://www.bioconductor.org/</ext-link>
</td>
</tr>
<tr>
<td align="left">GenePattern</td>
<td align="justify">Modular analysis platform; several visualisation modules available</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://tinyurl.com/GenePatt">http://tinyurl.com/GenePatt</ext-link>
</td>
</tr>
<tr>
<td align="left">Cytoscape</td>
<td align="justify">Open source software platform for visualizing molecular interaction networks and biological pathways and integrating these networks with annotations, gene expression profiles and other state data</td>
<td align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.cytoscape.org/index.html">http://www.cytoscape.org/index.html</ext-link>
</td>
</tr>
</tbody>
</table>
</table-wrap>
</p>
<p>There are also well known and generally adopted analysis suites that also provide visualisation tools as part of their repertoire of resources such as Galaxy [
<xref ref-type="bibr" rid="CR133">133</xref>
], Cytoscape [
<xref ref-type="bibr" rid="CR134">134</xref>
,
<xref ref-type="bibr" rid="CR135">135</xref>
], Ondex [
<xref ref-type="bibr" rid="CR136">136</xref>
], iPlant Collaborative [
<xref ref-type="bibr" rid="CR137">137</xref>
], Bioconductor [
<xref ref-type="bibr" rid="CR138">138</xref>
]. Other important efforts derive from initiatives that are working towards unlocking the actual visualisations, in other words going from the visualisation to the data and datasets. This is important not only for reproducibility but also to allow access for data and their integration with other data/datasets. A very interesting resource is Utopia Docs [
<xref ref-type="bibr" rid="CR139">139</xref>
,
<xref ref-type="bibr" rid="CR140">140</xref>
], a free PDF reader that connects the static content of scientific articles to the dynamic world of online content. This resources allows the user to interact directly with curated database entries; play with molecular structures; edit sequence and alignment data; even plot and export tabular data. Another totally different but relevant initiative in the world of visualisation is BIOJS, that aims to provide open-source library of JavaScript components to visualise biological data. BIOJS vision is that every online biological dataset in the world should be visualised with BIOJS tools (
<ext-link ext-link-type="uri" xlink:href="http://biojs.net/">http://biojs.net/</ext-link>
) [
<xref ref-type="bibr" rid="CR141">141</xref>
,
<xref ref-type="bibr" rid="CR142">142</xref>
].</p>
</sec>
</sec>
</sec>
<sec id="Sec11" sec-type="conclusion">
<title>Conclusion</title>
<p>Data heterogeneity is one of the biggest challenges in biological data integration. This could be solved with standardising the data structures that are being used. Biologists should get more involved with the aspects described here and working with bioinformaticians and computational scientists to achieve uniformity of their data. With this issue resolved, integration of biological data will greatly boost biological research and the field will gain a more robust structure: computational scientists will be responsible for maintaining and improving the infrastructure of the data; bioinformaticians will be able to build upon this infrastructure; biologists will be able to do research with advanced tools without the overhead of getting acquainted with complex topics of database management and programming tools.</p>
</sec>
</body>
<back>
<fn-group>
<fn>
<p>
<bold>Competing interests</bold>
</p>
<p>The authors declare that they have no competing interests.</p>
</fn>
<fn>
<p>
<bold>Authors’ contributions</bold>
</p>
<p>VL: worked on most of the writing, literature review, all illustrations and contributed to the design of this paper. MS: edited the paper and provided suggestions. RCJ: contributed to the specific aspects related to existing data integration methodologies and key references. AV: contributed with writing some specific sections and bringing the perspective of the biology readership as well as editing the manuscript. MVS worked on the design of the manuscript and some of the writing. All authors read and approved the final manuscript.</p>
</fn>
</fn-group>
<ack>
<title>Acknowledgements</title>
<p>We like to thank The Genome Analysis Centre (TGAC, Norwich, UK) and the Biotechnology and Biological Sciences Research Council (BBSRC, UK). AV acknowledges the King Abdullah University of Science and Technology (KAUST) Award No. KUK-I1-012-43 for funding support.</p>
</ack>
<ref-list id="Bib1">
<title>References</title>
<ref id="CR1">
<label>1</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stamatoyannopoulos</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Snyder</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Hardison</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Ren</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Gingeras</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Gilbert</surname>
<given-names>DM</given-names>
</name>
<etal></etal>
</person-group>
<article-title>An encyclopedia of mouse dna elements (mouse encode)</article-title>
<source>Genome Biol</source>
<year>2012</year>
<volume>13</volume>
<issue>8</issue>
<fpage>418</fpage>
<pub-id pub-id-type="doi">10.1186/gb-2012-13-8-418</pub-id>
<pub-id pub-id-type="pmid">22889292</pub-id>
</element-citation>
</ref>
<ref id="CR2">
<label>2</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gomez-Cabrero</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Abugessaisa</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Maier</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Teschendorff</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Merkenschlager</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Gisel</surname>
<given-names>A</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Data integration in the era of omics: current and future challenges</article-title>
<source>BMC Syst Biol</source>
<year>2014</year>
<volume>8</volume>
<issue>Suppl 2</issue>
<fpage>1</fpage>
<pub-id pub-id-type="doi">10.1186/1752-0509-8-S2-I1</pub-id>
<pub-id pub-id-type="pmid">24393148</pub-id>
</element-citation>
</ref>
<ref id="CR3">
<label>3</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma’ayan</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Rouillard</surname>
<given-names>AD</given-names>
</name>
<name>
<surname>Clark</surname>
<given-names>NR</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Duan</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Kou</surname>
<given-names>Y</given-names>
</name>
</person-group>
<article-title>Lean big data integration in systems biology and systems pharmacology</article-title>
<source>Trends Pharmacol Sci</source>
<year>2014</year>
<volume>35</volume>
<issue>9</issue>
<fpage>450</fpage>
<lpage>60</lpage>
<pub-id pub-id-type="doi">10.1016/j.tips.2014.07.001</pub-id>
<pub-id pub-id-type="pmid">25109570</pub-id>
</element-citation>
</ref>
<ref id="CR4">
<label>4</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ritchie</surname>
<given-names>MD</given-names>
</name>
<name>
<surname>Holzinger</surname>
<given-names>ER</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Pendergrass</surname>
<given-names>SA</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>D</given-names>
</name>
</person-group>
<article-title>Methods of integrating data to uncover genotype-phenotype interactions</article-title>
<source>Nat Rev Genet</source>
<year>2015</year>
<volume>16</volume>
<issue>2</issue>
<fpage>85</fpage>
<lpage>97</lpage>
<pub-id pub-id-type="doi">10.1038/nrg3868</pub-id>
<pub-id pub-id-type="pmid">25582081</pub-id>
</element-citation>
</ref>
<ref id="CR5">
<label>5</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Warde-Farley</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Donaldson</surname>
<given-names>SL</given-names>
</name>
<name>
<surname>Comes</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Zuberi</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Badrawi</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Chao</surname>
<given-names>P</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function</article-title>
<source>Nucleic Acids Res</source>
<year>2010</year>
<volume>38</volume>
<issue>Web Server issue</issue>
<fpage>214</fpage>
<lpage>20</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkq537</pub-id>
</element-citation>
</ref>
<ref id="CR6">
<label>6</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rieping</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Habeck</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Bardiaux</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Bernard</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Malliavin</surname>
<given-names>TE</given-names>
</name>
<name>
<surname>Nilges</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>ARIA2: automated NOE assignment and data integration in NMR structure calculation</article-title>
<source>Bioinformatics</source>
<year>2007</year>
<volume>23</volume>
<issue>3</issue>
<fpage>381</fpage>
<lpage>2</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btl589</pub-id>
<pub-id pub-id-type="pmid">17121777</pub-id>
</element-citation>
</ref>
<ref id="CR7">
<label>7</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jansen</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Greenbaum</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Kluger</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Krogan</surname>
<given-names>NJ</given-names>
</name>
<name>
<surname>Chung</surname>
<given-names>S</given-names>
</name>
<etal></etal>
</person-group>
<article-title>A bayesian networks approach for predicting protein-protein interactions from genomic data</article-title>
<source>Science</source>
<year>2003</year>
<volume>302</volume>
<issue>5644</issue>
<fpage>449</fpage>
<lpage>53</lpage>
<pub-id pub-id-type="doi">10.1126/science.1087361</pub-id>
<pub-id pub-id-type="pmid">14564010</pub-id>
</element-citation>
</ref>
<ref id="CR8">
<label>8</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hwang</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Rust</surname>
<given-names>AG</given-names>
</name>
<name>
<surname>Ramsey</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>JJ</given-names>
</name>
<name>
<surname>Leslie</surname>
<given-names>DM</given-names>
</name>
<name>
<surname>Weston</surname>
<given-names>AD</given-names>
</name>
<etal></etal>
</person-group>
<article-title>A data integration methodology for systems biology</article-title>
<source>Proc Natl Acad Sci U S A</source>
<year>2005</year>
<volume>102</volume>
<issue>48</issue>
<fpage>17296</fpage>
<lpage>301</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.0508647102</pub-id>
<pub-id pub-id-type="pmid">16301537</pub-id>
</element-citation>
</ref>
<ref id="CR9">
<label>9</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Myers</surname>
<given-names>CL</given-names>
</name>
<name>
<surname>Troyanskaya</surname>
<given-names>OG</given-names>
</name>
</person-group>
<article-title>Context-sensitive data integration and prediction of biological networks</article-title>
<source>Bioinformatics</source>
<year>2007</year>
<volume>23</volume>
<issue>17</issue>
<fpage>2322</fpage>
<lpage>30</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btm332</pub-id>
<pub-id pub-id-type="pmid">17599939</pub-id>
</element-citation>
</ref>
<ref id="CR10">
<label>10</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chung</surname>
<given-names>SY</given-names>
</name>
<name>
<surname>Wong</surname>
<given-names>L</given-names>
</name>
</person-group>
<article-title>Kleisli: a new tool for data integration in biology</article-title>
<source>Trends Biotechnol</source>
<year>1999</year>
<volume>17</volume>
<issue>9</issue>
<fpage>351</fpage>
<lpage>5</lpage>
<pub-id pub-id-type="doi">10.1016/S0167-7799(99)01342-6</pub-id>
<pub-id pub-id-type="pmid">10461180</pub-id>
</element-citation>
</ref>
<ref id="CR11">
<label>11</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Letunic</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Copley</surname>
<given-names>RR</given-names>
</name>
<name>
<surname>Schmidt</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Ciccarelli</surname>
<given-names>FD</given-names>
</name>
<name>
<surname>Doerks</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Schultz</surname>
<given-names>J</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Smart 4.0: towards genomic data integration</article-title>
<source>Nucleic Acids Res</source>
<year>2004</year>
<volume>32</volume>
<issue>suppl 1</issue>
<fpage>142</fpage>
<lpage>4</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkh088</pub-id>
</element-citation>
</ref>
<ref id="CR12">
<label>12</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Von Mering</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Jensen</surname>
<given-names>LJ</given-names>
</name>
<name>
<surname>Kuhn</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Chaffron</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Doerks</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Krüger</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Snel</surname>
<given-names>B</given-names>
</name>
<etal></etal>
</person-group>
<article-title>String 7—recent developments in the integration and prediction of protein interactions</article-title>
<source>Nucleic Acids Res</source>
<year>2007</year>
<volume>35</volume>
<issue>suppl 1</issue>
<fpage>358</fpage>
<lpage>62</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkl825</pub-id>
</element-citation>
</ref>
<ref id="CR13">
<label>13</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cheung</surname>
<given-names>K-H</given-names>
</name>
<name>
<surname>Yip</surname>
<given-names>KY</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Masiar</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Gerstein</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Yeasthub: a semantic web use case for integrating data in the life sciences domain</article-title>
<source>Bioinformatics</source>
<year>2005</year>
<volume>21</volume>
<issue>suppl 1</issue>
<fpage>85</fpage>
<lpage>96</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/bti1026</pub-id>
</element-citation>
</ref>
<ref id="CR14">
<label>14</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Goldovsky</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Janssen</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Ahren</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Audit</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Cases</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Darzentas</surname>
<given-names>N</given-names>
</name>
<etal></etal>
</person-group>
<article-title>CoGenT++: an extensive and extensible data environment for computational genomics</article-title>
<source>Bioinformatics</source>
<year>2005</year>
<volume>21</volume>
<issue>19</issue>
<fpage>3806</fpage>
<lpage>810</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/bti579</pub-id>
<pub-id pub-id-type="pmid">16216832</pub-id>
</element-citation>
</ref>
<ref id="CR15">
<label>15</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kauppinen</surname>
<given-names>T</given-names>
</name>
<name>
<surname>de Espindola</surname>
<given-names>GM</given-names>
</name>
</person-group>
<article-title>Linked open science-communicating, sharing and evaluating data, methods and results for executable papers</article-title>
<source>Procedia Comput Sci</source>
<year>2011</year>
<volume>4</volume>
<fpage>726</fpage>
<lpage>31</lpage>
<pub-id pub-id-type="doi">10.1016/j.procs.2011.04.076</pub-id>
</element-citation>
</ref>
<ref id="CR16">
<label>16</label>
<mixed-citation publication-type="other">Neylon C, Wu S. Open science: tools, approaches, and implications: 2008. p. 540–4. doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1038/npre.2008.1633.1">10.1038/npre.2008.1633.1</ext-link>
.</mixed-citation>
</ref>
<ref id="CR17">
<label>17</label>
<mixed-citation publication-type="other">Gentleman R, Temple Lang D. Statistical analyses and reproducible research. In: Bioconductor Project Working Papers. Working Paper 2: 2004.
<ext-link ext-link-type="uri" xlink:href="http://biostats.bepress.com/bioconductor/paper2">http://biostats.bepress.com/bioconductor/paper2</ext-link>
.</mixed-citation>
</ref>
<ref id="CR18">
<label>18</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chamberlain</surname>
<given-names>SA</given-names>
</name>
<name>
<surname>Szöcs</surname>
<given-names>E</given-names>
</name>
</person-group>
<article-title>taxize: taxonomic search and retrieval in R</article-title>
<source>F1000Res</source>
<year>2013</year>
<volume>2</volume>
<fpage>191</fpage>
<pub-id pub-id-type="pmid">24555091</pub-id>
</element-citation>
</ref>
<ref id="CR19">
<label>19</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Juty</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Ali</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Glont</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Keating</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Rodriguez</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Swat</surname>
<given-names>M</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Biomodels: Content, features, functionality, and use</article-title>
<source>CPT: Pharmacometrics Syst Pharmacol</source>
<year>2015</year>
<volume>4</volume>
<issue>2</issue>
<fpage>1</fpage>
<lpage>14</lpage>
</element-citation>
</ref>
<ref id="CR20">
<label>20</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kenall</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Edmunds</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Goodman</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Bal</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Flintoft</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Shanahan</surname>
<given-names>DR</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Better reporting for better research: a checklist for reproducibility</article-title>
<source>BMC Neurosci</source>
<year>2015</year>
<volume>16</volume>
<issue>1</issue>
<fpage>44</fpage>
<pub-id pub-id-type="doi">10.1186/s12868-015-0177-z</pub-id>
<pub-id pub-id-type="pmid">26202681</pub-id>
</element-citation>
</ref>
<ref id="CR21">
<label>21</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Garijo</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Kinnings</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Xie</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Xie</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Bourne</surname>
<given-names>PE</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Quantifying reproducibility in computational biology: the case of the tuberculosis drugome</article-title>
<source>PLoS ONE</source>
<year>2013</year>
<volume>8</volume>
<issue>11</issue>
<fpage>80278</fpage>
<pub-id pub-id-type="doi">10.1371/journal.pone.0080278</pub-id>
</element-citation>
</ref>
<ref id="CR22">
<label>22</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saleem</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Kamdar</surname>
<given-names>MR</given-names>
</name>
<name>
<surname>Iqbal</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Sampath</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Deus</surname>
<given-names>HF</given-names>
</name>
<name>
<surname>Ngomo</surname>
<given-names>A-CN</given-names>
</name>
</person-group>
<article-title>Big linked cancer data: Integrating linked tcga and pubmed</article-title>
<source>Web Semant Sci Serv Agents World Wide Web</source>
<year>2014</year>
<volume>27</volume>
<fpage>34</fpage>
<lpage>41</lpage>
<pub-id pub-id-type="doi">10.1016/j.websem.2014.07.004</pub-id>
</element-citation>
</ref>
<ref id="CR23">
<label>23</label>
<mixed-citation publication-type="other">Kadadi A, Agrawal R, Nyamful C, Atiq R. Challenges of data integration and interoperability in big data. In: Big Data (Big Data), 2014 IEEE International Conference On. IEEE: 2014. p. 38–40.</mixed-citation>
</ref>
<ref id="CR24">
<label>24</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wandelt</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Rheinländer</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Bux</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Thalheim</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Haldemann</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Leser</surname>
<given-names>U</given-names>
</name>
</person-group>
<article-title>Data management challenges in next generation sequencing</article-title>
<source>Datenbank-Spektrum</source>
<year>2012</year>
<volume>12</volume>
<issue>3</issue>
<fpage>161</fpage>
<lpage>71</lpage>
<pub-id pub-id-type="doi">10.1007/s13222-012-0098-2</pub-id>
</element-citation>
</ref>
<ref id="CR25">
<label>25</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nekrutenko</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Taylor</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>Next-generation sequencing data interpretation: enhancing reproducibility and accessibility</article-title>
<source>Nat Rev Genet</source>
<year>2012</year>
<volume>13</volume>
<issue>9</issue>
<fpage>667</fpage>
<lpage>72</lpage>
<pub-id pub-id-type="doi">10.1038/nrg3305</pub-id>
<pub-id pub-id-type="pmid">22898652</pub-id>
</element-citation>
</ref>
<ref id="CR26">
<label>26</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bravo</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Calzolari</surname>
<given-names>A</given-names>
</name>
<name>
<surname>De Castro</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Mabile</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Napolitani</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Rossi</surname>
<given-names>AM</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Developing a guideline to standardize the citation of bioresources in journal articles (cobra)</article-title>
<source>BMC Medicine</source>
<year>2015</year>
<volume>13</volume>
<issue>1</issue>
<fpage>33</fpage>
<pub-id pub-id-type="doi">10.1186/s12916-015-0266-y</pub-id>
<pub-id pub-id-type="pmid">25855867</pub-id>
</element-citation>
</ref>
<ref id="CR27">
<label>27</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mabile</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Dalgleish</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Thorisson</surname>
<given-names>GA</given-names>
</name>
<name>
<surname>Deschênes</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Hewitt</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Carpenter</surname>
<given-names>J</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Quantifying the use of bioresources for promoting their sharing in scientific research</article-title>
<source>GigaScience</source>
<year>2013</year>
<volume>2</volume>
<issue>1</issue>
<fpage>1</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="doi">10.1186/2047-217X-2-7</pub-id>
<pub-id pub-id-type="pmid">23587291</pub-id>
</element-citation>
</ref>
<ref id="CR28">
<label>28</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Goble</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Stevens</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>State of the nation in data integration for bioinformatics</article-title>
<source>J Biomed Inform</source>
<year>2008</year>
<volume>41</volume>
<issue>5</issue>
<fpage>687</fpage>
<lpage>93</lpage>
<pub-id pub-id-type="doi">10.1016/j.jbi.2008.01.008</pub-id>
<pub-id pub-id-type="pmid">18358788</pub-id>
</element-citation>
</ref>
<ref id="CR29">
<label>29</label>
<mixed-citation publication-type="other">Widom J. Integrating heterogeneous databases: Lazy or eager?ACM Comput Surv. 1996; 28(4es). doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1145/242224.242344">10.1145/242224.242344</ext-link>
.</mixed-citation>
</ref>
<ref id="CR30">
<label>30</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Widom</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>Research problems in data warehousing</article-title>
<source>Proceedings of the Fourth International Conference on Information and Knowledge Management, CIKM ’95</source>
<year>1995</year>
<publisher-loc>New York, NY, USA</publisher-loc>
<publisher-name>ACM</publisher-name>
</element-citation>
</ref>
<ref id="CR31">
<label>31</label>
<mixed-citation publication-type="other">Gupta A, Widom J. Local verification of global integrity constraints in distributed databases. In: ACM SIGMOD International Conference on Management of Data (SIGMOD 1993): 1993.
<ext-link ext-link-type="uri" xlink:href="http://ilpubs.stanford.edu:8090/20/">http://ilpubs.stanford.edu:8090/20/</ext-link>
.</mixed-citation>
</ref>
<ref id="CR32">
<label>32</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhuge</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>García-Molina</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Hammer</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Widom</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>View maintenance in a warehousing environment</article-title>
<source>SIGMOD Rec</source>
<year>1995</year>
<volume>24</volume>
<issue>2</issue>
<fpage>316</fpage>
<lpage>27</lpage>
<pub-id pub-id-type="doi">10.1145/568271.223848</pub-id>
</element-citation>
</ref>
<ref id="CR33">
<label>33</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ives</surname>
<given-names>ZG</given-names>
</name>
<name>
<surname>Florescu</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Friedman</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Levy</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Weld</surname>
<given-names>DS</given-names>
</name>
</person-group>
<article-title>An adaptive query execution system for data integration</article-title>
<source>SIGMOD Rec</source>
<year>1999</year>
<volume>28</volume>
<issue>2</issue>
<fpage>299</fpage>
<lpage>310</lpage>
<pub-id pub-id-type="doi">10.1145/304181.304209</pub-id>
</element-citation>
</ref>
<ref id="CR34">
<label>34</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Halevy</surname>
<given-names>AY</given-names>
</name>
</person-group>
<article-title>Answering queries using views: A survey</article-title>
<source>VLDB J</source>
<year>2001</year>
<volume>10</volume>
<issue>4</issue>
<fpage>270</fpage>
<lpage>94</lpage>
<pub-id pub-id-type="doi">10.1007/s007780100054</pub-id>
</element-citation>
</ref>
<ref id="CR35">
<label>35</label>
<mixed-citation publication-type="other">Calvanese D, De Giacomo G, Lenzerini M, Vardi MY. Answering regular path queries using views. In: Proc. of the 16th IEEE Int. Conf. on data engineering (ICDE). IEEE: 2000. p. 389–98.</mixed-citation>
</ref>
<ref id="CR36">
<label>36</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Abiteboul</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Duschka</surname>
<given-names>OM</given-names>
</name>
</person-group>
<article-title>Complexity of answering queries using materialized views</article-title>
<source>Proceedings of the Seventeenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, PODS ’98</source>
<year>1998</year>
<publisher-loc>New York, NY, USA</publisher-loc>
<publisher-name>ACM</publisher-name>
</element-citation>
</ref>
<ref id="CR37">
<label>37</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Levy</surname>
<given-names>AY</given-names>
</name>
</person-group>
<article-title>Obtaining complete answers from incomplete databases</article-title>
<source>Proceedings of the 22th International Conference on Very Large Data Bases, VLDB ’96</source>
<year>1996</year>
<publisher-loc>San Francisco, CA, USA</publisher-loc>
<publisher-name>Morgan Kaufmann Publishers Inc.</publisher-name>
</element-citation>
</ref>
<ref id="CR38">
<label>38</label>
<element-citation publication-type="book">
<person-group person-group-type="editor">
<name>
<surname>Beeri</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Buneman</surname>
<given-names>P</given-names>
</name>
</person-group>
<source>Tableau techniques for querying information sources through global schemas</source>
<year>1999</year>
<publisher-loc>Berlin Heidelberg</publisher-loc>
<publisher-name>Springer</publisher-name>
</element-citation>
</ref>
<ref id="CR39">
<label>39</label>
<mixed-citation publication-type="other">van der Meyden R. Logics for Databases and Information Systems. vol. 10 In: Chomicki J, Saake G, editors. Kluwer: 1998. p. 307–56.</mixed-citation>
</ref>
<ref id="CR40">
<label>40</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Etzioni</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Golden</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Weld</surname>
<given-names>DS</given-names>
</name>
</person-group>
<article-title>Sound and efficient closed-world reasoning for planning</article-title>
<source>Artif Intell</source>
<year>1997</year>
<volume>89</volume>
<issue>1–2</issue>
<fpage>113</fpage>
<lpage>48</lpage>
<pub-id pub-id-type="doi">10.1016/S0004-3702(96)00026-4</pub-id>
</element-citation>
</ref>
<ref id="CR41">
<label>41</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Smedley</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Haider</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Ballester</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Holland</surname>
<given-names>R</given-names>
</name>
<name>
<surname>London</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Thorisson</surname>
<given-names>G</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Biomart–biological queries made easy</article-title>
<source>BMC Genomics</source>
<year>2009</year>
<volume>10</volume>
<issue>1</issue>
<fpage>22</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2164-10-22</pub-id>
<pub-id pub-id-type="pmid">19144180</pub-id>
</element-citation>
</ref>
<ref id="CR42">
<label>42</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Etzold</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Argos</surname>
<given-names>P</given-names>
</name>
</person-group>
<article-title>SRS–an indexing and retrieval tool for flat file data libraries</article-title>
<source>Comput Appl Biosci</source>
<year>1993</year>
<volume>9</volume>
<issue>1</issue>
<fpage>49</fpage>
<lpage>57</lpage>
<pub-id pub-id-type="pmid">8435768</pub-id>
</element-citation>
</ref>
<ref id="CR43">
<label>43</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Belleau</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Nolin</surname>
<given-names>MA</given-names>
</name>
<name>
<surname>Tourigny</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Rigault</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Morissette</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>Bio2RDF: towards a mashup to build bioinformatics knowledge systems</article-title>
<source>J Biomed Inform</source>
<year>2008</year>
<volume>41</volume>
<issue>5</issue>
<fpage>706</fpage>
<lpage>16</lpage>
<pub-id pub-id-type="doi">10.1016/j.jbi.2008.03.004</pub-id>
<pub-id pub-id-type="pmid">18472304</pub-id>
</element-citation>
</ref>
<ref id="CR44">
<label>44</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bateman</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Martin</surname>
<given-names>MJ</given-names>
</name>
<name>
<surname>O’Donovan</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Magrane</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Apweiler</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Alpi</surname>
<given-names>E</given-names>
</name>
<etal></etal>
</person-group>
<article-title>UniProt: a hub for protein information</article-title>
<source>Nucleic Acids Res.</source>
<year>2015</year>
<volume>43</volume>
<issue>Database issue</issue>
<fpage>204</fpage>
<lpage>12</lpage>
</element-citation>
</ref>
<ref id="CR45">
<label>45</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Benson</surname>
<given-names>DA</given-names>
</name>
<name>
<surname>Clark</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Karsch-Mizrachi</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Lipman</surname>
<given-names>DJ</given-names>
</name>
<name>
<surname>Ostell</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sayers</surname>
<given-names>EW</given-names>
</name>
</person-group>
<article-title>GenBank</article-title>
<source>Nucleic Acids Res</source>
<year>2015</year>
<volume>43</volume>
<issue>Database issue</issue>
<fpage>30</fpage>
<lpage>5</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gku1216</pub-id>
</element-citation>
</ref>
<ref id="CR46">
<label>46</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cerami</surname>
<given-names>EG</given-names>
</name>
<name>
<surname>Gross</surname>
<given-names>BE</given-names>
</name>
<name>
<surname>Demir</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Rodchenkov</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Babur</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Anwar</surname>
<given-names>N</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Pathway Commons, a web resource for biological pathway data</article-title>
<source>Nucleic Acids Res</source>
<year>2011</year>
<volume>39</volume>
<issue>Database issue</issue>
<fpage>685</fpage>
<lpage>90</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkq1039</pub-id>
</element-citation>
</ref>
<ref id="CR47">
<label>47</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Artimo</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Jonnalagedda</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Arnold</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Baratin</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Csardi</surname>
<given-names>G</given-names>
</name>
<name>
<surname>de Castro</surname>
<given-names>E</given-names>
</name>
<etal></etal>
</person-group>
<article-title>ExPASy: SIB bioinformatics resource portal</article-title>
<source>Nucleic Acids Res</source>
<year>2012</year>
<volume>40</volume>
<issue>Web Server issue</issue>
<fpage>597</fpage>
<lpage>603</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gks400</pub-id>
</element-citation>
</ref>
<ref id="CR48">
<label>48</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Karp</surname>
<given-names>PD</given-names>
</name>
</person-group>
<article-title>Database links are a foundation for interoperability</article-title>
<source>Trends Biotechnol</source>
<year>1996</year>
<volume>14</volume>
<issue>8</issue>
<fpage>273</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1016/0167-7799(96)10044-5</pub-id>
<pub-id pub-id-type="pmid">8987457</pub-id>
</element-citation>
</ref>
<ref id="CR49">
<label>49</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dowell</surname>
<given-names>RD</given-names>
</name>
<name>
<surname>Jokerst</surname>
<given-names>RM</given-names>
</name>
<name>
<surname>Day</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Eddy</surname>
<given-names>SR</given-names>
</name>
<name>
<surname>Stein</surname>
<given-names>L</given-names>
</name>
</person-group>
<article-title>The distributed annotation system</article-title>
<source>BMC Bioinformatics</source>
<year>2001</year>
<volume>2</volume>
<fpage>7</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2105-2-7</pub-id>
<pub-id pub-id-type="pmid">11667947</pub-id>
</element-citation>
</ref>
<ref id="CR50">
<label>50</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gehlenborg</surname>
<given-names>N</given-names>
</name>
<name>
<surname>O’Donoghue</surname>
<given-names>SI</given-names>
</name>
<name>
<surname>Baliga</surname>
<given-names>NS</given-names>
</name>
<name>
<surname>Goesmann</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Hibbs</surname>
<given-names>MA</given-names>
</name>
<name>
<surname>Kitano</surname>
<given-names>H</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Visualization of omics data for systems biology</article-title>
<source>Nat Methods</source>
<year>2010</year>
<volume>7</volume>
<fpage>56</fpage>
<lpage>68</lpage>
<pub-id pub-id-type="doi">10.1038/nmeth.1436</pub-id>
<pub-id pub-id-type="pmid">20010831</pub-id>
</element-citation>
</ref>
<ref id="CR51">
<label>51</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Smith</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Ashburner</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Rosse</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Bard</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Bug</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Ceusters</surname>
<given-names>W</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration</article-title>
<source>Nat Biotechnol</source>
<year>2007</year>
<volume>25</volume>
<issue>11</issue>
<fpage>1251</fpage>
<lpage>1255</lpage>
<pub-id pub-id-type="doi">10.1038/nbt1346</pub-id>
<pub-id pub-id-type="pmid">17989687</pub-id>
</element-citation>
</ref>
<ref id="CR52">
<label>52</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Musen</surname>
<given-names>MA</given-names>
</name>
<name>
<surname>Noy</surname>
<given-names>NF</given-names>
</name>
<name>
<surname>Shah</surname>
<given-names>NH</given-names>
</name>
<name>
<surname>Whetzel</surname>
<given-names>PL</given-names>
</name>
<name>
<surname>Chute</surname>
<given-names>CG</given-names>
</name>
<name>
<surname>Story</surname>
<given-names>MA</given-names>
</name>
<etal></etal>
</person-group>
<article-title>NCBO team. The National Center for Biomedical Ontology</article-title>
<source>J Am Med Inform Assoc</source>
<year>2012</year>
<volume>19</volume>
<issue>2</issue>
<fpage>190</fpage>
<lpage>5</lpage>
<pub-id pub-id-type="doi">10.1136/amiajnl-2011-000523</pub-id>
<pub-id pub-id-type="pmid">22081220</pub-id>
</element-citation>
</ref>
<ref id="CR53">
<label>53</label>
<mixed-citation publication-type="other">Berjon R, Faulkner S, Leithead T, Pfeiffer S, O’Connor E, Navara ED. HTML5. Candidate recommendation, W3C. 2014.
<ext-link ext-link-type="uri" xlink:href="http://www.w3.org/TR/2014/CR-html5-20140731/">http://www.w3.org/TR/2014/CR-html5-20140731/</ext-link>
.</mixed-citation>
</ref>
<ref id="CR54">
<label>54</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gray</surname>
<given-names>KA</given-names>
</name>
<name>
<surname>Yates</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Seal</surname>
<given-names>RL</given-names>
</name>
<name>
<surname>Wright</surname>
<given-names>MW</given-names>
</name>
<name>
<surname>Bruford</surname>
<given-names>EA</given-names>
</name>
</person-group>
<article-title>genenames.org: the HGNC resources in 2015</article-title>
<source>Nucleic Acids Res</source>
<year>2015</year>
<volume>43</volume>
<issue>Database issue</issue>
<fpage>D1079</fpage>
<lpage>85</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gku1071</pub-id>
<pub-id pub-id-type="pmid">25361968</pub-id>
</element-citation>
</ref>
<ref id="CR55">
<label>55</label>
<mixed-citation publication-type="other">Kher S, Dickerson J, Rawat N. Biological pathway data integration trends, techniques, issues and challenges: A survey. In: Nature and Biologically Inspired Computing (NaBIC), 2010 Second World Congress On. IEEE: 2010. p. 177–82.</mixed-citation>
</ref>
<ref id="CR56">
<label>56</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mathew</surname>
<given-names>JP</given-names>
</name>
<name>
<surname>Taylor</surname>
<given-names>BS</given-names>
</name>
<name>
<surname>Bader</surname>
<given-names>GD</given-names>
</name>
<name>
<surname>Pyarajan</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Antoniotti</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Chinnaiyan</surname>
<given-names>AM</given-names>
</name>
<etal></etal>
</person-group>
<article-title>From bytes to bedside: Data integration and computational biology for translational cancer research</article-title>
<source>PLoS Comput Biol</source>
<year>2007</year>
<volume>3</volume>
<issue>2</issue>
<fpage>12</fpage>
<pub-id pub-id-type="doi">10.1371/journal.pcbi.0030012</pub-id>
</element-citation>
</ref>
<ref id="CR57">
<label>57</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Higgins</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>The dcc curation lifecycle model</article-title>
<source>Int J Digital Curation</source>
<year>2008</year>
<volume>3</volume>
<issue>1</issue>
<fpage>134</fpage>
<lpage>40</lpage>
<pub-id pub-id-type="doi">10.2218/ijdc.v3i1.48</pub-id>
</element-citation>
</ref>
<ref id="CR58">
<label>58</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Field</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Sansone</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Delong</surname>
<given-names>EF</given-names>
</name>
<name>
<surname>Sterk</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Friedberg</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Gaudet</surname>
<given-names>P</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Meeting Report: BioSharing at ISMB 2010</article-title>
<source>Stand Genomic Sci</source>
<year>2010</year>
<volume>3</volume>
<issue>3</issue>
<fpage>254</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="doi">10.4056/sigs/1403501</pub-id>
<pub-id pub-id-type="pmid">21304729</pub-id>
</element-citation>
</ref>
<ref id="CR59">
<label>59</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brazma</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>On the importance of standardisation in life sciences</article-title>
<source>Bioinformatics</source>
<year>2001</year>
<volume>17</volume>
<issue>2</issue>
<fpage>113</fpage>
<lpage>4</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/17.2.113</pub-id>
<pub-id pub-id-type="pmid">11238066</pub-id>
</element-citation>
</ref>
<ref id="CR60">
<label>60</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brooksbank</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Quackenbush</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>Data standards: a call to action</article-title>
<source>OMICS</source>
<year>2006</year>
<volume>10</volume>
<issue>2</issue>
<fpage>94</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1089/omi.2006.10.94</pub-id>
<pub-id pub-id-type="pmid">16901212</pub-id>
</element-citation>
</ref>
<ref id="CR61">
<label>61</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Piwowar</surname>
<given-names>HA</given-names>
</name>
<name>
<surname>Becich</surname>
<given-names>MJ</given-names>
</name>
<name>
<surname>Bilofsky</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Crowley</surname>
<given-names>RS</given-names>
</name>
</person-group>
<article-title>Towards a data sharing culture: recommendations for leadership from academic health centers</article-title>
<source>PLoS Med</source>
<year>2008</year>
<volume>5</volume>
<issue>9</issue>
<fpage>183</fpage>
<pub-id pub-id-type="doi">10.1371/journal.pmed.0050183</pub-id>
</element-citation>
</ref>
<ref id="CR62">
<label>62</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chervitz</surname>
<given-names>SA</given-names>
</name>
<name>
<surname>Deutsch</surname>
<given-names>EW</given-names>
</name>
<name>
<surname>Field</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Parkinson</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Quackenbush</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Rocca-Serra</surname>
<given-names>P</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Data standards for Omics data: the basis of data sharing and reuse</article-title>
<source>Methods Mol Biol</source>
<year>2011</year>
<volume>719</volume>
<fpage>31</fpage>
<lpage>69</lpage>
<pub-id pub-id-type="doi">10.1007/978-1-61779-027-0_2</pub-id>
<pub-id pub-id-type="pmid">21370078</pub-id>
</element-citation>
</ref>
<ref id="CR63">
<label>63</label>
<element-citation publication-type="book">
<person-group person-group-type="editor">
<name>
<surname>Popplewell</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Harding</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Poler</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Chalmeta</surname>
<given-names>R</given-names>
</name>
</person-group>
<source>Developing a science base for enterprise interoperability</source>
<year>2010</year>
<publisher-loc>London</publisher-loc>
<publisher-name>Springer</publisher-name>
</element-citation>
</ref>
<ref id="CR64">
<label>64</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bard</surname>
<given-names>JB</given-names>
</name>
<name>
<surname>Rhee</surname>
<given-names>SY</given-names>
</name>
</person-group>
<article-title>Ontologies in biology: design, applications and future challenges</article-title>
<source>Nat Rev Genet</source>
<year>2004</year>
<volume>5</volume>
<issue>3</issue>
<fpage>213</fpage>
<lpage>22</lpage>
<pub-id pub-id-type="doi">10.1038/nrg1295</pub-id>
<pub-id pub-id-type="pmid">14970823</pub-id>
</element-citation>
</ref>
<ref id="CR65">
<label>65</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Smith</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>The logic of biological classification and the foundations of biomedical ontology</article-title>
<source>Invited Papers from the 10th International Conference in Logic Methodology and Philosophy of Science</source>
<year>2003</year>
<publisher-loc>Amsterdam</publisher-loc>
<publisher-name>Elsevier-North-Holland</publisher-name>
</element-citation>
</ref>
<ref id="CR66">
<label>66</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chandrasekaran</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Josephson</surname>
<given-names>JR</given-names>
</name>
<name>
<surname>Benjamins</surname>
<given-names>VR</given-names>
</name>
</person-group>
<article-title>What are ontologies, and why do we need them?</article-title>
<source>IEEE Intell Syst</source>
<year>1999</year>
<volume>14</volume>
<issue>1</issue>
<fpage>20</fpage>
<lpage>6</lpage>
<pub-id pub-id-type="doi">10.1109/5254.747902</pub-id>
</element-citation>
</ref>
<ref id="CR67">
<label>67</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mayer</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>AR</given-names>
</name>
<name>
<surname>Binz</surname>
<given-names>P-A</given-names>
</name>
<name>
<surname>Deutsch</surname>
<given-names>EW</given-names>
</name>
<name>
<surname>Orchard</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Montecchi-Palazzi</surname>
<given-names>L</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Controlled vocabularies and ontologies in proteomics: overview, principles and practice</article-title>
<source>Biochim Biophys Acta (BBA) Protein Proteomics</source>
<year>2014</year>
<volume>1844</volume>
<issue>1</issue>
<fpage>98</fpage>
<lpage>107</lpage>
<pub-id pub-id-type="doi">10.1016/j.bbapap.2013.02.017</pub-id>
</element-citation>
</ref>
<ref id="CR68">
<label>68</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Blake</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Bult</surname>
<given-names>CJ</given-names>
</name>
</person-group>
<article-title>Beyond the data deluge: data integration and bio-ontologies</article-title>
<source>J Biomed Inform</source>
<year>2006</year>
<volume>39</volume>
<issue>3</issue>
<fpage>314</fpage>
<lpage>20</lpage>
<pub-id pub-id-type="doi">10.1016/j.jbi.2006.01.003</pub-id>
<pub-id pub-id-type="pmid">16564748</pub-id>
</element-citation>
</ref>
<ref id="CR69">
<label>69</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Whetzel</surname>
<given-names>PL</given-names>
</name>
</person-group>
<article-title>NCBO Technology: Powering semantically aware applications</article-title>
<source>J Biomed Semantics</source>
<year>2013</year>
<volume>4</volume>
<issue>Suppl 1</issue>
<fpage>8</fpage>
<pub-id pub-id-type="doi">10.1186/2041-1480-4-S1-S8</pub-id>
<pub-id pub-id-type="pmid">23497538</pub-id>
</element-citation>
</ref>
<ref id="CR70">
<label>70</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jonquet</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Lependu</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Falconer</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Coulet</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Noy</surname>
<given-names>NF</given-names>
</name>
<name>
<surname>Musen</surname>
<given-names>MA</given-names>
</name>
<etal></etal>
</person-group>
<article-title>NCBO Resource Index: Ontology-Based Search and Mining of Biomedical Resources</article-title>
<source>Web Semant</source>
<year>2011</year>
<volume>9</volume>
<issue>3</issue>
<fpage>316</fpage>
<lpage>24</lpage>
<pub-id pub-id-type="doi">10.1016/j.websem.2011.06.005</pub-id>
<pub-id pub-id-type="pmid">21918645</pub-id>
</element-citation>
</ref>
<ref id="CR71">
<label>71</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cote</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Reisinger</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Martens</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Barsnes</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Vizcaino</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Hermjakob</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>The Ontology Lookup Service: bigger and better</article-title>
<source>Nucleic Acids Res</source>
<year>2010</year>
<volume>38</volume>
<issue>Web Server issue</issue>
<fpage>155</fpage>
<lpage>60</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkq331</pub-id>
</element-citation>
</ref>
<ref id="CR72">
<label>72</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Corpas</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Fatumo</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Schneider</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>How not to be a bioinformatician</article-title>
<source>Source Code Biol Med</source>
<year>2012</year>
<volume>7</volume>
<issue>1</issue>
<fpage>3</fpage>
<pub-id pub-id-type="doi">10.1186/1751-0473-7-3</pub-id>
<pub-id pub-id-type="pmid">22640778</pub-id>
</element-citation>
</ref>
<ref id="CR73">
<label>73</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baker</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Next-generation sequencing: adjusting to data overload</article-title>
<source>Nat Methods</source>
<year>2010</year>
<volume>7</volume>
<issue>7</issue>
<fpage>495</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1038/nmeth0710-495</pub-id>
</element-citation>
</ref>
<ref id="CR74">
<label>74</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Field</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Garrity</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Gray</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Morrison</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Selengut</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sterk</surname>
<given-names>P</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The minimum information about a genome sequence (MIGS) specification</article-title>
<source>Nat Biotechnol</source>
<year>2008</year>
<volume>26</volume>
<issue>5</issue>
<fpage>541</fpage>
<lpage>7</lpage>
<pub-id pub-id-type="doi">10.1038/nbt1360</pub-id>
<pub-id pub-id-type="pmid">18464787</pub-id>
</element-citation>
</ref>
<ref id="CR75">
<label>75</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Parnell</surname>
<given-names>LD</given-names>
</name>
<name>
<surname>Lindenbaum</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Shameer</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Dall’Olio</surname>
<given-names>GM</given-names>
</name>
<name>
<surname>Swan</surname>
<given-names>DC</given-names>
</name>
<name>
<surname>Jensen</surname>
<given-names>LJ</given-names>
</name>
<etal></etal>
</person-group>
<article-title>BioStar: an online question & answer resource for the bioinformatics community</article-title>
<source>PLoS Comput Biol</source>
<year>2011</year>
<volume>7</volume>
<issue>10</issue>
<fpage>1002216</fpage>
<pub-id pub-id-type="doi">10.1371/journal.pcbi.1002216</pub-id>
</element-citation>
</ref>
<ref id="CR76">
<label>76</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Achard</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Vaysseix</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Barillot</surname>
<given-names>E</given-names>
</name>
</person-group>
<article-title>Xml, bioinformatics and data integration</article-title>
<source>Bioinformatics</source>
<year>2001</year>
<volume>17</volume>
<issue>2</issue>
<fpage>115</fpage>
<lpage>25</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/17.2.115</pub-id>
<pub-id pub-id-type="pmid">11238067</pub-id>
</element-citation>
</ref>
<ref id="CR77">
<label>77</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barsnes</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Vizcaino</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Eidhammer</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Martens</surname>
<given-names>L</given-names>
</name>
</person-group>
<article-title>Pride converter: making proteomics data-sharing easy</article-title>
<source>Nat Biotechnol</source>
<year>2009</year>
<volume>27</volume>
<issue>7</issue>
<fpage>598</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1038/nbt0709-598</pub-id>
<pub-id pub-id-type="pmid">19587657</pub-id>
</element-citation>
</ref>
<ref id="CR78">
<label>78</label>
<mixed-citation publication-type="other">Bray T, Sperberg-McQueen M, Paoli J, Yergeau F, Maler E. Extensible markup language (XML) 1.0 (third edition). W3C recommendation, W3C: (February 2004).
<ext-link ext-link-type="uri" xlink:href="http://www.w3.org/TR/2004/REC-xml-20040204">http://www.w3.org/TR/2004/REC-xml-20040204</ext-link>
.</mixed-citation>
</ref>
<ref id="CR79">
<label>79</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Martens</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Hermjakob</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Adamski</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Taylor</surname>
<given-names>C</given-names>
</name>
<name>
<surname>States</surname>
<given-names>D</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Pride: the proteomics identifications database</article-title>
<source>Proteomics</source>
<year>2005</year>
<volume>5</volume>
<issue>13</issue>
<fpage>3537</fpage>
<lpage>45</lpage>
<pub-id pub-id-type="doi">10.1002/pmic.200401303</pub-id>
<pub-id pub-id-type="pmid">16041671</pub-id>
</element-citation>
</ref>
<ref id="CR80">
<label>80</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Benson</surname>
<given-names>DA</given-names>
</name>
<name>
<surname>Karsch-Mizrachi</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Lipman</surname>
<given-names>DJ</given-names>
</name>
<name>
<surname>Ostell</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Rapp</surname>
<given-names>BA</given-names>
</name>
<name>
<surname>Wheeler</surname>
<given-names>DL</given-names>
</name>
</person-group>
<article-title>GenBank</article-title>
<source>Nucleic Acids Res</source>
<year>2000</year>
<volume>28</volume>
<issue>1</issue>
<fpage>15</fpage>
<lpage>18</lpage>
<pub-id pub-id-type="doi">10.1093/nar/28.1.15</pub-id>
<pub-id pub-id-type="pmid">10592170</pub-id>
</element-citation>
</ref>
<ref id="CR81">
<label>81</label>
<mixed-citation publication-type="other">Karp PD. A protocol for maintaining multidatabase referential integrity. Pac Symp Biocomput. 1996:438–45.</mixed-citation>
</ref>
<ref id="CR82">
<label>82</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Apweiler</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Bairoch</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>CH</given-names>
</name>
<name>
<surname>Barker</surname>
<given-names>WC</given-names>
</name>
<name>
<surname>Boeckmann</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Ferro</surname>
<given-names>S</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Uniprot: the universal protein knowledgebase</article-title>
<source>Nucleic Acids Res</source>
<year>2004</year>
<volume>32</volume>
<issue>Suppl 1</issue>
<fpage>115</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkh131</pub-id>
<pub-id pub-id-type="pmid">14704348</pub-id>
</element-citation>
</ref>
<ref id="CR83">
<label>83</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cunningham</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Amode</surname>
<given-names>MR</given-names>
</name>
<name>
<surname>Barrell</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Beal</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Billis</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Brent</surname>
<given-names>S</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Ensembl 2015</article-title>
<source>Nucleic Acids Res</source>
<year>2015</year>
<volume>43</volume>
<issue>Database issue</issue>
<fpage>662</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gku1010</pub-id>
</element-citation>
</ref>
<ref id="CR84">
<label>84</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pruitt</surname>
<given-names>KD</given-names>
</name>
<name>
<surname>Tatusova</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Maglott</surname>
<given-names>DR</given-names>
</name>
</person-group>
<article-title>NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins</article-title>
<source>Nucleic Acids Res</source>
<year>2007</year>
<volume>35</volume>
<issue>Database issue</issue>
<fpage>61</fpage>
<lpage>5</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkl842</pub-id>
</element-citation>
</ref>
<ref id="CR85">
<label>85</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Juty</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Le Novere</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Laibe</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>Identifiers.org and MIRIAM Registry: community resources to provide persistent identification</article-title>
<source>Nucleic Acids Res</source>
<year>2012</year>
<volume>40</volume>
<issue>Database issue</issue>
<fpage>580</fpage>
<lpage>6</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkr1097</pub-id>
</element-citation>
</ref>
<ref id="CR86">
<label>86</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cote</surname>
<given-names>RG</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Martens</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Kerrien</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Reisinger</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>Q</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The Protein Identifier Cross-Referencing (PICR) service: reconciling protein identifiers across multiple source databases</article-title>
<source>BMC Bioinformatics</source>
<year>2007</year>
<volume>8</volume>
<fpage>401</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2105-8-401</pub-id>
<pub-id pub-id-type="pmid">17945017</pub-id>
</element-citation>
</ref>
<ref id="CR87">
<label>87</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>daW</given-names>
</name>
<name>
<surname>Sherman</surname>
<given-names>BT</given-names>
</name>
<name>
<surname>Stephens</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Baseler</surname>
<given-names>MW</given-names>
</name>
<name>
<surname>Lane</surname>
<given-names>HC</given-names>
</name>
<name>
<surname>Lempicki</surname>
<given-names>RA</given-names>
</name>
</person-group>
<article-title>DAVID gene ID conversion tool</article-title>
<source>Bioinformation</source>
<year>2008</year>
<volume>2</volume>
<issue>10</issue>
<fpage>428</fpage>
<lpage>30</lpage>
<pub-id pub-id-type="doi">10.6026/97320630002428</pub-id>
<pub-id pub-id-type="pmid">18841237</pub-id>
</element-citation>
</ref>
<ref id="CR88">
<label>88</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Orchard</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Kerrien</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Abbani</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Aranda</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Bhate</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Bidwell</surname>
<given-names>S</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Protein interaction data curation: the International Molecular Exchange (IMEx) consortium</article-title>
<source>Nat Methods</source>
<year>2012</year>
<volume>9</volume>
<issue>4</issue>
<fpage>345</fpage>
<lpage>50</lpage>
<pub-id pub-id-type="doi">10.1038/nmeth.1931</pub-id>
<pub-id pub-id-type="pmid">22453911</pub-id>
</element-citation>
</ref>
<ref id="CR89">
<label>89</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chatr-aryamontri</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Ceol</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Palazzi</surname>
<given-names>LM</given-names>
</name>
<name>
<surname>Nardelli</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Schneider</surname>
<given-names>MV</given-names>
</name>
<name>
<surname>Castagnoli</surname>
<given-names>L</given-names>
</name>
<etal></etal>
</person-group>
<article-title>MINT: the Molecular INTeraction database</article-title>
<source>Nucleic Acids Res</source>
<year>2007</year>
<volume>35</volume>
<issue>Database issue</issue>
<fpage>572</fpage>
<lpage>4</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkl950</pub-id>
<pub-id pub-id-type="pmid">17175531</pub-id>
</element-citation>
</ref>
<ref id="CR90">
<label>90</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xenarios</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Salwinski</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Duan</surname>
<given-names>XJ</given-names>
</name>
<name>
<surname>Higney</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>SM</given-names>
</name>
<name>
<surname>Eisenberg</surname>
<given-names>D</given-names>
</name>
</person-group>
<article-title>DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions</article-title>
<source>Nucleic Acids Res</source>
<year>2002</year>
<volume>30</volume>
<issue>1</issue>
<fpage>303</fpage>
<lpage>5</lpage>
<pub-id pub-id-type="doi">10.1093/nar/30.1.303</pub-id>
<pub-id pub-id-type="pmid">11752321</pub-id>
</element-citation>
</ref>
<ref id="CR91">
<label>91</label>
<mixed-citation publication-type="other">Leinonen R, Akhtar R, Birney E, Bower L, Cerdeno-Tárraga A, Cheng Y, et al.The european nucleotide archive. Nucleic Acids Res. 2010:967.</mixed-citation>
</ref>
<ref id="CR92">
<label>92</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brazma</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Hingamp</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Quackenbush</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sherlock</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Spellman</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Stoeckert</surname>
<given-names>C</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Minimum information about a microarray experiment (MIAME)-toward standards for microarray data</article-title>
<source>Nat Genet</source>
<year>2001</year>
<volume>29</volume>
<issue>4</issue>
<fpage>365</fpage>
<lpage>71</lpage>
<pub-id pub-id-type="doi">10.1038/ng1201-365</pub-id>
<pub-id pub-id-type="pmid">11726920</pub-id>
</element-citation>
</ref>
<ref id="CR93">
<label>93</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Taylor</surname>
<given-names>CF</given-names>
</name>
<name>
<surname>Field</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Sansone</surname>
<given-names>S-A</given-names>
</name>
<name>
<surname>Aerts</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Apweiler</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Ashburner</surname>
<given-names>M</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the mibbi project</article-title>
<source>Nat Biotechnol</source>
<year>2008</year>
<volume>26</volume>
<issue>8</issue>
<fpage>889</fpage>
<lpage>96</lpage>
<pub-id pub-id-type="doi">10.1038/nbt.1411</pub-id>
<pub-id pub-id-type="pmid">18688244</pub-id>
</element-citation>
</ref>
<ref id="CR94">
<label>94</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sweet</surname>
<given-names>JJ</given-names>
</name>
</person-group>
<article-title>Editorial. EQUATOR - reporting guidelines for “Enhancing the QUality and Transparency Of health Research”</article-title>
<source>Clin Neuropsychol</source>
<year>2014</year>
<volume>28</volume>
<issue>4</issue>
<fpage>547</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="doi">10.1080/13854046.2014.934019</pub-id>
<pub-id pub-id-type="pmid">24983313</pub-id>
</element-citation>
</ref>
<ref id="CR95">
<label>95</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Orchard</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Al-Lazikani</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Bryant</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Clark</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Calder</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Dix</surname>
<given-names>I</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Minimum information about a bioactive entity (MIABE)</article-title>
<source>Nat Rev Drug Discov</source>
<year>2011</year>
<volume>10</volume>
<issue>9</issue>
<fpage>661</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1038/nrd3503</pub-id>
<pub-id pub-id-type="pmid">21878981</pub-id>
</element-citation>
</ref>
<ref id="CR96">
<label>96</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Orchard</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Salwinski</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Kerrien</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Montecchi-Palazzi</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Oesterheld</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Stumpflen</surname>
<given-names>V</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The minimum information required for reporting a molecular interaction experiment (MIMIx)</article-title>
<source>Nat Biotechnol</source>
<year>2007</year>
<volume>25</volume>
<issue>8</issue>
<fpage>894</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="doi">10.1038/nbt1324</pub-id>
<pub-id pub-id-type="pmid">17687370</pub-id>
</element-citation>
</ref>
<ref id="CR97">
<label>97</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sansone</surname>
<given-names>S-A</given-names>
</name>
<name>
<surname>Rocca-Serra</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Field</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Maguire</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Taylor</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Hofmann</surname>
<given-names>O</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Toward interoperable bioscience data</article-title>
<source>Nat Genet</source>
<year>2012</year>
<volume>44</volume>
<issue>2</issue>
<fpage>121</fpage>
<lpage>6</lpage>
<pub-id pub-id-type="doi">10.1038/ng.1054</pub-id>
<pub-id pub-id-type="pmid">22281772</pub-id>
</element-citation>
</ref>
<ref id="CR98">
<label>98</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hucka</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Nickerson</surname>
<given-names>DP</given-names>
</name>
<name>
<surname>Bader</surname>
<given-names>GD</given-names>
</name>
<name>
<surname>Bergmann</surname>
<given-names>FT</given-names>
</name>
<name>
<surname>Cooper</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Demir</surname>
<given-names>E</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Promoting Coordinated Development of Community-Based Information Standards for Modeling in Biology: The COMBINE Initiative</article-title>
<source>Front Bioeng Biotechnol</source>
<year>2015</year>
<volume>3</volume>
<fpage>19</fpage>
<pub-id pub-id-type="doi">10.3389/fbioe.2015.00019</pub-id>
<pub-id pub-id-type="pmid">25759811</pub-id>
</element-citation>
</ref>
<ref id="CR99">
<label>99</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Orchard</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Hermjakob</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Apweiler</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>The proteomics standards initiative</article-title>
<source>Proteomics</source>
<year>2003</year>
<volume>3</volume>
<issue>7</issue>
<fpage>1374</fpage>
<lpage>1376</lpage>
<pub-id pub-id-type="doi">10.1002/pmic.200300496</pub-id>
<pub-id pub-id-type="pmid">12872238</pub-id>
</element-citation>
</ref>
<ref id="CR100">
<label>100</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Knoppers</surname>
<given-names>BM</given-names>
</name>
</person-group>
<article-title>International ethics harmonization and the global alliance for genomics and health</article-title>
<source>Genome Med</source>
<year>2014</year>
<volume>6</volume>
<issue>2</issue>
<fpage>13</fpage>
<pub-id pub-id-type="doi">10.1186/gm530</pub-id>
<pub-id pub-id-type="pmid">25031613</pub-id>
</element-citation>
</ref>
<ref id="CR101">
<label>101</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nakamura</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Cochrane</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Karsch-Mizrachi</surname>
<given-names>I</given-names>
</name>
</person-group>
<article-title>The International Nucleotide Sequence Database Collaboration</article-title>
<source>Nucleic Acids Res</source>
<year>2013</year>
<volume>41</volume>
<issue>Database issue</issue>
<fpage>21</fpage>
<lpage>4</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gks1084</pub-id>
<pub-id pub-id-type="pmid">23093591</pub-id>
</element-citation>
</ref>
<ref id="CR102">
<label>102</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hermjakob</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Apweiler</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>The Proteomics Identifications Database (PRIDE) and the ProteomExchange Consortium: making proteomics data accessible</article-title>
<source>Expert Rev Proteomics</source>
<year>2006</year>
<volume>3</volume>
<issue>1</issue>
<fpage>1</fpage>
<lpage>3</lpage>
<pub-id pub-id-type="doi">10.1586/14789450.3.1.1</pub-id>
<pub-id pub-id-type="pmid">16445344</pub-id>
</element-citation>
</ref>
<ref id="CR103">
<label>103</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Demir</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Cary</surname>
<given-names>MP</given-names>
</name>
<name>
<surname>Paley</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Fukuda</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Lemer</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Vastrik</surname>
<given-names>I</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The BioPAX community standard for pathway data sharing</article-title>
<source>Nat Biotechnol</source>
<year>2010</year>
<volume>28</volume>
<issue>9</issue>
<fpage>935</fpage>
<lpage>42</lpage>
<pub-id pub-id-type="doi">10.1038/nbt.1666</pub-id>
<pub-id pub-id-type="pmid">20829833</pub-id>
</element-citation>
</ref>
<ref id="CR104">
<label>104</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Crosswell</surname>
<given-names>LC</given-names>
</name>
<name>
<surname>Thornton</surname>
<given-names>JM</given-names>
</name>
</person-group>
<article-title>ELIXIR: a distributed infrastructure for European biological data</article-title>
<source>Trends Biotechnol</source>
<year>2012</year>
<volume>30</volume>
<issue>5</issue>
<fpage>241</fpage>
<lpage>2</lpage>
<pub-id pub-id-type="doi">10.1016/j.tibtech.2012.02.002</pub-id>
<pub-id pub-id-type="pmid">22417641</pub-id>
</element-citation>
</ref>
<ref id="CR105">
<label>105</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yuille</surname>
<given-names>M</given-names>
</name>
<name>
<surname>van Ommen</surname>
<given-names>GJ</given-names>
</name>
<name>
<surname>Brechot</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Cambon-Thomsen</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Dagher</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Landegren</surname>
<given-names>U</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Biobanking for Europe</article-title>
<source>Brief. Bioinformatics</source>
<year>2008</year>
<volume>9</volume>
<issue>1</issue>
<fpage>14</fpage>
<lpage>24</lpage>
<pub-id pub-id-type="doi">10.1093/bib/bbm050</pub-id>
<pub-id pub-id-type="pmid">17959611</pub-id>
</element-citation>
</ref>
<ref id="CR106">
<label>106</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Margolis</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Derr</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Dunn</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Huerta</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Larkin</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sheehan</surname>
<given-names>J</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The National Institutes of Health’s Big Data to Knowledge (BD2K) initiative: capitalizing on biomedical big data</article-title>
<source>J Am Med Inform Assoc</source>
<year>2014</year>
<volume>21</volume>
<issue>6</issue>
<fpage>957</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="doi">10.1136/amiajnl-2014-002974</pub-id>
<pub-id pub-id-type="pmid">25008006</pub-id>
</element-citation>
</ref>
<ref id="CR107">
<label>107</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Klech</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Brooksbank</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Price</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Verpillat</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Buhler</surname>
<given-names>FR</given-names>
</name>
<name>
<surname>Dubois</surname>
<given-names>D</given-names>
</name>
<etal></etal>
</person-group>
<article-title>European initiative towards quality standards in education and training for discovery, development and use of medicines</article-title>
<source>Eur J Pharm Sci</source>
<year>2012</year>
<volume>45</volume>
<issue>5</issue>
<fpage>515</fpage>
<lpage>20</lpage>
<pub-id pub-id-type="doi">10.1016/j.ejps.2011.12.005</pub-id>
<pub-id pub-id-type="pmid">22178534</pub-id>
</element-citation>
</ref>
<ref id="CR108">
<label>108</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Doiron</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Burton</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Marcon</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Gaye</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Wolffenbuttel</surname>
<given-names>BH</given-names>
</name>
<name>
<surname>Perola</surname>
<given-names>M</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Data harmonization and federated analysis of population-based studies: the BioSHaRE project</article-title>
<source>Emerg Themes Epidemiol</source>
<year>2013</year>
<volume>10</volume>
<issue>1</issue>
<fpage>12</fpage>
<pub-id pub-id-type="doi">10.1186/1742-7622-10-12</pub-id>
<pub-id pub-id-type="pmid">24257327</pub-id>
</element-citation>
</ref>
<ref id="CR109">
<label>109</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Basset</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Los</surname>
<given-names>W</given-names>
</name>
</person-group>
<article-title>Biodiversity e-science: Lifewatch, the european infrastructure on biodiversity and ecosystem research</article-title>
<source>Plant Biosystems-An Int J Dealing Aspects Plant Biol</source>
<year>2012</year>
<volume>146</volume>
<issue>4</issue>
<fpage>780</fpage>
<lpage>2</lpage>
<pub-id pub-id-type="doi">10.1080/11263504.2012.740091</pub-id>
</element-citation>
</ref>
<ref id="CR110">
<label>110</label>
<mixed-citation publication-type="other">Krajewski P, Chen D, Ćwiek H, van Dijk AD, Fiorani F, Kersey P, et al.Towards recommendations for metadata and data handling in plant phenotyping. J Exp Bot. 2015:271.</mixed-citation>
</ref>
<ref id="CR111">
<label>111</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pettifer</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Thorne</surname>
<given-names>D</given-names>
</name>
<name>
<surname>McDermott</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Marsh</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Villeger</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Kell</surname>
<given-names>DB</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Visualising biological data: a semantic approach to tool and database integration</article-title>
<source>BMC Bioinformatics</source>
<year>2009</year>
<volume>10</volume>
<issue>Suppl 6</issue>
<fpage>19</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2105-10-S6-S19</pub-id>
<pub-id pub-id-type="pmid">19146673</pub-id>
</element-citation>
</ref>
<ref id="CR112">
<label>112</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gehlenborg</surname>
<given-names>N</given-names>
</name>
<name>
<surname>O’Donoghue</surname>
<given-names>SI</given-names>
</name>
<name>
<surname>Baliga</surname>
<given-names>NS</given-names>
</name>
<name>
<surname>Goesmann</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Hibbs</surname>
<given-names>MA</given-names>
</name>
<name>
<surname>Kitano</surname>
<given-names>H</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Visualization of omics data for systems biology</article-title>
<source>Nat Methods</source>
<year>2010</year>
<volume>7</volume>
<issue>3 Suppl</issue>
<fpage>56</fpage>
<lpage>68</lpage>
<pub-id pub-id-type="doi">10.1038/nmeth.1436</pub-id>
<pub-id pub-id-type="pmid">20010831</pub-id>
</element-citation>
</ref>
<ref id="CR113">
<label>113</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thorvaldsdottir</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Robinson</surname>
<given-names>JT</given-names>
</name>
<name>
<surname>Mesirov</surname>
<given-names>JP</given-names>
</name>
</person-group>
<article-title>Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration</article-title>
<source>Brief Bioinformatics</source>
<year>2013</year>
<volume>14</volume>
<issue>2</issue>
<fpage>178</fpage>
<lpage>92</lpage>
<pub-id pub-id-type="doi">10.1093/bib/bbs017</pub-id>
<pub-id pub-id-type="pmid">22517427</pub-id>
</element-citation>
</ref>
<ref id="CR114">
<label>114</label>
<mixed-citation publication-type="other">Johnson C, Moorhead R, Munzner T, Pfister H, Rheingans P, Yoo TS. Nih/nsf visualization research challenges report: 2006.
<ext-link ext-link-type="uri" xlink:href="http://nrs.harvard.edu/urn-3:HUL.InstRepos:4138744">http://nrs.harvard.edu/urn-3:HUL.InstRepos:4138744</ext-link>
.</mixed-citation>
</ref>
<ref id="CR115">
<label>115</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kent</surname>
<given-names>WJ</given-names>
</name>
<name>
<surname>Sugnet</surname>
<given-names>CW</given-names>
</name>
<name>
<surname>Furey</surname>
<given-names>TS</given-names>
</name>
<name>
<surname>Roskin</surname>
<given-names>KM</given-names>
</name>
<name>
<surname>Pringle</surname>
<given-names>TH</given-names>
</name>
<name>
<surname>Zahler</surname>
<given-names>AM</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The human genome browser at ucsc</article-title>
<source>Genome Res</source>
<year>2002</year>
<volume>12</volume>
<issue>6</issue>
<fpage>996</fpage>
<lpage>1006</lpage>
<pub-id pub-id-type="doi">10.1101/gr.229102.ArticlepublishedonlinebeforeprintinMay2002</pub-id>
<pub-id pub-id-type="pmid">12045153</pub-id>
</element-citation>
</ref>
<ref id="CR116">
<label>116</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hubbard</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Barker</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Birney</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Cameron</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Clark</surname>
<given-names>L</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The ensembl genome database project</article-title>
<source>Nucleic Acids Res</source>
<year>2002</year>
<volume>30</volume>
<issue>1</issue>
<fpage>38</fpage>
<lpage>41</lpage>
<pub-id pub-id-type="doi">10.1093/nar/30.1.38</pub-id>
<pub-id pub-id-type="pmid">11752248</pub-id>
</element-citation>
</ref>
<ref id="CR117">
<label>117</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Engels</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Burge</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Mesirov</surname>
<given-names>JP</given-names>
</name>
<name>
<surname>DeCaprio</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Galagan</surname>
<given-names>JE</given-names>
</name>
</person-group>
<article-title>Combo: a whole genome comparative browser</article-title>
<source>Bioinformatics</source>
<year>2006</year>
<volume>22</volume>
<issue>14</issue>
<fpage>1782</fpage>
<lpage>3</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btl193</pub-id>
<pub-id pub-id-type="pmid">16709588</pub-id>
</element-citation>
</ref>
<ref id="CR118">
<label>118</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shannon</surname>
<given-names>PT</given-names>
</name>
<name>
<surname>Reiss</surname>
<given-names>DJ</given-names>
</name>
<name>
<surname>Bonneau</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Baliga</surname>
<given-names>NS</given-names>
</name>
</person-group>
<article-title>The Gaggle: an open-source software system for integrating bioinformatics software and data sources</article-title>
<source>BMC Bioinformatics</source>
<year>2006</year>
<volume>7</volume>
<fpage>176</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2105-7-176</pub-id>
<pub-id pub-id-type="pmid">16569235</pub-id>
</element-citation>
</ref>
<ref id="CR119">
<label>119</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Frazer</surname>
<given-names>KA</given-names>
</name>
<name>
<surname>Pachter</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Poliakov</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Rubin</surname>
<given-names>EM</given-names>
</name>
<name>
<surname>Dubchak</surname>
<given-names>I</given-names>
</name>
</person-group>
<article-title>VISTA: computational tools for comparative genomics</article-title>
<source>Nucleic Acids Res</source>
<year>2004</year>
<volume>32</volume>
<issue>Web Server issue</issue>
<fpage>273</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkh458</pub-id>
</element-citation>
</ref>
<ref id="CR120">
<label>120</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pavlopoulos</surname>
<given-names>GA</given-names>
</name>
<name>
<surname>Wegener</surname>
<given-names>AL</given-names>
</name>
<name>
<surname>Schneider</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>A survey of visualization tools for biological network analysis</article-title>
<source>BioData Min</source>
<year>2008</year>
<volume>1</volume>
<fpage>12</fpage>
<pub-id pub-id-type="doi">10.1186/1756-0381-1-12</pub-id>
<pub-id pub-id-type="pmid">19040716</pub-id>
</element-citation>
</ref>
<ref id="CR121">
<label>121</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Andersson</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Archibald</surname>
<given-names>AL</given-names>
</name>
<name>
<surname>Bottema</surname>
<given-names>CD</given-names>
</name>
<name>
<surname>Brauning</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Burgess</surname>
<given-names>SC</given-names>
</name>
<name>
<surname>Burt</surname>
<given-names>DW</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project</article-title>
<source>Genome Biol</source>
<year>2015</year>
<volume>16</volume>
<fpage>57</fpage>
<pub-id pub-id-type="doi">10.1186/s13059-015-0622-4</pub-id>
<pub-id pub-id-type="pmid">25854118</pub-id>
</element-citation>
</ref>
<ref id="CR122">
<label>122</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Howe</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Costanzo</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Fey</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Gojobori</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Hannick</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Hide</surname>
<given-names>W</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Big data: The future of biocuration</article-title>
<source>Nature</source>
<year>2008</year>
<volume>455</volume>
<issue>7209</issue>
<fpage>47</fpage>
<lpage>50</lpage>
<pub-id pub-id-type="doi">10.1038/455047a</pub-id>
<pub-id pub-id-type="pmid">18769432</pub-id>
</element-citation>
</ref>
<ref id="CR123">
<label>123</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stein</surname>
<given-names>L</given-names>
</name>
</person-group>
<article-title>Genome annotation: from sequence to biology</article-title>
<source>Nat Rev Genet</source>
<year>2001</year>
<volume>2</volume>
<issue>7</issue>
<fpage>493</fpage>
<lpage>503</lpage>
<pub-id pub-id-type="doi">10.1038/35080529</pub-id>
<pub-id pub-id-type="pmid">11433356</pub-id>
</element-citation>
</ref>
<ref id="CR124">
<label>124</label>
<mixed-citation publication-type="other">Phylogeny Programs.
<ext-link ext-link-type="uri" xlink:href="http://evolution.genetics.washington.edu/phylip/software.html">http://evolution.genetics.washington.edu/phylip/software.html</ext-link>
.</mixed-citation>
</ref>
<ref id="CR125">
<label>125</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Haw</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Hermjakob</surname>
<given-names>H</given-names>
</name>
<name>
<surname>D’Eustachio</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Stein</surname>
<given-names>L</given-names>
</name>
</person-group>
<article-title>Reactome pathway analysis to enrich biological discovery in proteomics data sets</article-title>
<source>Proteomics</source>
<year>2011</year>
<volume>11</volume>
<issue>18</issue>
<fpage>3598</fpage>
<lpage>613</lpage>
<pub-id pub-id-type="doi">10.1002/pmic.201100066</pub-id>
<pub-id pub-id-type="pmid">21751369</pub-id>
</element-citation>
</ref>
<ref id="CR126">
<label>126</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tanabe</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Kanehisa</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Using the KEGG database resource</article-title>
<source>Curr Protoc Bioinformatics</source>
<year>2012</year>
<volume>Chapter 1</volume>
<fpage>1</fpage>
<lpage>12</lpage>
</element-citation>
</ref>
<ref id="CR127">
<label>127</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Marian</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Ressom</surname>
<given-names>HW</given-names>
</name>
</person-group>
<article-title>Identification of aberrant pathways and network activities from high-throughput data</article-title>
<source>Brief Bioinformatics</source>
<year>2012</year>
<volume>13</volume>
<issue>4</issue>
<fpage>406</fpage>
<lpage>19</lpage>
<pub-id pub-id-type="doi">10.1093/bib/bbs001</pub-id>
<pub-id pub-id-type="pmid">22287794</pub-id>
</element-citation>
</ref>
<ref id="CR128">
<label>128</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mlecnik</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Scheideler</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Hackl</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Hartler</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sanchez-Cabo</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Trajanoski</surname>
<given-names>Z</given-names>
</name>
</person-group>
<article-title>PathwayExplorer: web service for visualizing high-throughput expression data on biological pathways</article-title>
<source>Nucleic Acids Res</source>
<year>2005</year>
<volume>33</volume>
<issue>Web Server issue</issue>
<fpage>633</fpage>
<lpage>7</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gki391</pub-id>
</element-citation>
</ref>
<ref id="CR129">
<label>129</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gotz</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Garcia-Gomez</surname>
<given-names>JM</given-names>
</name>
<name>
<surname>Terol</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Williams</surname>
<given-names>TD</given-names>
</name>
<name>
<surname>Nagaraj</surname>
<given-names>SH</given-names>
</name>
<name>
<surname>Nueda</surname>
<given-names>MJ</given-names>
</name>
<etal></etal>
</person-group>
<article-title>High-throughput functional annotation and data mining with the Blast2GO suite</article-title>
<source>Nucleic Acids Res</source>
<year>2008</year>
<volume>36</volume>
<issue>10</issue>
<fpage>3420</fpage>
<lpage>435</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkn176</pub-id>
<pub-id pub-id-type="pmid">18445632</pub-id>
</element-citation>
</ref>
<ref id="CR130">
<label>130</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>daW</given-names>
</name>
<name>
<surname>Sherman</surname>
<given-names>BT</given-names>
</name>
<name>
<surname>Lempicki</surname>
<given-names>RA</given-names>
</name>
</person-group>
<article-title>Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources</article-title>
<source>Nat Protoc</source>
<year>2009</year>
<volume>4</volume>
<issue>1</issue>
<fpage>44</fpage>
<lpage>57</lpage>
<pub-id pub-id-type="doi">10.1038/nprot.2008.211</pub-id>
<pub-id pub-id-type="pmid">19131956</pub-id>
</element-citation>
</ref>
<ref id="CR131">
<label>131</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stobbe</surname>
<given-names>MD</given-names>
</name>
<name>
<surname>Jansen</surname>
<given-names>GA</given-names>
</name>
<name>
<surname>Moerland</surname>
<given-names>PD</given-names>
</name>
<name>
<surname>van Kampen</surname>
<given-names>AH</given-names>
</name>
</person-group>
<article-title>Knowledge representation in metabolic pathway databases</article-title>
<source>Brief Bioinformatics</source>
<year>2014</year>
<volume>15</volume>
<issue>3</issue>
<fpage>455</fpage>
<lpage>70</lpage>
<pub-id pub-id-type="doi">10.1093/bib/bbs060</pub-id>
<pub-id pub-id-type="pmid">23202525</pub-id>
</element-citation>
</ref>
<ref id="CR132">
<label>132</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Walter</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Shattuck</surname>
<given-names>DW</given-names>
</name>
<name>
<surname>Baldock</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Bastin</surname>
<given-names>ME</given-names>
</name>
<name>
<surname>Carpenter</surname>
<given-names>AE</given-names>
</name>
<name>
<surname>Duce</surname>
<given-names>S</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Visualization of image data from cells to organisms</article-title>
<source>Nat Methods</source>
<year>2010</year>
<volume>7</volume>
<issue>3 Suppl</issue>
<fpage>26</fpage>
<lpage>41</lpage>
<pub-id pub-id-type="doi">10.1038/nmeth.1431</pub-id>
</element-citation>
</ref>
<ref id="CR133">
<label>133</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Giardine</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Riemer</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Hardison</surname>
<given-names>RC</given-names>
</name>
<name>
<surname>Burhans</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Elnitski</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Shah</surname>
<given-names>P</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Galaxy: a platform for interactive large-scale genome analysis</article-title>
<source>Genome Res</source>
<year>2005</year>
<volume>15</volume>
<issue>10</issue>
<fpage>1451</fpage>
<lpage>5</lpage>
<pub-id pub-id-type="doi">10.1101/gr.4086505</pub-id>
<pub-id pub-id-type="pmid">16169926</pub-id>
</element-citation>
</ref>
<ref id="CR134">
<label>134</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shannon</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Markiel</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Ozier</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Baliga</surname>
<given-names>NS</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>JT</given-names>
</name>
<name>
<surname>Ramage</surname>
<given-names>D</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Cytoscape: a software environment for integrated models of biomolecular interaction networks</article-title>
<source>Genome Res</source>
<year>2003</year>
<volume>13</volume>
<issue>11</issue>
<fpage>2498</fpage>
<lpage>504</lpage>
<pub-id pub-id-type="doi">10.1101/gr.1239303</pub-id>
<pub-id pub-id-type="pmid">14597658</pub-id>
</element-citation>
</ref>
<ref id="CR135">
<label>135</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Smoot</surname>
<given-names>ME</given-names>
</name>
<name>
<surname>Ono</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Ruscheinski</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>PL</given-names>
</name>
<name>
<surname>Ideker</surname>
<given-names>T</given-names>
</name>
</person-group>
<article-title>Cytoscape 2.8: new features for data integration and network visualization</article-title>
<source>Bioinformatics</source>
<year>2011</year>
<volume>27</volume>
<issue>3</issue>
<fpage>431</fpage>
<lpage>2</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btq675</pub-id>
<pub-id pub-id-type="pmid">21149340</pub-id>
</element-citation>
</ref>
<ref id="CR136">
<label>136</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kohler</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Baumbach</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Taubert</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Specht</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Skusa</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Ruegg</surname>
<given-names>A</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Graph-based analysis and visualization of experimental results with ONDEX</article-title>
<source>Bioinformatics</source>
<year>2006</year>
<volume>22</volume>
<issue>11</issue>
<fpage>1383</fpage>
<lpage>90</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btl081</pub-id>
<pub-id pub-id-type="pmid">16533819</pub-id>
</element-citation>
</ref>
<ref id="CR137">
<label>137</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Goff</surname>
<given-names>SA</given-names>
</name>
<name>
<surname>Vaughn</surname>
<given-names>M</given-names>
</name>
<name>
<surname>McKay</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Lyons</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Stapleton</surname>
<given-names>AE</given-names>
</name>
<name>
<surname>Gessler</surname>
<given-names>D</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The iPlant Collaborative: Cyberinfrastructure for Plant Biology</article-title>
<source>Front Plant Sci</source>
<year>2011</year>
<volume>2</volume>
<fpage>34</fpage>
<pub-id pub-id-type="doi">10.3389/fpls.2011.00034</pub-id>
<pub-id pub-id-type="pmid">22645531</pub-id>
</element-citation>
</ref>
<ref id="CR138">
<label>138</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Luo</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Brouwer</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>Pathview: an R/Bioconductor package for pathway-based data integration and visualization</article-title>
<source>Bioinformatics</source>
<year>2013</year>
<volume>29</volume>
<issue>14</issue>
<fpage>1830</fpage>
<lpage>1</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btt285</pub-id>
<pub-id pub-id-type="pmid">23740750</pub-id>
</element-citation>
</ref>
<ref id="CR139">
<label>139</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Attwood</surname>
<given-names>TK</given-names>
</name>
<name>
<surname>Kell</surname>
<given-names>DB</given-names>
</name>
<name>
<surname>McDermott</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Marsh</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Pettifer</surname>
<given-names>SR</given-names>
</name>
<name>
<surname>Thorne</surname>
<given-names>D</given-names>
</name>
</person-group>
<article-title>Utopia documents: linking scholarly literature with research data</article-title>
<source>Bioinformatics</source>
<year>2010</year>
<volume>26</volume>
<issue>18</issue>
<fpage>568</fpage>
<lpage>74</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btq383</pub-id>
<pub-id pub-id-type="pmid">20007739</pub-id>
</element-citation>
</ref>
<ref id="CR140">
<label>140</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Attwood</surname>
<given-names>TK</given-names>
</name>
<name>
<surname>Kell</surname>
<given-names>DB</given-names>
</name>
<name>
<surname>McDermott</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Marsh</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Pettifer</surname>
<given-names>SR</given-names>
</name>
<name>
<surname>Thorne</surname>
<given-names>D</given-names>
</name>
</person-group>
<article-title>Calling International Rescue: knowledge lost in literature and data landslide!</article-title>
<source>Biochem J</source>
<year>2009</year>
<volume>424</volume>
<issue>3</issue>
<fpage>317</fpage>
<lpage>33</lpage>
<pub-id pub-id-type="doi">10.1042/BJ20091474</pub-id>
<pub-id pub-id-type="pmid">19929850</pub-id>
</element-citation>
</ref>
<ref id="CR141">
<label>141</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gomez</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Garcia</surname>
<given-names>LJ</given-names>
</name>
<name>
<surname>Salazar</surname>
<given-names>GA</given-names>
</name>
<name>
<surname>Villaveces</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Gore</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Garcia</surname>
<given-names>A</given-names>
</name>
<etal></etal>
</person-group>
<article-title>BioJS: an open source JavaScript framework for biological data visualization</article-title>
<source>Bioinformatics</source>
<year>2013</year>
<volume>29</volume>
<issue>8</issue>
<fpage>1103</fpage>
<lpage>4</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btt100</pub-id>
<pub-id pub-id-type="pmid">23435069</pub-id>
</element-citation>
</ref>
<ref id="CR142">
<label>142</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Treloar</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>The research data alliance: Globally co-ordinated action against barriers to data publishing and sharing</article-title>
<source>Learned Publishing</source>
<year>2014</year>
<volume>27</volume>
<issue>5</issue>
<fpage>9</fpage>
<lpage>13</lpage>
<pub-id pub-id-type="doi">10.1087/20140503</pub-id>
</element-citation>
</ref>
</ref-list>
</back>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000194 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000194 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:4557916
   |texte=   Data integration in biological research: an overview
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:26336651" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024