Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Meta4: a web application for sharing and annotating metagenomic gene predictions using web services

Identifieur interne : 000528 ( Pmc/Corpus ); précédent : 000527; suivant : 000529

Meta4: a web application for sharing and annotating metagenomic gene predictions using web services

Auteurs : Emily J. Richardson ; Franck Escalettes ; Ian Fotheringham ; Robert J. Wallace ; Mick Watson

Source :

RBID : PMC:3763215

Abstract

Whole-genome shotgun metagenomics experiments produce DNA sequence data from entire ecosystems, and provide a huge amount of novel information. Gene discovery projects require up-to-date information about sequence homology and domain structure for millions of predicted proteins to be presented in a simple, easy-to-use system. There is a lack of simple, open, flexible tools that allow the rapid sharing of metagenomics datasets with collaborators in a format they can easily interrogate. We present Meta4, a flexible and extensible web application that can be used to share and annotate metagenomic gene predictions. Proteins and predicted domains are stored in a simple relational database, with a dynamic front-end which displays the results in an internet browser. Web services are used to provide up-to-date information about the proteins from homology searches against public databases. Information about Meta4 can be found on the project website1, code is available on Github2, a cloud image is available, and an example implementation can be seen at


Url:
DOI: 10.3389/fgene.2013.00168
PubMed: 24046776
PubMed Central: 3763215

Links to Exploration step

PMC:3763215

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Meta4: a web application for sharing and annotating metagenomic gene predictions using web services</title>
<author>
<name sortKey="Richardson, Emily J" sort="Richardson, Emily J" uniqKey="Richardson E" first="Emily J." last="Richardson">Emily J. Richardson</name>
<affiliation>
<nlm:aff id="aff1">
<institution>ARK-Genomics, The Roslin Institute and R(D)SVS, University of Edinburgh</institution>
<country>Easter Bush, Midlothian, UK</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Escalettes, Franck" sort="Escalettes, Franck" uniqKey="Escalettes F" first="Franck" last="Escalettes">Franck Escalettes</name>
<affiliation>
<nlm:aff id="aff2">
<institution>Ingenza Ltd., Roslin BioCentre</institution>
<country>Midlothian, UK</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Fotheringham, Ian" sort="Fotheringham, Ian" uniqKey="Fotheringham I" first="Ian" last="Fotheringham">Ian Fotheringham</name>
<affiliation>
<nlm:aff id="aff2">
<institution>Ingenza Ltd., Roslin BioCentre</institution>
<country>Midlothian, UK</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Wallace, Robert J" sort="Wallace, Robert J" uniqKey="Wallace R" first="Robert J." last="Wallace">Robert J. Wallace</name>
<affiliation>
<nlm:aff id="aff3">
<institution>Rowett Institute of Nutrition and Health, University of Aberdeen</institution>
<country>Aberdeen, UK</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Watson, Mick" sort="Watson, Mick" uniqKey="Watson M" first="Mick" last="Watson">Mick Watson</name>
<affiliation>
<nlm:aff id="aff1">
<institution>ARK-Genomics, The Roslin Institute and R(D)SVS, University of Edinburgh</institution>
<country>Easter Bush, Midlothian, UK</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff4">
<institution>Edinburgh Genomics, University of Edinburgh</institution>
<country>Edinburgh, UK</country>
</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">24046776</idno>
<idno type="pmc">3763215</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3763215</idno>
<idno type="RBID">PMC:3763215</idno>
<idno type="doi">10.3389/fgene.2013.00168</idno>
<date when="2013">2013</date>
<idno type="wicri:Area/Pmc/Corpus">000528</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Meta4: a web application for sharing and annotating metagenomic gene predictions using web services</title>
<author>
<name sortKey="Richardson, Emily J" sort="Richardson, Emily J" uniqKey="Richardson E" first="Emily J." last="Richardson">Emily J. Richardson</name>
<affiliation>
<nlm:aff id="aff1">
<institution>ARK-Genomics, The Roslin Institute and R(D)SVS, University of Edinburgh</institution>
<country>Easter Bush, Midlothian, UK</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Escalettes, Franck" sort="Escalettes, Franck" uniqKey="Escalettes F" first="Franck" last="Escalettes">Franck Escalettes</name>
<affiliation>
<nlm:aff id="aff2">
<institution>Ingenza Ltd., Roslin BioCentre</institution>
<country>Midlothian, UK</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Fotheringham, Ian" sort="Fotheringham, Ian" uniqKey="Fotheringham I" first="Ian" last="Fotheringham">Ian Fotheringham</name>
<affiliation>
<nlm:aff id="aff2">
<institution>Ingenza Ltd., Roslin BioCentre</institution>
<country>Midlothian, UK</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Wallace, Robert J" sort="Wallace, Robert J" uniqKey="Wallace R" first="Robert J." last="Wallace">Robert J. Wallace</name>
<affiliation>
<nlm:aff id="aff3">
<institution>Rowett Institute of Nutrition and Health, University of Aberdeen</institution>
<country>Aberdeen, UK</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Watson, Mick" sort="Watson, Mick" uniqKey="Watson M" first="Mick" last="Watson">Mick Watson</name>
<affiliation>
<nlm:aff id="aff1">
<institution>ARK-Genomics, The Roslin Institute and R(D)SVS, University of Edinburgh</institution>
<country>Easter Bush, Midlothian, UK</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff4">
<institution>Edinburgh Genomics, University of Edinburgh</institution>
<country>Edinburgh, UK</country>
</nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Frontiers in Genetics</title>
<idno type="eISSN">1664-8021</idno>
<imprint>
<date when="2013">2013</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>Whole-genome shotgun metagenomics experiments produce DNA sequence data from entire ecosystems, and provide a huge amount of novel information. Gene discovery projects require up-to-date information about sequence homology and domain structure for millions of predicted proteins to be presented in a simple, easy-to-use system. There is a lack of simple, open, flexible tools that allow the rapid sharing of metagenomics datasets with collaborators in a format they can easily interrogate. We present Meta4, a flexible and extensible web application that can be used to share and annotate metagenomic gene predictions. Proteins and predicted domains are stored in a simple relational database, with a dynamic front-end which displays the results in an internet browser. Web services are used to provide up-to-date information about the proteins from homology searches against public databases. Information about Meta4 can be found on the project website
<sup>
<xref ref-type="fn" rid="fn01">1</xref>
</sup>
, code is available on Github
<sup>
<xref ref-type="fn" rid="fn02">2</xref>
</sup>
, a cloud image is available, and an example implementation can be seen at
<ext-link ext-link-type="uri" xlink:href="http://www.ark-genomics.org/tools/meta4"></ext-link>
</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Afgan, E" uniqKey="Afgan E">E. Afgan</name>
</author>
<author>
<name sortKey="Chapman, B" uniqKey="Chapman B">B. Chapman</name>
</author>
<author>
<name sortKey="Jadan, M" uniqKey="Jadan M">M. Jadan</name>
</author>
<author>
<name sortKey="Franke, V" uniqKey="Franke V">V. Franke</name>
</author>
<author>
<name sortKey="Taylor, J" uniqKey="Taylor J">J. Taylor</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Boisvert, S" uniqKey="Boisvert S">S. Boisvert</name>
</author>
<author>
<name sortKey="Raymond, F" uniqKey="Raymond F">F. Raymond</name>
</author>
<author>
<name sortKey="Godzaridis, E" uniqKey="Godzaridis E">E. Godzaridis</name>
</author>
<author>
<name sortKey="Laviolette, F" uniqKey="Laviolette F">F. Laviolette</name>
</author>
<author>
<name sortKey="Corbeil, J" uniqKey="Corbeil J">J. Corbeil</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cowan, D" uniqKey="Cowan D">D. Cowan</name>
</author>
<author>
<name sortKey="Meyer, Q" uniqKey="Meyer Q">Q. Meyer</name>
</author>
<author>
<name sortKey="Stafford, W" uniqKey="Stafford W">W. Stafford</name>
</author>
<author>
<name sortKey="Muyanga, S" uniqKey="Muyanga S">S. Muyanga</name>
</author>
<author>
<name sortKey="Cameron, R" uniqKey="Cameron R">R. Cameron</name>
</author>
<author>
<name sortKey="Wittwer, P" uniqKey="Wittwer P">P. Wittwer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cowan, D A" uniqKey="Cowan D">D. A. Cowan</name>
</author>
<author>
<name sortKey="Arslanoglu, A" uniqKey="Arslanoglu A">A. Arslanoglu</name>
</author>
<author>
<name sortKey="Burton, S G" uniqKey="Burton S">S. G. Burton</name>
</author>
<author>
<name sortKey="Baker, G C" uniqKey="Baker G">G. C. Baker</name>
</author>
<author>
<name sortKey="Cameron, R A" uniqKey="Cameron R">R. A. Cameron</name>
</author>
<author>
<name sortKey="Smith, J J" uniqKey="Smith J">J. J. Smith</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Eddy, S R" uniqKey="Eddy S">S. R. Eddy</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hess, M" uniqKey="Hess M">M. Hess</name>
</author>
<author>
<name sortKey="Sczyrba, A" uniqKey="Sczyrba A">A. Sczyrba</name>
</author>
<author>
<name sortKey="Egan, R" uniqKey="Egan R">R. Egan</name>
</author>
<author>
<name sortKey="Kim, T W" uniqKey="Kim T">T. W. Kim</name>
</author>
<author>
<name sortKey="Chokhawala, H" uniqKey="Chokhawala H">H. Chokhawala</name>
</author>
<author>
<name sortKey="Schroth, G" uniqKey="Schroth G">G. Schroth</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hoff, K J" uniqKey="Hoff K">K. J. Hoff</name>
</author>
<author>
<name sortKey="Lingner, T" uniqKey="Lingner T">T. Lingner</name>
</author>
<author>
<name sortKey="Meinicke, P" uniqKey="Meinicke P">P. Meinicke</name>
</author>
<author>
<name sortKey="Tech, M" uniqKey="Tech M">M. Tech</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jain, E" uniqKey="Jain E">E. Jain</name>
</author>
<author>
<name sortKey="Bairoch, A" uniqKey="Bairoch A">A. Bairoch</name>
</author>
<author>
<name sortKey="Duvaud, S" uniqKey="Duvaud S">S. Duvaud</name>
</author>
<author>
<name sortKey="Phan, I" uniqKey="Phan I">I. Phan</name>
</author>
<author>
<name sortKey="Redaschi, N" uniqKey="Redaschi N">N. Redaschi</name>
</author>
<author>
<name sortKey="Suzek, B E" uniqKey="Suzek B">B. E. Suzek</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kelley, D R" uniqKey="Kelley D">D. R. Kelley</name>
</author>
<author>
<name sortKey="Liu, B" uniqKey="Liu B">B. Liu</name>
</author>
<author>
<name sortKey="Delcher, A L" uniqKey="Delcher A">A. L. Delcher</name>
</author>
<author>
<name sortKey="Pop, M" uniqKey="Pop M">M. Pop</name>
</author>
<author>
<name sortKey="Salzberg, S L" uniqKey="Salzberg S">S. L. Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, R" uniqKey="Li R">R. Li</name>
</author>
<author>
<name sortKey="Zhu, H" uniqKey="Zhu H">H. Zhu</name>
</author>
<author>
<name sortKey="Ruan, J" uniqKey="Ruan J">J. Ruan</name>
</author>
<author>
<name sortKey="Qian, W" uniqKey="Qian W">W. Qian</name>
</author>
<author>
<name sortKey="Fang, X" uniqKey="Fang X">X. Fang</name>
</author>
<author>
<name sortKey="Shi, Z" uniqKey="Shi Z">Z. Shi</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mackelprang, R" uniqKey="Mackelprang R">R. Mackelprang</name>
</author>
<author>
<name sortKey="Waldrop, M P" uniqKey="Waldrop M">M. P. Waldrop</name>
</author>
<author>
<name sortKey="Deangelis, K M" uniqKey="Deangelis K">K. M. Deangelis</name>
</author>
<author>
<name sortKey="David, M M" uniqKey="David M">M. M. David</name>
</author>
<author>
<name sortKey="Chavarria, K L" uniqKey="Chavarria K">K. L. Chavarria</name>
</author>
<author>
<name sortKey="Blazewicz, S J" uniqKey="Blazewicz S">S. J. Blazewicz</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Magrane, M" uniqKey="Magrane M">M. Magrane</name>
</author>
<author>
<name sortKey="Consortium, U" uniqKey="Consortium U">U. Consortium</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Markowitz, V M" uniqKey="Markowitz V">V. M. Markowitz</name>
</author>
<author>
<name sortKey="Chen, I M" uniqKey="Chen I">I. M. Chen</name>
</author>
<author>
<name sortKey="Chu, K" uniqKey="Chu K">K. Chu</name>
</author>
<author>
<name sortKey="Szeto, E" uniqKey="Szeto E">E. Szeto</name>
</author>
<author>
<name sortKey="Palaniappan, K" uniqKey="Palaniappan K">K. Palaniappan</name>
</author>
<author>
<name sortKey="Grechkin, Y" uniqKey="Grechkin Y">Y. Grechkin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mcwilliam, H" uniqKey="Mcwilliam H">H. McWilliam</name>
</author>
<author>
<name sortKey="Valentin, F" uniqKey="Valentin F">F. Valentin</name>
</author>
<author>
<name sortKey="Goujon, M" uniqKey="Goujon M">M. Goujon</name>
</author>
<author>
<name sortKey="Li, W" uniqKey="Li W">W. Li</name>
</author>
<author>
<name sortKey="Narayanasamy, M" uniqKey="Narayanasamy M">M. Narayanasamy</name>
</author>
<author>
<name sortKey="Martin, J" uniqKey="Martin J">J. Martin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Meyer, F" uniqKey="Meyer F">F. Meyer</name>
</author>
<author>
<name sortKey="Paarmann, D" uniqKey="Paarmann D">D. Paarmann</name>
</author>
<author>
<name sortKey="D Ouza, M" uniqKey="D Ouza M">M. D’souza</name>
</author>
<author>
<name sortKey="Olson, R" uniqKey="Olson R">R. Olson</name>
</author>
<author>
<name sortKey="Glass, E M" uniqKey="Glass E">E. M. Glass</name>
</author>
<author>
<name sortKey="Kubal, M" uniqKey="Kubal M">M. Kubal</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mulder, N" uniqKey="Mulder N">N. Mulder</name>
</author>
<author>
<name sortKey="Apweiler, R" uniqKey="Apweiler R">R. Apweiler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Namiki, T" uniqKey="Namiki T">T. Namiki</name>
</author>
<author>
<name sortKey="Hachiya, T" uniqKey="Hachiya T">T. Hachiya</name>
</author>
<author>
<name sortKey="Tanaka, H" uniqKey="Tanaka H">H. Tanaka</name>
</author>
<author>
<name sortKey="Sakakibara, Y" uniqKey="Sakakibara Y">Y. Sakakibara</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Noguchi, H" uniqKey="Noguchi H">H. Noguchi</name>
</author>
<author>
<name sortKey="Taniguchi, T" uniqKey="Taniguchi T">T. Taniguchi</name>
</author>
<author>
<name sortKey="Itoh, T" uniqKey="Itoh T">T. Itoh</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pell, J" uniqKey="Pell J">J. Pell</name>
</author>
<author>
<name sortKey="Hintze, A" uniqKey="Hintze A">A. Hintze</name>
</author>
<author>
<name sortKey="Canino Koning, R" uniqKey="Canino Koning R">R. Canino-Koning</name>
</author>
<author>
<name sortKey="Howe, A" uniqKey="Howe A">A. Howe</name>
</author>
<author>
<name sortKey="Tiedje, J M" uniqKey="Tiedje J">J. M. Tiedje</name>
</author>
<author>
<name sortKey="Brown, C T" uniqKey="Brown C">C. T. Brown</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Peng, Y" uniqKey="Peng Y">Y. Peng</name>
</author>
<author>
<name sortKey="Leung, H C" uniqKey="Leung H">H. C. Leung</name>
</author>
<author>
<name sortKey="Yiu, S M" uniqKey="Yiu S">S. M. Yiu</name>
</author>
<author>
<name sortKey="Chin, F Y" uniqKey="Chin F">F. Y. Chin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Punta, M" uniqKey="Punta M">M. Punta</name>
</author>
<author>
<name sortKey="Coggill, P C" uniqKey="Coggill P">P. C. Coggill</name>
</author>
<author>
<name sortKey="Eberhardt, R Y" uniqKey="Eberhardt R">R. Y. Eberhardt</name>
</author>
<author>
<name sortKey="Mistry, J" uniqKey="Mistry J">J. Mistry</name>
</author>
<author>
<name sortKey="Tate, J" uniqKey="Tate J">J. Tate</name>
</author>
<author>
<name sortKey="Boursnell, C" uniqKey="Boursnell C">C. Boursnell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rho, M" uniqKey="Rho M">M. Rho</name>
</author>
<author>
<name sortKey="Tang, H" uniqKey="Tang H">H. Tang</name>
</author>
<author>
<name sortKey="Ye, Y" uniqKey="Ye Y">Y. Ye</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Richardson, E J" uniqKey="Richardson E">E. J. Richardson</name>
</author>
<author>
<name sortKey="Watson, M" uniqKey="Watson M">M. Watson</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sigrist, C J" uniqKey="Sigrist C">C. J. Sigrist</name>
</author>
<author>
<name sortKey="Cerutti, L" uniqKey="Cerutti L">L. Cerutti</name>
</author>
<author>
<name sortKey="De Castro, E" uniqKey="De Castro E">E. De Castro</name>
</author>
<author>
<name sortKey="Langendijk Genevaux, P S" uniqKey="Langendijk Genevaux P">P. S. Langendijk-Genevaux</name>
</author>
<author>
<name sortKey="Bulliard, V" uniqKey="Bulliard V">V. Bulliard</name>
</author>
<author>
<name sortKey="Bairoch, A" uniqKey="Bairoch A">A. Bairoch</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sun, S" uniqKey="Sun S">S. Sun</name>
</author>
<author>
<name sortKey="Chen, J" uniqKey="Chen J">J. Chen</name>
</author>
<author>
<name sortKey="Li, W" uniqKey="Li W">W. Li</name>
</author>
<author>
<name sortKey="Altintas, I" uniqKey="Altintas I">I. Altintas</name>
</author>
<author>
<name sortKey="Lin, A" uniqKey="Lin A">A. Lin</name>
</author>
<author>
<name sortKey="Peltier, S" uniqKey="Peltier S">S. Peltier</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Treangen, T J" uniqKey="Treangen T">T. J. Treangen</name>
</author>
<author>
<name sortKey="Koren, S" uniqKey="Koren S">S. Koren</name>
</author>
<author>
<name sortKey="Sommer, D D" uniqKey="Sommer D">D. D. Sommer</name>
</author>
<author>
<name sortKey="Liu, B" uniqKey="Liu B">B. Liu</name>
</author>
<author>
<name sortKey="Astrovskaya, I" uniqKey="Astrovskaya I">I. Astrovskaya</name>
</author>
<author>
<name sortKey="Ondov, B" uniqKey="Ondov B">B. Ondov</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Venter, J C" uniqKey="Venter J">J. C. Venter</name>
</author>
<author>
<name sortKey="Remington, K" uniqKey="Remington K">K. Remington</name>
</author>
<author>
<name sortKey="Heidelberg, J F" uniqKey="Heidelberg J">J. F. Heidelberg</name>
</author>
<author>
<name sortKey="Halpern, A L" uniqKey="Halpern A">A. L. Halpern</name>
</author>
<author>
<name sortKey="Rusch, D" uniqKey="Rusch D">D. Rusch</name>
</author>
<author>
<name sortKey="Eisen, J A" uniqKey="Eisen J">J. A. Eisen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yatsunenko, T" uniqKey="Yatsunenko T">T. Yatsunenko</name>
</author>
<author>
<name sortKey="Rey, F E" uniqKey="Rey F">F. E. Rey</name>
</author>
<author>
<name sortKey="Manary, M J" uniqKey="Manary M">M. J. Manary</name>
</author>
<author>
<name sortKey="Trehan, I" uniqKey="Trehan I">I. Trehan</name>
</author>
<author>
<name sortKey="Dominguez Bello, M G" uniqKey="Dominguez Bello M">M. G. Dominguez-Bello</name>
</author>
<author>
<name sortKey="Contreras, M" uniqKey="Contreras M">M. Contreras</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yok, N G" uniqKey="Yok N">N. G. Yok</name>
</author>
<author>
<name sortKey="Rosen, G L" uniqKey="Rosen G">G. L. Rosen</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Front Genet</journal-id>
<journal-id journal-id-type="iso-abbrev">Front Genet</journal-id>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title-group>
<journal-title>Frontiers in Genetics</journal-title>
</journal-title-group>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">24046776</article-id>
<article-id pub-id-type="pmc">3763215</article-id>
<article-id pub-id-type="doi">10.3389/fgene.2013.00168</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Methods Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Meta4: a web application for sharing and annotating metagenomic gene predictions using web services</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Richardson</surname>
<given-names>Emily J.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Escalettes</surname>
<given-names>Franck</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Fotheringham</surname>
<given-names>Ian</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wallace</surname>
<given-names>Robert J.</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Watson</surname>
<given-names>Mick</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff4">
<sup>4</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>ARK-Genomics, The Roslin Institute and R(D)SVS, University of Edinburgh</institution>
<country>Easter Bush, Midlothian, UK</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Ingenza Ltd., Roslin BioCentre</institution>
<country>Midlothian, UK</country>
</aff>
<aff id="aff3">
<sup>3</sup>
<institution>Rowett Institute of Nutrition and Health, University of Aberdeen</institution>
<country>Aberdeen, UK</country>
</aff>
<aff id="aff4">
<sup>4</sup>
<institution>Edinburgh Genomics, University of Edinburgh</institution>
<country>Edinburgh, UK</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by:
<italic>John Hancock, University of Cambridge, UK</italic>
</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by:
<italic>Ugur Sezerman, Sabanci University, Turkey; Pascale Gaudet, Swiss Institute of Bioinformatics, Switzerland</italic>
</p>
</fn>
<corresp id="fn001">*Correspondence:
<italic>Mick Watson, ARK-Genomics, The Roslin Institute and R(D)SVS, University of Edinburgh, Division of Genetics and Genomics, Easter Bush, Midlothian EH25 9RG, UK e-mail:
<email xlink:type="simple">mick.watson@roslin.ed.ac.uk</email>
</italic>
</corresp>
<fn fn-type="other" id="fn002">
<p>This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Genetics.</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>9</month>
<year>2013</year>
</pub-date>
<pub-date pub-type="collection">
<year>2013</year>
</pub-date>
<volume>4</volume>
<elocation-id>168</elocation-id>
<history>
<date date-type="received">
<day>01</day>
<month>5</month>
<year>2013</year>
</date>
<date date-type="accepted">
<day>13</day>
<month>8</month>
<year>2013</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright © Richardson, Escalettes, Fotheringham,Wallace andWatson.</copyright-statement>
<copyright-year>2013</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/">
<license-p> This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</license-p>
</license>
</permissions>
<abstract>
<p>Whole-genome shotgun metagenomics experiments produce DNA sequence data from entire ecosystems, and provide a huge amount of novel information. Gene discovery projects require up-to-date information about sequence homology and domain structure for millions of predicted proteins to be presented in a simple, easy-to-use system. There is a lack of simple, open, flexible tools that allow the rapid sharing of metagenomics datasets with collaborators in a format they can easily interrogate. We present Meta4, a flexible and extensible web application that can be used to share and annotate metagenomic gene predictions. Proteins and predicted domains are stored in a simple relational database, with a dynamic front-end which displays the results in an internet browser. Web services are used to provide up-to-date information about the proteins from homology searches against public databases. Information about Meta4 can be found on the project website
<sup>
<xref ref-type="fn" rid="fn01">1</xref>
</sup>
, code is available on Github
<sup>
<xref ref-type="fn" rid="fn02">2</xref>
</sup>
, a cloud image is available, and an example implementation can be seen at
<ext-link ext-link-type="uri" xlink:href="http://www.ark-genomics.org/tools/meta4"></ext-link>
</p>
</abstract>
<kwd-group>
<kwd>metagenomics</kwd>
<kwd>database</kwd>
<kwd>web service</kwd>
<kwd>gene discovery</kwd>
<kwd>bioinformatics</kwd>
</kwd-group>
<counts>
<fig-count count="2"></fig-count>
<table-count count="0"></table-count>
<equation-count count="0"></equation-count>
<ref-count count="29"></ref-count>
<page-count count="6"></page-count>
<word-count count="0"></word-count>
</counts>
</article-meta>
</front>
<body>
<sec>
<title>INTRODUCTION</title>
<p>Whole-genome shotgun (WGS) metagenomics can be defined as the application of high-throughput sequencing technologies to whole environmental samples, enabling scientists to assay the genomes of all organisms within a particular ecosystem, be it the human gut microbiome (
<xref ref-type="bibr" rid="B28">Yatsunenko et al., 2012</xref>
), permafrost (
<xref ref-type="bibr" rid="B11">Mackelprang et al., 2011</xref>
), or the Sargasso Sea (
<xref ref-type="bibr" rid="B27">Venter et al., 2004</xref>
). One of the aims of such endeavors is to discover novel enzymes that may have be of use to the biotechnology industry (
<xref ref-type="bibr" rid="B3">Cowan et al., 2005</xref>
), and metagenomics has been identified as a major mechanism for increasing the “sequencing space” from which to discover new biocatalysts (
<xref ref-type="bibr" rid="B4">Cowan et al., 2004</xref>
).</p>
<p>Whole-genome shotgun metagenomics experiments routinely produce hundreds of gigabases of sequencing data. A generalized analysis pipeline for such data is to (i) assemble the genomic data
<italic>de novo</italic>
; (ii) predict genes and proteins on the resulting contigs and scaffolds; (iii) assign domains and function to those proteins; (iv) interpret those findings within the biological context. It is not unusual for such studies to generate several million novel genes/proteins –
<xref ref-type="bibr" rid="B27">Venter et al. (2004)</xref>
reported over 1.2 million novel genes, and
<xref ref-type="bibr" rid="B6">Hess et al. (2011)</xref>
reported over 2.5 million putative genes, 27755 containing a domain of interest: those relevant to biomass degradation.</p>
<p>Metagenomic assembly poses specific problems over and above those of single genome assembly. The attempt to simultaneously assemble thousands of different genomes often results in large and complex assembly graphs. These require more memory to create and query, and also often require extra information in order to find true paths through the graphs. Ray Meta (
<xref ref-type="bibr" rid="B2">Boisvert et al., 2012</xref>
) is a massively distributed metagenome assembler that uses message passing, whereas
<xref ref-type="bibr" rid="B19">Pell et al. (2012)</xref>
reduce memory requirements using a bloom filter and use kmer connectivity to improve the assembly process. Other tools attempt to partition the assembly graph – Meta IDBA using graph connectivity (
<xref ref-type="bibr" rid="B20">Peng et al., 2011</xref>
) and MetaVelvet using both coverage and connectivity (
<xref ref-type="bibr" rid="B17">Namiki et al., 2012</xref>
). Finally, MetAMOS (
<xref ref-type="bibr" rid="B26">Treangen et al., 2013</xref>
) is a metagenomics pipeline that combines a number of published tools for metagenomic analysis.</p>
<p>Once the raw metagenomic reads have been assembled into contigs and scaffolds, the next stage is an attempt to predict the location of genes. Here again, metagenomics poses particular problems when compared to single bacterial genome annotation (recently reviewed in
<xref ref-type="bibr" rid="B23">Richardson and Watson, 2013</xref>
). Specifically, traditional bacterial gene predictors use models trained on a single, related genome; as with metagenomics we sequence thousands of genomes simultaneously, this is no longer appropriate. A number of tools have been published for metagenomic gene prediction, including MetaGeneAnnotator (
<xref ref-type="bibr" rid="B18">Noguchi et al., 2008</xref>
), Orphelia (
<xref ref-type="bibr" rid="B7">Hoff et al., 2009</xref>
), FragGeneScan (
<xref ref-type="bibr" rid="B22">Rho et al., 2010</xref>
), and Glimmer-MG (
<xref ref-type="bibr" rid="B9">Kelley et al., 2012</xref>
).
<xref ref-type="bibr" rid="B29">Yok and Rosen (2011)</xref>
propose a combination of tools.</p>
<p>Once genes have been annotated, domains can be assigned to protein-coding genes using traditional approaches, such as HMMER (
<xref ref-type="bibr" rid="B5">Eddy, 2009</xref>
) searches of domain databases such as Pfam (
<xref ref-type="bibr" rid="B21">Punta et al., 2012</xref>
), and the use of tools such as InterProScan (
<xref ref-type="bibr" rid="B16">Mulder and Apweiler, 2007</xref>
).</p>
<p>After raw reads from metagenomics experiments have been assembled and annotated, researchers are left with a very large and rich dataset which can be difficult to query and share. Tools that allow multiple users to browse and query such datasets, either privately within a consortium, or as part of a public collaboration, remain under-developed. It is essential that simple, open, and flexible tools are provided to allow scientists to easily access the outputs of metagenomic gene discovery projects. Here we describe Meta4, a web application that is easy to install, that should work on any standard LAMP (Linux, Apache, MySQL, PHP) server, and which allows users to search and browse large collections of metagenomic gene predictions in a user-friendly web interface. In addition, Meta4 makes use of web services to provide up-to-date annotation.</p>
<p>There are a few existing tools for organizing and analyzing metagenomic data on the web; however, despite being feature-rich, many are closed systems. The integrated microbial genomes and metagenomes (IMG/M) system (
<xref ref-type="bibr" rid="B13">Markowitz et al., 2012</xref>
) allows comprehensive analysis of genomes and metagenomes sequenced at the Joint Genome Institute (JGI). However, the system is not open-source, it is not possible to download the code and create a local installation, the software is only extensible by the authors and it is not easy to integrate your own data – one must e-mail the authors and request integration. Similarly, the Community cyberinfrastructure for Advanced Microbial Ecology Research and Analysis (CAMERA;
<xref ref-type="bibr" rid="B25">Sun et al., 2011</xref>
) is a workflow-based, feature-rich website for metagenomic analysis; however, the same issues remain in that it is not open-source, it is only extensible by the authors, it is not possible to create a local installation, and users must e-mail the authors to request integration of their data. Luckily, the metagenomics RAST server (MG-RAST;
<xref ref-type="bibr" rid="B15">Meyer et al., 2008</xref>
), a very popular and comprehensive tool for metagenomic data analysis, is far more open, with users encouraged to submit their own data, and the code is available on github
<sup>
<xref ref-type="fn" rid="fn03">3</xref>
</sup>
. However, even the authors admit, local installations of the tool are difficult, they advise against it, and no support for such an undertaking is available
<sup>
<xref ref-type="fn" rid="fn04">4</xref>
</sup>
.</p>
<p>All three tools are feature- and function-rich, and aim to be complete systems for the assembly, annotation, and comparison of multiple metagenomic samples. One problem with systems such as IMG/M and CAMERA is an inability for users to maintain data privacy; once data is uploaded to these systems, it is available for the public to see. MG-RAST does have the option to submit to a private queue, but this is a low priority queue. As such, these tools are not designed for the simple task of sharing large amounts of data quickly and simply. Meta4 is not designed to compete with these tools in terms of functionality; rather, it is a simple tool allowing the rapid sharing of metagenomic results that is easily extensible by the addition of web services. It is possible to set up a Meta4 database in less than 30 min on a simple Linux server such as an Amazon EC2 micro instance. Meta4 is a lightweight tool, completely open-source, easy to install locally and easy to add additional functionality through web services.</p>
<p>Meta4 was developed on an Amazon EC2 micro instance using a CloudBioLinux (
<xref ref-type="bibr" rid="B1">Afgan et al., 2012</xref>
) image. All code is available via Github. An example Meta4 database can be queried at
<ext-link ext-link-type="uri" xlink:href="http://www.ark-genomics.org/tools/meta4"></ext-link>
containing an assembly of the
<xref ref-type="bibr" rid="B6">Hess et al. (2011)</xref>
data.</p>
</sec>
<sec sec-type="materials|methods" id="s1">
<title>MATERIALS AND METHODS</title>
<p>The overall structure of Meta4 is shown in
<bold>Figure
<xref ref-type="fig" rid="F1">1</xref>
</bold>
. Central to the system is the Meta4 MySQL database, which stores information on samples, assemblies, gene predictions, and protein domain information. The choice to store some basic annotation in the database itself allows users to query the available gene predictions on domains of interest. Without such annotation, it would be very difficult for users to filter the large numbers of gene predictions in metagenomic datasets. We have chosen to store information on protein domains, rather than the results of homology searches (e.g., BLAST), as often domain searches are more sensitive to distant homology. Information can be loaded into the database from common formats using the database loading scripts, including GFF3 (gene predictions) and fasta (contigs and scaffolds). A web form is provided that allows users to query the database and information is presented in two ways: firstly, data extracted directly from the Meta4 database is presented in the browser; secondly, data extracted from the Meta4 database is provided to a range of web services, and the results of those web services presented in the browser. This allows for the latest, live, up-to-date annotation to be displayed for each gene prediction, and is a key feature of Meta4.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>
<bold>The overall structure of Meta4, which shows the relationship between the MySQL database, the data loading scripts, the web interface, external web services, and the users</bold>
.</p>
</caption>
<graphic xlink:href="fgene-04-00168-g001"></graphic>
</fig>
<sec>
<title>INTERFACE AND WEB SERVICES</title>
<p>The dynamic web interface is written in Perl/CGI and should run on any apache web-server with minimal setup. The user is presented with a form including several parameters for search and retrieval of genes/proteins within the database. The results are returned as an HTML table, and consist of two parts – those that return information stored in the database, and those returned from web services.</p>
<p>We have implemented three web services in Meta4. The first uses the EBI’s SOAP wublast interface (
<xref ref-type="bibr" rid="B14">McWilliam et al., 2009</xref>
), querying Uniprot (
<xref ref-type="bibr" rid="B12">Magrane and Consortium, 2011</xref>
) with a protein sequence retrieved from the database. The top 10 results are returned and these represent the most up-to-date homology information for that protein within Uniprot.</p>
<p>The second uses the Uniprot REST web service (
<xref ref-type="bibr" rid="B8">Jain et al., 2009</xref>
). Domains associated with a particular protein are extracted from the database and used as input to search Uniprot. In this way, known proteins with a similar domain structure to that being queried are returned and presented to the user. Users are then able to see the protein name and species of similar proteins, and can click through to the Uniprot entry.</p>
<p>The third uses the EBI’s InterproScan (
<xref ref-type="bibr" rid="B16">Mulder and Apweiler, 2007</xref>
) SOAP interface (
<xref ref-type="bibr" rid="B14">McWilliam et al., 2009</xref>
), querying up to 14 separate protein domain databases with a protein sequence retrieved from the database. The image and text returned also represent the most up-to-date information publicly available for the domains predicted within the query protein.</p>
</sec>
<sec>
<title>DATABASE STRUCTURE</title>
<p>The Meta4 MySQL database models the following specific entities and their relationships:</p>
<list list-type="simple">
<list-item>
<label>(i)</label>
<p>Sample: information about a specific biological sample that has been sequenced. In reality we imagine most researchers will store this information in some other database [e.g., a laboratory information management system (LIMS)], but this table allows metagenomic data to be linked to specific samples.</p>
</list-item>
<list-item>
<label>(ii)</label>
<p>Assembly: information about a
<italic>de novo</italic>
assembly of data from a biological sample. This allows for multiple different assemblies of the same sample. The parameters of the assembly can be stored as tag = value pairs in an assembly_param table.</p>
</list-item>
<list-item>
<label>(iii)</label>
<p>Contig: models the contigs that are output as the result of an assembly. We do not explicitly differentiate between contigs and scaffolds. In this instance, a contig simply describes a single, contiguous sequence obtained from a metagenomic assembly.</p>
</list-item>
<list-item>
<label>(iv)</label>
<p>Gene prediction: information on the genes predicted on any given contig, including the location on the contig, and the DNA and protein sequence.</p>
</list-item>
<list-item>
<label>(v)</label>
<p>Domain database: contains information on the domain database used and allows each gene prediction to have hits to multiple domain databases [e.g., PROSITE (
<xref ref-type="bibr" rid="B24">Sigrist et al., 2010</xref>
) and Pfam (
<xref ref-type="bibr" rid="B21">Punta et al., 2012</xref>
)] or multiple versions of the same domain database.</p>
</list-item>
<list-item>
<label>(vi)</label>
<p>Protein domain: information on the domains within each domain database.</p>
</list-item>
<list-item>
<label>(vii)</label>
<p>Domain match: storage of the link between gene predictions and protein domains, including location of the match, bit score and e-value.</p>
</list-item>
</list>
<p>Crucially, this structure allows multiple assemblies of the same biological sample, as it is common to carry out multiple genome assemblies using different software and parameter sets (which can be flexibly stored in the assembly_param table). Domain matches from multiple databases may also be stored.</p>
</sec>
<sec>
<title>CODE STRUCTURE AND DEVELOPMENT</title>
<p>We have implemented the Meta4 data model in MySQL with an interface written in Perl and Perl CGI. The code has been tested on CloudBioLinux (
<xref ref-type="bibr" rid="B1">Afgan et al., 2012</xref>
) and a local Scientific Linux server, and should work on any standard LAMP server. The github repository contains the following folders:</p>
<list list-type="simple">
<list-item>
<label>(i)</label>
<p>sql: SQL for creating the MySQL database.</p>
</list-item>
<list-item>
<label>(ii)</label>
<p>examples: example files used to create a simple instance of Meta4.</p>
</list-item>
<list-item>
<label>(iii)</label>
<p>scripts: perl scripts to load information and data into a Meta4 database.</p>
</list-item>
<list-item>
<label>(iv)</label>
<p>cgi_scripts: perl CGI scripts that provide an interface to query the data within a Meta4 database.</p>
</list-item>
</list>
<p>A README file is included in the distribution which gives accurate instructions on how to create a Meta4 database that is accessible via a web browser. If the import scripts are run with no parameters, simple instructions are printed to the terminal.</p>
<p>Meta4 is released under an open-source license and we welcome active participation in the project. Whilst Meta4 is suitable for release and publication in its current form, there are many ways in which Meta4 could be developed. For example, currently users must import data using Linux command-line scripts, rather than a graphical user interface (GUI); also, we present scripts to import data from the output of pfam_scan.pl
<sup>
<xref ref-type="fn" rid="fn05">5</xref>
</sup>
, and we welcome contributions that are able to import data from other software formats.</p>
</sec>
</sec>
<sec>
<title>RESULTS</title>
<sec>
<title>EXAMPLE DATASET</title>
<p>We have created an example Meta4 database and the results can be browsed at
<ext-link ext-link-type="uri" xlink:href="http://www.ark-genomics.org/tools/meta4"></ext-link>
. Briefly, we downloaded data from
<xref ref-type="bibr" rid="B6">Hess et al. (2011)</xref>
(SRA accession SRA023560) and assembled the reads using SOAPdenovo (
<xref ref-type="bibr" rid="B10">Li et al., 2010</xref>
). Open-reading frames greater than 200 bp in length were extracted as putative genes. Pfam-A domains were annotated using pfam_scan.pl
<sup>
<xref ref-type="fn" rid="fn05">5</xref>
</sup>
. As the experiment was designed to find novel biomass degrading genes, we encourage users to enter “glyco_hydro” into the “Name” field and click “Submit.”</p>
</sec>
<sec>
<title>BROWSING GENE PREDICTIONS</title>
<p>Meta4 allows users to browse information on particular gene predictions. An example screenshot of such information can be seen in
<bold>Figure
<xref ref-type="fig" rid="F2">2</xref>
</bold>
. Basic information such as the gene name, description, and sequence lengths are extracted from the database. Protein domains annotated within the database are also extracted, and presented as both a table and an image. Furthermore, the actual gene and protein sequences are presented, and formatted correctly. Afterward, live information is presented from the three web services. Firstly, proteins with the same domain structure are extracted from Uniprot, and presented as a table. Secondly, the top 10 BLAST hits against Uniprot/TREMBL are presented. In this way, users are able to see similar proteins in Unprot by domain structure and by sequence homology, and can click through to the relevant entries. Finally, results from the InterProScan web service are presented, both as an image and as text. As InterProScan searches 14 different domain databases, we are able to view more information here than the simple domain information stored in the Meta4 database. A key advantage of Meta4 is that information and annotation about the protein in question is served to the user in real time, and therefore represents the most up-to-date information possible.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>
<bold>Screenshot of the Meta4 results interface, showing information extracted from the Meta4 database, and information from web services (marked as “live” in the table)</bold>
.</p>
</caption>
<graphic xlink:href="fgene-04-00168-g002"></graphic>
</fig>
</sec>
<sec>
<title>WEB INTERFACE</title>
<p>The web interface has been tested on Firefox (Windows, Linux, Android), Safari (Windows, Mac), Opera (Windows, Android), Konqueror (Linux), Chrome (Windows), the Android native browser, and Internet Explorer (Windows). All features work on all browsers, except Internet Explorer 8 (Windows). Our implementation of the EBI’s InterproScan web service produces an in-line image using the data URI (uniform resource identifier) scheme, and we understand Internet Explorer 8 to have a 32 Kb limit for these. This is fixed in Internet Explorer version 9.</p>
</sec>
<sec>
<title>AMAZON EC2 CLOUD IMAGE</title>
<p>An Amazon Machine Image (AMI) is available (EU-WEST: ami-46687f32). The AMI is based on Ubuntu Precise 12.04 (64 Bit) with additional dependencies installed, including Meta4. We have loaded the example data packaged with Meta4, and the system is available from the cgi-bin of the installed Apache2 web-server. Full instructions on how this was set up are available here:
<ext-link ext-link-type="uri" xlink:href="http://www.ark-genomics.org/services-bioinformatics-meta4/creating-meta4-amazon-machine-image-ami"></ext-link>
</p>
</sec>
</sec>
<sec>
<title>DISCUSSION</title>
<p>The role of Meta4 is to allow bioinformaticians to share the results of metagenomic assembly and annotation with collaborators, and to provide those collaborators with a simple web-based interface with which to query and browse the data. It is not intended to compete with tools that aim to assemble, annotate, and functionally or taxonomically compare multiple metagenomic datasets; rather, it is a simple web application that can be used to search and browse large amounts of information quickly, and retrieve genes and proteins that may be of interest for further studies.</p>
<p>The key advantages of Meta4 are:</p>
<list list-type="simple">
<list-item>
<label>(i)</label>
<p>Simplicity: Meta4 is incredibly simple and can be installed in minutes on a standard LAMP server, either using the git repository or by using the Amazon EC2 image. A new Meta4 instance can be created rapidly from standard formats using the scripts provided. In addition, Meta4 is completely open-source.</p>
</list-item>
<list-item>
<label>(ii)</label>
<p>Use of web services: by using web services, Meta4 ensures the latest annotation results are delivered to users. In contrast, other systems store pre-computed results which can rapidly become out-of-date. By using web services, it is easy to extend the functionality of Meta4.</p>
</list-item>
<list-item>
<label>(iii)</label>
<p>Separation of data delivery from data analysis: existing web-based systems combine assembly and annotation with results presentation. By separating the search/browse function from data analysis, Meta4 allows bioinformaticians to use an assembly and annotation pipeline of their choice, and still share their results with collaborators through a user-friendly web interface.</p>
</list-item>
<list-item>
<label>(iv)</label>
<p>Access control: often when one submits data to a public web-server, a commitment is made to make the data publicly available. Meta4 can be set up on a private intranet in minutes, ensuring data privacy; alternatively, cloud Meta4 instances can be limited to specific IP addresses. Thus Meta4 allows both public and private sharing of data.</p>
</list-item>
</list>
<p>Managing the large amounts of data from WGS metagenomics projects is a challenge and there is a need for simple tools that enable scientists to access and query the results. We present Meta4, a simple database for the storage of proteins and their domains predicted from metagenomics experiments. Meta4 is lightweight, easy to install and deploy, and can handle large amounts of data. The system presents information to scientists in a format they understand via a web interface. Meta4 is easily extensible through the addition of web services, and despite not being as feature-rich as some existing systems, benefits from being open-source, lightweight and easy to install and deploy. The use of web services means that the data served to users is as up-to-date as the underlying primary database, which is an advantage over large data warehouses whose data may become out-of-sync with the primary data source. Meta4 is available under an open-source license at
<ext-link ext-link-type="uri" xlink:href="http://www.ark-genomics.org/bioinformatics/meta4"></ext-link>
.</p>
<p>Despite the increasing number of published algorithms for metagenomic assembly and annotation, the complexity of the problem is such that errors are common. Attempts must be made to assess the quality of metagenomic assemblies prior to annotation, especially to ensure inappropriate joins are not made during the contig and scaffold production steps. Metagenomic assemblies are often highly fragmented, and this can affect gene prediction and protein domain annotation. Once specific protein targets have been identified from metagenomic datasets, we recommend a manual annotation step to ensure the gene location (start and end) and protein domain structures are correctly defined.</p>
</sec>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>This research was supported by the Biotechnology and Biological Sciences Research Council (BBSRC; BB/J004243/1, BB/J004235/1), and by the Technology Strategy Board (TS/J000108/1, TS/J000116/1).</p>
</ack>
<fn-group>
<fn id="fn01">
<label>1</label>
<p>
<ext-link ext-link-type="uri" xlink:href="http://www.ark-genomics.org/bioinformatics/meta4"></ext-link>
</p>
</fn>
<fn id="fn02">
<label>2</label>
<p>
<ext-link ext-link-type="uri" xlink:href="https://github.com/mw55309/meta4"></ext-link>
</p>
</fn>
<fn id="fn03">
<label>3</label>
<p>
<ext-link ext-link-type="uri" xlink:href="https://github.com/MG-RAST/"></ext-link>
</p>
</fn>
<fn id="fn04">
<label>4</label>
<p>
<ext-link ext-link-type="uri" xlink:href="http://blog.metagenomics.anl.gov/mg-rast-v3-2-faq/#local_install"></ext-link>
</p>
</fn>
<fn id="fn05">
<label>5</label>
<p>
<ext-link ext-link-type="ftp" xlink:href="ftp://ftp.sanger.ac.uk/pub/databases/Pfam/Tools/PfamScan.tar.gz"></ext-link>
</p>
</fn>
</fn-group>
<ref-list>
<title>REFERENCES</title>
<ref id="B1">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Afgan</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Chapman</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Jadan</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Franke</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Taylor</surname>
<given-names>J.</given-names>
</name>
</person-group>
(
<year>2012</year>
).
<article-title>Using cloud computing infrastructure with CloudBioLinux, CloudMan, and Galaxy.</article-title>
<source>
<italic>Curr. Protoc. Bioinformatics</italic>
</source>
<comment>Chapter 11, Unit11.</comment>
<volume>9</volume>
<pub-id pub-id-type="doi">10.1002/0471250953.bi1109s38</pub-id>
</mixed-citation>
</ref>
<ref id="B2">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Boisvert</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Raymond</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Godzaridis</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Laviolette</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Corbeil</surname>
<given-names>J.</given-names>
</name>
</person-group>
(
<year>2012</year>
).
<article-title>Ray Meta: scalable de novo metagenome assembly and profiling.</article-title>
<source>
<italic>Genome Biol.</italic>
</source>
<volume>13</volume>
<issue>R122</issue>
<pub-id pub-id-type="doi">10.1186/gb-2012-13-12-r122</pub-id>
</mixed-citation>
</ref>
<ref id="B3">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cowan</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Meyer</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Stafford</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Muyanga</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Cameron</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Wittwer</surname>
<given-names>P.</given-names>
</name>
</person-group>
(
<year>2005</year>
).
<article-title>Metagenomic gene discovery: past, present and future.</article-title>
<source>
<italic>Trends Biotechnol.</italic>
</source>
<volume>23</volume>
<fpage>321</fpage>
<lpage>329</lpage>
<pub-id pub-id-type="doi">10.1016/j.tibtech.2005.04.001</pub-id>
<pub-id pub-id-type="pmid">15922085</pub-id>
</mixed-citation>
</ref>
<ref id="B4">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cowan</surname>
<given-names>D. A.</given-names>
</name>
<name>
<surname>Arslanoglu</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Burton</surname>
<given-names>S. G.</given-names>
</name>
<name>
<surname>Baker</surname>
<given-names>G. C.</given-names>
</name>
<name>
<surname>Cameron</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>J. J.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2004</year>
).
<article-title>Metagenomics, gene discovery and the ideal biocatalyst.</article-title>
<source>
<italic>Biochem. Soc. Trans.</italic>
</source>
<volume>32</volume>
<fpage>298</fpage>
<lpage>302</lpage>
<pub-id pub-id-type="doi">10.1042/BST0320298</pub-id>
<pub-id pub-id-type="pmid">15046593</pub-id>
</mixed-citation>
</ref>
<ref id="B5">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Eddy</surname>
<given-names>S. R.</given-names>
</name>
</person-group>
(
<year>2009</year>
).
<article-title>A new generation of homology search tools based on probabilistic inference.</article-title>
<source>
<italic>Genome Inform.</italic>
</source>
<volume>23</volume>
<fpage>205</fpage>
<lpage>211</lpage>
<pub-id pub-id-type="doi">10.1142/9781848165632_0019</pub-id>
<pub-id pub-id-type="pmid">20180275</pub-id>
</mixed-citation>
</ref>
<ref id="B6">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hess</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sczyrba</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Egan</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>T. W.</given-names>
</name>
<name>
<surname>Chokhawala</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Schroth</surname>
<given-names>G.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2011</year>
).
<article-title>Metagenomic discovery of biomass-degrading genes and genomes from cow rumen.</article-title>
<source>
<italic>Science</italic>
</source>
<volume>331</volume>
<fpage>463</fpage>
<lpage>467</lpage>
<pub-id pub-id-type="doi">10.1126/science.1200387</pub-id>
<pub-id pub-id-type="pmid">21273488</pub-id>
</mixed-citation>
</ref>
<ref id="B7">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hoff</surname>
<given-names>K. J.</given-names>
</name>
<name>
<surname>Lingner</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Meinicke</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Tech</surname>
<given-names>M.</given-names>
</name>
</person-group>
(
<year>2009</year>
).
<article-title>Orphelia: predicting genes in metagenomic sequencing reads.</article-title>
<source>
<italic>Nucleic Acids Res.</italic>
</source>
<volume>37</volume>
<fpage>W101</fpage>
<lpage>W105</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkp327</pub-id>
<pub-id pub-id-type="pmid">19429689</pub-id>
</mixed-citation>
</ref>
<ref id="B8">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jain</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Bairoch</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Duvaud</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Phan</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Redaschi</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Suzek</surname>
<given-names>B. E.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2009</year>
).
<article-title>Infrastructure for the life sciences: design and implementation of the UniProt website.</article-title>
<source>
<italic>BMC Bioinformatics</italic>
</source>
<volume>10</volume>
:
<issue>136</issue>
<pub-id pub-id-type="doi">10.1186/1471-2105-10-136</pub-id>
</mixed-citation>
</ref>
<ref id="B9">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kelley</surname>
<given-names>D. R.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Delcher</surname>
<given-names>A. L.</given-names>
</name>
<name>
<surname>Pop</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Salzberg</surname>
<given-names>S. L.</given-names>
</name>
</person-group>
(
<year>2012</year>
).
<article-title>Gene prediction with Glimmer for metagenomic sequences augmented by classification and clustering.</article-title>
<source>
<italic>Nucleic Acids Res.</italic>
</source>
<volume>40</volume>
<issue>e9</issue>
<pub-id pub-id-type="doi">10.1093/nar/gkr1067</pub-id>
</mixed-citation>
</ref>
<ref id="B10">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Ruan</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Qian</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Fang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>Z.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2010</year>
).
<article-title>De novo assembly of human genomes with massively parallel short read sequencing.</article-title>
<source>
<italic>Genome Res.</italic>
</source>
<volume>20</volume>
<fpage>265</fpage>
<lpage>272</lpage>
<pub-id pub-id-type="doi">10.1101/gr.097261.109</pub-id>
<pub-id pub-id-type="pmid">20019144</pub-id>
</mixed-citation>
</ref>
<ref id="B11">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mackelprang</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Waldrop</surname>
<given-names>M. P.</given-names>
</name>
<name>
<surname>Deangelis</surname>
<given-names>K. M.</given-names>
</name>
<name>
<surname>David</surname>
<given-names>M. M.</given-names>
</name>
<name>
<surname>Chavarria</surname>
<given-names>K. L.</given-names>
</name>
<name>
<surname>Blazewicz</surname>
<given-names>S. J.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2011</year>
).
<article-title>Metagenomic analysis of a permafrost microbial community reveals a rapid response to thaw.</article-title>
<source>
<italic>Nature</italic>
</source>
<volume>480</volume>
<fpage>368</fpage>
<lpage>371</lpage>
<pub-id pub-id-type="doi">10.1038/nature10576</pub-id>
<pub-id pub-id-type="pmid">22056985</pub-id>
</mixed-citation>
</ref>
<ref id="B12">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Magrane</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Consortium</surname>
<given-names>U.</given-names>
</name>
</person-group>
(
<year>2011</year>
).
<article-title>UniProt Knowledgebase: a hub of integrated protein data.</article-title>
<source>
<italic>Database (Oxford)</italic>
</source>
<volume>2011</volume>
<issue>bar009</issue>
<pub-id pub-id-type="doi">10.1093/database/bar009</pub-id>
</mixed-citation>
</ref>
<ref id="B13">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Markowitz</surname>
<given-names>V. M.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>I. M.</given-names>
</name>
<name>
<surname>Chu</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Szeto</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Palaniappan</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Grechkin</surname>
<given-names>Y.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2012</year>
).
<article-title>IMG/M: the integrated metagenome data management and comparative analysis system.</article-title>
<source>
<italic>Nucleic Acids Res.</italic>
</source>
<volume>40</volume>
<fpage>D123</fpage>
<lpage>D129</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkr975</pub-id>
<pub-id pub-id-type="pmid">22086953</pub-id>
</mixed-citation>
</ref>
<ref id="B14">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>McWilliam</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Valentin</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Goujon</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Narayanasamy</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Martin</surname>
<given-names>J.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2009</year>
).
<article-title>Web services at the European Bioinformatics Institute-2009.</article-title>
<source>
<italic>Nucleic Acids Res.</italic>
</source>
<volume>37</volume>
<fpage>W6</fpage>
<lpage>W10</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkp302</pub-id>
<pub-id pub-id-type="pmid">19435877</pub-id>
</mixed-citation>
</ref>
<ref id="B15">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meyer</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Paarmann</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>D’souza</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Olson</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Glass</surname>
<given-names>E. M.</given-names>
</name>
<name>
<surname>Kubal</surname>
<given-names>M.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2008</year>
).
<article-title>The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomes.</article-title>
<source>
<italic>BMC Bioinformatics</italic>
</source>
<volume>9</volume>
:
<issue>386</issue>
<pub-id pub-id-type="doi">10.1186/1471-2105-9-386</pub-id>
</mixed-citation>
</ref>
<ref id="B16">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mulder</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Apweiler</surname>
<given-names>R.</given-names>
</name>
</person-group>
(
<year>2007</year>
).
<article-title>InterPro and InterProScan: tools for protein sequence classification and comparison.</article-title>
<source>
<italic>Methods Mol. Biol.</italic>
</source>
<volume>396</volume>
<fpage>59</fpage>
<lpage>70</lpage>
<pub-id pub-id-type="doi">10.1007/978-1-59745-515-2_5</pub-id>
<pub-id pub-id-type="pmid">18025686</pub-id>
</mixed-citation>
</ref>
<ref id="B17">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Namiki</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Hachiya</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Tanaka</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Sakakibara</surname>
<given-names>Y.</given-names>
</name>
</person-group>
(
<year>2012</year>
).
<article-title>MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads.</article-title>
<source>
<italic>Nucleic Acids Res.</italic>
</source>
<volume>40</volume>
<issue>e155</issue>
<pub-id pub-id-type="doi">10.1093/nar/gks678</pub-id>
</mixed-citation>
</ref>
<ref id="B18">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Noguchi</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Taniguchi</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Itoh</surname>
<given-names>T.</given-names>
</name>
</person-group>
(
<year>2008</year>
).
<article-title>MetaGeneAnnotator: detecting species-specific patterns of ribosomal binding site for precise gene prediction in anonymous prokaryotic and phage genomes.</article-title>
<source>
<italic>DNA Res.</italic>
</source>
<volume>15</volume>
<fpage>387</fpage>
<lpage>396</lpage>
<pub-id pub-id-type="doi">10.1093/dnares/dsn027</pub-id>
<pub-id pub-id-type="pmid">18940874</pub-id>
</mixed-citation>
</ref>
<ref id="B19">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pell</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Hintze</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Canino-Koning</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Howe</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Tiedje</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Brown</surname>
<given-names>C. T.</given-names>
</name>
</person-group>
(
<year>2012</year>
).
<article-title>Scaling metagenome sequence assembly with probabilistic de Bruijn graphs.</article-title>
<source>
<italic>Proc. Natl. Acad. Sci. U.S.A.</italic>
</source>
<volume>109</volume>
<fpage>13272</fpage>
<lpage>13277</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.1121464109</pub-id>
<pub-id pub-id-type="pmid">22847406</pub-id>
</mixed-citation>
</ref>
<ref id="B20">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peng</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Leung</surname>
<given-names>H. C.</given-names>
</name>
<name>
<surname>Yiu</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Chin</surname>
<given-names>F. Y.</given-names>
</name>
</person-group>
(
<year>2011</year>
).
<article-title>Meta-IDBA: a de Novo assembler for metagenomic data.</article-title>
<source>
<italic>Bioinformatics</italic>
</source>
<volume>27</volume>
<fpage>i94</fpage>
<lpage>i101</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btr216</pub-id>
<pub-id pub-id-type="pmid">21685107</pub-id>
</mixed-citation>
</ref>
<ref id="B21">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Punta</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Coggill</surname>
<given-names>P. C.</given-names>
</name>
<name>
<surname>Eberhardt</surname>
<given-names>R. Y.</given-names>
</name>
<name>
<surname>Mistry</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Tate</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Boursnell</surname>
<given-names>C.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2012</year>
).
<article-title>The Pfam protein families database.</article-title>
<source>
<italic>Nucleic Acids Res.</italic>
</source>
<volume>40</volume>
<fpage>D290</fpage>
<lpage>D301</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkr1065</pub-id>
<pub-id pub-id-type="pmid">22127870</pub-id>
</mixed-citation>
</ref>
<ref id="B22">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rho</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Ye</surname>
<given-names>Y.</given-names>
</name>
</person-group>
(
<year>2010</year>
).
<article-title>FragGeneScan: predicting genes in short and error-prone reads.</article-title>
<source>
<italic>Nucleic Acids Res.</italic>
</source>
<volume>38</volume>
<issue>e191</issue>
<pub-id pub-id-type="doi">10.1093/nar/gkq747</pub-id>
</mixed-citation>
</ref>
<ref id="B23">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Richardson</surname>
<given-names>E. J.</given-names>
</name>
<name>
<surname>Watson</surname>
<given-names>M.</given-names>
</name>
</person-group>
(
<year>2013</year>
).
<article-title>The automatic annotation of bacterial genomes.</article-title>
<source>
<italic>Brief Bioinform.</italic>
</source>
<volume>14</volume>
<fpage>1</fpage>
<lpage>12</lpage>
<pub-id pub-id-type="doi">10.1093/bib/bbs007</pub-id>
<pub-id pub-id-type="pmid">22408191</pub-id>
</mixed-citation>
</ref>
<ref id="B24">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sigrist</surname>
<given-names>C. J.</given-names>
</name>
<name>
<surname>Cerutti</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>De Castro</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Langendijk-Genevaux</surname>
<given-names>P. S.</given-names>
</name>
<name>
<surname>Bulliard</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Bairoch</surname>
<given-names>A.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2010</year>
).
<article-title>PROSITE, a protein domain database for functional characterization and annotation.</article-title>
<source>
<italic>Nucleic Acids Res.</italic>
</source>
<volume>38</volume>
<fpage>D161</fpage>
<lpage>D166</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkp885</pub-id>
<pub-id pub-id-type="pmid">19858104</pub-id>
</mixed-citation>
</ref>
<ref id="B25">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Altintas</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Peltier</surname>
<given-names>S.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2011</year>
).
<article-title>Community cyberinfrastructure for Advanced Microbial Ecology Research and Analysis: the CAMERA resource.</article-title>
<source>
<italic>Nucleic Acids Res.</italic>
</source>
<volume>39</volume>
<fpage>D546</fpage>
<lpage>D551</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkq1102</pub-id>
<pub-id pub-id-type="pmid">21045053</pub-id>
</mixed-citation>
</ref>
<ref id="B26">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Treangen</surname>
<given-names>T. J.</given-names>
</name>
<name>
<surname>Koren</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Sommer</surname>
<given-names>D. D.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Astrovskaya</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Ondov</surname>
<given-names>B.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2013</year>
).
<article-title>MetAMOS: a modular and open source metagenomic assembly and analysis pipeline.</article-title>
<source>
<italic>Genome Biol.</italic>
</source>
<volume>14</volume>
<issue>R2</issue>
<pub-id pub-id-type="doi">10.1186/gb-2013-14-1-r2</pub-id>
</mixed-citation>
</ref>
<ref id="B27">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Venter</surname>
<given-names>J. C.</given-names>
</name>
<name>
<surname>Remington</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Heidelberg</surname>
<given-names>J. F.</given-names>
</name>
<name>
<surname>Halpern</surname>
<given-names>A. L.</given-names>
</name>
<name>
<surname>Rusch</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Eisen</surname>
<given-names>J. A.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2004</year>
).
<article-title>Environmental genome shotgun sequencing of the Sargasso Sea.</article-title>
<source>
<italic>Science</italic>
</source>
<volume>304</volume>
<fpage>66</fpage>
<lpage>74</lpage>
<pub-id pub-id-type="doi">10.1126/science.1093857</pub-id>
<pub-id pub-id-type="pmid">15001713</pub-id>
</mixed-citation>
</ref>
<ref id="B28">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yatsunenko</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Rey</surname>
<given-names>F. E.</given-names>
</name>
<name>
<surname>Manary</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Trehan</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Dominguez-Bello</surname>
<given-names>M. G.</given-names>
</name>
<name>
<surname>Contreras</surname>
<given-names>M.</given-names>
</name>
<etal></etal>
</person-group>
(
<year>2012</year>
).
<article-title>Human gut microbiome viewed across age and geography.</article-title>
<source>
<italic>Nature</italic>
</source>
<volume>486</volume>
<fpage>222</fpage>
<lpage>227</lpage>
<pub-id pub-id-type="doi">10.1038/nature11053</pub-id>
<pub-id pub-id-type="pmid">22699611</pub-id>
</mixed-citation>
</ref>
<ref id="B29">
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yok</surname>
<given-names>N. G.</given-names>
</name>
<name>
<surname>Rosen</surname>
<given-names>G. L.</given-names>
</name>
</person-group>
(
<year>2011</year>
).
<article-title>Combining gene prediction methods to improve metagenomic gene annotation.</article-title>
<source>
<italic>BMC Bioinformatics</italic>
</source>
<volume>12</volume>
:
<issue>20</issue>
<pub-id pub-id-type="doi">10.1186/1471-2105-12-20</pub-id>
</mixed-citation>
</ref>
</ref-list>
</back>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000528 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000528 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:3763215
   |texte=   Meta4: a web application for sharing and annotating metagenomic gene predictions using web services
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:24046776" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024