Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.
***** Acces problem to record *****\

Identifieur interne : 0001980 ( Pmc/Corpus ); précédent : 0001979; suivant : 0001981 ***** probable Xml problem with record *****

Links to Exploration step


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">JCoast – A biologist-centric software tool for data mining and comparison of prokaryotic (meta)genomes</title>
<author>
<name sortKey="Richter, Michael" sort="Richter, Michael" uniqKey="Richter M" first="Michael" last="Richter">Michael Richter</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I2">Jacobs University Bremen gGmbH, D-28759 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Lombardot, Thierry" sort="Lombardot, Thierry" uniqKey="Lombardot T" first="Thierry" last="Lombardot">Thierry Lombardot</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kostadinov, Ivaylo" sort="Kostadinov, Ivaylo" uniqKey="Kostadinov I" first="Ivaylo" last="Kostadinov">Ivaylo Kostadinov</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I2">Jacobs University Bremen gGmbH, D-28759 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kottmann, Renzo" sort="Kottmann, Renzo" uniqKey="Kottmann R" first="Renzo" last="Kottmann">Renzo Kottmann</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I2">Jacobs University Bremen gGmbH, D-28759 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Duhaime, Melissa Beth" sort="Duhaime, Melissa Beth" uniqKey="Duhaime M" first="Melissa Beth" last="Duhaime">Melissa Beth Duhaime</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I2">Jacobs University Bremen gGmbH, D-28759 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Peplies, Jorg" sort="Peplies, Jorg" uniqKey="Peplies J" first="Jörg" last="Peplies">Jörg Peplies</name>
<affiliation>
<nlm:aff id="I3">Ribocon GmbH D-28359 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Glockner, Frank Oliver" sort="Glockner, Frank Oliver" uniqKey="Glockner F" first="Frank Oliver" last="Glöckner">Frank Oliver Glöckner</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I2">Jacobs University Bremen gGmbH, D-28759 Bremen, Germany</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">18380896</idno>
<idno type="pmc">2311307</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2311307</idno>
<idno type="RBID">PMC:2311307</idno>
<idno type="doi">10.1186/1471-2105-9-177</idno>
<date when="2008">2008</date>
<idno type="wicri:Area/Pmc/Corpus">000198</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">JCoast – A biologist-centric software tool for data mining and comparison of prokaryotic (meta)genomes</title>
<author>
<name sortKey="Richter, Michael" sort="Richter, Michael" uniqKey="Richter M" first="Michael" last="Richter">Michael Richter</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I2">Jacobs University Bremen gGmbH, D-28759 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Lombardot, Thierry" sort="Lombardot, Thierry" uniqKey="Lombardot T" first="Thierry" last="Lombardot">Thierry Lombardot</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kostadinov, Ivaylo" sort="Kostadinov, Ivaylo" uniqKey="Kostadinov I" first="Ivaylo" last="Kostadinov">Ivaylo Kostadinov</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I2">Jacobs University Bremen gGmbH, D-28759 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kottmann, Renzo" sort="Kottmann, Renzo" uniqKey="Kottmann R" first="Renzo" last="Kottmann">Renzo Kottmann</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I2">Jacobs University Bremen gGmbH, D-28759 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Duhaime, Melissa Beth" sort="Duhaime, Melissa Beth" uniqKey="Duhaime M" first="Melissa Beth" last="Duhaime">Melissa Beth Duhaime</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I2">Jacobs University Bremen gGmbH, D-28759 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Peplies, Jorg" sort="Peplies, Jorg" uniqKey="Peplies J" first="Jörg" last="Peplies">Jörg Peplies</name>
<affiliation>
<nlm:aff id="I3">Ribocon GmbH D-28359 Bremen, Germany</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Glockner, Frank Oliver" sort="Glockner, Frank Oliver" uniqKey="Glockner F" first="Frank Oliver" last="Glöckner">Frank Oliver Glöckner</name>
<affiliation>
<nlm:aff id="I1">Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I2">Jacobs University Bremen gGmbH, D-28759 Bremen, Germany</nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">BMC Bioinformatics</title>
<idno type="eISSN">1471-2105</idno>
<imprint>
<date when="2008">2008</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec>
<title>Background</title>
<p>Current sequencing technologies give access to sequence information for genomes and metagenomes at a tremendous speed. Subsequent data processing is mainly performed by automatic pipelines provided by the sequencing centers. Although, standardised workflows are desirable and useful in many respects, rational data mining, comparative genomics, and especially the interpretation of the sequence information in the biological context, demands for intuitive, flexible, and extendable solutions.</p>
</sec>
<sec>
<title>Results</title>
<p>The JCoast software tool was primarily designed to analyse and compare (meta)genome sequences of prokaryotes. Based on a pre-computed GenDB database project, JCoast offers a flexible graphical user interface (GUI), as well as an application programming interface (API) that facilitates back-end data access. JCoast offers individual, cross genome-, and metagenome analysis, and assists the biologist in exploration of large and complex datasets.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>JCoast combines all functions required for the mining, annotation, and interpretation of (meta)genomic data. The lightweight software solution allows the user to easily take advantage of advanced back-end database structures by providing a programming and graphical user interface to answer biological questions. JCoast is available at the project homepage.</p>
</sec>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="product-review">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">BMC Bioinformatics</journal-id>
<journal-title>BMC Bioinformatics</journal-title>
<issn pub-type="epub">1471-2105</issn>
<publisher>
<publisher-name>BioMed Central</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">18380896</article-id>
<article-id pub-id-type="pmc">2311307</article-id>
<article-id pub-id-type="publisher-id">1471-2105-9-177</article-id>
<article-id pub-id-type="doi">10.1186/1471-2105-9-177</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Software</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>JCoast – A biologist-centric software tool for data mining and comparison of prokaryotic (meta)genomes</article-title>
</title-group>
<contrib-group>
<contrib id="A1" contrib-type="author">
<name>
<surname>Richter</surname>
<given-names>Michael</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<xref ref-type="aff" rid="I2">2</xref>
<email>mrichter@mpi-bremen.de</email>
</contrib>
<contrib id="A2" contrib-type="author">
<name>
<surname>Lombardot</surname>
<given-names>Thierry</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>tlombard@mpi-bremen.de</email>
</contrib>
<contrib id="A3" contrib-type="author">
<name>
<surname>Kostadinov</surname>
<given-names>Ivaylo</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<xref ref-type="aff" rid="I2">2</xref>
<email>ikostadi@mpi-bremen.de</email>
</contrib>
<contrib id="A4" contrib-type="author">
<name>
<surname>Kottmann</surname>
<given-names>Renzo</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<xref ref-type="aff" rid="I2">2</xref>
<email>rkottman@mpi-bremen.de</email>
</contrib>
<contrib id="A5" contrib-type="author">
<name>
<surname>Duhaime</surname>
<given-names>Melissa Beth</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<xref ref-type="aff" rid="I2">2</xref>
<email>mduhaime@mpi-bremen.de</email>
</contrib>
<contrib id="A6" contrib-type="author">
<name>
<surname>Peplies</surname>
<given-names>Jörg</given-names>
</name>
<xref ref-type="aff" rid="I3">3</xref>
<email>jpeplies@ribocon.com</email>
</contrib>
<contrib id="A7" corresp="yes" contrib-type="author">
<name>
<surname>Glöckner</surname>
<given-names>Frank Oliver</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<xref ref-type="aff" rid="I2">2</xref>
<email>fog@mpi-bremen.de</email>
</contrib>
</contrib-group>
<aff id="I1">
<label>1</label>
Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, D-28359 Bremen, Germany</aff>
<aff id="I2">
<label>2</label>
Jacobs University Bremen gGmbH, D-28759 Bremen, Germany</aff>
<aff id="I3">
<label>3</label>
Ribocon GmbH D-28359 Bremen, Germany</aff>
<pub-date pub-type="collection">
<year>2008</year>
</pub-date>
<pub-date pub-type="epub">
<day>1</day>
<month>4</month>
<year>2008</year>
</pub-date>
<volume>9</volume>
<fpage>177</fpage>
<lpage>177</lpage>
<ext-link ext-link-type="uri" xlink:href="http://www.biomedcentral.com/1471-2105/9/177"></ext-link>
<history>
<date date-type="received">
<day>10</day>
<month>1</month>
<year>2008</year>
</date>
<date date-type="accepted">
<day>1</day>
<month>4</month>
<year>2008</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright © 2008 Richter et al; licensee BioMed Central Ltd.</copyright-statement>
<copyright-year>2008</copyright-year>
<copyright-holder>Richter et al; licensee BioMed Central Ltd.</copyright-holder>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/2.0">
<p>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/2.0"></ext-link>
), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</p>
<pmc-comment> Richter Michael mrichter@mpi-bremen.de JCoast – A biologist-centric software tool for data mining and comparison of prokaryotic (meta)genomes 2008BMC Bioinformatics 9(1): 177-. (2008)1471-2105(2008)9:1<177>urn:ISSN:1471-2105</pmc-comment>
</license>
</permissions>
<abstract>
<sec>
<title>Background</title>
<p>Current sequencing technologies give access to sequence information for genomes and metagenomes at a tremendous speed. Subsequent data processing is mainly performed by automatic pipelines provided by the sequencing centers. Although, standardised workflows are desirable and useful in many respects, rational data mining, comparative genomics, and especially the interpretation of the sequence information in the biological context, demands for intuitive, flexible, and extendable solutions.</p>
</sec>
<sec>
<title>Results</title>
<p>The JCoast software tool was primarily designed to analyse and compare (meta)genome sequences of prokaryotes. Based on a pre-computed GenDB database project, JCoast offers a flexible graphical user interface (GUI), as well as an application programming interface (API) that facilitates back-end data access. JCoast offers individual, cross genome-, and metagenome analysis, and assists the biologist in exploration of large and complex datasets.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>JCoast combines all functions required for the mining, annotation, and interpretation of (meta)genomic data. The lightweight software solution allows the user to easily take advantage of advanced back-end database structures by providing a programming and graphical user interface to answer biological questions. JCoast is available at the project homepage.</p>
</sec>
</abstract>
</article-meta>
</front>
<body>
<sec>
<title>Background</title>
<p>The sequencing of genomes and metagenomes has become a standard technology in molecular biology. Currently, over 700 sequenced genomes of bacterial and archaeal origin are publicly available [
<xref ref-type="bibr" rid="B1">1</xref>
]. Initiatives such as the Community Sequencing Program at the Joint Genome Institute (JGI), the Microbial Genome Sequencing Project funded by the Gordon and Betty Moore Foundation, or collaborations with Genoscope, enable researchers worldwide to get their genome or metagenome of interest easily sequenced. With the acceptance of a sequencing project, initial bioinformatic support is often granted through web-based systems, such as the Integrated Microbial Genomes (IMG and IMG/M) system [
<xref ref-type="bibr" rid="B2">2</xref>
] or Magnifying Genomes [
<xref ref-type="bibr" rid="B3">3</xref>
] to give two examples. To cope with the flood of data generated by community sequencing projects such as the Venter cruises [
<xref ref-type="bibr" rid="B4">4</xref>
,
<xref ref-type="bibr" rid="B5">5</xref>
], the CAMERA (Community Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis) consortium was recently established providing access to both the data and pre-computed information [
<xref ref-type="bibr" rid="B6">6</xref>
].</p>
<p>Standardised steps for data processing in (meta)genome analysis are highly appreciated for their ability to make results comparable and the processing transparent. Nevertheless, after the first round of data mining using web-based annotation systems, specific requests by the biologists typically arise, which ask for alternative views of the data. To deal with such demands, full access to the tools and databases is required. This is best handled through the use of "rich clients", which take full advantage of the native facilities of the user's computer. Making use of existing graphics hardware acceleration, rich clients can serve as graphical front-end to display complex and interactive visualisations [
<xref ref-type="bibr" rid="B7">7</xref>
]. One of the most popular, stable, flexible, and publicly available genome visualisation tools is Artemis [
<xref ref-type="bibr" rid="B8">8</xref>
]. Although, it can be extended for computations, it lacks a central storage system and is therefore, in most cases, only used as a viewer for genomic data.</p>
<p>A "state-of-the-art" infrastructure for (meta)genome analysis should be based on a relational database system that stores and organises assembled DNA sequence data, gene predictions, results from automatic analysis, and manual annotations [
<xref ref-type="bibr" rid="B9">9</xref>
]. The analysis should integrate similarity searches against a variety of different data sources based on established algorithms to get a comprehensive overview of the available information for each gene and gene family. Consistent data processing and storage is a prerequisite for flexible data analysis, which is essential when addressing specific requests of the biologists. Project and user management is also necessary to organise data access on all levels (administrator, annotator, guests). In 2003, the GenDB system [
<xref ref-type="bibr" rid="B10">10</xref>
] was released as an open source solution for high quality whole genome annotation. The system relies on a relational database system for back-end storage and takes advantage of Grid technology for massive distributed computing. In addition to project and user management, GenDB offers visualisation, annotation, and search capabilities via a web front-end. GenDB has already been successfully used in many annotation projects [e.g. [
<xref ref-type="bibr" rid="B11">11</xref>
-
<xref ref-type="bibr" rid="B13">13</xref>
]] and adopted by the Network of Excellence "Marine Genomics Europe" [
<xref ref-type="bibr" rid="B14">14</xref>
] as their standard tool for genome analysis.</p>
<p>Although equipped with an advanced backbone for data processing and storage, the available GenDB web-visualisation and analysis capabilities were not able to cover all user-specific requests such as parallel analysis of several genomic sources or a sortable tabular representation of the gene content. Further modules for the calculation of group-specific genes, COG statistics, advanced search functionalities, as well as gene grouping were continuously requested.</p>
<p>In order to address this issue, we developed JCoast, a
<underline>Co</underline>
mparative
<underline>A</underline>
nalysis and
<underline>S</underline>
earch
<underline>T</underline>
ool for prokaryotic genomes. JCoast is a standalone tool that makes use of the standardised genome processing and storage system, GenDB. JCoast offers individual and cross genome- and metagenome analysis by handling several projects simultaneously. It provides a graphical user interface (GUI), and an application programming interface (API), including a plug-in facility for user extensions. JCoast can also work on local databases following the GenDB schema, independently of a full GenDB installation. It is publicly available and can be easily installed using the Java Webstart technology. The low system requirements, especially when pre-computed databases are used, the very limited need for maintenance, combined with highly flexible data analysis options, leaves the biologists to concentrate on biological questions rather than solving computation problems.</p>
</sec>
<sec>
<title>Implementation</title>
<p>JCoast is written in the platform-independent, object-oriented programming language, Java [
<xref ref-type="bibr" rid="B15">15</xref>
]. It can be started using the Java Web Start technology, which automatically downloads and installs the software locally. This ensures the user to always get access to the latest version available. Alternatively, it can be downloaded and installed manually.</p>
<p>JCoast offers two entry points to access the genomic data and bioinformatic results:</p>
<p>i. the GUI, which is implemented with the Java-Swing extension, SwingX [
<xref ref-type="bibr" rid="B16">16</xref>
]. The GUI provides all functions necessary to analyse, search, and manipulate genomic information, and includes dedicated modules for addressing e.g. group specific genes (GSG) or comparative statistics based on profile-HMMs.</p>
<p>ii. the API, which provides object relational mapping to the underlying database and specific classes for advanced searches, data mining and statistics.</p>
<p>The JCoast source code is organised in:</p>
<p>i. the 'api' package, which contains all classes describing the core functionality;</p>
<p>ii. the 'gui' package, which contains all classes describing the graphical user interface and related inherited functionalities;</p>
<p>iii. the 'scripts' package, which contains ready to use methods for project maintenance, time consuming calculations (e.g. calculation of reciprocal best matches), and specific data transfer methods, such as the export of the database in NCBI Sequin format for genome data submissions.</p>
<p>By default, JCoast supports five bioinformatic tools: BLAST [
<xref ref-type="bibr" rid="B17">17</xref>
], Pfam [
<xref ref-type="bibr" rid="B18">18</xref>
], InterPro [
<xref ref-type="bibr" rid="B19">19</xref>
], SignalP [
<xref ref-type="bibr" rid="B20">20</xref>
], and TMHMM [
<xref ref-type="bibr" rid="B21">21</xref>
], and offers a direct access to the Geographic-BLAST tool provided by megx.net [
<xref ref-type="bibr" rid="B22">22</xref>
].</p>
<p>The current JCoast implementation relies on MySQL 5.0 [
<xref ref-type="bibr" rid="B23">23</xref>
] and a GenDB 2.2 compatible database schema. A detailed description of the GenDB system can be found in Meyer
<italic>et al.</italic>
, 2003 [
<xref ref-type="bibr" rid="B10">10</xref>
].</p>
<p>The following extensions to the GenDB database schema are necessary to support all features offered by JCoast:</p>
<sec>
<title>Gene groups</title>
<p>JCoast offers assigning a set of genes into "gene groups". This allows visualisation and analysis of the (meta)genomic information across projects. In order to store gene groups in JCoast, the GenDB database model needs to be extended by two tables,
<italic>Gene_Group </italic>
and
<italic>Gene_Group_Region</italic>
.
<italic>Gene_Group </italic>
contains the description of the group,
<italic>Gene_Groups_Region </italic>
contains the corresponding genes belonging to a defined
<italic>Gene_Group</italic>
.</p>
</sec>
<sec>
<title>Codon offset</title>
<p>Partial protein coding genes that result from missing start codons are common in draft- and metagenomic datasets. To handle this, the GenDB database needs to be extended by the table
<italic>Region_CDS_Codon_Offset</italic>
. This ensures that the translation will always start on the first complete codon.</p>
</sec>
</sec>
<sec>
<title>Results</title>
<sec>
<title>The Graphical User Interface (GUI)</title>
<p>The main window of JCoast is composed of six modules (Fig
<xref ref-type="fig" rid="F1">1</xref>
). The 'Browser' module is the central component of JCoast, which is re-used by most other modules to display results. It is separated into three panels, the 'Genome Browser' on top, the 'Table Browser' in the middle, and the 'Observation Browser' at the bottom.</p>
<fig position="float" id="F1">
<label>Figure 1</label>
<caption>
<p>
<bold>JCoast overview</bold>
. The JCoast main window is separated into three panels, the 'Genome Browser' on top, the 'Table Browser' in the middle, and the 'Observation Browser' at the bottom. The 'Genome Browser' provides a graphical representation of genes on the genomic or metagenomic contigs under investigation. The 'Table Browser' displays different types of regions (CDS, contig, tRNA and rRNA) belonging to a project. A button panel implements rapid switching between regions. The 'Observation Browser' at the bottom displays the different similarity search results for a CDS.</p>
</caption>
<graphic xlink:href="1471-2105-9-177-1"></graphic>
</fig>
<p>The 'Genome Browser' provides a graphical representation of genes on the genomic or metagenomic contigs under investigation. It is directly linked to the 'Table Browser' where the corresponding region is shown. The 'Table Browser' displays different types of regions (coding sequence (CDS), contig, tRNA and rRNA) belonging to a project. Where a project is defined as all contigs belonging to e.g. a single organism or a metagenomic sample. A button panel implements rapid switching between regions. The panel also offers the ability to extract sequence information for a single gene, region, or for a whole project, as either amino acids or nucleotides. For each CDS entry, an 'Annotation dialog' can be entered. This dialog allows annotation of the gene products, the EC numbers, gene names, and additional comments for each entry. All annotations are immediately stored and subsequently available for other annotators. With the history function of GenDB all user changes are tracked. Additionally, regions can be deleted via the panel. The 'Observation Browser' at the bottom displays the different similarity search results for a CDS. Depending on the selected tools, additional functions are available for each entry. For example, every similarity search result is referenced to its original entry in public repositories (e.g. GenBank), if available. The public entry is shown in the standard browser of the host system.</p>
<p>For the similarity search results against genomesDB (see below for more details), preconfigured charts are available for visualisation. For example, the taxonomic distribution analysis of BLAST hits can be shown in a pie chart, ordered either by phylum, family, order, class, or species (Fig
<xref ref-type="fig" rid="F2">2</xref>
).</p>
<fig position="float" id="F2">
<label>Figure 2</label>
<caption>
<p>
<bold>Taxonomy distribution chart</bold>
. JCoast uses the database genomesDB for the calculation of GSGs and for drawing taxonomy distribution charts. For each CDS such a chart can be calculated on the fly, based on different taxonomic levels e.g. phylum, class, order, family or species. In addition also contextual information can be used for this calculation.</p>
</caption>
<graphic xlink:href="1471-2105-9-177-2"></graphic>
</fig>
<p>All tables have a common set of functions to enhance the usability:</p>
<p>i. an alphanumeric sorter for each column of the table.</p>
<p>ii. the column control button in the upper left corner of each table; this button enables the user to hide and unhide a column within the table.</p>
<p>iii. a panel for text search in order to search within the visible content of the tables.</p>
<p>The module 'Statistics' includes three kinds of "on-the-fly" calculations based on:</p>
<p>i. Cluster of Orthologous Groups of Genes (COG) [
<xref ref-type="bibr" rid="B24">24</xref>
], which counts the absolute number of genes belonging to one of the COG categories.</p>
<p>ii. Pfam [
<xref ref-type="bibr" rid="B18">18</xref>
], which counts the absolute occurrence of a defined Pfam model in a project.</p>
<p>iii. project content, which calculates statistics about the number of CDS, contigs, tRNAs, rRNAs, nucleotide usage and coding percentage.</p>
<p>The 'Text Search' module has been designed for string and regular expression searches in annotations, comments, gene names, EC numbers (Annotation Search). Within the complex set of similarity search results (Observation Search); the search is filtered by applying an E-value cutoff. The results view makes use of the main browser panel and displays matching regions as subsets.</p>
<p>The 'Pfam Search' module allows searching within Pfam models by applying an E-value cutoff. This allows, for instance, consistent cross genome comparisons. The results can be displayed as a subset of regions or as a graphical output of all Pfam models of each CDS (Fig
<xref ref-type="fig" rid="F3">3</xref>
).</p>
<fig position="float" id="F3">
<label>Figure 3</label>
<caption>
<p>
<bold>Pfam model search</bold>
. JCoast supports extensive Pfam model search functionality, including graphical domain structure display.</p>
</caption>
<graphic xlink:href="1471-2105-9-177-3"></graphic>
</fig>
<p>The 'Group-Specific Genes' (GSG) module was designed to enable the researcher to search for genes with a limited occurrence in a given taxonomic group or any group defined by the user. This module is based on the custom database, genomesDB (see below). Each CDS of a reference genome is tested for group-specificity by looking at the observations produced by similarity searches on the protein level against the custom database, genomesDB. By definition, a GSG shows significant similarity only to genes in the same taxonomic unit. Therefore, an E-value cutoff was implemented to evaluate significance. To be group-specific, a CDS must have at least one "in-taxon" observation and no "out-of-taxon" observations below the defined threshold. Self-hits are filtered out and can be adjust within the preferences.</p>
</sec>
<sec>
<title>The 'genomesDB' database</title>
<p>GenomesDB is a custom designed relational database, which includes a Java interface for maintenance. It is build from the proteome FASTA files obtained by the NCBI Reference Sequences database (RefSeq) for all fully sequenced bacterial and archaeal genomes (621 genomes, Jan 2008). Each genome, chromosome, and protein in the database is tagged with a unique internal numerical identifier. In addition, taxonomic and contextual information are parsed from NCBI Entrez Genome Project database. For every entry, taxonomic information is collected for the corresponding kingdom, phylum, class, order, family, genus and species. Further contextual data available pertain to genome size, guanine-cytosine content, Gram staining, shape, arrangement, endospores formation, motility, salinity, oxygen, habitat and temperature range. Genomes of interest can be selected for export via the interface, including the protein sequences as a multiple sequence FASTA file.</p>
<p>In contrast to the general-purpose database NCBI-nr, the focus of genomesDB is to provide manually curated phylogenetic affiliations, plus as much additional contextual information as possible. The database is used by JCoast to determine GSGs and to calculate distribution charts of selected properties. The current version of genomesDB can be downloaded from the JCoast homepage.</p>
</sec>
<sec>
<title>Web-Service: Geographic-BLAST</title>
<p>To allow researchers to systematically study the geographic distribution of particular genes in the environment, a click on the 'Geographic BLAST' button in JCoast starts a remote BLAST search of the database for marine ecological genomix (megx.net) [
<xref ref-type="bibr" rid="B22">22</xref>
] for a selected gene. The results are shown on the Genomes Mapserver, which integrates sequence data with contextual information, such as physical, chemical, and biological data based on geography. In addition to the geographical distribution of particular genes, statistics are provided pertaining to the presence/absence and abundance of the gene of interest with respect to sampling sites and environmental conditions.</p>
</sec>
<sec>
<title>The Application Programming Interface (API)</title>
<p>The JCoast core API comprises strictly defined objects of all the important GenDB database tables. All sequence related features, such as contigs or CDS definitions, annotations, and bioinformatic tool results are encoded within these Java objects. The core API also contains the complete SQL code, which is required to communicate with the database. The building of all required Java objects has been merged into a single class. This encapsulation renders JCoast flexible, making it possible to work on data sources other than GenDB. The JCoast core API is used extensively by the graphical user interface and the 'scripts' package, which is included in the JCoast source code. The classes within this package deliver an easy to use environment, mainly designed for users with little Java programming experience that want to use JCoast to address complex biological questions. The 'scripts' package includes template classes, which manage the database communication and user identification issues for the user. It also includes ready to use classes for maintaining projects, exporting data, and performing statistical calculations. Many of these classes are able to store the results directly in the database as 'GeneGroups' for subsequent evaluation of the results using the JCoast GUI.</p>
</sec>
<sec>
<title>Case studies</title>
<p>JCoast has already been extensively used and evaluated in recently published comparative genomic and metagenomic projects.</p>
<p>In the field of genomics, JCoast was used to analyse the finished genome of the marine Bacteroidetes
<italic>Gramella forsetii </italic>
[
<xref ref-type="bibr" rid="B25">25</xref>
] in the context of 15 other Bacteroidetes draft sequences provided by the Moore foundation. Extensive analysis was successfully performed on the draft genome of
<italic>Magnetospirillum gryphiswaldense </italic>
strain MSR-1 for comparison with three other draft genomes of magnetotactic bacteria [
<xref ref-type="bibr" rid="B26">26</xref>
]. Aside from standard annotation support in both projects, detailed statistical analysis of the presence/absence and abundance of specific Pfam profiles in the genomes was performed to identify specialisation and niche adaptation of the organisms. Cross-comparisons of all genes within the phenotypic group of magnetotactic bacteria revealed a set of group specific genes that are now the subject of targeted lab experiments. The implementation of the group specific genes module, in combination with the genomesDB database, proved extremely helpful by significantly accelerating the transition from
<italic>in silico </italic>
predictions to lab work. Furthermore, the system was used to assist in the annotation and ongoing comparative analysis of
<italic>Congregibacter forsetii </italic>
KT 71 [
<xref ref-type="bibr" rid="B27">27</xref>
].</p>
<p>In the field of metagenomics, a prototype of the software was used to analyse and compare 30 Mb of DNA on 511 scaffolds (comprising 21,077 ORFs) the symbiotic community of the marine oligochaete,
<italic>Olavius </italic>
sp., sequenced by a community shot-gun approach [
<xref ref-type="bibr" rid="B28">28</xref>
], as well as 9 Mb of DNA on 7,860 scaffolds from single filaments of
<italic>Beggiatoa </italic>
sp. determined by combined genome amplification and pyro- and Sanger sequencing [
<xref ref-type="bibr" rid="B29">29</xref>
]. Both projects had a challenging and heterogeneous set of short to medium sized DNA scaffolds and contigs that needed to be analysed. The marginally low quality of the data was problematic, and led to the development of the codon offset table to cope with partial genes. Moreover, JCoast is currently used to analyse Fosmid-sized clone libraries from different marine sampling sites.</p>
<p>In all projects, the system was able to assist the biologists to generate results faster by providing custom-tailored solutions. On the other hand, close connection to the users provided valuable feedback for software enhancements.</p>
</sec>
</sec>
<sec>
<title>Discussion</title>
<p>Many data mining, annotation, and visualisation systems have been developed over the last years, each with their advantages and disadvantages, for a review see [
<xref ref-type="bibr" rid="B30">30</xref>
,
<xref ref-type="bibr" rid="B7">7</xref>
]. In addition, several stand-alone tools have been introduced, which rely on object-oriented programming languages (e.g. Strainer: Software for analysis of population variation in community genomic datasets [
<xref ref-type="bibr" rid="B31">31</xref>
] or MetaLook: a 3D visualisation software for marine ecological genomics [
<xref ref-type="bibr" rid="B32">32</xref>
]). They have been designed to facilitate the exploration and analysis of highly specific datasets. Moreover, they are user-friendly and offer simple installation procedures. A shift from process to object oriented programming languages is a current trend in biologist-centric software development.</p>
<p>The JCoast software tool offers the unique combination of a standardised, open source relational database system in the back-end and a user-oriented rich-client in a lightweight, stand-alone solution. The common usage of the GenDB system in academia and industry has made it easy to find a collaboration partner providing initial data processing and database access. The company Ribocon GmbH already offers custom tailored genome analysis in GenDB/JCoast format on a commercial basis [
<xref ref-type="bibr" rid="B33">33</xref>
].</p>
<p>The possibility to run JCoast locally with either on site or remote access to pre-computed databases frees the biologist from the need to acquire specialised knowledge about how to install and maintain complex annotation pipelines, while taking advantage of an advanced database structure and a tightly-linked graphical user interface. As a result, the biologists can focus on their research with no or minimal programming efforts.</p>
<p>With the advent of next generation sequencing technologies even independent working groups or individual researchers can get easy access to genomic sequence data. In such a case, researchers often go for specific genes of interest in a defined set of genomes or metagenomes, rather than to perform a time-consuming comprehensive annotation of an entire (meta)genome. Therefore, easy to use and flexible data mining software systems will most likely be favored over complex annotation systems. Biologist-centric software tools such as JCoast facilitate these tasks by providing components with sophisticated bioinformatic functionalities without prior programming knowledge of the biologist [
<xref ref-type="bibr" rid="B34">34</xref>
].</p>
<p>Furthermore, the handling of sensitive data, as it is often the case in commercial applications, demands for locally installed software systems. Nevertheless, there is no doubt that the analysis of thousands of genes and large scale comparisons with published data is not trivial and will always require an appropriate cyberinfrastructure.</p>
<p>Currently, JCoast is primarily used for the analysis of prokaryotic genome data. In general, the GenDB system supports the analysis of eukaryotic data as well, but handling the additional information for such projects is not implemented in JCoast at the moment. Several extensions are planned for the future, e.g. including an importer allowing the user to import standardised sequence description files which currently can only be done using the import capabilities of the GenDB backbone. A project and user management system will be necessary to enhance usability in this respect. Common genome linguistic methods (GC-skew, oligonucleotide statistics), data quality checks for 454/Solexa [
<xref ref-type="bibr" rid="B35">35</xref>
] sequences and further incorporation of contextual (meta)data standards are envisioned.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>JCoast is a biologist-oriented graphical software tool that provides a powerful API to manipulate and data mine genomic information and bioinformatic results using the Java object-oriented programming language. The GUI is able to handle large metagenomes, as well as minimally assembled single genome projects. It provides a full-featured genome browser and sophisticated statistical and search functionalities. JCoast is developed as an extension to the GenDB back-end, but can also be used as standalone software. Pre-computed databases can be gained through an academic collaboration or by a dedicated service company [
<xref ref-type="bibr" rid="B33">33</xref>
]. This lightweight software solution allows the biologist to concentrate on transforming genomic data into biological knowledge with minimal programming experience. JCoast has been successfully applied to several genome and metagenome projects and has proven to be both stable and easy to use. The JCoast software tool is publicly available from the project website via Java Webstart or as a Kubuntu [
<xref ref-type="bibr" rid="B36">36</xref>
] JCoast-LiveCD. The source code is available upon request from the authors.</p>
</sec>
<sec>
<title>Availability and Requirements</title>
<p>-
<bold>Project name</bold>
: JCoast – Comparative Analysis and Search Tool</p>
<p>-
<bold>Project homepage</bold>
:
<ext-link ext-link-type="uri" xlink:href="http://www.megx.net/jcoast"></ext-link>
</p>
<p>-
<bold>Operating systems</bold>
: Linux and Windows</p>
<p>-
<bold>Programming language</bold>
: Java JRE 1.5 or higher</p>
<p>-
<bold>Other requirements</bold>
: Pre-computed GenDB V2.2 MySQL database</p>
<p>-
<bold>License</bold>
: GNU General Public License version 3 (GPL3)</p>
</sec>
<sec>
<title>Authors' contributions</title>
<p>MR designed and implemented most of the API and GUI. TL implemented the core search functionalities of the API and GUI. IK implemented the extraction of group-specific genes and helped with genomesDB. RK gave expert advises for programming design and code optimisation. MBD contributed necessary classes for the calculation of statistics of large data sets and helped with the website. JP tested the GUI and helped to improve it. FOG supervised the work and helped with writing the manuscript.</p>
</sec>
</body>
<back>
<ack>
<sec>
<title>Acknowledgements</title>
<p>We thank Marga Schüler, Stefka Tyanova, Christian Quast, Elmar Prüsse, Anke Meyerdierks, and Beatriz Fernández Gómez for testing the application at early stages of the development and for giving feedback and helpful comments. This study was supported by the Max Planck Society.</p>
</sec>
</ack>
<ref-list>
<ref id="B1">
<citation citation-type="other">
<article-title>Genomes OnLine Database</article-title>
<ext-link ext-link-type="uri" xlink:href="http://www.genomesonline.org"></ext-link>
</citation>
</ref>
<ref id="B2">
<citation citation-type="other">
<person-group person-group-type="author">
<name>
<surname>Markowitz</surname>
<given-names>VM</given-names>
</name>
<name>
<surname>Szeto</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Palaniappan</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Grechkin</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Chu</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>IA</given-names>
</name>
<name>
<surname>Dubchak</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Anderson</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Lykidis</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Mavromatis</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Ivanova</surname>
<given-names>NN</given-names>
</name>
<name>
<surname>Kyrpides</surname>
<given-names>NC</given-names>
</name>
</person-group>
<article-title>The integrated microbial genomes (IMG) system in 2007: data content and analysis tool extensions</article-title>
<source>Nucleic Acids Res</source>
<comment>Advance Access published on October 12, 2007.</comment>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vallenet</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Labarre</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Rouy</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Barbe</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Bocs</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Cruveiller</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Lajus</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Pascal</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Scarpelli</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Médigue</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>MaGe: a microbial genome annotation system supported by synteny results</article-title>
<source>Nucleic Acids Res</source>
<year>2006</year>
<volume>34</volume>
<fpage>53</fpage>
<lpage>65</lpage>
<pub-id pub-id-type="pmid">16407324</pub-id>
<pub-id pub-id-type="doi">10.1093/nar/gkj406</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Venter</surname>
<given-names>JC</given-names>
</name>
<name>
<surname>Remington</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Heidelberg</surname>
<given-names>JF</given-names>
</name>
<name>
<surname>Halpern</surname>
<given-names>AL</given-names>
</name>
<name>
<surname>Rusch</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Eisen</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Paulsen</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Nelson</surname>
<given-names>KE</given-names>
</name>
<name>
<surname>Nelson</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Fouts</surname>
<given-names>DE</given-names>
</name>
<name>
<surname>Levy</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Knap</surname>
<given-names>AH</given-names>
</name>
<name>
<surname>Lomas</surname>
<given-names>MW</given-names>
</name>
<name>
<surname>Nealson</surname>
<given-names>K</given-names>
</name>
<name>
<surname>White</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Peterson</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Hoffman</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Parsons</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Baden-Tillson</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Pfannkoch</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Rogers</surname>
<given-names>YH</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>HO</given-names>
</name>
</person-group>
<article-title>Environmental genome shotgun sequencing of the Sargasso Sea</article-title>
<source>Science</source>
<year>2004</year>
<volume>304</volume>
<fpage>66</fpage>
<lpage>74</lpage>
<pub-id pub-id-type="pmid">15001713</pub-id>
<pub-id pub-id-type="doi">10.1126/science.1093857</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rusch</surname>
<given-names>DB</given-names>
</name>
<name>
<surname>Halpern</surname>
<given-names>AL</given-names>
</name>
<name>
<surname>Sutton</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Heidelberg</surname>
<given-names>KB</given-names>
</name>
<name>
<surname>Williamson</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Yooseph</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Eisen</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Hoffman</surname>
<given-names>JM</given-names>
</name>
<name>
<surname>Remington</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Beeson</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Tran</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Baden-Tillson</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Stewart</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Thorpe</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Freeman</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Andrews-Pfannkoch</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Venter</surname>
<given-names>JE</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Kravitz</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Heidelberg</surname>
<given-names>JF</given-names>
</name>
<name>
<surname>Utterback</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Rogers</surname>
<given-names>YH</given-names>
</name>
<name>
<surname>Falcon</surname>
<given-names>LI</given-names>
</name>
<name>
<surname>Souza</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Bonilla-Rosso</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Eguiarte</surname>
<given-names>LE</given-names>
</name>
<name>
<surname>Karl</surname>
<given-names>DM</given-names>
</name>
<name>
<surname>Sathyendranath</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Platt</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Bermingham</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Gallardo</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Tamayo-Castillo</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Ferrari</surname>
<given-names>MR</given-names>
</name>
<name>
<surname>Strausberg</surname>
<given-names>RL</given-names>
</name>
<name>
<surname>Nealson</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Friedman</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Frazier</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Venter</surname>
<given-names>JC</given-names>
</name>
</person-group>
<article-title>The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical Pacific</article-title>
<source>PLoS Biol</source>
<year>2007</year>
<volume>5</volume>
<fpage>e77</fpage>
<pub-id pub-id-type="pmid">17355176</pub-id>
<pub-id pub-id-type="doi">10.1371/journal.pbio.0050077</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Seshadri</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Kravitz</surname>
<given-names>SA</given-names>
</name>
<name>
<surname>Smarr</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Gilna</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Frazier</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>CAMERA: a community resource for metagenomics</article-title>
<source>PLoS Biol</source>
<year>2007</year>
<volume>5</volume>
<fpage>e75</fpage>
<pub-id pub-id-type="pmid">17355175</pub-id>
<pub-id pub-id-type="doi">10.1371/journal.pbio.0050075</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gans</surname>
<given-names>JD</given-names>
</name>
<name>
<surname>Wolinsky</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Genomorama: genome visualization and analysis</article-title>
<source>BMC Bioinformatics</source>
<year>2007</year>
<volume>8</volume>
<fpage>204</fpage>
<pub-id pub-id-type="pmid">17570856</pub-id>
<pub-id pub-id-type="doi">10.1186/1471-2105-8-204</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rutherford</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Parkhill</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Crook</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Horsnell</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Rice</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Rajandream</surname>
<given-names>MA</given-names>
</name>
<name>
<surname>Barrell</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>Artemis: sequence visualization and annotation</article-title>
<source>Bioinformatics</source>
<year>2000</year>
<volume>16</volume>
<fpage>944</fpage>
<lpage>945</lpage>
<pub-id pub-id-type="pmid">11120685</pub-id>
<pub-id pub-id-type="doi">10.1093/bioinformatics/16.10.944</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bairoch</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Boeckmann</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Ferro</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Gasteiger</surname>
<given-names>E</given-names>
</name>
</person-group>
<article-title>Swiss-Prot: juggling between evolution and stability</article-title>
<source>Brief Bioinform</source>
<year>2004</year>
<volume>5</volume>
<fpage>39</fpage>
<lpage>55</lpage>
<pub-id pub-id-type="pmid">15153305</pub-id>
<pub-id pub-id-type="doi">10.1093/bib/5.1.39</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meyer</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Goesmann</surname>
<given-names>A</given-names>
</name>
<name>
<surname>McHardy</surname>
<given-names>AC</given-names>
</name>
<name>
<surname>Bartels</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Bekel</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Clausen</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Kalinowski</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Linke</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Rupp</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Giegerich</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Pühler</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>GenDB – an open source genome annotation system for prokaryote genomes</article-title>
<source>Nucleic Acids Res</source>
<year>2003</year>
<volume>31</volume>
<fpage>2187</fpage>
<lpage>95</lpage>
<pub-id pub-id-type="pmid">12682369</pub-id>
<pub-id pub-id-type="doi">10.1093/nar/gkg312</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Moran</surname>
<given-names>MA</given-names>
</name>
<name>
<surname>Belas</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Schell</surname>
<given-names>MA</given-names>
</name>
<name>
<surname>González</surname>
<given-names>JM</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Binder</surname>
<given-names>BJ</given-names>
</name>
<name>
<surname>Edmonds</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Ye</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Orcutt</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Howard</surname>
<given-names>EC</given-names>
</name>
<name>
<surname>Meile</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Palefsky</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Goesmann</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Ren</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Paulsen</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Ulrich</surname>
<given-names>LE</given-names>
</name>
<name>
<surname>Thompson</surname>
<given-names>LS</given-names>
</name>
<name>
<surname>Saunders</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Buchan</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>Ecological genomics of marine Roseobacters</article-title>
<source>Appl Environ Microbiol</source>
<year>2007</year>
<volume>73</volume>
<fpage>4559</fpage>
<lpage>4569</lpage>
<pub-id pub-id-type="pmid">17526795</pub-id>
<pub-id pub-id-type="doi">10.1128/AEM.02580-06</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Scott</surname>
<given-names>KM</given-names>
</name>
<name>
<surname>Sievert</surname>
<given-names>SM</given-names>
</name>
<name>
<surname>Abril</surname>
<given-names>FN</given-names>
</name>
<name>
<surname>Ball</surname>
<given-names>LA</given-names>
</name>
<name>
<surname>Barrett</surname>
<given-names>CJ</given-names>
</name>
<name>
<surname>Blake</surname>
<given-names>RA</given-names>
</name>
<name>
<surname>Boller</surname>
<given-names>AJ</given-names>
</name>
<name>
<surname>Chain</surname>
<given-names>PSG</given-names>
</name>
<name>
<surname>Clark</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Davis</surname>
<given-names>CR</given-names>
</name>
<name>
<surname>Detter</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Do</surname>
<given-names>KF</given-names>
</name>
<name>
<surname>Dobrinski</surname>
<given-names>KP</given-names>
</name>
<name>
<surname>Faza</surname>
<given-names>BI</given-names>
</name>
<name>
<surname>Fitzpatrick</surname>
<given-names>KA</given-names>
</name>
<name>
<surname>Freyermuth</surname>
<given-names>SK</given-names>
</name>
<name>
<surname>Harmer</surname>
<given-names>TL</given-names>
</name>
<name>
<surname>Hauser</surname>
<given-names>LJ</given-names>
</name>
<name>
<surname>Hügler</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Kerfeld</surname>
<given-names>CA</given-names>
</name>
<name>
<surname>Klotz</surname>
<given-names>MG</given-names>
</name>
<name>
<surname>Kong</surname>
<given-names>WW</given-names>
</name>
<name>
<surname>Land</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lapidus</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Larimer</surname>
<given-names>FW</given-names>
</name>
<name>
<surname>Longo</surname>
<given-names>DL</given-names>
</name>
<name>
<surname>Lucas</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Malfatti</surname>
<given-names>SA</given-names>
</name>
<name>
<surname>Massey</surname>
<given-names>SE</given-names>
</name>
<name>
<surname>Martin</surname>
<given-names>DD</given-names>
</name>
<name>
<surname>McCuddin</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Meyer</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Moore</surname>
<given-names>JL</given-names>
</name>
<name>
<surname>Ocampo</surname>
<given-names>LH</given-names>
</name>
<name>
<surname>Paul</surname>
<given-names>JH</given-names>
</name>
<name>
<surname>Paulsen</surname>
<given-names>IT</given-names>
</name>
<name>
<surname>Reep</surname>
<given-names>DK</given-names>
</name>
<name>
<surname>Ren</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Ross</surname>
<given-names>RL</given-names>
</name>
<name>
<surname>Sato</surname>
<given-names>PY</given-names>
</name>
<name>
<surname>Thomas</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Tinkham</surname>
<given-names>LE</given-names>
</name>
<name>
<surname>Zeruth</surname>
<given-names>GT</given-names>
</name>
</person-group>
<article-title>The genome of deep-sea vent chemolithoautotroph Thiomicrospira crunogena XCL-2</article-title>
<source>PLoS Biol</source>
<year>2006</year>
<volume>4</volume>
<fpage>e383</fpage>
<pub-id pub-id-type="pmid">17105352</pub-id>
<pub-id pub-id-type="doi">10.1371/journal.pbio.0040383</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baar</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Eppinger</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Raddatz</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Simon</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Lanz</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Klimmek</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Nandakumar</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Gross</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Rosinus</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Keller</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Jagtap</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Linke</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Meyer</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Lederer</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Schuster</surname>
<given-names>SC</given-names>
</name>
</person-group>
<article-title>Complete genome sequence and analysis of
<italic>Wolinella succinogenes</italic>
</article-title>
<source>Proc Natl Acad Sci USA</source>
<year>2003</year>
<volume>100</volume>
<fpage>11690</fpage>
<lpage>11695</lpage>
<pub-id pub-id-type="pmid">14500908</pub-id>
<pub-id pub-id-type="doi">10.1073/pnas.1932838100</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="other">
<article-title>Marine Genomics Europe</article-title>
<ext-link ext-link-type="uri" xlink:href="http://www.marine-genomics-europe.org"></ext-link>
</citation>
</ref>
<ref id="B15">
<citation citation-type="other">
<article-title>Sun Java</article-title>
<ext-link ext-link-type="uri" xlink:href="http://java.sun.com"></ext-link>
</citation>
</ref>
<ref id="B16">
<citation citation-type="other">
<article-title>SwingLabs – Java Desktop Technology</article-title>
<ext-link ext-link-type="uri" xlink:href="http://swinglabs.org"></ext-link>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Altschul</surname>
<given-names>SF</given-names>
</name>
<name>
<surname>Madden</surname>
<given-names>TL</given-names>
</name>
<name>
<surname>Schaffer</surname>
<given-names>AA</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Miller</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Lipman</surname>
<given-names>DJ</given-names>
</name>
</person-group>
<article-title>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</article-title>
<source>Nucleic Acids Res</source>
<year>1997</year>
<volume>25</volume>
<fpage>3389</fpage>
<lpage>3402</lpage>
<pub-id pub-id-type="pmid">9254694</pub-id>
<pub-id pub-id-type="doi">10.1093/nar/25.17.3389</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bateman</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Coin</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Durbin</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Finn</surname>
<given-names>RD</given-names>
</name>
<name>
<surname>Hollich</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Griffiths-Jones</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Khanna</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Marshall</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Moxon</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Sonnhammer</surname>
<given-names>ELL</given-names>
</name>
<name>
<surname>Studholme</surname>
<given-names>DJ</given-names>
</name>
<name>
<surname>Yeats</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Eddy</surname>
<given-names>SR</given-names>
</name>
</person-group>
<article-title>The Pfam protein families database</article-title>
<source>Nucleic Acids Res</source>
<year>2004</year>
<volume>32</volume>
<fpage>D138</fpage>
<lpage>41</lpage>
<pub-id pub-id-type="pmid">14681378</pub-id>
<pub-id pub-id-type="doi">10.1093/nar/gkh121</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mulder</surname>
<given-names>NJ</given-names>
</name>
<name>
<surname>Apweiler</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Attwood</surname>
<given-names>TK</given-names>
</name>
<name>
<surname>Bairoch</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Bateman</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Binns</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Bradley</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Bork</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Bucher</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Cerutti</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Copley</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Courcelle</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Das</surname>
<given-names>U</given-names>
</name>
<name>
<surname>Durbin</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Fleischmann</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Gough</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Haft</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Harte</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Hulo</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Kahn</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Kanapin</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Krestyaninova</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lonsdale</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Lopez</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Letunic</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Madera</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Maslen</surname>
<given-names>J</given-names>
</name>
<name>
<surname>McDowall</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Mitchell</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Nikolskaya</surname>
<given-names>AN</given-names>
</name>
<name>
<surname>Orchard</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Pagni</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Ponting</surname>
<given-names>CP</given-names>
</name>
<name>
<surname>Quevillon</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Selengut</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sigrist</surname>
<given-names>CJA</given-names>
</name>
<name>
<surname>Silventoinen</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Studholme</surname>
<given-names>DJ</given-names>
</name>
<name>
<surname>Vaughan</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>CH</given-names>
</name>
</person-group>
<article-title>InterPro, progress and status in 2005</article-title>
<source>Nucleic Acids Res</source>
<year>2005</year>
<volume>33</volume>
<fpage>D201</fpage>
<lpage>5</lpage>
<pub-id pub-id-type="pmid">15608177</pub-id>
<pub-id pub-id-type="doi">10.1093/nar/gki106</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bendtsen</surname>
<given-names>JD</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>H</given-names>
</name>
<name>
<surname>von Heijne</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Brunak</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Improved prediction of signal peptides: SignalP 3.0</article-title>
<source>J Mol Biol</source>
<year>2004</year>
<volume>340</volume>
<fpage>783</fpage>
<lpage>95</lpage>
<pub-id pub-id-type="pmid">15223320</pub-id>
<pub-id pub-id-type="doi">10.1016/j.jmb.2004.05.028</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krogh</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Larsson</surname>
<given-names>B</given-names>
</name>
<name>
<surname>von Heijne</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Sonnhammer</surname>
<given-names>EL</given-names>
</name>
</person-group>
<article-title>Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes</article-title>
<source>J Mol Biol</source>
<year>2001</year>
<volume>305</volume>
<fpage>567</fpage>
<lpage>80</lpage>
<pub-id pub-id-type="pmid">11152613</pub-id>
<pub-id pub-id-type="doi">10.1006/jmbi.2000.4315</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lombardot</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Kottmann</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Pfeffer</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Richter</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Teeling</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Quast</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Glöckner</surname>
<given-names>FO</given-names>
</name>
</person-group>
<article-title>Megx.net – database resources for marine ecological genomics</article-title>
<source>Nucleic Acids Res</source>
<year>2006</year>
<volume>34</volume>
<fpage>D390</fpage>
<lpage>3</lpage>
<pub-id pub-id-type="pmid">16381894</pub-id>
<pub-id pub-id-type="doi">10.1093/nar/gkj070</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="other">
<article-title>MySQL – Open Source database</article-title>
<ext-link ext-link-type="uri" xlink:href="http://www.mysql.com"></ext-link>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tatusov</surname>
<given-names>RL</given-names>
</name>
<name>
<surname>Fedorova</surname>
<given-names>ND</given-names>
</name>
<name>
<surname>Jackson</surname>
<given-names>JD</given-names>
</name>
<name>
<surname>Jacobs</surname>
<given-names>AR</given-names>
</name>
<name>
<surname>Kiryutin</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Koonin</surname>
<given-names>EV</given-names>
</name>
<name>
<surname>Krylov</surname>
<given-names>DM</given-names>
</name>
<name>
<surname>Mazumder</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Mekhedov</surname>
<given-names>SL</given-names>
</name>
<name>
<surname>Nikolskaya</surname>
<given-names>AN</given-names>
</name>
<name>
<surname>Rao</surname>
<given-names>BS</given-names>
</name>
<name>
<surname>Smirnov</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Sverdlov</surname>
<given-names>AV</given-names>
</name>
<name>
<surname>Vasudevan</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Wolf</surname>
<given-names>YI</given-names>
</name>
<name>
<surname>Yin</surname>
<given-names>JJ</given-names>
</name>
<name>
<surname>Natale</surname>
<given-names>DA</given-names>
</name>
</person-group>
<article-title>The COG database: an updated version includes eukaryotes</article-title>
<source>BMC Bioinformatics</source>
<year>2003</year>
<volume>4</volume>
<fpage>41</fpage>
<pub-id pub-id-type="pmid">12969510</pub-id>
<pub-id pub-id-type="doi">10.1186/1471-2105-4-41</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bauer</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Kube</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Teeling</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Richter</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lombardot</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Allers</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Würdemann</surname>
<given-names>CA</given-names>
</name>
<name>
<surname>Quast</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Kuhl</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Knaust</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Woebken</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Bischof</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Mussmann</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Choudhuri</surname>
<given-names>JV</given-names>
</name>
<name>
<surname>Meyer</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Reinhardt</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Amann</surname>
<given-names>RI</given-names>
</name>
<name>
<surname>Glöckner</surname>
<given-names>FO</given-names>
</name>
</person-group>
<article-title>Whole genome analysis of the marine Bacteroidetes'Gramella forsetii' reveals adaptations to degradation of polymeric organic matter</article-title>
<source>Environ Microbiol</source>
<year>2006</year>
<volume>8</volume>
<fpage>2201</fpage>
<lpage>2213</lpage>
<pub-id pub-id-type="pmid">17107561</pub-id>
<pub-id pub-id-type="doi">10.1111/j.1462-2920.2006.01152.x</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Richter</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Kube</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Bazylinski</surname>
<given-names>DA</given-names>
</name>
<name>
<surname>Lombardot</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Glöckner</surname>
<given-names>FO</given-names>
</name>
<name>
<surname>Reinhardt</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Schüler</surname>
<given-names>D</given-names>
</name>
</person-group>
<article-title>Comparative genome analysis of four magnetotactic bacteria reveals a complex set of group-specific genes implicated in magnetosome biomineralization and function</article-title>
<source>J Bacteriol</source>
<year>2007</year>
<volume>189</volume>
<fpage>4899</fpage>
<lpage>4910</lpage>
<pub-id pub-id-type="pmid">17449609</pub-id>
<pub-id pub-id-type="doi">10.1128/JB.00119-07</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fuchs</surname>
<given-names>BM</given-names>
</name>
<name>
<surname>Spring</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Teeling</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Quast</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Wulf</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Schattenhofer</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Ferriera</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Johnson</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Glockner</surname>
<given-names>FO</given-names>
</name>
<name>
<surname>Amann</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>Characterization of a marine gammaproteobacterium capable of aerobic anoxygenic photosynthesis</article-title>
<source>Proc Natl Acad Sci USA</source>
<year>2007</year>
<volume>104</volume>
<fpage>2891</fpage>
<lpage>2896</lpage>
<pub-id pub-id-type="pmid">17299055</pub-id>
<pub-id pub-id-type="doi">10.1073/pnas.0608046104</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Woyke</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Teeling</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Ivanova</surname>
<given-names>NN</given-names>
</name>
<name>
<surname>Hunteman</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Richter</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Gloeckner</surname>
<given-names>FO</given-names>
</name>
<name>
<surname>Boffelli</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Anderson</surname>
<given-names>IJ</given-names>
</name>
<name>
<surname>Barry</surname>
<given-names>KW</given-names>
</name>
<name>
<surname>Shapiro</surname>
<given-names>HJ</given-names>
</name>
<name>
<surname>Szeto</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Kyrpides</surname>
<given-names>NC</given-names>
</name>
<name>
<surname>Mussmann</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Amann</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Bergin</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Ruehland</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Rubin</surname>
<given-names>EM</given-names>
</name>
<name>
<surname>Dubilier</surname>
<given-names>N</given-names>
</name>
</person-group>
<article-title>Symbiosis insights through metagenomic analysis of a microbial consortium</article-title>
<source>Nature</source>
<year>2006</year>
<volume>443</volume>
<fpage>950</fpage>
<lpage>955</lpage>
<pub-id pub-id-type="pmid">16980956</pub-id>
<pub-id pub-id-type="doi">10.1038/nature05192</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mussmann</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>FZ</given-names>
</name>
<name>
<surname>Richter</surname>
<given-names>M</given-names>
</name>
<name>
<surname>de Beer</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Preisler</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Jørgensen</surname>
<given-names>BB</given-names>
</name>
<name>
<surname>Huntemann</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Glöckner</surname>
<given-names>FO</given-names>
</name>
<name>
<surname>Amann</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Koopman</surname>
<given-names>WJH</given-names>
</name>
<name>
<surname>Lasken</surname>
<given-names>RS</given-names>
</name>
<name>
<surname>Janto</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Hogg</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Stoodley</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Boissy</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Ehrlich</surname>
<given-names>GD</given-names>
</name>
</person-group>
<article-title>Insights into the genome of large sulfur bacteria revealed by analysis of single filaments</article-title>
<source>PLoS Biol</source>
<year>2007</year>
<volume>5</volume>
<fpage>e230</fpage>
<pub-id pub-id-type="pmid">17760503</pub-id>
<pub-id pub-id-type="doi">10.1371/journal.pbio.0050230</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bryson</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Loux</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Bossy</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Nicolas</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Chaillou</surname>
<given-names>S</given-names>
</name>
<name>
<surname>van de Guchte</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Penaud</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Maguin</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Hoebeke</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Bessières</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Gibrat</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system</article-title>
<source>Nucleic Acids Res</source>
<year>2006</year>
<volume>34</volume>
<fpage>3533</fpage>
<lpage>3545</lpage>
<pub-id pub-id-type="pmid">16855290</pub-id>
<pub-id pub-id-type="doi">10.1093/nar/gkl471</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Eppley</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Tyson</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Getz</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Banfield</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>Strainer: Software for analysis of population variation in community genomic datasets</article-title>
<source>BMC Bioinformatics</source>
<year>2007</year>
<volume>8</volume>
<fpage>398</fpage>
<pub-id pub-id-type="pmid">17941997</pub-id>
<pub-id pub-id-type="doi">10.1186/1471-2105-8-398</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lombardot</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Kottmann</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Giuliani</surname>
<given-names>G</given-names>
</name>
<name>
<surname>de Bono</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Addor</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Glockner</surname>
<given-names>F</given-names>
</name>
</person-group>
<article-title>MetaLook: a 3D visualisation software for marine ecological genomics</article-title>
<source>BMC Bioinformatics</source>
<year>2007</year>
<volume>8</volume>
<fpage>406</fpage>
<pub-id pub-id-type="pmid">17953757</pub-id>
<pub-id pub-id-type="doi">10.1186/1471-2105-8-406</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="other">
<article-title>The Ribocon GmbH-Bioinformatics and Molecular Diagnostics</article-title>
<ext-link ext-link-type="uri" xlink:href="http://www.ribocon.com"></ext-link>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Dudley</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>Bioinformatics software for biologists in the genomics era</article-title>
<source>Bioinformatics</source>
<year>2007</year>
<volume>23</volume>
<fpage>1713</fpage>
<lpage>1717</lpage>
<pub-id pub-id-type="pmid">17485425</pub-id>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btm239</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hall</surname>
<given-names>N</given-names>
</name>
</person-group>
<article-title>Advanced sequencing technologies and their wider impact in microbiology</article-title>
<source>J Exp Biol</source>
<year>2007</year>
<volume>210</volume>
<fpage>1518</fpage>
<lpage>25</lpage>
<pub-id pub-id-type="pmid">17449817</pub-id>
<pub-id pub-id-type="doi">10.1242/jeb.001370</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="other">
<article-title>Kubuntu – A user friendly operating system</article-title>
<ext-link ext-link-type="uri" xlink:href="http://www.kubuntu.org"></ext-link>
</citation>
</ref>
</ref-list>
</back>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 0001980 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 0001980 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     
   |texte=   
}}

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024