Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Real Time Metagenomics: Using k-mers to annotate metagenomes

Identifieur interne : 000A02 ( Ncbi/Curation ); précédent : 000A01; suivant : 000A03

Real Time Metagenomics: Using k-mers to annotate metagenomes

Auteurs : Robert A. Edwards [États-Unis] ; Robert Olson [États-Unis] ; Terry Disz [États-Unis] ; Gordon D. Pusch [États-Unis] ; Veronika Vonstein [États-Unis] ; Rick Stevens [États-Unis] ; Ross Overbeek [États-Unis]

Source :

RBID : PMC:3519453

Descripteurs français

English descriptors

Abstract

Summary: Annotation of metagenomes involves comparing the individual sequence reads with a database of known sequences and assigning a unique function to each read. This is a time-consuming task that is computationally intensive (though not computationally complex). Here we present a novel approach to annotate metagenomes using unique k-mer oligopeptide sequences from 7 to 12 amino acids long. We demonstrate that k-mer-based annotations are faster and approach the sensitivity and precision of blastx-based annotations without loosing accuracy. A last-common ancestor approach was also developed to describe the members of the community.

Availability and implementation: This open-source application was implemented in Perl and can be accessed via a user-friendly website at http://edwards.sdsu.edu/rtmg. In addition, code to access the annotation servers is available for download from http://www.theseed.org/. FIGfams and k-mers are available for download from ftp://ftp.theseed.org/FIGfams/.

Contact:redwards@mail.sdsu.edu

Supplementary information:Supplementary data are available at Bioinformatics online.


Url:
DOI: 10.1093/bioinformatics/bts599
PubMed: 23047562
PubMed Central: 3519453

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:3519453

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Real Time Metagenomics: Using
<italic>k</italic>
-mers to annotate metagenomes</title>
<author>
<name sortKey="Edwards, Robert A" sort="Edwards, Robert A" uniqKey="Edwards R" first="Robert A." last="Edwards">Robert A. Edwards</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
<affiliation>
<nlm:aff id="bts599-AFF1">Department of Biology,</nlm:aff>
</affiliation>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Department of Computer Science, San Diego State University, San Diego, CA 92182, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computer Science, San Diego State University, San Diego, CA 92182</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Olson, Robert" sort="Olson, Robert" uniqKey="Olson R" first="Robert" last="Olson">Robert Olson</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff wicri:cut=" and" id="bts599-AFF1">Computation Institute, University of Chicago, Chicago, IL 60637, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Computation Institute, University of Chicago, Chicago, IL 60637</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
<settlement type="city">Chicago</settlement>
</placeName>
<orgName type="university">Université de Chicago</orgName>
</affiliation>
</author>
<author>
<name sortKey="Disz, Terry" sort="Disz, Terry" uniqKey="Disz T" first="Terry" last="Disz">Terry Disz</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff wicri:cut=" and" id="bts599-AFF1">Computation Institute, University of Chicago, Chicago, IL 60637, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Computation Institute, University of Chicago, Chicago, IL 60637</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
<settlement type="city">Chicago</settlement>
</placeName>
<orgName type="university">Université de Chicago</orgName>
</affiliation>
</author>
<author>
<name sortKey="Pusch, Gordon D" sort="Pusch, Gordon D" uniqKey="Pusch G" first="Gordon D." last="Pusch">Gordon D. Pusch</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff wicri:cut=" and" id="bts599-AFF1">Computation Institute, University of Chicago, Chicago, IL 60637, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Computation Institute, University of Chicago, Chicago, IL 60637</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
<settlement type="city">Chicago</settlement>
</placeName>
<orgName type="university">Université de Chicago</orgName>
</affiliation>
</author>
<author>
<name sortKey="Vonstein, Veronika" sort="Vonstein, Veronika" uniqKey="Vonstein V" first="Veronika" last="Vonstein">Veronika Vonstein</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Fellowship for the Interpretation of Genomes, Burr Ridge, IL 60527, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Fellowship for the Interpretation of Genomes, Burr Ridge, IL 60527</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Stevens, Rick" sort="Stevens, Rick" uniqKey="Stevens R" first="Rick" last="Stevens">Rick Stevens</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff wicri:cut=" and" id="bts599-AFF1">Computation Institute, University of Chicago, Chicago, IL 60637, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Computation Institute, University of Chicago, Chicago, IL 60637</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
<settlement type="city">Chicago</settlement>
</placeName>
<orgName type="university">Université de Chicago</orgName>
</affiliation>
</author>
<author>
<name sortKey="Overbeek, Ross" sort="Overbeek, Ross" uniqKey="Overbeek R" first="Ross" last="Overbeek">Ross Overbeek</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Fellowship for the Interpretation of Genomes, Burr Ridge, IL 60527, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Fellowship for the Interpretation of Genomes, Burr Ridge, IL 60527</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">23047562</idno>
<idno type="pmc">3519453</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3519453</idno>
<idno type="RBID">PMC:3519453</idno>
<idno type="doi">10.1093/bioinformatics/bts599</idno>
<date when="2012">2012</date>
<idno type="wicri:Area/Pmc/Corpus">000B09</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000B09</idno>
<idno type="wicri:Area/Pmc/Curation">000B09</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Curation">000B09</idno>
<idno type="wicri:Area/Pmc/Checkpoint">001284</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Checkpoint">001284</idno>
<idno type="wicri:source">PubMed</idno>
<idno type="RBID">pubmed:23047562</idno>
<idno type="wicri:Area/PubMed/Corpus">001D42</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">001D42</idno>
<idno type="wicri:Area/PubMed/Curation">001D42</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">001D42</idno>
<idno type="wicri:Area/PubMed/Checkpoint">001C41</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">001C41</idno>
<idno type="wicri:Area/Ncbi/Merge">000A02</idno>
<idno type="wicri:Area/Ncbi/Curation">000A02</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Real Time Metagenomics: Using
<italic>k</italic>
-mers to annotate metagenomes</title>
<author>
<name sortKey="Edwards, Robert A" sort="Edwards, Robert A" uniqKey="Edwards R" first="Robert A." last="Edwards">Robert A. Edwards</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
<affiliation>
<nlm:aff id="bts599-AFF1">Department of Biology,</nlm:aff>
</affiliation>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Department of Computer Science, San Diego State University, San Diego, CA 92182, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computer Science, San Diego State University, San Diego, CA 92182</wicri:regionArea>
<placeName>
<region type="state">Californie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Olson, Robert" sort="Olson, Robert" uniqKey="Olson R" first="Robert" last="Olson">Robert Olson</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff wicri:cut=" and" id="bts599-AFF1">Computation Institute, University of Chicago, Chicago, IL 60637, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Computation Institute, University of Chicago, Chicago, IL 60637</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
<settlement type="city">Chicago</settlement>
</placeName>
<orgName type="university">Université de Chicago</orgName>
</affiliation>
</author>
<author>
<name sortKey="Disz, Terry" sort="Disz, Terry" uniqKey="Disz T" first="Terry" last="Disz">Terry Disz</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff wicri:cut=" and" id="bts599-AFF1">Computation Institute, University of Chicago, Chicago, IL 60637, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Computation Institute, University of Chicago, Chicago, IL 60637</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
<settlement type="city">Chicago</settlement>
</placeName>
<orgName type="university">Université de Chicago</orgName>
</affiliation>
</author>
<author>
<name sortKey="Pusch, Gordon D" sort="Pusch, Gordon D" uniqKey="Pusch G" first="Gordon D." last="Pusch">Gordon D. Pusch</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff wicri:cut=" and" id="bts599-AFF1">Computation Institute, University of Chicago, Chicago, IL 60637, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Computation Institute, University of Chicago, Chicago, IL 60637</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
<settlement type="city">Chicago</settlement>
</placeName>
<orgName type="university">Université de Chicago</orgName>
</affiliation>
</author>
<author>
<name sortKey="Vonstein, Veronika" sort="Vonstein, Veronika" uniqKey="Vonstein V" first="Veronika" last="Vonstein">Veronika Vonstein</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Fellowship for the Interpretation of Genomes, Burr Ridge, IL 60527, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Fellowship for the Interpretation of Genomes, Burr Ridge, IL 60527</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Stevens, Rick" sort="Stevens, Rick" uniqKey="Stevens R" first="Rick" last="Stevens">Rick Stevens</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA,</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff wicri:cut=" and" id="bts599-AFF1">Computation Institute, University of Chicago, Chicago, IL 60637, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Computation Institute, University of Chicago, Chicago, IL 60637</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
<settlement type="city">Chicago</settlement>
</placeName>
<orgName type="university">Université de Chicago</orgName>
</affiliation>
</author>
<author>
<name sortKey="Overbeek, Ross" sort="Overbeek, Ross" uniqKey="Overbeek R" first="Ross" last="Overbeek">Ross Overbeek</name>
<affiliation wicri:level="2">
<nlm:aff id="bts599-AFF1">Fellowship for the Interpretation of Genomes, Burr Ridge, IL 60527, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Fellowship for the Interpretation of Genomes, Burr Ridge, IL 60527</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Bioinformatics</title>
<idno type="ISSN">1367-4803</idno>
<idno type="eISSN">1367-4811</idno>
<imprint>
<date when="2012">2012</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithms</term>
<term>Metagenome</term>
<term>Metagenomics (methods)</term>
<term>Molecular Sequence Annotation</term>
<term>Sequence Analysis, DNA</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr">
<term>Algorithmes</term>
<term>Analyse de séquence d'ADN</term>
<term>Annotation de séquence moléculaire</term>
<term>Métagénome</term>
<term>Métagénomique ()</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Metagenomics</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Algorithms</term>
<term>Metagenome</term>
<term>Molecular Sequence Annotation</term>
<term>Sequence Analysis, DNA</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr">
<term>Algorithmes</term>
<term>Analyse de séquence d'ADN</term>
<term>Annotation de séquence moléculaire</term>
<term>Métagénome</term>
<term>Métagénomique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>
<bold>Summary:</bold>
Annotation of metagenomes involves comparing the individual sequence reads with a database of known sequences and assigning a unique function to each read. This is a time-consuming task that is computationally intensive (though not computationally complex). Here we present a novel approach to annotate metagenomes using unique
<italic>k</italic>
-mer oligopeptide sequences from 7 to 12 amino acids long. We demonstrate that
<italic>k-</italic>
mer-based annotations are faster and approach the sensitivity and precision of blastx-based annotations without loosing accuracy. A last-common ancestor approach was also developed to describe the members of the community.</p>
<p>
<bold>Availability and implementation:</bold>
This open-source application was implemented in Perl and can be accessed via a user-friendly website at
<ext-link ext-link-type="uri" xlink:href="http://edwards.sdsu.edu/rtmg">http://edwards.sdsu.edu/rtmg</ext-link>
. In addition, code to access the annotation servers is available for download from
<ext-link ext-link-type="uri" xlink:href="http://www.theseed.org/">http://www.theseed.org/</ext-link>
. FIGfams and
<italic>k</italic>
-mers are available for download from
<ext-link ext-link-type="ftp" xlink:href="ftp://ftp.theseed.org/FIGfams/">ftp://ftp.theseed.org/FIGfams/</ext-link>
.</p>
<p>
<bold>Contact:</bold>
<email>redwards@mail.sdsu.edu</email>
</p>
<p>
<bold>Supplementary information:</bold>
<ext-link ext-link-type="uri" xlink:href="http://bioinformatics.oxfordjournals.org/cgi/content/full/bts599/DC1">Supplementary data</ext-link>
are available at
<italic>Bioinformatics</italic>
online.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Angly, F E" uniqKey="Angly F">F.E. Angly</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Dinsdale, E A" uniqKey="Dinsdale E">E.A. Dinsdale</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Meyer, F" uniqKey="Meyer F">F. Meyer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Meyer, F" uniqKey="Meyer F">F. Meyer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Overbeek, R" uniqKey="Overbeek R">R. Overbeek</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wilkening, J" uniqKey="Wilkening J">J. Wilkening</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wommack, K E" uniqKey="Wommack K">K.E. Wommack</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Ncbi/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000A02 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Ncbi/Curation/biblio.hfd -nk 000A02 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Ncbi
   |étape=   Curation
   |type=    RBID
   |clé=     PMC:3519453
   |texte=   Real Time Metagenomics: Using k-mers to annotate metagenomes
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Ncbi/Curation/RBID.i   -Sk "pubmed:23047562" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Ncbi/Curation/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021