Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Kraken: ultrafast metagenomic sequence classification using exact alignments.

Identifieur interne : 001A39 ( PubMed/Corpus ); précédent : 001A38; suivant : 001A40

Kraken: ultrafast metagenomic sequence classification using exact alignments.

Auteurs : Derrick E. Wood ; Steven L. Salzberg

Source :

RBID : pubmed:24580807

English descriptors

Abstract

Kraken is an ultrafast and highly accurate program for assigning taxonomic labels to metagenomic DNA sequences. Previous programs designed for this task have been relatively slow and computationally expensive, forcing researchers to use faster abundance estimation programs, which only classify small subsets of metagenomic data. Using exact alignment of k-mers, Kraken achieves classification accuracy comparable to the fastest BLAST program. In its fastest mode, Kraken classifies 100 base pair reads at a rate of over 4.1 million reads per minute, 909 times faster than Megablast and 11 times faster than the abundance estimation program MetaPhlAn. Kraken is available at http://ccb.jhu.edu/software/kraken/.

DOI: 10.1186/gb-2014-15-3-r46
PubMed: 24580807

Links to Exploration step

pubmed:24580807

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Kraken: ultrafast metagenomic sequence classification using exact alignments.</title>
<author>
<name sortKey="Wood, Derrick E" sort="Wood, Derrick E" uniqKey="Wood D" first="Derrick E" last="Wood">Derrick E. Wood</name>
</author>
<author>
<name sortKey="Salzberg, Steven L" sort="Salzberg, Steven L" uniqKey="Salzberg S" first="Steven L" last="Salzberg">Steven L. Salzberg</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2014">2014</date>
<idno type="RBID">pubmed:24580807</idno>
<idno type="pmid">24580807</idno>
<idno type="doi">10.1186/gb-2014-15-3-r46</idno>
<idno type="wicri:Area/PubMed/Corpus">001A39</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">001A39</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Kraken: ultrafast metagenomic sequence classification using exact alignments.</title>
<author>
<name sortKey="Wood, Derrick E" sort="Wood, Derrick E" uniqKey="Wood D" first="Derrick E" last="Wood">Derrick E. Wood</name>
</author>
<author>
<name sortKey="Salzberg, Steven L" sort="Salzberg, Steven L" uniqKey="Salzberg S" first="Steven L" last="Salzberg">Steven L. Salzberg</name>
</author>
</analytic>
<series>
<title level="j">Genome biology</title>
<idno type="eISSN">1474-760X</idno>
<imprint>
<date when="2014" type="published">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Archaea (classification)</term>
<term>Archaea (genetics)</term>
<term>Bacteria (classification)</term>
<term>Bacteria (genetics)</term>
<term>Classification</term>
<term>Humans</term>
<term>Metagenome</term>
<term>Metagenomics (methods)</term>
<term>Sensitivity and Specificity</term>
<term>Sequence Alignment (methods)</term>
<term>Sequence Analysis, DNA (methods)</term>
<term>Software</term>
</keywords>
<keywords scheme="MESH" qualifier="classification" xml:lang="en">
<term>Archaea</term>
<term>Bacteria</term>
</keywords>
<keywords scheme="MESH" qualifier="genetics" xml:lang="en">
<term>Archaea</term>
<term>Bacteria</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Metagenomics</term>
<term>Sequence Alignment</term>
<term>Sequence Analysis, DNA</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Classification</term>
<term>Humans</term>
<term>Metagenome</term>
<term>Sensitivity and Specificity</term>
<term>Software</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Kraken is an ultrafast and highly accurate program for assigning taxonomic labels to metagenomic DNA sequences. Previous programs designed for this task have been relatively slow and computationally expensive, forcing researchers to use faster abundance estimation programs, which only classify small subsets of metagenomic data. Using exact alignment of k-mers, Kraken achieves classification accuracy comparable to the fastest BLAST program. In its fastest mode, Kraken classifies 100 base pair reads at a rate of over 4.1 million reads per minute, 909 times faster than Megablast and 11 times faster than the abundance estimation program MetaPhlAn. Kraken is available at http://ccb.jhu.edu/software/kraken/. </div>
</front>
</TEI>
<pubmed>
<MedlineCitation Status="MEDLINE" Owner="NLM">
<PMID Version="1">24580807</PMID>
<DateCompleted>
<Year>2015</Year>
<Month>03</Month>
<Day>30</Day>
</DateCompleted>
<DateRevised>
<Year>2019</Year>
<Month>12</Month>
<Day>10</Day>
</DateRevised>
<Article PubModel="Electronic">
<Journal>
<ISSN IssnType="Electronic">1474-760X</ISSN>
<JournalIssue CitedMedium="Internet">
<Volume>15</Volume>
<Issue>3</Issue>
<PubDate>
<Year>2014</Year>
<Month>Mar</Month>
<Day>03</Day>
</PubDate>
</JournalIssue>
<Title>Genome biology</Title>
<ISOAbbreviation>Genome Biol.</ISOAbbreviation>
</Journal>
<ArticleTitle>Kraken: ultrafast metagenomic sequence classification using exact alignments.</ArticleTitle>
<Pagination>
<MedlinePgn>R46</MedlinePgn>
</Pagination>
<ELocationID EIdType="doi" ValidYN="Y">10.1186/gb-2014-15-3-r46</ELocationID>
<Abstract>
<AbstractText>Kraken is an ultrafast and highly accurate program for assigning taxonomic labels to metagenomic DNA sequences. Previous programs designed for this task have been relatively slow and computationally expensive, forcing researchers to use faster abundance estimation programs, which only classify small subsets of metagenomic data. Using exact alignment of k-mers, Kraken achieves classification accuracy comparable to the fastest BLAST program. In its fastest mode, Kraken classifies 100 base pair reads at a rate of over 4.1 million reads per minute, 909 times faster than Megablast and 11 times faster than the abundance estimation program MetaPhlAn. Kraken is available at http://ccb.jhu.edu/software/kraken/. </AbstractText>
</Abstract>
<AuthorList CompleteYN="Y">
<Author ValidYN="Y">
<LastName>Wood</LastName>
<ForeName>Derrick E</ForeName>
<Initials>DE</Initials>
</Author>
<Author ValidYN="Y">
<LastName>Salzberg</LastName>
<ForeName>Steven L</ForeName>
<Initials>SL</Initials>
</Author>
</AuthorList>
<Language>eng</Language>
<GrantList CompleteYN="Y">
<Grant>
<GrantID>R01 GM083873</GrantID>
<Acronym>GM</Acronym>
<Agency>NIGMS NIH HHS</Agency>
<Country>United States</Country>
</Grant>
<Grant>
<GrantID>R01 HG006677</GrantID>
<Acronym>HG</Acronym>
<Agency>NHGRI NIH HHS</Agency>
<Country>United States</Country>
</Grant>
</GrantList>
<PublicationTypeList>
<PublicationType UI="D003160">Comparative Study</PublicationType>
<PublicationType UI="D023362">Evaluation Study</PublicationType>
<PublicationType UI="D016428">Journal Article</PublicationType>
<PublicationType UI="D052061">Research Support, N.I.H., Extramural</PublicationType>
</PublicationTypeList>
<ArticleDate DateType="Electronic">
<Year>2014</Year>
<Month>03</Month>
<Day>03</Day>
</ArticleDate>
</Article>
<MedlineJournalInfo>
<Country>England</Country>
<MedlineTA>Genome Biol</MedlineTA>
<NlmUniqueID>100960660</NlmUniqueID>
<ISSNLinking>1474-7596</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<MeshHeadingList>
<MeshHeading>
<DescriptorName UI="D001105" MajorTopicYN="N">Archaea</DescriptorName>
<QualifierName UI="Q000145" MajorTopicYN="N">classification</QualifierName>
<QualifierName UI="Q000235" MajorTopicYN="N">genetics</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D001419" MajorTopicYN="N">Bacteria</DescriptorName>
<QualifierName UI="Q000145" MajorTopicYN="N">classification</QualifierName>
<QualifierName UI="Q000235" MajorTopicYN="N">genetics</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D002965" MajorTopicYN="N">Classification</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D006801" MajorTopicYN="N">Humans</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D054892" MajorTopicYN="N">Metagenome</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D056186" MajorTopicYN="N">Metagenomics</DescriptorName>
<QualifierName UI="Q000379" MajorTopicYN="Y">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D012680" MajorTopicYN="N">Sensitivity and Specificity</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D016415" MajorTopicYN="N">Sequence Alignment</DescriptorName>
<QualifierName UI="Q000379" MajorTopicYN="Y">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D017422" MajorTopicYN="N">Sequence Analysis, DNA</DescriptorName>
<QualifierName UI="Q000379" MajorTopicYN="Y">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D012984" MajorTopicYN="Y">Software</DescriptorName>
</MeshHeading>
</MeshHeadingList>
</MedlineCitation>
<PubmedData>
<History>
<PubMedPubDate PubStatus="received">
<Year>2013</Year>
<Month>11</Month>
<Day>17</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="accepted">
<Year>2014</Year>
<Month>03</Month>
<Day>03</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez">
<Year>2014</Year>
<Month>3</Month>
<Day>4</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed">
<Year>2014</Year>
<Month>3</Month>
<Day>4</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline">
<Year>2015</Year>
<Month>3</Month>
<Day>31</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>epublish</PublicationStatus>
<ArticleIdList>
<ArticleId IdType="pubmed">24580807</ArticleId>
<ArticleId IdType="pii">gb-2014-15-3-r46</ArticleId>
<ArticleId IdType="doi">10.1186/gb-2014-15-3-r46</ArticleId>
<ArticleId IdType="pmc">PMC4053813</ArticleId>
</ArticleIdList>
<ReferenceList>
<Reference>
<Citation>Nat Methods. 2011 May;8(5):367</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21527926</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2013 Sep 15;29(18):2253-60</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">23828782</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Mol Oral Microbiol. 2012 Oct;27(5):362-72</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">22958385</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2013 Nov 1;29(21):2669-77</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">23990416</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2011 Mar 15;27(6):764-70</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21217122</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Nat Methods. 2012 Aug;9(8):811-4</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">22688413</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Antimicrob Agents Chemother. 1993 Apr;37(4):804-9</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">8494378</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>BMC Bioinformatics. 2009;10:421</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">20003500</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Nature. 2004 Mar 4;428(6978):37-43</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">14961025</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2004 Dec 12;20(18):3363-9</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">15256412</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Genome Res. 2007 Mar;17(3):377-86</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">17255551</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>J Mol Biol. 1990 Oct 5;215(3):403-10</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">2231712</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Adv Bioinformatics. 2008;2008:205969</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">19956701</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Nat Methods. 2009 Sep;6(9):673-6</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">19648916</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Science. 2004 Apr 2;304(5667):66-74</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">15001713</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2013 Jul 15;29(14):1718-25</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">23665771</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Oral Microbiol Immunol. 1994 Oct;9(5):310-4</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">7808775</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Genome Biol. 2013;14(1):R2</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">23320958</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>BMC Genomics. 2011;12 Suppl 2:S4</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21989143</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Nat Methods. 2007 Jun;4(6):495-500</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">17468765</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Nucleic Acids Res. 2012 Jan;40(Database issue):D130-5</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">22121212</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>BMC Bioinformatics. 2011;12:385</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21961884</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
</PubmedData>
</pubmed>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/PubMed/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001A39 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd -nk 001A39 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    PubMed
   |étape=   Corpus
   |type=    RBID
   |clé=     pubmed:24580807
   |texte=   Kraken: ultrafast metagenomic sequence classification using exact alignments.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/RBID.i   -Sk "pubmed:24580807" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021