MyTaxa: an advanced taxonomic classifier for genomic and metagenomic sequences
Identifieur interne : 000581 ( Pmc/Curation ); précédent : 000580; suivant : 000582MyTaxa: an advanced taxonomic classifier for genomic and metagenomic sequences
Auteurs : Chengwei Luo [États-Unis] ; Luis M. Rodriguez-R [États-Unis] ; Konstantinos T. Konstantinidis [États-Unis]Source :
- Nucleic Acids Research [ 0305-1048 ] ; 2014.
Abstract
Determining the taxonomic affiliation of sequences assembled from metagenomes remains a major bottleneck that affects research across the fields of environmental, clinical and evolutionary microbiology. Here, we introduce MyTaxa, a homology-based bioinformatics framework to classify metagenomic and genomic sequences with unprecedented accuracy. The distinguishing aspect of MyTaxa is that it employs all genes present in an unknown sequence as classifiers, weighting each gene based on its (predetermined) classifying power at a given taxonomic level and frequency of horizontal gene transfer. MyTaxa also implements a novel classification scheme based on the genome-aggregate average amino acid identity concept to determine the degree of novelty of sequences representing uncharacterized taxa, i.e. whether they represent novel species, genera or phyla. Application of MyTaxa on
Url:
DOI: 10.1093/nar/gku169
PubMed: 24589583
PubMed Central: 4005636
Links toward previous steps (curation, corpus...)
- to stream Pmc, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000581
Links to Exploration step
PMC:4005636Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">MyTaxa: an advanced taxonomic classifier for genomic and metagenomic sequences</title>
<author><name sortKey="Luo, Chengwei" sort="Luo, Chengwei" uniqKey="Luo C" first="Chengwei" last="Luo">Chengwei Luo</name>
<affiliation wicri:level="1"><nlm:aff wicri:cut=" and" id="gku169-AFF1">Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="gku169-AFF1">School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Rodriguez R, Luis M" sort="Rodriguez R, Luis M" uniqKey="Rodriguez R L" first="Luis M." last="Rodriguez-R">Luis M. Rodriguez-R</name>
<affiliation wicri:level="1"><nlm:aff wicri:cut=" and" id="gku169-AFF1">Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="gku169-AFF1">School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Konstantinidis, Konstantinos T" sort="Konstantinidis, Konstantinos T" uniqKey="Konstantinidis K" first="Konstantinos T." last="Konstantinidis">Konstantinos T. Konstantinidis</name>
<affiliation wicri:level="1"><nlm:aff wicri:cut=" and" id="gku169-AFF1">Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="gku169-AFF1">School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">24589583</idno>
<idno type="pmc">4005636</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005636</idno>
<idno type="RBID">PMC:4005636</idno>
<idno type="doi">10.1093/nar/gku169</idno>
<date when="2014">2014</date>
<idno type="wicri:Area/Pmc/Corpus">000581</idno>
<idno type="wicri:Area/Pmc/Curation">000581</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">MyTaxa: an advanced taxonomic classifier for genomic and metagenomic sequences</title>
<author><name sortKey="Luo, Chengwei" sort="Luo, Chengwei" uniqKey="Luo C" first="Chengwei" last="Luo">Chengwei Luo</name>
<affiliation wicri:level="1"><nlm:aff wicri:cut=" and" id="gku169-AFF1">Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="gku169-AFF1">School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Rodriguez R, Luis M" sort="Rodriguez R, Luis M" uniqKey="Rodriguez R L" first="Luis M." last="Rodriguez-R">Luis M. Rodriguez-R</name>
<affiliation wicri:level="1"><nlm:aff wicri:cut=" and" id="gku169-AFF1">Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="gku169-AFF1">School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Konstantinidis, Konstantinos T" sort="Konstantinidis, Konstantinos T" uniqKey="Konstantinidis K" first="Konstantinos T." last="Konstantinidis">Konstantinos T. Konstantinidis</name>
<affiliation wicri:level="1"><nlm:aff wicri:cut=" and" id="gku169-AFF1">Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><nlm:aff id="gku169-AFF1">School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332</wicri:regionArea>
</affiliation>
</author>
</analytic>
<series><title level="j">Nucleic Acids Research</title>
<idno type="ISSN">0305-1048</idno>
<idno type="eISSN">1362-4962</idno>
<imprint><date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><p>Determining the taxonomic affiliation of sequences assembled from metagenomes remains a major bottleneck that affects research across the fields of environmental, clinical and evolutionary microbiology. Here, we introduce MyTaxa, a homology-based bioinformatics framework to classify metagenomic and genomic sequences with unprecedented accuracy. The distinguishing aspect of MyTaxa is that it employs all genes present in an unknown sequence as classifiers, weighting each gene based on its (predetermined) classifying power at a given taxonomic level and frequency of horizontal gene transfer. MyTaxa also implements a novel classification scheme based on the genome-aggregate average amino acid identity concept to determine the degree of novelty of sequences representing uncharacterized taxa, i.e. whether they represent novel species, genera or phyla. Application of MyTaxa on <italic>in silico</italic>
generated (mock) and real metagenomes of varied read length (100–2000 bp) revealed that it correctly classified at least 5% more sequences than any other tool. The analysis also showed that ∼10% of the assembled sequences from human gut metagenomes represent novel species with no sequenced representatives, several of which were highly abundant <italic>in situ</italic>
such as members of the <italic>Prevotella</italic>
genus. Thus, MyTaxa can find several important applications in microbial identification and diversity studies.</p>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct><analytic><author><name sortKey="Wrighton, Kc" uniqKey="Wrighton K">KC Wrighton</name>
</author>
<author><name sortKey="Thomas, Bc" uniqKey="Thomas B">BC Thomas</name>
</author>
<author><name sortKey="Sharon, I" uniqKey="Sharon I">I Sharon</name>
</author>
<author><name sortKey="Miller, Cs" uniqKey="Miller C">CS Miller</name>
</author>
<author><name sortKey="Castelle, Cj" uniqKey="Castelle C">CJ Castelle</name>
</author>
<author><name sortKey="Verberkmoes, Nc" uniqKey="Verberkmoes N">NC VerBerkmoes</name>
</author>
<author><name sortKey="Wilkins, Mj" uniqKey="Wilkins M">MJ Wilkins</name>
</author>
<author><name sortKey="Hettich, Rl" uniqKey="Hettich R">RL Hettich</name>
</author>
<author><name sortKey="Lipton, Ms" uniqKey="Lipton M">MS Lipton</name>
</author>
<author><name sortKey="Williams, Kh" uniqKey="Williams K">KH Williams</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Arumugam, M" uniqKey="Arumugam M">M Arumugam</name>
</author>
<author><name sortKey="Raes, J" uniqKey="Raes J">J Raes</name>
</author>
<author><name sortKey="Pelletier, E" uniqKey="Pelletier E">E Pelletier</name>
</author>
<author><name sortKey="Le Paslier, D" uniqKey="Le Paslier D">D Le Paslier</name>
</author>
<author><name sortKey="Yamada, T" uniqKey="Yamada T">T Yamada</name>
</author>
<author><name sortKey="Mende, Dr" uniqKey="Mende D">DR Mende</name>
</author>
<author><name sortKey="Fernandes, Gr" uniqKey="Fernandes G">GR Fernandes</name>
</author>
<author><name sortKey="Tap, J" uniqKey="Tap J">J Tap</name>
</author>
<author><name sortKey="Bruls, T" uniqKey="Bruls T">T Bruls</name>
</author>
<author><name sortKey="Batto, Jm" uniqKey="Batto J">JM Batto</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Iverson, V" uniqKey="Iverson V">V Iverson</name>
</author>
<author><name sortKey="Morris, Rm" uniqKey="Morris R">RM Morris</name>
</author>
<author><name sortKey="Frazar, Cd" uniqKey="Frazar C">CD Frazar</name>
</author>
<author><name sortKey="Berthiaume, Ct" uniqKey="Berthiaume C">CT Berthiaume</name>
</author>
<author><name sortKey="Morales, Rl" uniqKey="Morales R">RL Morales</name>
</author>
<author><name sortKey="Armbrust, Ev" uniqKey="Armbrust E">EV Armbrust</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Glass, Em" uniqKey="Glass E">EM Glass</name>
</author>
<author><name sortKey="Wilkening, J" uniqKey="Wilkening J">J Wilkening</name>
</author>
<author><name sortKey="Wilke, A" uniqKey="Wilke A">A Wilke</name>
</author>
<author><name sortKey="Antonopoulos, D" uniqKey="Antonopoulos D">D Antonopoulos</name>
</author>
<author><name sortKey="Meyer, F" uniqKey="Meyer F">F Meyer</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Sun, S" uniqKey="Sun S">S Sun</name>
</author>
<author><name sortKey="Chen, J" uniqKey="Chen J">J Chen</name>
</author>
<author><name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
<author><name sortKey="Altintas, I" uniqKey="Altintas I">I Altintas</name>
</author>
<author><name sortKey="Lin, A" uniqKey="Lin A">A Lin</name>
</author>
<author><name sortKey="Peltier, S" uniqKey="Peltier S">S Peltier</name>
</author>
<author><name sortKey="Stocks, K" uniqKey="Stocks K">K Stocks</name>
</author>
<author><name sortKey="Allen, Ee" uniqKey="Allen E">EE Allen</name>
</author>
<author><name sortKey="Ellisman, M" uniqKey="Ellisman M">M Ellisman</name>
</author>
<author><name sortKey="Grethe, J" uniqKey="Grethe J">J Grethe</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Amann, Ri" uniqKey="Amann R">RI Amann</name>
</author>
<author><name sortKey="Ludwig, W" uniqKey="Ludwig W">W Ludwig</name>
</author>
<author><name sortKey="Schleifer, Kh" uniqKey="Schleifer K">KH Schleifer</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Stepanauskas, R" uniqKey="Stepanauskas R">R Stepanauskas</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Desantis, Tz" uniqKey="Desantis T">TZ DeSantis</name>
</author>
<author><name sortKey="Hugenholtz, P" uniqKey="Hugenholtz P">P Hugenholtz</name>
</author>
<author><name sortKey="Larsen, N" uniqKey="Larsen N">N Larsen</name>
</author>
<author><name sortKey="Rojas, M" uniqKey="Rojas M">M Rojas</name>
</author>
<author><name sortKey="Brodie, El" uniqKey="Brodie E">EL Brodie</name>
</author>
<author><name sortKey="Keller, K" uniqKey="Keller K">K Keller</name>
</author>
<author><name sortKey="Huber, T" uniqKey="Huber T">T Huber</name>
</author>
<author><name sortKey="Dalevi, D" uniqKey="Dalevi D">D Dalevi</name>
</author>
<author><name sortKey="Hu, P" uniqKey="Hu P">P Hu</name>
</author>
<author><name sortKey="Andersen, Gl" uniqKey="Andersen G">GL Andersen</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Cole, Jr" uniqKey="Cole J">JR Cole</name>
</author>
<author><name sortKey="Wang, Q" uniqKey="Wang Q">Q Wang</name>
</author>
<author><name sortKey="Cardenas, E" uniqKey="Cardenas E">E Cardenas</name>
</author>
<author><name sortKey="Fish, J" uniqKey="Fish J">J Fish</name>
</author>
<author><name sortKey="Chai, B" uniqKey="Chai B">B Chai</name>
</author>
<author><name sortKey="Farris, Rj" uniqKey="Farris R">RJ Farris</name>
</author>
<author><name sortKey="Kulam Syed Mohideen, As" uniqKey="Kulam Syed Mohideen A">AS Kulam-Syed-Mohideen</name>
</author>
<author><name sortKey="Mcgarrell, Dm" uniqKey="Mcgarrell D">DM McGarrell</name>
</author>
<author><name sortKey="Marsh, T" uniqKey="Marsh T">T Marsh</name>
</author>
<author><name sortKey="Garrity, Gm" uniqKey="Garrity G">GM Garrity</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Woyke, T" uniqKey="Woyke T">T Woyke</name>
</author>
<author><name sortKey="Xie, G" uniqKey="Xie G">G Xie</name>
</author>
<author><name sortKey="Copeland, A" uniqKey="Copeland A">A Copeland</name>
</author>
<author><name sortKey="Gonzalez, Jm" uniqKey="Gonzalez J">JM Gonzalez</name>
</author>
<author><name sortKey="Han, C" uniqKey="Han C">C Han</name>
</author>
<author><name sortKey="Kiss, H" uniqKey="Kiss H">H Kiss</name>
</author>
<author><name sortKey="Saw, Jh" uniqKey="Saw J">JH Saw</name>
</author>
<author><name sortKey="Senin, P" uniqKey="Senin P">P Senin</name>
</author>
<author><name sortKey="Yang, C" uniqKey="Yang C">C Yang</name>
</author>
<author><name sortKey="Chatterji, S" uniqKey="Chatterji S">S Chatterji</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Konstantinidis, Kt" uniqKey="Konstantinidis K">KT Konstantinidis</name>
</author>
<author><name sortKey="Tiedje, Jm" uniqKey="Tiedje J">JM Tiedje</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Rosen, Gl" uniqKey="Rosen G">GL Rosen</name>
</author>
<author><name sortKey="Reichenberger, Er" uniqKey="Reichenberger E">ER Reichenberger</name>
</author>
<author><name sortKey="Rosenfeld, Am" uniqKey="Rosenfeld A">AM Rosenfeld</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Patil, Kr" uniqKey="Patil K">KR Patil</name>
</author>
<author><name sortKey="Haider, P" uniqKey="Haider P">P Haider</name>
</author>
<author><name sortKey="Pope, Pb" uniqKey="Pope P">PB Pope</name>
</author>
<author><name sortKey="Turnbaugh, Pj" uniqKey="Turnbaugh P">PJ Turnbaugh</name>
</author>
<author><name sortKey="Morrison, M" uniqKey="Morrison M">M Morrison</name>
</author>
<author><name sortKey="Scheffer, T" uniqKey="Scheffer T">T Scheffer</name>
</author>
<author><name sortKey="Mchardy, Ac" uniqKey="Mchardy A">AC McHardy</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Huson, Dh" uniqKey="Huson D">DH Huson</name>
</author>
<author><name sortKey="Mitra, S" uniqKey="Mitra S">S Mitra</name>
</author>
<author><name sortKey="Ruscheweyh, Hj" uniqKey="Ruscheweyh H">HJ Ruscheweyh</name>
</author>
<author><name sortKey="Weber, N" uniqKey="Weber N">N Weber</name>
</author>
<author><name sortKey="Schuster, Sc" uniqKey="Schuster S">SC Schuster</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Monzoorul Haque, M" uniqKey="Monzoorul Haque M">M Monzoorul Haque</name>
</author>
<author><name sortKey="Ghosh, Ts" uniqKey="Ghosh T">TS Ghosh</name>
</author>
<author><name sortKey="Komanduri, D" uniqKey="Komanduri D">D Komanduri</name>
</author>
<author><name sortKey="Mande, Ss" uniqKey="Mande S">SS Mande</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Krause, L" uniqKey="Krause L">L Krause</name>
</author>
<author><name sortKey="Diaz, Nn" uniqKey="Diaz N">NN Diaz</name>
</author>
<author><name sortKey="Goesmann, A" uniqKey="Goesmann A">A Goesmann</name>
</author>
<author><name sortKey="Kelley, S" uniqKey="Kelley S">S Kelley</name>
</author>
<author><name sortKey="Nattkemper, Tw" uniqKey="Nattkemper T">TW Nattkemper</name>
</author>
<author><name sortKey="Rohwer, F" uniqKey="Rohwer F">F Rohwer</name>
</author>
<author><name sortKey="Edwards, Ra" uniqKey="Edwards R">RA Edwards</name>
</author>
<author><name sortKey="Stoye, J" uniqKey="Stoye J">J Stoye</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Altschul, Sf" uniqKey="Altschul S">SF Altschul</name>
</author>
<author><name sortKey="Madden, Tl" uniqKey="Madden T">TL Madden</name>
</author>
<author><name sortKey="Schaffer, Aa" uniqKey="Schaffer A">AA Schaffer</name>
</author>
<author><name sortKey="Zhang, J" uniqKey="Zhang J">J Zhang</name>
</author>
<author><name sortKey="Zhang, Z" uniqKey="Zhang Z">Z Zhang</name>
</author>
<author><name sortKey="Miller, W" uniqKey="Miller W">W Miller</name>
</author>
<author><name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Sonnhammer, El" uniqKey="Sonnhammer E">EL Sonnhammer</name>
</author>
<author><name sortKey="Eddy, Sr" uniqKey="Eddy S">SR Eddy</name>
</author>
<author><name sortKey="Birney, E" uniqKey="Birney E">E Birney</name>
</author>
<author><name sortKey="Bateman, A" uniqKey="Bateman A">A Bateman</name>
</author>
<author><name sortKey="Durbin, R" uniqKey="Durbin R">R Durbin</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Brady, A" uniqKey="Brady A">A Brady</name>
</author>
<author><name sortKey="Salzberg, L" uniqKey="Salzberg L">L Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Zhu, W" uniqKey="Zhu W">W Zhu</name>
</author>
<author><name sortKey="Lomsadze, A" uniqKey="Lomsadze A">A Lomsadze</name>
</author>
<author><name sortKey="Borodovsky, M" uniqKey="Borodovsky M">M Borodovsky</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Hyatt, D" uniqKey="Hyatt D">D Hyatt</name>
</author>
<author><name sortKey="Locascio, Pf" uniqKey="Locascio P">PF LoCascio</name>
</author>
<author><name sortKey="Hauser, Lj" uniqKey="Hauser L">LJ Hauser</name>
</author>
<author><name sortKey="Uberbacher, Ec" uniqKey="Uberbacher E">EC Uberbacher</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Rho, M" uniqKey="Rho M">M Rho</name>
</author>
<author><name sortKey="Tang, H" uniqKey="Tang H">H Tang</name>
</author>
<author><name sortKey="Ye, Y" uniqKey="Ye Y">Y Ye</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Edgar, Rc" uniqKey="Edgar R">RC Edgar</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Kent, Wj" uniqKey="Kent W">WJ Kent</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ondov, Bd" uniqKey="Ondov B">BD Ondov</name>
</author>
<author><name sortKey="Bergman, Nh" uniqKey="Bergman N">NH Bergman</name>
</author>
<author><name sortKey="Phillippy, Am" uniqKey="Phillippy A">AM Phillippy</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Konstantinidis, Kt" uniqKey="Konstantinidis K">KT Konstantinidis</name>
</author>
<author><name sortKey="Tiedje, Jm" uniqKey="Tiedje J">JM Tiedje</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Konstantinidis, Kt" uniqKey="Konstantinidis K">KT Konstantinidis</name>
</author>
<author><name sortKey="Tiedje, Jm" uniqKey="Tiedje J">JM Tiedje</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Bowman, Aw" uniqKey="Bowman A">AW Bowman</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Price, Mn" uniqKey="Price M">MN Price</name>
</author>
<author><name sortKey="Dehal, Ps" uniqKey="Dehal P">PS Dehal</name>
</author>
<author><name sortKey="Arkin, Ap" uniqKey="Arkin A">AP Arkin</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Edgar, Rc" uniqKey="Edgar R">RC Edgar</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Schluter, A" uniqKey="Schluter A">A Schluter</name>
</author>
<author><name sortKey="Bekel, T" uniqKey="Bekel T">T Bekel</name>
</author>
<author><name sortKey="Diaz, Nn" uniqKey="Diaz N">NN Diaz</name>
</author>
<author><name sortKey="Dondrup, M" uniqKey="Dondrup M">M Dondrup</name>
</author>
<author><name sortKey="Eichenlaub, R" uniqKey="Eichenlaub R">R Eichenlaub</name>
</author>
<author><name sortKey="Gartemann, Kh" uniqKey="Gartemann K">KH Gartemann</name>
</author>
<author><name sortKey="Krahn, I" uniqKey="Krahn I">I Krahn</name>
</author>
<author><name sortKey="Krause, L" uniqKey="Krause L">L Krause</name>
</author>
<author><name sortKey="Kromeke, H" uniqKey="Kromeke H">H Kromeke</name>
</author>
<author><name sortKey="Kruse, O" uniqKey="Kruse O">O Kruse</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Konstantinidis, Kt" uniqKey="Konstantinidis K">KT Konstantinidis</name>
</author>
<author><name sortKey="Tiedje, Jm" uniqKey="Tiedje J">JM Tiedje</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Goris, J" uniqKey="Goris J">J Goris</name>
</author>
<author><name sortKey="Konstantinidis, Kt" uniqKey="Konstantinidis K">KT Konstantinidis</name>
</author>
<author><name sortKey="Klappenbach, Ja" uniqKey="Klappenbach J">JA Klappenbach</name>
</author>
<author><name sortKey="Coenye, T" uniqKey="Coenye T">T Coenye</name>
</author>
<author><name sortKey="Vandamme, P" uniqKey="Vandamme P">P Vandamme</name>
</author>
<author><name sortKey="Tiedje, Jm" uniqKey="Tiedje J">JM Tiedje</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Luo, C" uniqKey="Luo C">C Luo</name>
</author>
<author><name sortKey="Tsementzi, D" uniqKey="Tsementzi D">D Tsementzi</name>
</author>
<author><name sortKey="Kyrpides, N" uniqKey="Kyrpides N">N Kyrpides</name>
</author>
<author><name sortKey="Read, T" uniqKey="Read T">T Read</name>
</author>
<author><name sortKey="Konstantinidis, Kt" uniqKey="Konstantinidis K">KT Konstantinidis</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Mande, Ss" uniqKey="Mande S">SS Mande</name>
</author>
<author><name sortKey="Mohammed, Mh" uniqKey="Mohammed M">MH Mohammed</name>
</author>
<author><name sortKey="Ghosh, Ts" uniqKey="Ghosh T">TS Ghosh</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Konstantinidis, Kt" uniqKey="Konstantinidis K">KT Konstantinidis</name>
</author>
<author><name sortKey="Delong, Ef" uniqKey="Delong E">EF DeLong</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Linnaeus, C" uniqKey="Linnaeus C">C Linnaeus</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Sunagawa, S" uniqKey="Sunagawa S">S Sunagawa</name>
</author>
<author><name sortKey="Mende, Dr" uniqKey="Mende D">DR Mende</name>
</author>
<author><name sortKey="Zeller, G" uniqKey="Zeller G">G Zeller</name>
</author>
<author><name sortKey="Izquierdo Carrasco, F" uniqKey="Izquierdo Carrasco F">F Izquierdo-Carrasco</name>
</author>
<author><name sortKey="Berger, Sa" uniqKey="Berger S">SA Berger</name>
</author>
<author><name sortKey="Kultima, Jr" uniqKey="Kultima J">JR Kultima</name>
</author>
<author><name sortKey="Coelho, Lp" uniqKey="Coelho L">LP Coelho</name>
</author>
<author><name sortKey="Arumugam, M" uniqKey="Arumugam M">M Arumugam</name>
</author>
<author><name sortKey="Tap, J" uniqKey="Tap J">J Tap</name>
</author>
<author><name sortKey="Nielsen, Hb" uniqKey="Nielsen H">HB Nielsen</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article"><pmc-dir>properties open_access</pmc-dir>
<front><journal-meta><journal-id journal-id-type="nlm-ta">Nucleic Acids Res</journal-id>
<journal-id journal-id-type="iso-abbrev">Nucleic Acids Res</journal-id>
<journal-id journal-id-type="publisher-id">nar</journal-id>
<journal-id journal-id-type="hwp">nar</journal-id>
<journal-title-group><journal-title>Nucleic Acids Research</journal-title>
</journal-title-group>
<issn pub-type="ppub">0305-1048</issn>
<issn pub-type="epub">1362-4962</issn>
<publisher><publisher-name>Oxford University Press</publisher-name>
</publisher>
</journal-meta>
<article-meta><article-id pub-id-type="pmid">24589583</article-id>
<article-id pub-id-type="pmc">4005636</article-id>
<article-id pub-id-type="doi">10.1093/nar/gku169</article-id>
<article-id pub-id-type="publisher-id">gku169</article-id>
<article-categories><subj-group subj-group-type="heading"><subject>Methods Online</subject>
</subj-group>
</article-categories>
<title-group><article-title>MyTaxa: an advanced taxonomic classifier for genomic and metagenomic sequences</article-title>
</title-group>
<contrib-group><contrib contrib-type="author"><name><surname>Luo</surname>
<given-names>Chengwei</given-names>
</name>
<xref ref-type="aff" rid="gku169-AFF1"><sup>1</sup>
</xref>
<xref ref-type="aff" rid="gku169-AFF1"><sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author"><name><surname>Rodriguez-R</surname>
<given-names>Luis M.</given-names>
</name>
<xref ref-type="aff" rid="gku169-AFF1"><sup>1</sup>
</xref>
<xref ref-type="aff" rid="gku169-AFF1"><sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author"><name><surname>Konstantinidis</surname>
<given-names>Konstantinos T.</given-names>
</name>
<xref ref-type="aff" rid="gku169-AFF1"><sup>1</sup>
</xref>
<xref ref-type="aff" rid="gku169-AFF1"><sup>2</sup>
</xref>
<xref ref-type="corresp" rid="gku169-COR1">*</xref>
</contrib>
<aff id="gku169-AFF1"><sup>1</sup>
Centre for Bioinformatics and Computational Genomics, and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA and<sup>2</sup>
School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA</aff>
</contrib-group>
<author-notes><corresp id="gku169-COR1">*To whom correspondence should be addressed. Tel: <phone>+1 404 385 3628</phone>
; Fax: <fax>+1 404 894 8266</fax>
; Email: <email>kostas@ce.gatech.edu</email>
</corresp>
</author-notes>
<pub-date pub-type="ppub"><month>4</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="epub"><day>3</day>
<month>3</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="pmc-release"><day>3</day>
<month>3</month>
<year>2014</year>
</pub-date>
<pmc-comment> PMC Release delay is 0 months and 0 days and was based on the
. </pmc-comment>
<volume>42</volume>
<issue>8</issue>
<fpage>e73</fpage>
<lpage>e73</lpage>
<history><date date-type="received"><day>30</day>
<month>9</month>
<year>2013</year>
</date>
<date date-type="rev-recd"><day>9</day>
<month>2</month>
<year>2014</year>
</date>
<date date-type="accepted"><day>10</day>
<month>2</month>
<year>2014</year>
</date>
</history>
<permissions><copyright-statement>© The Author(s) 2014. Published by Oxford University Press.</copyright-statement>
<copyright-year>2014</copyright-year>
<license license-type="creative-commons" xlink:href="http://creativecommons.org/licenses/by/3.0/"><license-p><pmc-comment>CREATIVE COMMONS</pmc-comment>
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/3.0/">http://creativecommons.org/licenses/by/3.0/</ext-link>
), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<abstract><p>Determining the taxonomic affiliation of sequences assembled from metagenomes remains a major bottleneck that affects research across the fields of environmental, clinical and evolutionary microbiology. Here, we introduce MyTaxa, a homology-based bioinformatics framework to classify metagenomic and genomic sequences with unprecedented accuracy. The distinguishing aspect of MyTaxa is that it employs all genes present in an unknown sequence as classifiers, weighting each gene based on its (predetermined) classifying power at a given taxonomic level and frequency of horizontal gene transfer. MyTaxa also implements a novel classification scheme based on the genome-aggregate average amino acid identity concept to determine the degree of novelty of sequences representing uncharacterized taxa, i.e. whether they represent novel species, genera or phyla. Application of MyTaxa on <italic>in silico</italic>
generated (mock) and real metagenomes of varied read length (100–2000 bp) revealed that it correctly classified at least 5% more sequences than any other tool. The analysis also showed that ∼10% of the assembled sequences from human gut metagenomes represent novel species with no sequenced representatives, several of which were highly abundant <italic>in situ</italic>
such as members of the <italic>Prevotella</italic>
genus. Thus, MyTaxa can find several important applications in microbial identification and diversity studies.</p>
</abstract>
<counts><page-count count="12"></page-count>
</counts>
</article-meta>
</front>
</pmc>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000581 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd -nk 000581 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= CyberinfraV1 |flux= Pmc |étape= Curation |type= RBID |clé= PMC:4005636 |texte= MyTaxa: an advanced taxonomic classifier for genomic and metagenomic sequences }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Curation/RBID.i -Sk "pubmed:24589583" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd \ | NlmPubMed2Wicri -a CyberinfraV1
This area was generated with Dilib version V0.6.25. |