Metagenomics: Facts and Artifacts, and Computational Challenges*
Identifieur interne : 000147 ( Ncbi/Merge ); précédent : 000146; suivant : 000148Metagenomics: Facts and Artifacts, and Computational Challenges*
Auteurs : John C. Wooley [États-Unis] ; Yuzhen YeSource :
- Journal of computer science and technology [ 1000-9000 ] ; 2009.
Abstract
Metagenomics is the study of microbial communities sampled directly from their natural environment, without prior culturing. By enabling an analysis of populations including many (so-far) unculturable and often unknown microbes, metagenomics is revolutionizing the field of microbiology, and has excited researchers in many disciplines that could benefit from the study of environmental microbes, including those in ecology, environmental sciences, and biomedicine. Specific computational and statistical tools have been developed for metagenomic data analysis and comparison. New studies, however, have revealed various kinds of artifacts present in metagenomics data caused by limitations in the experimental protocols and/or inadequate data analysis procedures, which often lead to incorrect conclusions about a microbial community. Here, we review some of the artifacts, such as overestimation of species diversity and incorrect estimation of gene family frequencies, and discuss emerging computational approaches to address them. We also review potential challenges that metagenomics may encounter with the extensive application of next-generation sequencing (NGS) techniques.
Url:
DOI: 10.1007/s11390-010-9306-4
PubMed: 20648230
PubMed Central: 2905821
Links toward previous steps (curation, corpus...)
- to stream Pmc, to step Corpus: 000296
- to stream Pmc, to step Curation: 000296
- to stream Pmc, to step Checkpoint: 000629
Links to Exploration step
PMC:2905821Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Metagenomics: Facts and Artifacts, and Computational Challenges*</title>
<author><name sortKey="Wooley, John C" sort="Wooley, John C" uniqKey="Wooley J" first="John C." last="Wooley">John C. Wooley</name>
<affiliation wicri:level="2"><nlm:aff id="A1"> Center for Research on BioSystems, Calit2, UC San Diego, La Jolla CA 92093</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName><region type="state">Californie</region>
</placeName>
<wicri:cityArea> Center for Research on BioSystems, Calit2, UC San Diego</wicri:cityArea>
</affiliation>
</author>
<author><name sortKey="Ye, Yuzhen" sort="Ye, Yuzhen" uniqKey="Ye Y" first="Yuzhen" last="Ye">Yuzhen Ye</name>
<affiliation><nlm:aff id="A2"> School of Informatics and Computing, Indiana University, Bloomington, Indiana, 47408</nlm:aff>
<wicri:noCountry code="subfield">47408</wicri:noCountry>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">20648230</idno>
<idno type="pmc">2905821</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2905821</idno>
<idno type="RBID">PMC:2905821</idno>
<idno type="doi">10.1007/s11390-010-9306-4</idno>
<date when="2009">2009</date>
<idno type="wicri:Area/Pmc/Corpus">000296</idno>
<idno type="wicri:Area/Pmc/Curation">000296</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000629</idno>
<idno type="wicri:Area/Ncbi/Merge">000147</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">Metagenomics: Facts and Artifacts, and Computational Challenges*</title>
<author><name sortKey="Wooley, John C" sort="Wooley, John C" uniqKey="Wooley J" first="John C." last="Wooley">John C. Wooley</name>
<affiliation wicri:level="2"><nlm:aff id="A1"> Center for Research on BioSystems, Calit2, UC San Diego, La Jolla CA 92093</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName><region type="state">Californie</region>
</placeName>
<wicri:cityArea> Center for Research on BioSystems, Calit2, UC San Diego</wicri:cityArea>
</affiliation>
</author>
<author><name sortKey="Ye, Yuzhen" sort="Ye, Yuzhen" uniqKey="Ye Y" first="Yuzhen" last="Ye">Yuzhen Ye</name>
<affiliation><nlm:aff id="A2"> School of Informatics and Computing, Indiana University, Bloomington, Indiana, 47408</nlm:aff>
<wicri:noCountry code="subfield">47408</wicri:noCountry>
</affiliation>
</author>
</analytic>
<series><title level="j">Journal of computer science and technology</title>
<idno type="ISSN">1000-9000</idno>
<idno type="eISSN">1860-4749</idno>
<imprint><date when="2009">2009</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><p id="P1">Metagenomics is the study of microbial communities sampled directly from their natural environment, without prior culturing. By enabling an analysis of populations including many (so-far) unculturable and often unknown microbes, metagenomics is revolutionizing the field of microbiology, and has excited researchers in many disciplines that could benefit from the study of environmental microbes, including those in ecology, environmental sciences, and biomedicine. Specific computational and statistical tools have been developed for metagenomic data analysis and comparison. New studies, however, have revealed various kinds of artifacts present in metagenomics data caused by limitations in the experimental protocols and/or inadequate data analysis procedures, which often lead to incorrect conclusions about a microbial community. Here, we review some of the artifacts, such as overestimation of species diversity and incorrect estimation of gene family frequencies, and discuss emerging computational approaches to address them. We also review potential challenges that metagenomics may encounter with the extensive application of next-generation sequencing (NGS) techniques.</p>
</div>
</front>
</TEI>
<pmc article-type="research-article" xml:lang="EN"><pmc-comment>The publisher of this article does not allow downloading of the full text in XML form.</pmc-comment>
<pmc-dir>properties manuscript</pmc-dir>
<front><journal-meta><journal-id journal-id-type="nlm-journal-id">101530317</journal-id>
<journal-id journal-id-type="pubmed-jr-id">37826</journal-id>
<journal-id journal-id-type="nlm-ta">J Comput Sci Technol</journal-id>
<journal-title>Journal of computer science and technology</journal-title>
<issn pub-type="ppub">1000-9000</issn>
<issn pub-type="epub">1860-4749</issn>
</journal-meta>
<article-meta><article-id pub-id-type="pmid">20648230</article-id>
<article-id pub-id-type="pmc">2905821</article-id>
<article-id pub-id-type="doi">10.1007/s11390-010-9306-4</article-id>
<article-id pub-id-type="manuscript">NIHMS175389</article-id>
<article-categories><subj-group subj-group-type="heading"><subject>Article</subject>
</subj-group>
</article-categories>
<title-group><article-title>Metagenomics: Facts and Artifacts, and Computational Challenges*</article-title>
</title-group>
<contrib-group><contrib contrib-type="author"><name><surname>Wooley</surname>
<given-names>John C.</given-names>
</name>
<xref rid="A1" ref-type="aff">1</xref>
<email>jwooley@ucsd.edu</email>
</contrib>
<contrib contrib-type="author"><name><surname>Ye</surname>
<given-names>Yuzhen</given-names>
</name>
<xref rid="A2" ref-type="aff">2</xref>
<email>yye@indiana.edu</email>
</contrib>
</contrib-group>
<aff id="A1"><label>1</label>
Center for Research on BioSystems, Calit2, UC San Diego, La Jolla CA 92093</aff>
<aff id="A2"><label>2</label>
School of Informatics and Computing, Indiana University, Bloomington, Indiana, 47408</aff>
<pub-date pub-type="nihms-submitted"><day>1</day>
<month>7</month>
<year>2010</year>
</pub-date>
<pub-date pub-type="ppub"><month>1</month>
<year>2009</year>
</pub-date>
<pub-date pub-type="pmc-release"><day>19</day>
<month>7</month>
<year>2010</year>
</pub-date>
<volume>25</volume>
<issue>1</issue>
<fpage>71</fpage>
<lpage>81</lpage>
<abstract><p id="P1">Metagenomics is the study of microbial communities sampled directly from their natural environment, without prior culturing. By enabling an analysis of populations including many (so-far) unculturable and often unknown microbes, metagenomics is revolutionizing the field of microbiology, and has excited researchers in many disciplines that could benefit from the study of environmental microbes, including those in ecology, environmental sciences, and biomedicine. Specific computational and statistical tools have been developed for metagenomic data analysis and comparison. New studies, however, have revealed various kinds of artifacts present in metagenomics data caused by limitations in the experimental protocols and/or inadequate data analysis procedures, which often lead to incorrect conclusions about a microbial community. Here, we review some of the artifacts, such as overestimation of species diversity and incorrect estimation of gene family frequencies, and discuss emerging computational approaches to address them. We also review potential challenges that metagenomics may encounter with the extensive application of next-generation sequencing (NGS) techniques.</p>
</abstract>
<kwd-group><kwd>Metagenomics</kwd>
<kwd>next-generation sequencing (NGS)</kwd>
<kwd>taxonomic/functional profiling</kwd>
<kwd>statistical approaches</kwd>
<kwd>comparative metagenomics</kwd>
</kwd-group>
<contract-num rid="HG1">R01 HG004908-02
||HG</contract-num>
<contract-num rid="HG1">R01 HG004908-01
||HG</contract-num>
<contract-sponsor id="HG1">National Human Genome Research Institute : NHGRI</contract-sponsor>
</article-meta>
</front>
</pmc>
<affiliations><list><country><li>États-Unis</li>
</country>
<region><li>Californie</li>
</region>
</list>
<tree><noCountry><name sortKey="Ye, Yuzhen" sort="Ye, Yuzhen" uniqKey="Ye Y" first="Yuzhen" last="Ye">Yuzhen Ye</name>
</noCountry>
<country name="États-Unis"><region name="Californie"><name sortKey="Wooley, John C" sort="Wooley, John C" uniqKey="Wooley J" first="John C." last="Wooley">John C. Wooley</name>
</region>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Ncbi/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000147 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Ncbi/Merge/biblio.hfd -nk 000147 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= CyberinfraV1 |flux= Ncbi |étape= Merge |type= RBID |clé= PMC:2905821 |texte= Metagenomics: Facts and Artifacts, and Computational Challenges* }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Ncbi/Merge/RBID.i -Sk "pubmed:20648230" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Ncbi/Merge/biblio.hfd \ | NlmPubMed2Wicri -a CyberinfraV1
This area was generated with Dilib version V0.6.25. |