Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Metagenomics: Facts and Artifacts, and Computational Challenges*

Identifieur interne : 000147 ( Ncbi/Merge ); précédent : 000146; suivant : 000148

Metagenomics: Facts and Artifacts, and Computational Challenges*

Auteurs : John C. Wooley [États-Unis] ; Yuzhen Ye

Source :

RBID : PMC:2905821

Abstract

Metagenomics is the study of microbial communities sampled directly from their natural environment, without prior culturing. By enabling an analysis of populations including many (so-far) unculturable and often unknown microbes, metagenomics is revolutionizing the field of microbiology, and has excited researchers in many disciplines that could benefit from the study of environmental microbes, including those in ecology, environmental sciences, and biomedicine. Specific computational and statistical tools have been developed for metagenomic data analysis and comparison. New studies, however, have revealed various kinds of artifacts present in metagenomics data caused by limitations in the experimental protocols and/or inadequate data analysis procedures, which often lead to incorrect conclusions about a microbial community. Here, we review some of the artifacts, such as overestimation of species diversity and incorrect estimation of gene family frequencies, and discuss emerging computational approaches to address them. We also review potential challenges that metagenomics may encounter with the extensive application of next-generation sequencing (NGS) techniques.


Url:
DOI: 10.1007/s11390-010-9306-4
PubMed: 20648230
PubMed Central: 2905821

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:2905821

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Metagenomics: Facts and Artifacts, and Computational Challenges*</title>
<author>
<name sortKey="Wooley, John C" sort="Wooley, John C" uniqKey="Wooley J" first="John C." last="Wooley">John C. Wooley</name>
<affiliation wicri:level="2">
<nlm:aff id="A1"> Center for Research on BioSystems, Calit2, UC San Diego, La Jolla CA 92093</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Californie</region>
</placeName>
<wicri:cityArea> Center for Research on BioSystems, Calit2, UC San Diego</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Ye, Yuzhen" sort="Ye, Yuzhen" uniqKey="Ye Y" first="Yuzhen" last="Ye">Yuzhen Ye</name>
<affiliation>
<nlm:aff id="A2"> School of Informatics and Computing, Indiana University, Bloomington, Indiana, 47408</nlm:aff>
<wicri:noCountry code="subfield">47408</wicri:noCountry>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">20648230</idno>
<idno type="pmc">2905821</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2905821</idno>
<idno type="RBID">PMC:2905821</idno>
<idno type="doi">10.1007/s11390-010-9306-4</idno>
<date when="2009">2009</date>
<idno type="wicri:Area/Pmc/Corpus">000296</idno>
<idno type="wicri:Area/Pmc/Curation">000296</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000629</idno>
<idno type="wicri:Area/Ncbi/Merge">000147</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Metagenomics: Facts and Artifacts, and Computational Challenges*</title>
<author>
<name sortKey="Wooley, John C" sort="Wooley, John C" uniqKey="Wooley J" first="John C." last="Wooley">John C. Wooley</name>
<affiliation wicri:level="2">
<nlm:aff id="A1"> Center for Research on BioSystems, Calit2, UC San Diego, La Jolla CA 92093</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Californie</region>
</placeName>
<wicri:cityArea> Center for Research on BioSystems, Calit2, UC San Diego</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Ye, Yuzhen" sort="Ye, Yuzhen" uniqKey="Ye Y" first="Yuzhen" last="Ye">Yuzhen Ye</name>
<affiliation>
<nlm:aff id="A2"> School of Informatics and Computing, Indiana University, Bloomington, Indiana, 47408</nlm:aff>
<wicri:noCountry code="subfield">47408</wicri:noCountry>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Journal of computer science and technology</title>
<idno type="ISSN">1000-9000</idno>
<idno type="eISSN">1860-4749</idno>
<imprint>
<date when="2009">2009</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p id="P1">Metagenomics is the study of microbial communities sampled directly from their natural environment, without prior culturing. By enabling an analysis of populations including many (so-far) unculturable and often unknown microbes, metagenomics is revolutionizing the field of microbiology, and has excited researchers in many disciplines that could benefit from the study of environmental microbes, including those in ecology, environmental sciences, and biomedicine. Specific computational and statistical tools have been developed for metagenomic data analysis and comparison. New studies, however, have revealed various kinds of artifacts present in metagenomics data caused by limitations in the experimental protocols and/or inadequate data analysis procedures, which often lead to incorrect conclusions about a microbial community. Here, we review some of the artifacts, such as overestimation of species diversity and incorrect estimation of gene family frequencies, and discuss emerging computational approaches to address them. We also review potential challenges that metagenomics may encounter with the extensive application of next-generation sequencing (NGS) techniques.</p>
</div>
</front>
</TEI>
<pmc article-type="research-article" xml:lang="EN">
<pmc-comment>The publisher of this article does not allow downloading of the full text in XML form.</pmc-comment>
<pmc-dir>properties manuscript</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-journal-id">101530317</journal-id>
<journal-id journal-id-type="pubmed-jr-id">37826</journal-id>
<journal-id journal-id-type="nlm-ta">J Comput Sci Technol</journal-id>
<journal-title>Journal of computer science and technology</journal-title>
<issn pub-type="ppub">1000-9000</issn>
<issn pub-type="epub">1860-4749</issn>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">20648230</article-id>
<article-id pub-id-type="pmc">2905821</article-id>
<article-id pub-id-type="doi">10.1007/s11390-010-9306-4</article-id>
<article-id pub-id-type="manuscript">NIHMS175389</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Metagenomics: Facts and Artifacts, and Computational Challenges*</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Wooley</surname>
<given-names>John C.</given-names>
</name>
<xref rid="A1" ref-type="aff">1</xref>
<email>jwooley@ucsd.edu</email>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ye</surname>
<given-names>Yuzhen</given-names>
</name>
<xref rid="A2" ref-type="aff">2</xref>
<email>yye@indiana.edu</email>
</contrib>
</contrib-group>
<aff id="A1">
<label>1</label>
Center for Research on BioSystems, Calit2, UC San Diego, La Jolla CA 92093</aff>
<aff id="A2">
<label>2</label>
School of Informatics and Computing, Indiana University, Bloomington, Indiana, 47408</aff>
<pub-date pub-type="nihms-submitted">
<day>1</day>
<month>7</month>
<year>2010</year>
</pub-date>
<pub-date pub-type="ppub">
<month>1</month>
<year>2009</year>
</pub-date>
<pub-date pub-type="pmc-release">
<day>19</day>
<month>7</month>
<year>2010</year>
</pub-date>
<volume>25</volume>
<issue>1</issue>
<fpage>71</fpage>
<lpage>81</lpage>
<abstract>
<p id="P1">Metagenomics is the study of microbial communities sampled directly from their natural environment, without prior culturing. By enabling an analysis of populations including many (so-far) unculturable and often unknown microbes, metagenomics is revolutionizing the field of microbiology, and has excited researchers in many disciplines that could benefit from the study of environmental microbes, including those in ecology, environmental sciences, and biomedicine. Specific computational and statistical tools have been developed for metagenomic data analysis and comparison. New studies, however, have revealed various kinds of artifacts present in metagenomics data caused by limitations in the experimental protocols and/or inadequate data analysis procedures, which often lead to incorrect conclusions about a microbial community. Here, we review some of the artifacts, such as overestimation of species diversity and incorrect estimation of gene family frequencies, and discuss emerging computational approaches to address them. We also review potential challenges that metagenomics may encounter with the extensive application of next-generation sequencing (NGS) techniques.</p>
</abstract>
<kwd-group>
<kwd>Metagenomics</kwd>
<kwd>next-generation sequencing (NGS)</kwd>
<kwd>taxonomic/functional profiling</kwd>
<kwd>statistical approaches</kwd>
<kwd>comparative metagenomics</kwd>
</kwd-group>
<contract-num rid="HG1">R01 HG004908-02 ||HG</contract-num>
<contract-num rid="HG1">R01 HG004908-01 ||HG</contract-num>
<contract-sponsor id="HG1">National Human Genome Research Institute : NHGRI</contract-sponsor>
</article-meta>
</front>
</pmc>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Californie</li>
</region>
</list>
<tree>
<noCountry>
<name sortKey="Ye, Yuzhen" sort="Ye, Yuzhen" uniqKey="Ye Y" first="Yuzhen" last="Ye">Yuzhen Ye</name>
</noCountry>
<country name="États-Unis">
<region name="Californie">
<name sortKey="Wooley, John C" sort="Wooley, John C" uniqKey="Wooley J" first="John C." last="Wooley">John C. Wooley</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Ncbi/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000147 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Ncbi/Merge/biblio.hfd -nk 000147 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Ncbi
   |étape=   Merge
   |type=    RBID
   |clé=     PMC:2905821
   |texte=   Metagenomics: Facts and Artifacts, and Computational Challenges*
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Ncbi/Merge/RBID.i   -Sk "pubmed:20648230" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Ncbi/Merge/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024