Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Detection of low-abundance bacterial strains in metagenomic datasets by eigengenome partitioning.

Identifieur interne : 001833 ( Main/Exploration ); précédent : 001832; suivant : 001834

Detection of low-abundance bacterial strains in metagenomic datasets by eigengenome partitioning.

Auteurs : Brian Cleary [États-Unis] ; Ilana Lauren Brito [États-Unis] ; Katherine Huang [États-Unis] ; Dirk Gevers [États-Unis] ; Terrance Shea [États-Unis] ; Sarah Young [États-Unis] ; Eric J. Alm [États-Unis]

Source :

RBID : pubmed:26368049

Descripteurs français

English descriptors

Abstract

Analyses of metagenomic datasets that are sequenced to a depth of billions or trillions of bases can uncover hundreds of microbial genomes, but naive assembly of these data is computationally intensive, requiring hundreds of gigabytes to terabytes of RAM. We present latent strain analysis (LSA), a scalable, de novo pre-assembly method that separates reads into biologically informed partitions and thereby enables assembly of individual genomes. LSA is implemented with a streaming calculation of unobserved variables that we call eigengenomes. Eigengenomes reflect covariance in the abundance of short, fixed-length sequences, or k-mers. As the abundance of each genome in a sample is reflected in the abundance of each k-mer in that genome, eigengenome analysis can be used to partition reads from different genomes. This partitioning can be done in fixed memory using tens of gigabytes of RAM, which makes assembly and downstream analyses of terabytes of data feasible on commodity hardware. Using LSA, we assemble partial and near-complete genomes of bacterial taxa present at relative abundances as low as 0.00001%. We also show that LSA is sensitive enough to separate reads from several strains of the same species.

DOI: 10.1038/nbt.3329
PubMed: 26368049


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Detection of low-abundance bacterial strains in metagenomic datasets by eigengenome partitioning.</title>
<author>
<name sortKey="Cleary, Brian" sort="Cleary, Brian" uniqKey="Cleary B" first="Brian" last="Cleary">Brian Cleary</name>
<affiliation wicri:level="2">
<nlm:affiliation>Computational and Systems Biology Program, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Computational and Systems Biology Program, Massachusetts Institute of Technology, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Brito, Ilana Lauren" sort="Brito, Ilana Lauren" uniqKey="Brito I" first="Ilana Lauren" last="Brito">Ilana Lauren Brito</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Huang, Katherine" sort="Huang, Katherine" uniqKey="Huang K" first="Katherine" last="Huang">Katherine Huang</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Gevers, Dirk" sort="Gevers, Dirk" uniqKey="Gevers D" first="Dirk" last="Gevers">Dirk Gevers</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Shea, Terrance" sort="Shea, Terrance" uniqKey="Shea T" first="Terrance" last="Shea">Terrance Shea</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Young, Sarah" sort="Young, Sarah" uniqKey="Young S" first="Sarah" last="Young">Sarah Young</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Alm, Eric J" sort="Alm, Eric J" uniqKey="Alm E" first="Eric J" last="Alm">Eric J. Alm</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2015">2015</date>
<idno type="RBID">pubmed:26368049</idno>
<idno type="pmid">26368049</idno>
<idno type="doi">10.1038/nbt.3329</idno>
<idno type="wicri:Area/PubMed/Corpus">001468</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">001468</idno>
<idno type="wicri:Area/PubMed/Curation">001468</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">001468</idno>
<idno type="wicri:Area/PubMed/Checkpoint">001581</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">001581</idno>
<idno type="wicri:Area/Ncbi/Merge">001269</idno>
<idno type="wicri:Area/Ncbi/Curation">001269</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">001269</idno>
<idno type="wicri:Area/Main/Merge">001838</idno>
<idno type="wicri:Area/Main/Curation">001833</idno>
<idno type="wicri:Area/Main/Exploration">001833</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Detection of low-abundance bacterial strains in metagenomic datasets by eigengenome partitioning.</title>
<author>
<name sortKey="Cleary, Brian" sort="Cleary, Brian" uniqKey="Cleary B" first="Brian" last="Cleary">Brian Cleary</name>
<affiliation wicri:level="2">
<nlm:affiliation>Computational and Systems Biology Program, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Computational and Systems Biology Program, Massachusetts Institute of Technology, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Brito, Ilana Lauren" sort="Brito, Ilana Lauren" uniqKey="Brito I" first="Ilana Lauren" last="Brito">Ilana Lauren Brito</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Huang, Katherine" sort="Huang, Katherine" uniqKey="Huang K" first="Katherine" last="Huang">Katherine Huang</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Gevers, Dirk" sort="Gevers, Dirk" uniqKey="Gevers D" first="Dirk" last="Gevers">Dirk Gevers</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Shea, Terrance" sort="Shea, Terrance" uniqKey="Shea T" first="Terrance" last="Shea">Terrance Shea</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Young, Sarah" sort="Young, Sarah" uniqKey="Young S" first="Sarah" last="Young">Sarah Young</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Alm, Eric J" sort="Alm, Eric J" uniqKey="Alm E" first="Eric J" last="Alm">Eric J. Alm</name>
<affiliation wicri:level="2">
<nlm:affiliation>Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Broad Institute of Harvard and MIT, Cambridge, Massachusetts</wicri:regionArea>
<placeName>
<region type="state">Massachusetts</region>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Nature biotechnology</title>
<idno type="eISSN">1546-1696</idno>
<imprint>
<date when="2015" type="published">2015</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithms</term>
<term>Bacteria (classification)</term>
<term>Bacteria (genetics)</term>
<term>Chromosome Mapping (methods)</term>
<term>Databases, Genetic</term>
<term>Datasets as Topic</term>
<term>Epigenesis, Genetic (genetics)</term>
<term>Genome, Bacterial (genetics)</term>
<term>Metagenomics (methods)</term>
<term>Microbiota (genetics)</term>
<term>Sequence Analysis, DNA (methods)</term>
<term>Species Specificity</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr">
<term>Algorithmes</term>
<term>Analyse de séquence d'ADN ()</term>
<term>Bactéries ()</term>
<term>Bactéries (génétique)</term>
<term>Bases de données génétiques</term>
<term>Cartographie chromosomique ()</term>
<term>Données de la recherche comme sujet</term>
<term>Génome bactérien (génétique)</term>
<term>Microbiote (génétique)</term>
<term>Métagénomique ()</term>
<term>Spécificité d'espèce</term>
<term>Épigenèse génétique (génétique)</term>
</keywords>
<keywords scheme="MESH" qualifier="classification" xml:lang="en">
<term>Bacteria</term>
</keywords>
<keywords scheme="MESH" qualifier="genetics" xml:lang="en">
<term>Bacteria</term>
<term>Epigenesis, Genetic</term>
<term>Genome, Bacterial</term>
<term>Microbiota</term>
</keywords>
<keywords scheme="MESH" qualifier="génétique" xml:lang="fr">
<term>Bactéries</term>
<term>Génome bactérien</term>
<term>Microbiote</term>
<term>Épigenèse génétique</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Chromosome Mapping</term>
<term>Metagenomics</term>
<term>Sequence Analysis, DNA</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Algorithms</term>
<term>Databases, Genetic</term>
<term>Datasets as Topic</term>
<term>Species Specificity</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr">
<term>Algorithmes</term>
<term>Analyse de séquence d'ADN</term>
<term>Bactéries</term>
<term>Bases de données génétiques</term>
<term>Cartographie chromosomique</term>
<term>Données de la recherche comme sujet</term>
<term>Métagénomique</term>
<term>Spécificité d'espèce</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Analyses of metagenomic datasets that are sequenced to a depth of billions or trillions of bases can uncover hundreds of microbial genomes, but naive assembly of these data is computationally intensive, requiring hundreds of gigabytes to terabytes of RAM. We present latent strain analysis (LSA), a scalable, de novo pre-assembly method that separates reads into biologically informed partitions and thereby enables assembly of individual genomes. LSA is implemented with a streaming calculation of unobserved variables that we call eigengenomes. Eigengenomes reflect covariance in the abundance of short, fixed-length sequences, or k-mers. As the abundance of each genome in a sample is reflected in the abundance of each k-mer in that genome, eigengenome analysis can be used to partition reads from different genomes. This partitioning can be done in fixed memory using tens of gigabytes of RAM, which makes assembly and downstream analyses of terabytes of data feasible on commodity hardware. Using LSA, we assemble partial and near-complete genomes of bacterial taxa present at relative abundances as low as 0.00001%. We also show that LSA is sensitive enough to separate reads from several strains of the same species. </div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Massachusetts</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Massachusetts">
<name sortKey="Cleary, Brian" sort="Cleary, Brian" uniqKey="Cleary B" first="Brian" last="Cleary">Brian Cleary</name>
</region>
<name sortKey="Alm, Eric J" sort="Alm, Eric J" uniqKey="Alm E" first="Eric J" last="Alm">Eric J. Alm</name>
<name sortKey="Brito, Ilana Lauren" sort="Brito, Ilana Lauren" uniqKey="Brito I" first="Ilana Lauren" last="Brito">Ilana Lauren Brito</name>
<name sortKey="Gevers, Dirk" sort="Gevers, Dirk" uniqKey="Gevers D" first="Dirk" last="Gevers">Dirk Gevers</name>
<name sortKey="Huang, Katherine" sort="Huang, Katherine" uniqKey="Huang K" first="Katherine" last="Huang">Katherine Huang</name>
<name sortKey="Shea, Terrance" sort="Shea, Terrance" uniqKey="Shea T" first="Terrance" last="Shea">Terrance Shea</name>
<name sortKey="Young, Sarah" sort="Young, Sarah" uniqKey="Young S" first="Sarah" last="Young">Sarah Young</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001833 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001833 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     pubmed:26368049
   |texte=   Detection of low-abundance bacterial strains in metagenomic datasets by eigengenome partitioning.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:26368049" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021