Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Lighter: fast and memory-efficient sequencing error correction without counting.

Identifieur interne : 001788 ( PubMed/Curation ); précédent : 001787; suivant : 001789

Lighter: fast and memory-efficient sequencing error correction without counting.

Auteurs : Li Song ; Liliana Florea ; Ben Langmead

Source :

RBID : pubmed:25398208

Descripteurs français

English descriptors

Abstract

Lighter is a fast, memory-efficient tool for correcting sequencing errors. Lighter avoids counting k-mers. Instead, it uses a pair of Bloom filters, one holding a sample of the input k-mers and the other holding k-mers likely to be correct. As long as the sampling fraction is adjusted in inverse proportion to the depth of sequencing, Bloom filter size can be held constant while maintaining near-constant accuracy. Lighter is parallelized, uses no secondary storage, and is both faster and more memory-efficient than competing approaches while achieving comparable accuracy.

DOI: 10.1186/s13059-014-0509-9
PubMed: 25398208

Links toward previous steps (curation, corpus...)


Links to Exploration step

pubmed:25398208

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Lighter: fast and memory-efficient sequencing error correction without counting.</title>
<author>
<name sortKey="Song, Li" sort="Song, Li" uniqKey="Song L" first="Li" last="Song">Li Song</name>
</author>
<author>
<name sortKey="Florea, Liliana" sort="Florea, Liliana" uniqKey="Florea L" first="Liliana" last="Florea">Liliana Florea</name>
</author>
<author>
<name sortKey="Langmead, Ben" sort="Langmead, Ben" uniqKey="Langmead B" first="Ben" last="Langmead">Ben Langmead</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2014">2014</date>
<idno type="RBID">pubmed:25398208</idno>
<idno type="pmid">25398208</idno>
<idno type="doi">10.1186/s13059-014-0509-9</idno>
<idno type="wicri:Area/PubMed/Corpus">001788</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">001788</idno>
<idno type="wicri:Area/PubMed/Curation">001788</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">001788</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Lighter: fast and memory-efficient sequencing error correction without counting.</title>
<author>
<name sortKey="Song, Li" sort="Song, Li" uniqKey="Song L" first="Li" last="Song">Li Song</name>
</author>
<author>
<name sortKey="Florea, Liliana" sort="Florea, Liliana" uniqKey="Florea L" first="Liliana" last="Florea">Liliana Florea</name>
</author>
<author>
<name sortKey="Langmead, Ben" sort="Langmead, Ben" uniqKey="Langmead B" first="Ben" last="Langmead">Ben Langmead</name>
</author>
</analytic>
<series>
<title level="j">Genome biology</title>
<idno type="eISSN">1474-760X</idno>
<imprint>
<date when="2014" type="published">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Computational Biology</term>
<term>High-Throughput Nucleotide Sequencing (methods)</term>
<term>Humans</term>
<term>Quality Control</term>
<term>Sequence Analysis, DNA</term>
<term>Software</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr">
<term>Analyse de séquence d'ADN</term>
<term>Biologie informatique</term>
<term>Contrôle de qualité</term>
<term>Humains</term>
<term>Logiciel</term>
<term>Séquençage nucléotidique à haut débit ()</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>High-Throughput Nucleotide Sequencing</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Computational Biology</term>
<term>Humans</term>
<term>Quality Control</term>
<term>Sequence Analysis, DNA</term>
<term>Software</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr">
<term>Analyse de séquence d'ADN</term>
<term>Biologie informatique</term>
<term>Contrôle de qualité</term>
<term>Humains</term>
<term>Logiciel</term>
<term>Séquençage nucléotidique à haut débit</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Lighter is a fast, memory-efficient tool for correcting sequencing errors. Lighter avoids counting k-mers. Instead, it uses a pair of Bloom filters, one holding a sample of the input k-mers and the other holding k-mers likely to be correct. As long as the sampling fraction is adjusted in inverse proportion to the depth of sequencing, Bloom filter size can be held constant while maintaining near-constant accuracy. Lighter is parallelized, uses no secondary storage, and is both faster and more memory-efficient than competing approaches while achieving comparable accuracy.</div>
</front>
</TEI>
<pubmed>
<MedlineCitation Status="MEDLINE" Owner="NLM">
<PMID Version="1">25398208</PMID>
<DateCompleted>
<Year>2015</Year>
<Month>07</Month>
<Day>14</Day>
</DateCompleted>
<DateRevised>
<Year>2018</Year>
<Month>11</Month>
<Day>13</Day>
</DateRevised>
<Article PubModel="Print">
<Journal>
<ISSN IssnType="Electronic">1474-760X</ISSN>
<JournalIssue CitedMedium="Internet">
<Volume>15</Volume>
<Issue>11</Issue>
<PubDate>
<Year>2014</Year>
</PubDate>
</JournalIssue>
<Title>Genome biology</Title>
<ISOAbbreviation>Genome Biol.</ISOAbbreviation>
</Journal>
<ArticleTitle>Lighter: fast and memory-efficient sequencing error correction without counting.</ArticleTitle>
<Pagination>
<MedlinePgn>509</MedlinePgn>
</Pagination>
<Abstract>
<AbstractText>Lighter is a fast, memory-efficient tool for correcting sequencing errors. Lighter avoids counting k-mers. Instead, it uses a pair of Bloom filters, one holding a sample of the input k-mers and the other holding k-mers likely to be correct. As long as the sampling fraction is adjusted in inverse proportion to the depth of sequencing, Bloom filter size can be held constant while maintaining near-constant accuracy. Lighter is parallelized, uses no secondary storage, and is both faster and more memory-efficient than competing approaches while achieving comparable accuracy.</AbstractText>
</Abstract>
<AuthorList CompleteYN="Y">
<Author ValidYN="Y">
<LastName>Song</LastName>
<ForeName>Li</ForeName>
<Initials>L</Initials>
</Author>
<Author ValidYN="Y">
<LastName>Florea</LastName>
<ForeName>Liliana</ForeName>
<Initials>L</Initials>
</Author>
<Author ValidYN="Y">
<LastName>Langmead</LastName>
<ForeName>Ben</ForeName>
<Initials>B</Initials>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList>
<PublicationType UI="D016428">Journal Article</PublicationType>
<PublicationType UI="D013485">Research Support, Non-U.S. Gov't</PublicationType>
<PublicationType UI="D013486">Research Support, U.S. Gov't, Non-P.H.S.</PublicationType>
</PublicationTypeList>
</Article>
<MedlineJournalInfo>
<Country>England</Country>
<MedlineTA>Genome Biol</MedlineTA>
<NlmUniqueID>100960660</NlmUniqueID>
<ISSNLinking>1474-7596</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<MeshHeadingList>
<MeshHeading>
<DescriptorName UI="D019295" MajorTopicYN="N">Computational Biology</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D059014" MajorTopicYN="N">High-Throughput Nucleotide Sequencing</DescriptorName>
<QualifierName UI="Q000379" MajorTopicYN="Y">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D006801" MajorTopicYN="N">Humans</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D011786" MajorTopicYN="Y">Quality Control</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D017422" MajorTopicYN="N">Sequence Analysis, DNA</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D012984" MajorTopicYN="Y">Software</DescriptorName>
</MeshHeading>
</MeshHeadingList>
</MedlineCitation>
<PubmedData>
<History>
<PubMedPubDate PubStatus="received">
<Year>2014</Year>
<Month>05</Month>
<Day>27</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez">
<Year>2014</Year>
<Month>11</Month>
<Day>16</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed">
<Year>2014</Year>
<Month>11</Month>
<Day>16</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline">
<Year>2015</Year>
<Month>7</Month>
<Day>15</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList>
<ArticleId IdType="pubmed">25398208</ArticleId>
<ArticleId IdType="pii">s13059-014-0509-9</ArticleId>
<ArticleId IdType="doi">10.1186/s13059-014-0509-9</ArticleId>
<ArticleId IdType="pmc">PMC4248469</ArticleId>
</ArticleIdList>
<ReferenceList>
<Reference>
<Citation>Genome Biol. 2010;11(11):R116</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21114842</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Gigascience. 2012 Dec 27;1(1):18</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">23587118</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Genome Res. 2011 Jul;21(7):1181-92</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21482625</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Nucleic Acids Res. 2012 Dec;40(22):e171</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">22904078</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Genome Res. 2008 May;18(5):821-9</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">18349386</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2010 Oct 15;26(20):2526-33</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">20834037</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2011 Mar 15;27(6):764-70</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21217122</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2014 May 15;30(10):1354-62</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">24451628</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Genome Res. 2012 Mar;22(3):557-67</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">22147368</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Mol Ecol Resour. 2011 Sep;11(5):759-69</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21592312</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2009 Sep 1;25(17):2157-63</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">19542152</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Nucleic Acids Res. 2011 Jul;39(13):e90</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21576222</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>PLoS One. 2014;9(7):e101271</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">25062443</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2011 Jun 1;27(11):1455-61</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21471014</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>J Comput Biol. 2010 Apr;17(4):603-15</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">20426693</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2014 Dec 15;30(24):3541-7</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">25355787</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2011 Feb 1;27(3):295-302</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21115437</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Proc Natl Acad Sci U S A. 2001 Aug 14;98(17):9748-53</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">11504945</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2013 Apr 15;29(8):1072-5</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">23422339</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Nat Methods. 2012 Apr;9(4):357-9</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">22388286</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2011 Jul 1;27(13):i137-41</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21685062</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2004 Sep 1;20(13):2067-74</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">15059830</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>BMC Bioinformatics. 2011;12:333</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">21831268</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2014 Jan 1;30(1):31-7</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">23732276</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2013 Feb 1;29(3):308-15</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">23202746</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Proc Natl Acad Sci U S A. 2012 Aug 14;109(33):13272-7</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">22847406</ArticleId>
</ArticleIdList>
</Reference>
<Reference>
<Citation>Bioinformatics. 2012 Feb 15;28(4):593-4</Citation>
<ArticleIdList>
<ArticleId IdType="pubmed">22199392</ArticleId>
</ArticleIdList>
</Reference>
</ReferenceList>
</PubmedData>
</pubmed>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/PubMed/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001788 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PubMed/Curation/biblio.hfd -nk 001788 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    PubMed
   |étape=   Curation
   |type=    RBID
   |clé=     pubmed:25398208
   |texte=   Lighter: fast and memory-efficient sequencing  error correction without counting.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Curation/RBID.i   -Sk "pubmed:25398208" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Curation/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021