Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Multi-seed lossless filtration

Identifieur interne : 000666 ( Crin/Checkpoint ); précédent : 000665; suivant : 000667

Multi-seed lossless filtration

Auteurs : Gregory Kucherov ; Laurent Noé ; Mikhail Roytberg

Source :

RBID : CRIN:kucherov04a

English descriptors

Abstract

We study a method of seed-based lossless filtration for approximate string matching and related applications. The method is based on a simultaneous use of several spaced seeds rather than a single seed as studied by Burkhardt and Karkkainen. We present algorithms to compute several important parameters of seed families, study their combinatorial properties, and describe several techniques to construct efficient families. We also report a large-scale application of the proposed technique to the problem of oligonucleotide selection for an EST sequence database.

Links toward previous steps (curation, corpus...)


Links to Exploration step

CRIN:kucherov04a

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" wicri:score="5">Multi-seed lossless filtration</title>
</titleStmt>
<publicationStmt>
<idno type="RBID">CRIN:kucherov04a</idno>
<date when="2004" year="2004">2004</date>
<idno type="wicri:Area/Crin/Corpus">003F00</idno>
<idno type="wicri:Area/Crin/Curation">003F00</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Curation">003F00</idno>
<idno type="wicri:Area/Crin/Checkpoint">000666</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Checkpoint">000666</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Multi-seed lossless filtration</title>
<author>
<name sortKey="Kucherov, Gregory" sort="Kucherov, Gregory" uniqKey="Kucherov G" first="Gregory" last="Kucherov">Gregory Kucherov</name>
</author>
<author>
<name sortKey="Noe, Laurent" sort="Noe, Laurent" uniqKey="Noe L" first="Laurent" last="Noé">Laurent Noé</name>
</author>
<author>
<name sortKey="Roytberg, Mikhail" sort="Roytberg, Mikhail" uniqKey="Roytberg M" first="Mikhail" last="Roytberg">Mikhail Roytberg</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>est</term>
<term>filtration</term>
<term>lossless filtering</term>
<term>multiple seed</term>
<term>oligonucleotide design</term>
<term>pattern matching</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en" wicri:score="610">We study a method of seed-based lossless filtration for approximate string matching and related applications. The method is based on a simultaneous use of several spaced seeds rather than a single seed as studied by Burkhardt and Karkkainen. We present algorithms to compute several important parameters of seed families, study their combinatorial properties, and describe several techniques to construct efficient families. We also report a large-scale application of the proposed technique to the problem of oligonucleotide selection for an EST sequence database.</div>
</front>
</TEI>
<BibTex type="inproceedings">
<ref>kucherov04a</ref>
<crinnumber>A04-R-201</crinnumber>
<category>3</category>
<equipe>ADAGE</equipe>
<author>
<e>Kucherov, Gregory</e>
<e>Noé, Laurent</e>
<e>Roytberg, Mikhail</e>
</author>
<title>Multi-seed lossless filtration</title>
<booktitle>{Proceedings of the 15th Annual Symposium on Combinatorial Pattern Matching - CPM'2004, Istanbul, Turkey}</booktitle>
<year>2004</year>
<editor>Suleyman Cenk Sahinalp and S. Muthukrishnan and Ugur Dogrusoz</editor>
<volume>3109</volume>
<series>Lecture notes in computer science</series>
<pages>297-310</pages>
<month>Jul</month>
<publisher>Springer</publisher>
<keywords>
<e>multiple seed</e>
<e>lossless filtering</e>
<e>filtration</e>
<e>pattern matching</e>
<e>oligonucleotide design</e>
<e>est</e>
</keywords>
<abstract>We study a method of seed-based lossless filtration for approximate string matching and related applications. The method is based on a simultaneous use of several spaced seeds rather than a single seed as studied by Burkhardt and Karkkainen. We present algorithms to compute several important parameters of seed families, study their combinatorial properties, and describe several techniques to construct efficient families. We also report a large-scale application of the proposed technique to the problem of oligonucleotide selection for an EST sequence database.</abstract>
</BibTex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Crin/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000666 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Crin/Checkpoint/biblio.hfd -nk 000666 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Crin
   |étape=   Checkpoint
   |type=    RBID
   |clé=     CRIN:kucherov04a
   |texte=   Multi-seed lossless filtration
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022