Spectrum alignment: efficient resequencing by hybridization.
Identifieur interne : 002577 ( PubMed/Corpus ); précédent : 002576; suivant : 002578Spectrum alignment: efficient resequencing by hybridization.
Auteurs : I. Pe'Er ; R. ShamirSource :
- Proceedings. International Conference on Intelligent Systems for Molecular Biology [ 1553-0833 ] ; 2000.
English descriptors
- KwdEn :
- MESH :
Abstract
Recent high-density microarray technologies allow, in principle, the determination of all k-mers that appear along a DNA sequence, for k = 8 - 10 in a single experiment on a standard chip. The k-mer contents, also called the spectrum of the sequence, is not sufficient to uniquely reconstruct a sequence longer than a few hundred bases. We have devised a polynomial algorithm that reconstructs the sequence, given the spectrum and a homologous sequence. This situation occurs, for example, in the identification of single nucleotide polymorphisms (SNPs), and whenever a homologue of the target sequence is known. The algorithm is robust, can handle errors in the spectrum and assumes no knowledge of the k-mer multiplicities. Our simulations show that with realistic levels of SNPs, the algorithm correctly reconstructs a target sequence of length up to 2,000 nucleotides when a polymorphic sequence is known. The technique is generalized to handle profiles and HMMs as input instead of a single homologous sequence.
PubMed: 10977087
Links to Exploration step
pubmed:10977087Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Spectrum alignment: efficient resequencing by hybridization.</title>
<author><name sortKey="Pe Er, I" sort="Pe Er, I" uniqKey="Pe Er I" first="I" last="Pe'Er">I. Pe'Er</name>
<affiliation><nlm:affiliation>Department of Computer Science, Tel Aviv University, Israel. izik@math.tau.ac.il</nlm:affiliation>
</affiliation>
</author>
<author><name sortKey="Shamir, R" sort="Shamir, R" uniqKey="Shamir R" first="R" last="Shamir">R. Shamir</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2000">2000</date>
<idno type="RBID">pubmed:10977087</idno>
<idno type="pmid">10977087</idno>
<idno type="wicri:Area/PubMed/Corpus">002577</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">002577</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Spectrum alignment: efficient resequencing by hybridization.</title>
<author><name sortKey="Pe Er, I" sort="Pe Er, I" uniqKey="Pe Er I" first="I" last="Pe'Er">I. Pe'Er</name>
<affiliation><nlm:affiliation>Department of Computer Science, Tel Aviv University, Israel. izik@math.tau.ac.il</nlm:affiliation>
</affiliation>
</author>
<author><name sortKey="Shamir, R" sort="Shamir, R" uniqKey="Shamir R" first="R" last="Shamir">R. Shamir</name>
</author>
</analytic>
<series><title level="j">Proceedings. International Conference on Intelligent Systems for Molecular Biology</title>
<idno type="ISSN">1553-0833</idno>
<imprint><date when="2000" type="published">2000</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Algorithms</term>
<term>Animals</term>
<term>Genome</term>
<term>Humans</term>
<term>Markov Chains</term>
<term>Nucleic Acid Hybridization</term>
<term>Oligonucleotide Array Sequence Analysis</term>
<term>Sequence Alignment (methods)</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en"><term>Sequence Alignment</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Algorithms</term>
<term>Animals</term>
<term>Genome</term>
<term>Humans</term>
<term>Markov Chains</term>
<term>Nucleic Acid Hybridization</term>
<term>Oligonucleotide Array Sequence Analysis</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Recent high-density microarray technologies allow, in principle, the determination of all k-mers that appear along a DNA sequence, for k = 8 - 10 in a single experiment on a standard chip. The k-mer contents, also called the spectrum of the sequence, is not sufficient to uniquely reconstruct a sequence longer than a few hundred bases. We have devised a polynomial algorithm that reconstructs the sequence, given the spectrum and a homologous sequence. This situation occurs, for example, in the identification of single nucleotide polymorphisms (SNPs), and whenever a homologue of the target sequence is known. The algorithm is robust, can handle errors in the spectrum and assumes no knowledge of the k-mer multiplicities. Our simulations show that with realistic levels of SNPs, the algorithm correctly reconstructs a target sequence of length up to 2,000 nucleotides when a polymorphic sequence is known. The technique is generalized to handle profiles and HMMs as input instead of a single homologous sequence.</div>
</front>
</TEI>
<pubmed><MedlineCitation Status="MEDLINE" Owner="NLM"><PMID Version="1">10977087</PMID>
<DateCompleted><Year>2001</Year>
<Month>01</Month>
<Day>26</Day>
</DateCompleted>
<DateRevised><Year>2016</Year>
<Month>10</Month>
<Day>21</Day>
</DateRevised>
<Article PubModel="Print"><Journal><ISSN IssnType="Print">1553-0833</ISSN>
<JournalIssue CitedMedium="Print"><Volume>8</Volume>
<PubDate><Year>2000</Year>
</PubDate>
</JournalIssue>
<Title>Proceedings. International Conference on Intelligent Systems for Molecular Biology</Title>
<ISOAbbreviation>Proc Int Conf Intell Syst Mol Biol</ISOAbbreviation>
</Journal>
<ArticleTitle>Spectrum alignment: efficient resequencing by hybridization.</ArticleTitle>
<Pagination><MedlinePgn>260-8</MedlinePgn>
</Pagination>
<Abstract><AbstractText>Recent high-density microarray technologies allow, in principle, the determination of all k-mers that appear along a DNA sequence, for k = 8 - 10 in a single experiment on a standard chip. The k-mer contents, also called the spectrum of the sequence, is not sufficient to uniquely reconstruct a sequence longer than a few hundred bases. We have devised a polynomial algorithm that reconstructs the sequence, given the spectrum and a homologous sequence. This situation occurs, for example, in the identification of single nucleotide polymorphisms (SNPs), and whenever a homologue of the target sequence is known. The algorithm is robust, can handle errors in the spectrum and assumes no knowledge of the k-mer multiplicities. Our simulations show that with realistic levels of SNPs, the algorithm correctly reconstructs a target sequence of length up to 2,000 nucleotides when a polymorphic sequence is known. The technique is generalized to handle profiles and HMMs as input instead of a single homologous sequence.</AbstractText>
</Abstract>
<AuthorList CompleteYN="Y"><Author ValidYN="Y"><LastName>Pe'er</LastName>
<ForeName>I</ForeName>
<Initials>I</Initials>
<AffiliationInfo><Affiliation>Department of Computer Science, Tel Aviv University, Israel. izik@math.tau.ac.il</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Shamir</LastName>
<ForeName>R</ForeName>
<Initials>R</Initials>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList><PublicationType UI="D016428">Journal Article</PublicationType>
<PublicationType UI="D013485">Research Support, Non-U.S. Gov't</PublicationType>
</PublicationTypeList>
</Article>
<MedlineJournalInfo><Country>United States</Country>
<MedlineTA>Proc Int Conf Intell Syst Mol Biol</MedlineTA>
<NlmUniqueID>9509125</NlmUniqueID>
<ISSNLinking>1553-0833</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<MeshHeadingList><MeshHeading><DescriptorName UI="D000465" MajorTopicYN="Y">Algorithms</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName UI="D000818" MajorTopicYN="N">Animals</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName UI="D016678" MajorTopicYN="Y">Genome</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName UI="D006801" MajorTopicYN="N">Humans</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName UI="D008390" MajorTopicYN="N">Markov Chains</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName UI="D009693" MajorTopicYN="Y">Nucleic Acid Hybridization</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName UI="D020411" MajorTopicYN="N">Oligonucleotide Array Sequence Analysis</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName UI="D016415" MajorTopicYN="N">Sequence Alignment</DescriptorName>
<QualifierName UI="Q000379" MajorTopicYN="Y">methods</QualifierName>
</MeshHeading>
</MeshHeadingList>
</MedlineCitation>
<PubmedData><History><PubMedPubDate PubStatus="pubmed"><Year>2000</Year>
<Month>9</Month>
<Day>8</Day>
<Hour>11</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline"><Year>2001</Year>
<Month>2</Month>
<Day>28</Day>
<Hour>10</Hour>
<Minute>1</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez"><Year>2000</Year>
<Month>9</Month>
<Day>8</Day>
<Hour>11</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList><ArticleId IdType="pubmed">10977087</ArticleId>
</ArticleIdList>
</PubmedData>
</pubmed>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/PubMed/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002577 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd -nk 002577 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Sante |area= MersV1 |flux= PubMed |étape= Corpus |type= RBID |clé= pubmed:10977087 |texte= Spectrum alignment: efficient resequencing by hybridization. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/RBID.i -Sk "pubmed:10977087" \ | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd \ | NlmPubMed2Wicri -a MersV1
This area was generated with Dilib version V0.6.33. |