MersV1, PubMed, Checkpoint, bibRecord, 001F78

Mango: multiple alignment with N gapped oligos.

Identifieur interne : 001F78 ( PubMed/Checkpoint ); précédent : 001F77; suivant : 001F79

Mango: multiple alignment with N gapped oligos.

Auteurs : Zefeng Zhang [République populaire de Chine] ; Hao Lin ; Ming Li

Source :

Journal of bioinformatics and computational biology [ 0219-7200 ] ; 2008.

RBID : pubmed:18574861

Descripteurs français

KwdFr :
- Algorithmes, Alignement de séquences (), Analyse de séquence d'ADN (), Données de séquences moléculaires, Oligonucléotides (analyse).
MESH :
- analyse : Oligonucléotides.
- Algorithmes, Alignement de séquences, Analyse de séquence d'ADN, Données de séquences moléculaires.

English descriptors

KwdEn :
- Algorithms, Molecular Sequence Data, Oligonucleotides (analysis), Sequence Alignment (methods), Sequence Analysis, DNA (methods).
MESH :
- chemical , analysis : Oligonucleotides.
- methods : Sequence Alignment, Sequence Analysis, DNA.
- Algorithms, Molecular Sequence Data.

Abstract

Multiple sequence alignment is a classical and challenging task. The problem is NP-hard. The full dynamic programming takes too much time. The progressive alignment heuristics adopted by most state-of-the-art works suffer from the "once a gap, always a gap" phenomenon. Is there a radically new way to do multiple sequence alignment? In this paper, we introduce a novel and orthogonal multiple sequence alignment method, using both multiple optimized spaced seeds and new algorithms to handle these seeds efficiently. Our new algorithm processes information of all sequences as a whole and tries to build the alignment vertically, avoiding problems caused by the popular progressive approaches. Because the optimized spaced seeds have proved significantly more sensitive than the consecutive k-mers, the new approach promises to be more accurate and reliable. To validate our new approach, we have implemented MANGO: Multiple Alignment with N Gapped Oligos. Experiments were carried out on large 16S RNA benchmarks, showing that MANGO compares favorably, in both accuracy and speed, against state-of-the-art multiple sequence alignment methods, including ClustalW 1.83, MUSCLE 3.6, MAFFT 5.861, ProbConsRNA 1.11, Dialign 2.2.1, DIALIGN-T 0.2.1, T-Coffee 4.85, POA 2.0, and Kalign 2.0. We have further demonstrated the scalability of MANGO on very large datasets of repeat elements. MANGO can be downloaded at http://www.bioinfo.org.cn/mango/ and is free for academic usage.

DOI: 10.1142/s0219720008003527
PubMed: 18574861

Affiliations:

Links toward previous steps (curation, corpus...)

to stream PubMed, to step Corpus: 002098
to stream PubMed, to step Curation: 002098

Links to Exploration step

pubmed:18574861

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Mango: multiple alignment with N gapped oligos.</title>
<author><name sortKey="Zhang, Zefeng" sort="Zhang, Zefeng" uniqKey="Zhang Z" first="Zefeng" last="Zhang">Zefeng Zhang</name>
<affiliation wicri:level="3"><nlm:affiliation>Computational Biology Research Group, Division of Intelligent Software Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China. zhangzf@ict.ac.cn</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Computational Biology Research Group, Division of Intelligent Software Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing</wicri:regionArea>
<placeName><settlement type="city">Pékin</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Lin, Hao" sort="Lin, Hao" uniqKey="Lin H" first="Hao" last="Lin">Hao Lin</name>
</author>
<author><name sortKey="Li, Ming" sort="Li, Ming" uniqKey="Li M" first="Ming" last="Li">Ming Li</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2008">2008</date>
<idno type="RBID">pubmed:18574861</idno>
<idno type="pmid">18574861</idno>
<idno type="doi">10.1142/s0219720008003527</idno>
<idno type="wicri:Area/PubMed/Corpus">002098</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">002098</idno>
<idno type="wicri:Area/PubMed/Curation">002098</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">002098</idno>
<idno type="wicri:Area/PubMed/Checkpoint">001F78</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">001F78</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Mango: multiple alignment with N gapped oligos.</title>
<author><name sortKey="Zhang, Zefeng" sort="Zhang, Zefeng" uniqKey="Zhang Z" first="Zefeng" last="Zhang">Zefeng Zhang</name>
<affiliation wicri:level="3"><nlm:affiliation>Computational Biology Research Group, Division of Intelligent Software Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China. zhangzf@ict.ac.cn</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Computational Biology Research Group, Division of Intelligent Software Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing</wicri:regionArea>
<placeName><settlement type="city">Pékin</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Lin, Hao" sort="Lin, Hao" uniqKey="Lin H" first="Hao" last="Lin">Hao Lin</name>
</author>
<author><name sortKey="Li, Ming" sort="Li, Ming" uniqKey="Li M" first="Ming" last="Li">Ming Li</name>
</author>
</analytic>
<series><title level="j">Journal of bioinformatics and computational biology</title>
<idno type="ISSN">0219-7200</idno>
<imprint><date when="2008" type="published">2008</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Algorithms</term>
<term>Molecular Sequence Data</term>
<term>Oligonucleotides (analysis)</term>
<term>Sequence Alignment (methods)</term>
<term>Sequence Analysis, DNA (methods)</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr"><term>Algorithmes</term>
<term>Alignement de séquences ()</term>
<term>Analyse de séquence d'ADN ()</term>
<term>Données de séquences moléculaires</term>
<term>Oligonucléotides (analyse)</term>
</keywords>
<keywords scheme="MESH" type="chemical" qualifier="analysis" xml:lang="en"><term>Oligonucleotides</term>
</keywords>
<keywords scheme="MESH" qualifier="analyse" xml:lang="fr"><term>Oligonucléotides</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en"><term>Sequence Alignment</term>
<term>Sequence Analysis, DNA</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Algorithms</term>
<term>Molecular Sequence Data</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr"><term>Algorithmes</term>
<term>Alignement de séquences</term>
<term>Analyse de séquence d'ADN</term>
<term>Données de séquences moléculaires</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Multiple sequence alignment is a classical and challenging task. The problem is NP-hard. The full dynamic programming takes too much time. The progressive alignment heuristics adopted by most state-of-the-art works suffer from the "once a gap, always a gap" phenomenon. Is there a radically new way to do multiple sequence alignment? In this paper, we introduce a novel and orthogonal multiple sequence alignment method, using both multiple optimized spaced seeds and new algorithms to handle these seeds efficiently. Our new algorithm processes information of all sequences as a whole and tries to build the alignment vertically, avoiding problems caused by the popular progressive approaches. Because the optimized spaced seeds have proved significantly more sensitive than the consecutive k-mers, the new approach promises to be more accurate and reliable. To validate our new approach, we have implemented MANGO: Multiple Alignment with N Gapped Oligos. Experiments were carried out on large 16S RNA benchmarks, showing that MANGO compares favorably, in both accuracy and speed, against state-of-the-art multiple sequence alignment methods, including ClustalW 1.83, MUSCLE 3.6, MAFFT 5.861, ProbConsRNA 1.11, Dialign 2.2.1, DIALIGN-T 0.2.1, T-Coffee 4.85, POA 2.0, and Kalign 2.0. We have further demonstrated the scalability of MANGO on very large datasets of repeat elements. MANGO can be downloaded at http://www.bioinfo.org.cn/mango/ and is free for academic usage.</div>
</front>
</TEI>
<pubmed><MedlineCitation Status="MEDLINE" Owner="NLM"><PMID Version="1">18574861</PMID>
<DateCompleted><Year>2008</Year>
<Month>08</Month>
<Day>26</Day>
</DateCompleted>
<DateRevised><Year>2019</Year>
<Month>11</Month>
<Day>10</Day>
</DateRevised>
<Article PubModel="Print"><Journal><ISSN IssnType="Print">0219-7200</ISSN>
<JournalIssue CitedMedium="Print"><Volume>6</Volume>
<Issue>3</Issue>
<PubDate><Year>2008</Year>
<Month>Jun</Month>
</PubDate>
</JournalIssue>
<Title>Journal of bioinformatics and computational biology</Title>
<ISOAbbreviation>J Bioinform Comput Biol</ISOAbbreviation>
</Journal>
<ArticleTitle>Mango: multiple alignment with N gapped oligos.</ArticleTitle>
<Pagination><MedlinePgn>521-41</MedlinePgn>
</Pagination>
<Abstract><AbstractText>Multiple sequence alignment is a classical and challenging task. The problem is NP-hard. The full dynamic programming takes too much time. The progressive alignment heuristics adopted by most state-of-the-art works suffer from the "once a gap, always a gap" phenomenon. Is there a radically new way to do multiple sequence alignment? In this paper, we introduce a novel and orthogonal multiple sequence alignment method, using both multiple optimized spaced seeds and new algorithms to handle these seeds efficiently. Our new algorithm processes information of all sequences as a whole and tries to build the alignment vertically, avoiding problems caused by the popular progressive approaches. Because the optimized spaced seeds have proved significantly more sensitive than the consecutive k-mers, the new approach promises to be more accurate and reliable. To validate our new approach, we have implemented MANGO: Multiple Alignment with N Gapped Oligos. Experiments were carried out on large 16S RNA benchmarks, showing that MANGO compares favorably, in both accuracy and speed, against state-of-the-art multiple sequence alignment methods, including ClustalW 1.83, MUSCLE 3.6, MAFFT 5.861, ProbConsRNA 1.11, Dialign 2.2.1, DIALIGN-T 0.2.1, T-Coffee 4.85, POA 2.0, and Kalign 2.0. We have further demonstrated the scalability of MANGO on very large datasets of repeat elements. MANGO can be downloaded at http://www.bioinfo.org.cn/mango/ and is free for academic usage.</AbstractText>
</Abstract>
<AuthorList CompleteYN="Y"><Author ValidYN="Y"><LastName>Zhang</LastName>
<ForeName>Zefeng</ForeName>
<Initials>Z</Initials>
<AffiliationInfo><Affiliation>Computational Biology Research Group, Division of Intelligent Software Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China. zhangzf@ict.ac.cn</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Lin</LastName>
<ForeName>Hao</ForeName>
<Initials>H</Initials>
</Author>
<Author ValidYN="Y"><LastName>Li</LastName>
<ForeName>Ming</ForeName>
<Initials>M</Initials>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList><PublicationType UI="D016428">Journal Article</PublicationType>
<PublicationType UI="D013485">Research Support, Non-U.S. Gov't</PublicationType>
</PublicationTypeList>
</Article>
<MedlineJournalInfo><Country>Singapore</Country>
<MedlineTA>J Bioinform Comput Biol</MedlineTA>
<NlmUniqueID>101187344</NlmUniqueID>
<ISSNLinking>0219-7200</ISSNLinking>
</MedlineJournalInfo>
<ChemicalList><Chemical><RegistryNumber>0</RegistryNumber>
<NameOfSubstance UI="D009841">Oligonucleotides</NameOfSubstance>
</Chemical>
</ChemicalList>
<CitationSubset>IM</CitationSubset>
<MeshHeadingList><MeshHeading><DescriptorName UI="D000465" MajorTopicYN="Y">Algorithms</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName UI="D008969" MajorTopicYN="N">Molecular Sequence Data</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName UI="D009841" MajorTopicYN="N">Oligonucleotides</DescriptorName>
<QualifierName UI="Q000032" MajorTopicYN="Y">analysis</QualifierName>
</MeshHeading>
<MeshHeading><DescriptorName UI="D016415" MajorTopicYN="N">Sequence Alignment</DescriptorName>
<QualifierName UI="Q000379" MajorTopicYN="Y">methods</QualifierName>
</MeshHeading>
<MeshHeading><DescriptorName UI="D017422" MajorTopicYN="N">Sequence Analysis, DNA</DescriptorName>
<QualifierName UI="Q000379" MajorTopicYN="Y">methods</QualifierName>
</MeshHeading>
</MeshHeadingList>
</MedlineCitation>
<PubmedData><History><PubMedPubDate PubStatus="received"><Year>2007</Year>
<Month>08</Month>
<Day>01</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="revised"><Year>2007</Year>
<Month>12</Month>
<Day>01</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="accepted"><Year>2008</Year>
<Month>01</Month>
<Day>03</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed"><Year>2008</Year>
<Month>6</Month>
<Day>25</Day>
<Hour>9</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline"><Year>2008</Year>
<Month>8</Month>
<Day>30</Day>
<Hour>9</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez"><Year>2008</Year>
<Month>6</Month>
<Day>25</Day>
<Hour>9</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList><ArticleId IdType="pubmed">18574861</ArticleId>
<ArticleId IdType="pii">S0219720008003527</ArticleId>
<ArticleId IdType="doi">10.1142/s0219720008003527</ArticleId>
</ArticleIdList>
</PubmedData>
</pubmed>
<affiliations><list><country><li>République populaire de Chine</li>
</country>
<settlement><li>Pékin</li>
</settlement>
</list>
<tree><noCountry><name sortKey="Li, Ming" sort="Li, Ming" uniqKey="Li M" first="Ming" last="Li">Ming Li</name>
<name sortKey="Lin, Hao" sort="Lin, Hao" uniqKey="Lin H" first="Hao" last="Lin">Hao Lin</name>
</noCountry>
<country name="République populaire de Chine"><noRegion><name sortKey="Zhang, Zefeng" sort="Zhang, Zefeng" uniqKey="Zhang Z" first="Zefeng" last="Zhang">Zefeng Zhang</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/PubMed/Checkpoint

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001F78 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/PubMed/Checkpoint/biblio.hfd -nk 001F78 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    PubMed
   |étape=   Checkpoint
   |type=    RBID
   |clé=     pubmed:18574861
   |texte=   Mango: multiple alignment with N gapped oligos.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Checkpoint/RBID.i   -Sk "pubmed:18574861" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Checkpoint/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021

	Serveur d'exploration MERS
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration MERS

Mango: multiple alignment with N gapped oligos.

Mango: multiple alignment with N gapped oligos.

Source :

Descripteurs français

English descriptors

Abstract

Links toward previous steps (curation, corpus...)

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri

Pour générer des pages wiki