Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Detecting Superbubbles in Assembly Graphs

Identifieur interne : 000135 ( Istex/Corpus ); précédent : 000134; suivant : 000136

Detecting Superbubbles in Assembly Graphs

Auteurs : Taku Onodera ; Kunihiko Sadakane ; Tetsuo Shibuya

Source :

RBID : ISTEX:68BDA37CB6A189519047ED981BCC99AF97559CCC

Abstract

Abstract: We introduce a new concept of a subgraph class called a superbubble for analyzing assembly graphs, and propose an efficient algorithm for detecting it. Most assembly algorithms utilize assembly graphs like the de Bruijn graph or the overlap graph constructed from reads. From these graphs, many assembly algorithms first detect simple local graph structures (motifs), such as tips and bubbles, mainly to find sequencing errors. These motifs are easy to detect, but they are sometimes too simple to deal with more complex errors. The superbubble is an extension of the bubble, which is also important for analyzing assembly graphs. Though superbubbles are much more complex than ordinary bubbles, we show that they can be efficiently enumerated. We propose an average-case linear time algorithm (i.e., O(n + m) for a graph with n vertices and m edges) for graphs with a reasonable model, though the worst-case time complexity of our algorithm is quadratic (i.e., O(n(n + m))). Moreover, the algorithm is practically very fast: Our experiments show that our algorithm runs in reasonable time with a single CPU core even against a very large graph of a whole human genome.

Url:
DOI: 10.1007/978-3-642-40453-5_26

Links to Exploration step

ISTEX:68BDA37CB6A189519047ED981BCC99AF97559CCC

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Detecting Superbubbles in Assembly Graphs</title>
<author>
<name sortKey="Onodera, Taku" sort="Onodera, Taku" uniqKey="Onodera T" first="Taku" last="Onodera">Taku Onodera</name>
<affiliation>
<mods:affiliation>Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, 108-8639, Minato-ku, Tokyo, Japan</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: tk-ono@hgc.jp</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Sadakane, Kunihiko" sort="Sadakane, Kunihiko" uniqKey="Sadakane K" first="Kunihiko" last="Sadakane">Kunihiko Sadakane</name>
<affiliation>
<mods:affiliation>National Institute of Informatics, 2-1-2 Hitotsubashi, 101-8430, Chiyoda-ku, Tokyo, Japan</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: sada@nii.ac.jp</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Shibuya, Tetsuo" sort="Shibuya, Tetsuo" uniqKey="Shibuya T" first="Tetsuo" last="Shibuya">Tetsuo Shibuya</name>
<affiliation>
<mods:affiliation>Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, 108-8639, Minato-ku, Tokyo, Japan</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: tshibuya@hgc.jp</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:68BDA37CB6A189519047ED981BCC99AF97559CCC</idno>
<date when="2013" year="2013">2013</date>
<idno type="doi">10.1007/978-3-642-40453-5_26</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HCB-M738F4XW-6/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000135</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">000135</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Detecting Superbubbles in Assembly Graphs</title>
<author>
<name sortKey="Onodera, Taku" sort="Onodera, Taku" uniqKey="Onodera T" first="Taku" last="Onodera">Taku Onodera</name>
<affiliation>
<mods:affiliation>Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, 108-8639, Minato-ku, Tokyo, Japan</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: tk-ono@hgc.jp</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Sadakane, Kunihiko" sort="Sadakane, Kunihiko" uniqKey="Sadakane K" first="Kunihiko" last="Sadakane">Kunihiko Sadakane</name>
<affiliation>
<mods:affiliation>National Institute of Informatics, 2-1-2 Hitotsubashi, 101-8430, Chiyoda-ku, Tokyo, Japan</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: sada@nii.ac.jp</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Shibuya, Tetsuo" sort="Shibuya, Tetsuo" uniqKey="Shibuya T" first="Tetsuo" last="Shibuya">Tetsuo Shibuya</name>
<affiliation>
<mods:affiliation>Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, 108-8639, Minato-ku, Tokyo, Japan</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: tshibuya@hgc.jp</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s" type="main" xml:lang="en">Lecture Notes in Computer Science</title>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: We introduce a new concept of a subgraph class called a superbubble for analyzing assembly graphs, and propose an efficient algorithm for detecting it. Most assembly algorithms utilize assembly graphs like the de Bruijn graph or the overlap graph constructed from reads. From these graphs, many assembly algorithms first detect simple local graph structures (motifs), such as tips and bubbles, mainly to find sequencing errors. These motifs are easy to detect, but they are sometimes too simple to deal with more complex errors. The superbubble is an extension of the bubble, which is also important for analyzing assembly graphs. Though superbubbles are much more complex than ordinary bubbles, we show that they can be efficiently enumerated. We propose an average-case linear time algorithm (i.e., O(n + m) for a graph with n vertices and m edges) for graphs with a reasonable model, though the worst-case time complexity of our algorithm is quadratic (i.e., O(n(n + m))). Moreover, the algorithm is practically very fast: Our experiments show that our algorithm runs in reasonable time with a single CPU core even against a very large graph of a whole human genome.</div>
</front>
</TEI>
<istex>
<corpusName>springer-ebooks</corpusName>
<author>
<json:item>
<name>Taku Onodera</name>
<affiliations>
<json:string>Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, 108-8639, Minato-ku, Tokyo, Japan</json:string>
<json:string>E-mail: tk-ono@hgc.jp</json:string>
</affiliations>
</json:item>
<json:item>
<name>Kunihiko Sadakane</name>
<affiliations>
<json:string>National Institute of Informatics, 2-1-2 Hitotsubashi, 101-8430, Chiyoda-ku, Tokyo, Japan</json:string>
<json:string>E-mail: sada@nii.ac.jp</json:string>
</affiliations>
</json:item>
<json:item>
<name>Tetsuo Shibuya</name>
<affiliations>
<json:string>Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, 108-8639, Minato-ku, Tokyo, Japan</json:string>
<json:string>E-mail: tshibuya@hgc.jp</json:string>
</affiliations>
</json:item>
</author>
<arkIstex>ark:/67375/HCB-M738F4XW-6</arkIstex>
<language>
<json:string>eng</json:string>
</language>
<originalGenre>
<json:string>OriginalPaper</json:string>
</originalGenre>
<abstract>Abstract: We introduce a new concept of a subgraph class called a superbubble for analyzing assembly graphs, and propose an efficient algorithm for detecting it. Most assembly algorithms utilize assembly graphs like the de Bruijn graph or the overlap graph constructed from reads. From these graphs, many assembly algorithms first detect simple local graph structures (motifs), such as tips and bubbles, mainly to find sequencing errors. These motifs are easy to detect, but they are sometimes too simple to deal with more complex errors. The superbubble is an extension of the bubble, which is also important for analyzing assembly graphs. Though superbubbles are much more complex than ordinary bubbles, we show that they can be efficiently enumerated. We propose an average-case linear time algorithm (i.e., O(n + m) for a graph with n vertices and m edges) for graphs with a reasonable model, though the worst-case time complexity of our algorithm is quadratic (i.e., O(n(n + m))). Moreover, the algorithm is practically very fast: Our experiments show that our algorithm runs in reasonable time with a single CPU core even against a very large graph of a whole human genome.</abstract>
<qualityIndicators>
<refBibsNative>false</refBibsNative>
<abstractWordCount>186</abstractWordCount>
<abstractCharCount>1179</abstractCharCount>
<keywordCount>0</keywordCount>
<score>8.895</score>
<pdfWordCount>4663</pdfWordCount>
<pdfCharCount>24196</pdfCharCount>
<pdfVersion>1.6</pdfVersion>
<pdfPageCount>11</pdfPageCount>
<pdfPageSize>439.363 x 666.131 pts</pdfPageSize>
</qualityIndicators>
<title>Detecting Superbubbles in Assembly Graphs</title>
<chapterId>
<json:string>26</json:string>
<json:string>Chap26</json:string>
</chapterId>
<genre>
<json:string>conference</json:string>
</genre>
<serie>
<title>Lecture Notes in Computer Science</title>
<language>
<json:string>unknown</json:string>
</language>
<copyrightDate>2013</copyrightDate>
<issn>
<json:string>0302-9743</json:string>
</issn>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<editor>
<json:item>
<name>David Hutchison</name>
<affiliations>
<json:string>Lancaster University, Lancaster, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Takeo Kanade</name>
<affiliations>
<json:string>Carnegie Mellon University, Pittsburgh, PA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Josef Kittler</name>
<affiliations>
<json:string>University of Surrey, Guildford, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Jon M. Kleinberg</name>
<affiliations>
<json:string>Cornell University, Ithaca, NY, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Friedemann Mattern</name>
<affiliations>
<json:string>ETH Zurich, Zurich, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>John C. Mitchell</name>
<affiliations>
<json:string>Stanford University, Stanford, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moni Naor</name>
<affiliations>
<json:string>Weizmann Institute of Science, Rehovot, Israel</json:string>
</affiliations>
</json:item>
<json:item>
<name>Oscar Nierstrasz</name>
<affiliations>
<json:string>University of Bern, Bern, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>C. Pandu Rangan</name>
<affiliations>
<json:string>Indian Institute of Technology, Madras, India</json:string>
</affiliations>
</json:item>
<json:item>
<name>Bernhard Steffen</name>
<affiliations>
<json:string>University of Dortmund, Dortmund, Germany</json:string>
</affiliations>
</json:item>
<json:item>
<name>Madhu Sudan</name>
<affiliations>
<json:string>Massachusetts Institute of Technology, MA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Demetri Terzopoulos</name>
<affiliations>
<json:string>University of California, Los Angeles, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Doug Tygar</name>
<affiliations>
<json:string>University of California, Berkeley, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moshe Y. Vardi</name>
<affiliations>
<json:string>Rice University, Houston, TX, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Gerhard Weikum</name>
<affiliations>
<json:string>Max-Planck Institute of Computer Science, Saarbrücken, Germany</json:string>
</affiliations>
</json:item>
</editor>
</serie>
<host>
<title>Algorithms in Bioinformatics</title>
<language>
<json:string>unknown</json:string>
</language>
<copyrightDate>2013</copyrightDate>
<doi>
<json:string>10.1007/978-3-642-40453-5</json:string>
</doi>
<issn>
<json:string>0302-9743</json:string>
</issn>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<eisbn>
<json:string>978-3-642-40453-5</json:string>
</eisbn>
<bookId>
<json:string>978-3-642-40453-5</json:string>
</bookId>
<isbn>
<json:string>978-3-642-40452-8</json:string>
</isbn>
<volume>8126</volume>
<pages>
<first>338</first>
<last>348</last>
</pages>
<genre>
<json:string>book-series</json:string>
</genre>
<editor>
<json:item>
<name>Aaron Darling</name>
<affiliations>
<json:string>ithree institute,, University of Technology Sydney, 2007, Ultimo, NSW, Australia</json:string>
<json:string>E-mail: aaron.darling@uts.edu.au</json:string>
</affiliations>
</json:item>
<json:item>
<name>Jens Stoye</name>
<affiliations>
<json:string>Faculty of Technology, Bielefeld University, Universitätsstraße 25, 33615, Bielefeld, Germany</json:string>
<json:string>E-mail: jens.stoye@uni-bielefeld.de</json:string>
</affiliations>
</json:item>
</editor>
<subject>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computational Biology/Bioinformatics</value>
</json:item>
<json:item>
<value>Algorithm Analysis and Problem Complexity</value>
</json:item>
<json:item>
<value>Discrete Mathematics in Computer Science</value>
</json:item>
<json:item>
<value>Computation by Abstract Devices</value>
</json:item>
<json:item>
<value>Data Mining and Knowledge Discovery</value>
</json:item>
<json:item>
<value>Probability and Statistics in Computer Science</value>
</json:item>
</subject>
</host>
<ark>
<json:string>ark:/67375/HCB-M738F4XW-6</json:string>
</ark>
<publicationDate>2013</publicationDate>
<copyrightDate>2013</copyrightDate>
<doi>
<json:string>10.1007/978-3-642-40453-5_26</json:string>
</doi>
<id>68BDA37CB6A189519047ED981BCC99AF97559CCC</id>
<score>1</score>
<fulltext>
<json:item>
<extension>pdf</extension>
<original>true</original>
<mimetype>application/pdf</mimetype>
<uri>https://api.istex.fr/ark:/67375/HCB-M738F4XW-6/fulltext.pdf</uri>
</json:item>
<json:item>
<extension>zip</extension>
<original>false</original>
<mimetype>application/zip</mimetype>
<uri>https://api.istex.fr/ark:/67375/HCB-M738F4XW-6/bundle.zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/ark:/67375/HCB-M738F4XW-6/fulltext.tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">Detecting Superbubbles in Assembly Graphs</title>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<availability>
<licence>Springer-Verlag Berlin Heidelberg</licence>
</availability>
<date when="2013">2013</date>
</publicationStmt>
<notesStmt>
<note type="conference" source="proceedings" scheme="https://content-type.data.istex.fr/ark:/67375/XTP-BFHXPBJJ-3">conference</note>
<note type="publication-type" subtype="book-series" scheme="https://publication-type.data.istex.fr/ark:/67375/JMC-0G6R5W5T-Z">book-series</note>
</notesStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Detecting Superbubbles in Assembly Graphs</title>
<author>
<persName>
<forename type="first">Taku</forename>
<surname>Onodera</surname>
</persName>
<email>tk-ono@hgc.jp</email>
<affiliation>
<idno type="GRID" subtype="Institution">grid.26999.3d</idno>
<idno type="ISNI" subtype="Institution">000000012151536X</idno>
<orgName type="department">Human Genome Center, Institute of Medical Science</orgName>
<orgName type="institution">University of Tokyo</orgName>
<address>
<street>4-6-1 Shirokanedai</street>
<settlement>Minato-ku</settlement>
<region>Tokyo</region>
<postCode>108-8639</postCode>
<country key="JP">JAPAN</country>
</address>
</affiliation>
</author>
<author>
<persName>
<forename type="first">Kunihiko</forename>
<surname>Sadakane</surname>
</persName>
<email>sada@nii.ac.jp</email>
<affiliation>
<idno type="GRID" subtype="Institution">grid.250343.3</idno>
<idno type="ISNI" subtype="Institution">0000000110185342</idno>
<orgName type="institution">National Institute of Informatics</orgName>
<address>
<street>2-1-2 Hitotsubashi</street>
<settlement>Chiyoda-ku</settlement>
<region>Tokyo</region>
<postCode>101-8430</postCode>
<country key="JP">JAPAN</country>
</address>
</affiliation>
</author>
<author>
<persName>
<forename type="first">Tetsuo</forename>
<surname>Shibuya</surname>
</persName>
<email>tshibuya@hgc.jp</email>
<affiliation>
<idno type="GRID" subtype="Institution">grid.26999.3d</idno>
<idno type="ISNI" subtype="Institution">000000012151536X</idno>
<orgName type="department">Human Genome Center, Institute of Medical Science</orgName>
<orgName type="institution">University of Tokyo</orgName>
<address>
<street>4-6-1 Shirokanedai</street>
<settlement>Minato-ku</settlement>
<region>Tokyo</region>
<postCode>108-8639</postCode>
<country key="JP">JAPAN</country>
</address>
</affiliation>
</author>
<idno type="istex">68BDA37CB6A189519047ED981BCC99AF97559CCC</idno>
<idno type="ark">ark:/67375/HCB-M738F4XW-6</idno>
<idno type="DOI">10.1007/978-3-642-40453-5_26</idno>
</analytic>
<monogr>
<title level="m" type="main">Algorithms in Bioinformatics</title>
<title level="m" type="sub">13th International Workshop, WABI 2013, Sophia Antipolis, France, September 2-4, 2013. Proceedings</title>
<idno type="DOI">10.1007/978-3-642-40453-5</idno>
<idno type="book-id">978-3-642-40453-5</idno>
<idno type="ISBN">978-3-642-40452-8</idno>
<idno type="eISBN">978-3-642-40453-5</idno>
<idno type="chapter-id">Chap26</idno>
<editor>
<persName>
<forename type="first">Aaron</forename>
<surname>Darling</surname>
</persName>
<email>aaron.darling@uts.edu.au</email>
<affiliation>
<idno type="GRID" subtype="Institution">grid.117476.2</idno>
<idno type="ISNI" subtype="Institution">0000000419367611</idno>
<orgName type="department">ithree institute,</orgName>
<orgName type="institution">University of Technology Sydney</orgName>
<address>
<postCode>2007</postCode>
<settlement>Ultimo</settlement>
<region>NSW</region>
<country key="AU">AUSTRALIA</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jens</forename>
<surname>Stoye</surname>
</persName>
<email>jens.stoye@uni-bielefeld.de</email>
<affiliation>
<idno type="GRID" subtype="Institution">grid.7491.b</idno>
<idno type="ISNI" subtype="Institution">0000000109449128</idno>
<orgName type="department">Faculty of Technology</orgName>
<orgName type="institution">Bielefeld University</orgName>
<address>
<street>Universitätsstraße 25</street>
<postCode>33615</postCode>
<settlement>Bielefeld</settlement>
<country key="DE">GERMANY</country>
</address>
</affiliation>
</editor>
<meeting>
<title type="name">International Workshop on Algorithms in Bioinformatics</title>
<title type="abbr">WABI</title>
<idno type="conf-number">13</idno>
<idno type="Springer">wabi</idno>
<idno type="DBLP">wabi</idno>
<idno type="conf-ID">wabi2013</idno>
<settlement>Sophia Antipolis</settlement>
<country>France</country>
<date from="20130902" to="20130904"></date>
</meeting>
<imprint>
<biblScope unit="vol">8126</biblScope>
<biblScope unit="page" from="338">338</biblScope>
<biblScope unit="page" to="348">348</biblScope>
<biblScope unit="chapter-count">28</biblScope>
</imprint>
</monogr>
<series>
<title level="s" type="main" xml:lang="en">Lecture Notes in Computer Science</title>
<editor>
<persName>
<forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.9835.7</idno>
<idno type="ISNI" subtype="Institution">0000000081906402</idno>
<orgName type="institution">Lancaster University</orgName>
<address>
<settlement>Lancaster</settlement>
<country key="GB">UNITED KINGDOM</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.147455.6</idno>
<idno type="ISNI" subtype="Institution">0000000120970344</idno>
<orgName type="institution">Carnegie Mellon University</orgName>
<address>
<settlement>Pittsburgh</settlement>
<region>PA</region>
<country key="US">UNITED STATES</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.5475.3</idno>
<idno type="ISNI" subtype="Institution">0000000404074824</idno>
<orgName type="institution">University of Surrey</orgName>
<address>
<settlement>Guildford</settlement>
<country key="GB">UNITED KINGDOM</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.5386.8</idno>
<idno type="ISNI" subtype="Institution">000000041936877X</idno>
<orgName type="institution">Cornell University</orgName>
<address>
<settlement>Ithaca</settlement>
<region>NY</region>
<country key="US">UNITED STATES</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.5801.c</idno>
<idno type="ISNI" subtype="Institution">0000000121562780</idno>
<orgName type="institution">ETH Zurich</orgName>
<address>
<settlement>Zurich</settlement>
<country key="CH">SWITZERLAND</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.168010.e</idno>
<idno type="ISNI" subtype="Institution">0000000419368956</idno>
<orgName type="institution">Stanford University</orgName>
<address>
<settlement>Stanford</settlement>
<region>CA</region>
<country key="US">UNITED STATES</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.13992.30</idno>
<idno type="ISNI" subtype="Institution">0000000406047563</idno>
<orgName type="institution">Weizmann Institute of Science</orgName>
<address>
<settlement>Rehovot</settlement>
<country key="IL">ISRAEL</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.5734.5</idno>
<idno type="ISNI" subtype="Institution">0000000107265157</idno>
<orgName type="institution">University of Bern</orgName>
<address>
<settlement>Bern</settlement>
<country key="CH">SWITZERLAND</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.417969.4</idno>
<idno type="ISNI" subtype="Institution">0000000123151926</idno>
<orgName type="institution">Indian Institute of Technology</orgName>
<address>
<settlement>Madras</settlement>
<country key="IN">INDIA</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.5675.1</idno>
<idno type="ISNI" subtype="Institution">0000000104169637</idno>
<orgName type="institution">University of Dortmund</orgName>
<address>
<settlement>Dortmund</settlement>
<country key="DE">GERMANY</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.116068.8</idno>
<idno type="ISNI" subtype="Institution">0000000123412786</idno>
<orgName type="institution">Massachusetts Institute of Technology</orgName>
<address>
<region>MA</region>
<country key="US">UNITED STATES</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.19006.3e</idno>
<idno type="ISNI" subtype="Institution">0000000096326718</idno>
<orgName type="institution">University of California</orgName>
<address>
<settlement>Los Angeles</settlement>
<region>CA</region>
<country key="US">UNITED STATES</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Doug</forename>
<surname>Tygar</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.47840.3f</idno>
<idno type="ISNI" subtype="Institution">0000000121817878</idno>
<orgName type="institution">University of California</orgName>
<address>
<settlement>Berkeley</settlement>
<region>CA</region>
<country key="US">UNITED STATES</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.21940.3e</idno>
<idno type="ISNI" subtype="Institution"> 0000000419368278</idno>
<orgName type="institution">Rice University</orgName>
<address>
<settlement>Houston</settlement>
<region>TX</region>
<country key="US">UNITED STATES</country>
</address>
</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
<affiliation>
<idno type="GRID" subtype="Institution">grid.419607.d</idno>
<idno type="ISNI" subtype="Institution">0000000120969941</idno>
<orgName type="institution">Max-Planck Institute of Computer Science</orgName>
<address>
<settlement>Saarbrücken</settlement>
<country key="DE">GERMANY</country>
</address>
</affiliation>
</editor>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesID">558</idno>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<abstract xml:lang="en">
<head>Abstract</head>
<p>We introduce a new concept of a subgraph class called a superbubble for analyzing assembly graphs, and propose an efficient algorithm for detecting it. Most assembly algorithms utilize assembly graphs like the de Bruijn graph or the overlap graph constructed from reads. From these graphs, many assembly algorithms first detect simple local graph structures (motifs), such as tips and bubbles, mainly to find sequencing errors. These motifs are easy to detect, but they are sometimes too simple to deal with more complex errors. The superbubble is an extension of the bubble, which is also important for analyzing assembly graphs. Though superbubbles are much more complex than ordinary bubbles, we show that they can be efficiently enumerated. We propose an average-case linear time algorithm (
<hi rend="italic">i.e.</hi>
,
<hi rend="italic">O</hi>
(
<hi rend="italic">n</hi>
 + 
<hi rend="italic">m</hi>
) for a graph with
<hi rend="italic">n</hi>
vertices and
<hi rend="italic">m</hi>
edges) for graphs with a reasonable model, though the worst-case time complexity of our algorithm is quadratic (
<hi rend="italic">i.e.</hi>
,
<hi rend="italic">O</hi>
(
<hi rend="italic">n</hi>
(
<hi rend="italic">n</hi>
 + 
<hi rend="italic">m</hi>
))). Moreover, the algorithm is practically very fast: Our experiments show that our algorithm runs in reasonable time with a single CPU core even against a very large graph of a whole human genome.</p>
</abstract>
<textClass ana="subject">
<keywords scheme="book-subject-collection">
<list>
<label>SUCO11645</label>
<item>
<term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass ana="subject">
<keywords scheme="book-subject">
<list>
<label>SCI</label>
<item>
<term type="Primary">Computer Science</term>
</item>
<label>SCI23050</label>
<item>
<term type="Secondary" subtype="priority-1">Computational Biology/Bioinformatics</term>
</item>
<label>SCI16021</label>
<item>
<term type="Secondary" subtype="priority-2">Algorithm Analysis and Problem Complexity</term>
</item>
<label>SCI17028</label>
<item>
<term type="Secondary" subtype="priority-3">Discrete Mathematics in Computer Science</term>
</item>
<label>SCI16013</label>
<item>
<term type="Secondary" subtype="priority-4">Computation by Abstract Devices</term>
</item>
<label>SCI18030</label>
<item>
<term type="Secondary" subtype="priority-5">Data Mining and Knowledge Discovery</term>
</item>
<label>SCI17036</label>
<item>
<term type="Secondary" subtype="priority-6">Probability and Statistics in Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<langUsage>
<language ident="EN"></language>
</langUsage>
</profileDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<extension>txt</extension>
<original>false</original>
<mimetype>text/plain</mimetype>
<uri>https://api.istex.fr/ark:/67375/HCB-M738F4XW-6/fulltext.txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="corpus springer-ebooks not found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document>
<Publisher>
<PublisherInfo>
<PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
<PublisherImprintName>Springer</PublisherImprintName>
</PublisherInfo>
<Series>
<SeriesInfo ID="Series558" SeriesType="Series" TocLevels="0">
<SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff1">
<EditorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff2">
<EditorName DisplayOrder="Western">
<GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff3">
<EditorName DisplayOrder="Western">
<GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff4">
<EditorName DisplayOrder="Western">
<GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff5">
<EditorName DisplayOrder="Western">
<GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff6">
<EditorName DisplayOrder="Western">
<GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff7">
<EditorName DisplayOrder="Western">
<GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff8">
<EditorName DisplayOrder="Western">
<GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff9">
<EditorName DisplayOrder="Western">
<GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff10">
<EditorName DisplayOrder="Western">
<GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff11">
<EditorName DisplayOrder="Western">
<GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff12">
<EditorName DisplayOrder="Western">
<GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff13">
<EditorName DisplayOrder="Western">
<GivenName>Doug</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff14">
<EditorName DisplayOrder="Western">
<GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff15">
<EditorName DisplayOrder="Western">
<GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff1">
<OrgID Level="Institution" Type="GRID">grid.9835.7</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000081906402</OrgID>
<OrgName>Lancaster University</OrgName>
<OrgAddress>
<City>Lancaster</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff2">
<OrgID Level="Institution" Type="GRID">grid.147455.6</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000120970344</OrgID>
<OrgName>Carnegie Mellon University</OrgName>
<OrgAddress>
<City>Pittsburgh</City>
<State>PA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3">
<OrgID Level="Institution" Type="GRID">grid.5475.3</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000404074824</OrgID>
<OrgName>University of Surrey</OrgName>
<OrgAddress>
<City>Guildford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff4">
<OrgID Level="Institution" Type="GRID">grid.5386.8</OrgID>
<OrgID Level="Institution" Type="ISNI">000000041936877X</OrgID>
<OrgName>Cornell University</OrgName>
<OrgAddress>
<City>Ithaca</City>
<State>NY</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff5">
<OrgID Level="Institution" Type="GRID">grid.5801.c</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000121562780</OrgID>
<OrgName>ETH Zurich</OrgName>
<OrgAddress>
<City>Zurich</City>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff6">
<OrgID Level="Institution" Type="GRID">grid.168010.e</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000419368956</OrgID>
<OrgName>Stanford University</OrgName>
<OrgAddress>
<City>Stanford</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff7">
<OrgID Level="Institution" Type="GRID">grid.13992.30</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000406047563</OrgID>
<OrgName>Weizmann Institute of Science</OrgName>
<OrgAddress>
<City>Rehovot</City>
<Country>Israel</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff8">
<OrgID Level="Institution" Type="GRID">grid.5734.5</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000107265157</OrgID>
<OrgName>University of Bern</OrgName>
<OrgAddress>
<City>Bern</City>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff9">
<OrgID Level="Institution" Type="GRID">grid.417969.4</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000123151926</OrgID>
<OrgName>Indian Institute of Technology</OrgName>
<OrgAddress>
<City>Madras</City>
<Country>India</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff10">
<OrgID Level="Institution" Type="GRID">grid.5675.1</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000104169637</OrgID>
<OrgName>University of Dortmund</OrgName>
<OrgAddress>
<City>Dortmund</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff11">
<OrgID Level="Institution" Type="GRID">grid.116068.8</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000123412786</OrgID>
<OrgName>Massachusetts Institute of Technology</OrgName>
<OrgAddress>
<State>MA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff12">
<OrgID Level="Institution" Type="GRID">grid.19006.3e</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000096326718</OrgID>
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Los Angeles</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff13">
<OrgID Level="Institution" Type="GRID">grid.47840.3f</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000121817878</OrgID>
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Berkeley</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff14">
<OrgID Level="Institution" Type="GRID">grid.21940.3e</OrgID>
<OrgID Level="Institution" Type="ISNI"> 0000000419368278</OrgID>
<OrgName>Rice University</OrgName>
<OrgAddress>
<City>Houston</City>
<State>TX</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff15">
<OrgID Level="Institution" Type="GRID">grid.419607.d</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000120969941</OrgID>
<OrgName>Max-Planck Institute of Computer Science</OrgName>
<OrgAddress>
<City>Saarbrücken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SeriesHeader>
<SubSeries>
<SubSeriesInfo>
<SubSeriesID>5381</SubSeriesID>
<SubSeriesPrintISSN>0302-9743</SubSeriesPrintISSN>
<SubSeriesElectronicISSN>1611-3349</SubSeriesElectronicISSN>
<SubSeriesTitle Language="En">Lecture Notes in Bioinformatics</SubSeriesTitle>
</SubSeriesInfo>
<SubSeriesHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff16">
<EditorName DisplayOrder="Western">
<GivenName>Sorin</GivenName>
<FamilyName>Istrail</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff17">
<EditorName DisplayOrder="Western">
<GivenName>Pavel</GivenName>
<FamilyName>Pevzner</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff18">
<EditorName DisplayOrder="Western">
<GivenName>Michael</GivenName>
<GivenName>S.</GivenName>
<FamilyName>Waterman</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff16">
<OrgID Level="Institution" Type="GRID">grid.40263.33</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000419369094</OrgID>
<OrgName>Brown University</OrgName>
<OrgAddress>
<City>Providence</City>
<State>RI</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff17">
<OrgID Level="Institution" Type="GRID">grid.266100.3</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000121074242</OrgID>
<OrgName>University of California</OrgName>
<OrgAddress>
<City>San Diego</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff18">
<OrgID Level="Institution" Type="GRID">grid.42505.36</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000121566853</OrgID>
<OrgName>University of Southern California</OrgName>
<OrgAddress>
<City>Los Angeles</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SubSeriesHeader>
</SubSeries>
<Book Language="En">
<BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingDepth="2" NumberingStyle="ContentOnly" OutputMedium="All" TocLevels="0">
<BookID>978-3-642-40453-5</BookID>
<BookTitle>Algorithms in Bioinformatics</BookTitle>
<BookSubTitle>13th International Workshop, WABI 2013, Sophia Antipolis, France, September 2-4, 2013. Proceedings</BookSubTitle>
<BookVolumeNumber>8126</BookVolumeNumber>
<BookSequenceNumber>8126</BookSequenceNumber>
<BookDOI>10.1007/978-3-642-40453-5</BookDOI>
<BookTitleID>318433</BookTitleID>
<BookPrintISBN>978-3-642-40452-8</BookPrintISBN>
<BookElectronicISBN>978-3-642-40453-5</BookElectronicISBN>
<BookChapterCount>28</BookChapterCount>
<BookCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2013</CopyrightYear>
</BookCopyright>
<BookSubjectGroup>
<BookSubject Code="SCI" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="SCI23050" Priority="1" Type="Secondary">Computational Biology/Bioinformatics</BookSubject>
<BookSubject Code="SCI16021" Priority="2" Type="Secondary">Algorithm Analysis and Problem Complexity</BookSubject>
<BookSubject Code="SCI17028" Priority="3" Type="Secondary">Discrete Mathematics in Computer Science</BookSubject>
<BookSubject Code="SCI16013" Priority="4" Type="Secondary">Computation by Abstract Devices</BookSubject>
<BookSubject Code="SCI18030" Priority="5" Type="Secondary">Data Mining and Knowledge Discovery</BookSubject>
<BookSubject Code="SCI17036" Priority="6" Type="Secondary">Probability and Statistics in Computer Science</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
<BookContext>
<SeriesID>558</SeriesID>
<SubSeriesID>5381</SubSeriesID>
</BookContext>
<ConferenceInfo>
<ConfSeriesName>International Workshop on Algorithms in Bioinformatics</ConfSeriesName>
<ConfSeriesID Type="Springer">wabi</ConfSeriesID>
<ConfSeriesID Type="DBLP">wabi</ConfSeriesID>
<ConfEventID Type="Springer">wabi2013</ConfEventID>
<ConfEventAbbreviation>WABI</ConfEventAbbreviation>
<ConfNumber>13</ConfNumber>
<ConfEventLocation>
<City>Sophia Antipolis</City>
<Country>France</Country>
</ConfEventLocation>
<ConfEventDateStart>
<Year>2013</Year>
<Month>9</Month>
<Day>2</Day>
</ConfEventDateStart>
<ConfEventDateEnd>
<Year>2013</Year>
<Month>9</Month>
<Day>4</Day>
</ConfEventDateEnd>
</ConferenceInfo>
</BookInfo>
<BookHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff19">
<EditorName DisplayOrder="Western">
<GivenName>Aaron</GivenName>
<FamilyName>Darling</FamilyName>
</EditorName>
<Contact>
<Email>aaron.darling@uts.edu.au</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff20">
<EditorName DisplayOrder="Western">
<GivenName>Jens</GivenName>
<FamilyName>Stoye</FamilyName>
</EditorName>
<Contact>
<Email>jens.stoye@uni-bielefeld.de</Email>
</Contact>
</Editor>
<Affiliation ID="Aff19">
<OrgID Level="Institution" Type="GRID">grid.117476.2</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000419367611</OrgID>
<OrgDivision>ithree institute,</OrgDivision>
<OrgName>University of Technology Sydney</OrgName>
<OrgAddress>
<Postcode>2007</Postcode>
<City>Ultimo</City>
<State>NSW</State>
<Country>Australia</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff20">
<OrgID Level="Institution" Type="GRID">grid.7491.b</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000109449128</OrgID>
<OrgDivision>Faculty of Technology</OrgDivision>
<OrgName>Bielefeld University</OrgName>
<OrgAddress>
<Street>Universitätsstraße 25</Street>
<Postcode>33615</Postcode>
<City>Bielefeld</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Chapter ID="Chap26" Language="En">
<ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingDepth="2" NumberingStyle="ContentOnly" TocLevels="0">
<ChapterID>26</ChapterID>
<ChapterDOI>10.1007/978-3-642-40453-5_26</ChapterDOI>
<ChapterSequenceNumber>26</ChapterSequenceNumber>
<ChapterTitle Language="En">Detecting Superbubbles in Assembly Graphs</ChapterTitle>
<ChapterFirstPage>338</ChapterFirstPage>
<ChapterLastPage>348</ChapterLastPage>
<ChapterCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2013</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular">
<MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext>
<SeriesID>558</SeriesID>
<BookID>978-3-642-40453-5</BookID>
<BookTitle>Algorithms in Bioinformatics</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader>
<AuthorGroup>
<Author AffiliationIDS="Aff21">
<AuthorName DisplayOrder="Western">
<GivenName>Taku</GivenName>
<FamilyName>Onodera</FamilyName>
</AuthorName>
<Contact>
<Email>tk-ono@hgc.jp</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff22">
<AuthorName DisplayOrder="Western">
<GivenName>Kunihiko</GivenName>
<FamilyName>Sadakane</FamilyName>
</AuthorName>
<Contact>
<Email>sada@nii.ac.jp</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff21">
<AuthorName DisplayOrder="Western">
<GivenName>Tetsuo</GivenName>
<FamilyName>Shibuya</FamilyName>
</AuthorName>
<Contact>
<Email>tshibuya@hgc.jp</Email>
</Contact>
</Author>
<Affiliation ID="Aff21">
<OrgID Level="Institution" Type="GRID">grid.26999.3d</OrgID>
<OrgID Level="Institution" Type="ISNI">000000012151536X</OrgID>
<OrgDivision>Human Genome Center, Institute of Medical Science</OrgDivision>
<OrgName>University of Tokyo</OrgName>
<OrgAddress>
<Street>4-6-1 Shirokanedai</Street>
<City>Minato-ku</City>
<State>Tokyo</State>
<Postcode>108-8639</Postcode>
<Country>Japan</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff22">
<OrgID Level="Institution" Type="GRID">grid.250343.3</OrgID>
<OrgID Level="Institution" Type="ISNI">0000000110185342</OrgID>
<OrgName>National Institute of Informatics</OrgName>
<OrgAddress>
<Street>2-1-2 Hitotsubashi</Street>
<City>Chiyoda-ku</City>
<State>Tokyo</State>
<Postcode>101-8430</Postcode>
<Country>Japan</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En">
<Heading>Abstract</Heading>
<Para>We introduce a new concept of a subgraph class called a superbubble for analyzing assembly graphs, and propose an efficient algorithm for detecting it. Most assembly algorithms utilize assembly graphs like the de Bruijn graph or the overlap graph constructed from reads. From these graphs, many assembly algorithms first detect simple local graph structures (motifs), such as tips and bubbles, mainly to find sequencing errors. These motifs are easy to detect, but they are sometimes too simple to deal with more complex errors. The superbubble is an extension of the bubble, which is also important for analyzing assembly graphs. Though superbubbles are much more complex than ordinary bubbles, we show that they can be efficiently enumerated. We propose an average-case linear time algorithm (
<Emphasis Type="Italic">i.e.</Emphasis>
,
<Emphasis Type="Italic">O</Emphasis>
(
<Emphasis Type="Italic">n</Emphasis>
 + 
<Emphasis Type="Italic">m</Emphasis>
) for a graph with
<Emphasis Type="Italic">n</Emphasis>
vertices and
<Emphasis Type="Italic">m</Emphasis>
edges) for graphs with a reasonable model, though the worst-case time complexity of our algorithm is quadratic (
<Emphasis Type="Italic">i.e.</Emphasis>
,
<Emphasis Type="Italic">O</Emphasis>
(
<Emphasis Type="Italic">n</Emphasis>
(
<Emphasis Type="Italic">n</Emphasis>
 + 
<Emphasis Type="Italic">m</Emphasis>
))). Moreover, the algorithm is practically very fast: Our experiments show that our algorithm runs in reasonable time with a single CPU core even against a very large graph of a whole human genome.</Para>
</Abstract>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Book>
</Series>
</Publisher>
<!-- Converted from LaTeX with LaTeX2A++ V3.1.46 --></istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>Detecting Superbubbles in Assembly Graphs</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA">
<title>Detecting Superbubbles in Assembly Graphs</title>
</titleInfo>
<name type="personal">
<namePart type="given">Taku</namePart>
<namePart type="family">Onodera</namePart>
<affiliation>Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, 108-8639, Minato-ku, Tokyo, Japan</affiliation>
<affiliation>E-mail: tk-ono@hgc.jp</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Kunihiko</namePart>
<namePart type="family">Sadakane</namePart>
<affiliation>National Institute of Informatics, 2-1-2 Hitotsubashi, 101-8430, Chiyoda-ku, Tokyo, Japan</affiliation>
<affiliation>E-mail: sada@nii.ac.jp</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Tetsuo</namePart>
<namePart type="family">Shibuya</namePart>
<affiliation>Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, 108-8639, Minato-ku, Tokyo, Japan</affiliation>
<affiliation>E-mail: tshibuya@hgc.jp</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre displayLabel="OriginalPaper" authority="ISTEX" authorityURI="https://content-type.data.istex.fr" type="conference" valueURI="https://content-type.data.istex.fr/ark:/67375/XTP-BFHXPBJJ-3">conference</genre>
<originInfo>
<publisher>Springer Berlin Heidelberg</publisher>
<place>
<placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2013</dateIssued>
<copyrightDate encoding="w3cdtf">2013</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<abstract lang="en">Abstract: We introduce a new concept of a subgraph class called a superbubble for analyzing assembly graphs, and propose an efficient algorithm for detecting it. Most assembly algorithms utilize assembly graphs like the de Bruijn graph or the overlap graph constructed from reads. From these graphs, many assembly algorithms first detect simple local graph structures (motifs), such as tips and bubbles, mainly to find sequencing errors. These motifs are easy to detect, but they are sometimes too simple to deal with more complex errors. The superbubble is an extension of the bubble, which is also important for analyzing assembly graphs. Though superbubbles are much more complex than ordinary bubbles, we show that they can be efficiently enumerated. We propose an average-case linear time algorithm (i.e., O(n + m) for a graph with n vertices and m edges) for graphs with a reasonable model, though the worst-case time complexity of our algorithm is quadratic (i.e., O(n(n + m))). Moreover, the algorithm is practically very fast: Our experiments show that our algorithm runs in reasonable time with a single CPU core even against a very large graph of a whole human genome.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Algorithms in Bioinformatics</title>
<subTitle>13th International Workshop, WABI 2013, Sophia Antipolis, France, September 2-4, 2013. Proceedings</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">Aaron</namePart>
<namePart type="family">Darling</namePart>
<affiliation>ithree institute,, University of Technology Sydney, 2007, Ultimo, NSW, Australia</affiliation>
<affiliation>E-mail: aaron.darling@uts.edu.au</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jens</namePart>
<namePart type="family">Stoye</namePart>
<affiliation>Faculty of Technology, Bielefeld University, Universitätsstraße 25, 33615, Bielefeld, Germany</affiliation>
<affiliation>E-mail: jens.stoye@uni-bielefeld.de</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="book-series" authority="ISTEX" authorityURI="https://publication-type.data.istex.fr" valueURI="https://publication-type.data.istex.fr/ark:/67375/JMC-0G6R5W5T-Z">book-series</genre>
<originInfo>
<publisher>Springer</publisher>
<copyrightDate encoding="w3cdtf">2013</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject>
<genre>Book-Subject-Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject>
<genre>Book-Subject-Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SCI">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="SCI23050">Computational Biology/Bioinformatics</topic>
<topic authority="SpringerSubjectCodes" authorityURI="SCI16021">Algorithm Analysis and Problem Complexity</topic>
<topic authority="SpringerSubjectCodes" authorityURI="SCI17028">Discrete Mathematics in Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="SCI16013">Computation by Abstract Devices</topic>
<topic authority="SpringerSubjectCodes" authorityURI="SCI18030">Data Mining and Knowledge Discovery</topic>
<topic authority="SpringerSubjectCodes" authorityURI="SCI17036">Probability and Statistics in Computer Science</topic>
</subject>
<identifier type="DOI">10.1007/978-3-642-40453-5</identifier>
<identifier type="ISBN">978-3-642-40452-8</identifier>
<identifier type="eISBN">978-3-642-40453-5</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">318433</identifier>
<identifier type="BookID">978-3-642-40453-5</identifier>
<identifier type="BookChapterCount">28</identifier>
<identifier type="BookVolumeNumber">8126</identifier>
<identifier type="BookSequenceNumber">8126</identifier>
<part>
<date>2013</date>
<detail type="volume">
<number>8126</number>
<caption>vol.</caption>
</detail>
<extent unit="pages">
<start>338</start>
<end>348</end>
</extent>
</part>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2013</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series">
<titleInfo>
<title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<affiliation>Lancaster University, Lancaster, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<affiliation>University of Surrey, Guildford, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<affiliation>ETH Zurich, Zurich, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Stanford University, Stanford, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<affiliation>University of Bern, Bern, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<affiliation>University of Dortmund, Dortmund, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Doug</namePart>
<namePart type="family">Tygar</namePart>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<affiliation>Rice University, Houston, TX, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute of Computer Science, Saarbrücken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<publisher>Springer</publisher>
<copyrightDate encoding="w3cdtf">2013</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<relatedItem type="constituent">
<titleInfo>
<title>Lecture Notes in Bioinformatics</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<affiliation>Lancaster University, Lancaster, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<affiliation>University of Surrey, Guildford, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<affiliation>ETH Zurich, Zurich, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Stanford University, Stanford, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<affiliation>University of Bern, Bern, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<affiliation>University of Dortmund, Dortmund, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Doug</namePart>
<namePart type="family">Tygar</namePart>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<affiliation>Rice University, Houston, TX, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute of Computer Science, Saarbrücken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Sorin</namePart>
<namePart type="family">Istrail</namePart>
<affiliation>Brown University, Providence, RI, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Pavel</namePart>
<namePart type="family">Pevzner</namePart>
<affiliation>University of California, San Diego, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Michael</namePart>
<namePart type="given">S.</namePart>
<namePart type="family">Waterman</namePart>
<affiliation>University of Southern California, Los Angeles, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Aaron</namePart>
<namePart type="family">Darling</namePart>
<affiliation>ithree institute,, University of Technology Sydney, 2007, Ultimo, NSW, Australia</affiliation>
<affiliation>E-mail: aaron.darling@uts.edu.au</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jens</namePart>
<namePart type="family">Stoye</namePart>
<affiliation>Faculty of Technology, Bielefeld University, Universitätsstraße 25, 33615, Bielefeld, Germany</affiliation>
<affiliation>E-mail: jens.stoye@uni-bielefeld.de</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="sub-series"></genre>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SubSeriesID">5381</identifier>
</relatedItem>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2013</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">68BDA37CB6A189519047ED981BCC99AF97559CCC</identifier>
<identifier type="ark">ark:/67375/HCB-M738F4XW-6</identifier>
<identifier type="DOI">10.1007/978-3-642-40453-5_26</identifier>
<identifier type="ChapterID">26</identifier>
<identifier type="ChapterID">Chap26</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer-Verlag Berlin Heidelberg, 2013</accessCondition>
<recordInfo>
<recordContentSource authority="ISTEX" authorityURI="https://loaded-corpus.data.istex.fr" valueURI="https://loaded-corpus.data.istex.fr/ark:/67375/XBH-RLRX46XW-4">springer</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2013</recordOrigin>
</recordInfo>
</mods>
<json:item>
<extension>json</extension>
<original>false</original>
<mimetype>application/json</mimetype>
<uri>https://api.istex.fr/ark:/67375/HCB-M738F4XW-6/record.json</uri>
</json:item>
</metadata>
<annexes>
<json:item>
<extension>xml</extension>
<original>true</original>
<mimetype>application/xml</mimetype>
<uri>https://api.istex.fr/ark:/67375/HCB-M738F4XW-6/annexes.xml</uri>
</json:item>
</annexes>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000135 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000135 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:68BDA37CB6A189519047ED981BCC99AF97559CCC
   |texte=   Detecting Superbubbles in Assembly Graphs
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021