Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Konnector v2.0: pseudo-long reads from paired-end sequencing data

Identifieur interne : 000959 ( Pmc/Curation ); précédent : 000958; suivant : 000960

Konnector v2.0: pseudo-long reads from paired-end sequencing data

Auteurs : Benjamin P. Vandervalk [Canada] ; Chen Yang [Canada] ; Zhuyi Xue [Canada] ; Karthika Raghavan [Canada] ; Justin Chu [Canada] ; Hamid Mohamadi [Canada] ; Shaun D. Jackman [Canada] ; Readman Chiu [Canada] ; René L. Warren [Canada] ; Inanç Birol [Canada]

Source :

RBID : PMC:4582294

Abstract

Background

Reading the nucleotides from two ends of a DNA fragment is called paired-end tag (PET) sequencing. When the fragment length is longer than the combined read length, there remains a gap of unsequenced nucleotides between read pairs. If the target in such experiments is sequenced at a level to provide redundant coverage, it may be possible to bridge these gaps using bioinformatics methods. Konnector is a local de novo assembly tool that addresses this problem. Here we report on version 2.0 of our tool.

Results

Konnector uses a probabilistic and memory-efficient data structure called Bloom filter to represent a k-mer spectrum - all possible sequences of length k in an input file, such as the collection of reads in a PET sequencing experiment. It performs look-ups to this data structure to construct an implicit de Bruijn graph, which describes (k-1) base pair overlaps between adjacent k-mers. It traverses this graph to bridge the gap between a given pair of flanking sequences.

Conclusions

Here we report the performance of Konnector v2.0 on simulated and experimental datasets, and compare it against other tools with similar functionality. We note that, representing k-mers with 1.5 bytes of memory on average, Konnector can scale to very large genomes. With our parallel implementation, it can also process over a billion bases on commodity hardware.


Url:
DOI: 10.1186/1755-8794-8-S3-S1
PubMed: 26399504
PubMed Central: 4582294

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:4582294

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Konnector v2.0: pseudo-long reads from paired-end sequencing data</title>
<author>
<name sortKey="Vandervalk, Benjamin P" sort="Vandervalk, Benjamin P" uniqKey="Vandervalk B" first="Benjamin P" last="Vandervalk">Benjamin P. Vandervalk</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Yang, Chen" sort="Yang, Chen" uniqKey="Yang C" first="Chen" last="Yang">Chen Yang</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Xue, Zhuyi" sort="Xue, Zhuyi" uniqKey="Xue Z" first="Zhuyi" last="Xue">Zhuyi Xue</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Raghavan, Karthika" sort="Raghavan, Karthika" uniqKey="Raghavan K" first="Karthika" last="Raghavan">Karthika Raghavan</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Chu, Justin" sort="Chu, Justin" uniqKey="Chu J" first="Justin" last="Chu">Justin Chu</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Mohamadi, Hamid" sort="Mohamadi, Hamid" uniqKey="Mohamadi H" first="Hamid" last="Mohamadi">Hamid Mohamadi</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Jackman, Shaun D" sort="Jackman, Shaun D" uniqKey="Jackman S" first="Shaun D" last="Jackman">Shaun D. Jackman</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Chiu, Readman" sort="Chiu, Readman" uniqKey="Chiu R" first="Readman" last="Chiu">Readman Chiu</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Warren, Rene L" sort="Warren, Rene L" uniqKey="Warren R" first="René L" last="Warren">René L. Warren</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Birol, Inanc" sort="Birol, Inanc" uniqKey="Birol I" first="Inanç" last="Birol">Inanç Birol</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">26399504</idno>
<idno type="pmc">4582294</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4582294</idno>
<idno type="RBID">PMC:4582294</idno>
<idno type="doi">10.1186/1755-8794-8-S3-S1</idno>
<date when="2015">2015</date>
<idno type="wicri:Area/Pmc/Corpus">000959</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000959</idno>
<idno type="wicri:Area/Pmc/Curation">000959</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Curation">000959</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Konnector v2.0: pseudo-long reads from paired-end sequencing data</title>
<author>
<name sortKey="Vandervalk, Benjamin P" sort="Vandervalk, Benjamin P" uniqKey="Vandervalk B" first="Benjamin P" last="Vandervalk">Benjamin P. Vandervalk</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Yang, Chen" sort="Yang, Chen" uniqKey="Yang C" first="Chen" last="Yang">Chen Yang</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Xue, Zhuyi" sort="Xue, Zhuyi" uniqKey="Xue Z" first="Zhuyi" last="Xue">Zhuyi Xue</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Raghavan, Karthika" sort="Raghavan, Karthika" uniqKey="Raghavan K" first="Karthika" last="Raghavan">Karthika Raghavan</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Chu, Justin" sort="Chu, Justin" uniqKey="Chu J" first="Justin" last="Chu">Justin Chu</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Mohamadi, Hamid" sort="Mohamadi, Hamid" uniqKey="Mohamadi H" first="Hamid" last="Mohamadi">Hamid Mohamadi</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Jackman, Shaun D" sort="Jackman, Shaun D" uniqKey="Jackman S" first="Shaun D" last="Jackman">Shaun D. Jackman</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Chiu, Readman" sort="Chiu, Readman" uniqKey="Chiu R" first="Readman" last="Chiu">Readman Chiu</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Warren, Rene L" sort="Warren, Rene L" uniqKey="Warren R" first="René L" last="Warren">René L. Warren</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Birol, Inanc" sort="Birol, Inanc" uniqKey="Birol I" first="Inanç" last="Birol">Inanç Birol</name>
<affiliation wicri:level="1">
<nlm:aff id="I1">Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</nlm:aff>
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6</wicri:regionArea>
</affiliation>
</author>
</analytic>
<series>
<title level="j">BMC Medical Genomics</title>
<idno type="eISSN">1755-8794</idno>
<imprint>
<date when="2015">2015</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec>
<title>Background</title>
<p>Reading the nucleotides from two ends of a DNA fragment is called paired-end tag (PET) sequencing. When the fragment length is longer than the combined read length, there remains a gap of unsequenced nucleotides between read pairs. If the target in such experiments is sequenced at a level to provide redundant coverage, it may be possible to bridge these gaps using bioinformatics methods. Konnector is a local
<italic>de novo </italic>
assembly tool that addresses this problem. Here we report on version 2.0 of our tool.</p>
</sec>
<sec>
<title>Results</title>
<p>Konnector uses a probabilistic and memory-efficient data structure called Bloom filter to represent a k-mer spectrum - all possible sequences of length k in an input file, such as the collection of reads in a PET sequencing experiment. It performs look-ups to this data structure to construct an implicit de Bruijn graph, which describes (k-1) base pair overlaps between adjacent k-mers. It traverses this graph to bridge the gap between a given pair of flanking sequences.</p>
</sec>
<sec>
<title>Conclusions</title>
<p>Here we report the performance of Konnector v2.0 on simulated and experimental datasets, and compare it against other tools with similar functionality. We note that, representing k-mers with 1.5 bytes of memory on average, Konnector can scale to very large genomes. With our parallel implementation, it can also process over a billion bases on commodity hardware.</p>
</sec>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Vandervalk, Bp" uniqKey="Vandervalk B">BP Vandervalk</name>
</author>
<author>
<name sortKey="Jackman, Sd" uniqKey="Jackman S">SD Jackman</name>
</author>
<author>
<name sortKey="Raymond, A" uniqKey="Raymond A">A Raymond</name>
</author>
<author>
<name sortKey="Mohamadi, H" uniqKey="Mohamadi H">H Mohamadi</name>
</author>
<author>
<name sortKey="Yang, C" uniqKey="Yang C">C Yang</name>
</author>
<author>
<name sortKey="Attali, Da" uniqKey="Attali D">DA Attali</name>
</author>
<author>
<name sortKey="Konnector" uniqKey="Konnector">Konnector</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pevzner, Pa" uniqKey="Pevzner P">PA Pevzner</name>
</author>
<author>
<name sortKey="Tang, H" uniqKey="Tang H">H Tang</name>
</author>
<author>
<name sortKey="Waterman, Ms" uniqKey="Waterman M">MS Waterman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bloom, Bh" uniqKey="Bloom B">BH Bloom</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chaisson, Mj" uniqKey="Chaisson M">MJ Chaisson</name>
</author>
<author>
<name sortKey="Pevzner, Pa" uniqKey="Pevzner P">PA Pevzner</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zimin, Av" uniqKey="Zimin A">AV Zimin</name>
</author>
<author>
<name sortKey="Marcais, G" uniqKey="Marcais G">G Marcais</name>
</author>
<author>
<name sortKey="Puiu, D" uniqKey="Puiu D">D Puiu</name>
</author>
<author>
<name sortKey="Roberts, M" uniqKey="Roberts M">M Roberts</name>
</author>
<author>
<name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
<author>
<name sortKey="Yorke, Ja" uniqKey="Yorke J">JA Yorke</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Magoc, T" uniqKey="Magoc T">T Magoc</name>
</author>
<author>
<name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Liu, B" uniqKey="Liu B">B Liu</name>
</author>
<author>
<name sortKey="Yuan, J" uniqKey="Yuan J">J Yuan</name>
</author>
<author>
<name sortKey="Yiu, Sm" uniqKey="Yiu S">SM Yiu</name>
</author>
<author>
<name sortKey="Li, Z" uniqKey="Li Z">Z Li</name>
</author>
<author>
<name sortKey="Xie, Y" uniqKey="Xie Y">Y Xie</name>
</author>
<author>
<name sortKey="Chen, Y" uniqKey="Chen Y">Y Chen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Silver, Dh" uniqKey="Silver D">DH Silver</name>
</author>
<author>
<name sortKey="Ben Elazar, S" uniqKey="Ben Elazar S">S Ben-Elazar</name>
</author>
<author>
<name sortKey="Bogoslavsky, A" uniqKey="Bogoslavsky A">A Bogoslavsky</name>
</author>
<author>
<name sortKey="Yanai, I" uniqKey="Yanai I">I Yanai</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Nadalin, F" uniqKey="Nadalin F">F Nadalin</name>
</author>
<author>
<name sortKey="Vezzi, F" uniqKey="Vezzi F">F Vezzi</name>
</author>
<author>
<name sortKey="Policriti, A" uniqKey="Policriti A">A Policriti</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Myers, Ew" uniqKey="Myers E">EW Myers</name>
</author>
<author>
<name sortKey="Sutton, Gg" uniqKey="Sutton G">GG Sutton</name>
</author>
<author>
<name sortKey="Delcher, Al" uniqKey="Delcher A">AL Delcher</name>
</author>
<author>
<name sortKey="Dew, Im" uniqKey="Dew I">IM Dew</name>
</author>
<author>
<name sortKey="Fasulo, Dp" uniqKey="Fasulo D">DP Fasulo</name>
</author>
<author>
<name sortKey="Flanigan, Mj" uniqKey="Flanigan M">MJ Flanigan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Simpson, Jt" uniqKey="Simpson J">JT Simpson</name>
</author>
<author>
<name sortKey="Wong, K" uniqKey="Wong K">K Wong</name>
</author>
<author>
<name sortKey="Jackman, Sd" uniqKey="Jackman S">SD Jackman</name>
</author>
<author>
<name sortKey="Schein, Je" uniqKey="Schein J">JE Schein</name>
</author>
<author>
<name sortKey="Jones, Sj" uniqKey="Jones S">SJ Jones</name>
</author>
<author>
<name sortKey="Birol, I" uniqKey="Birol I">I Birol</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Boisvert, S" uniqKey="Boisvert S">S Boisvert</name>
</author>
<author>
<name sortKey="Laviolette, F" uniqKey="Laviolette F">F Laviolette</name>
</author>
<author>
<name sortKey="Corbeil, J" uniqKey="Corbeil J">J Corbeil</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, H" uniqKey="Li H">H Li</name>
</author>
<author>
<name sortKey="Durbin, R" uniqKey="Durbin R">R Durbin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Langmead, B" uniqKey="Langmead B">B Langmead</name>
</author>
<author>
<name sortKey="Trapnell, C" uniqKey="Trapnell C">C Trapnell</name>
</author>
<author>
<name sortKey="Pop, M" uniqKey="Pop M">M Pop</name>
</author>
<author>
<name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Simpson, Jt" uniqKey="Simpson J">JT Simpson</name>
</author>
<author>
<name sortKey="Durbin, R" uniqKey="Durbin R">R Durbin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Stranneheim, H" uniqKey="Stranneheim H">H Stranneheim</name>
</author>
<author>
<name sortKey="Kaller, M" uniqKey="Kaller M">M Kaller</name>
</author>
<author>
<name sortKey="Allander, T" uniqKey="Allander T">T Allander</name>
</author>
<author>
<name sortKey="Andersson, B" uniqKey="Andersson B">B Andersson</name>
</author>
<author>
<name sortKey="Arvestad, L" uniqKey="Arvestad L">L Arvestad</name>
</author>
<author>
<name sortKey="Lundeberg, J" uniqKey="Lundeberg J">J Lundeberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chikhi, R" uniqKey="Chikhi R">R Chikhi</name>
</author>
<author>
<name sortKey="Rizk, G" uniqKey="Rizk G">G Rizk</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Miller, Jr" uniqKey="Miller J">JR Miller</name>
</author>
<author>
<name sortKey="Delcher, Al" uniqKey="Delcher A">AL Delcher</name>
</author>
<author>
<name sortKey="Koren, S" uniqKey="Koren S">S Koren</name>
</author>
<author>
<name sortKey="Venter, E" uniqKey="Venter E">E Venter</name>
</author>
<author>
<name sortKey="Walenz, Bp" uniqKey="Walenz B">BP Walenz</name>
</author>
<author>
<name sortKey="Brownley, A" uniqKey="Brownley A">A Brownley</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hu, X" uniqKey="Hu X">X Hu</name>
</author>
<author>
<name sortKey="Yuan, J" uniqKey="Yuan J">J Yuan</name>
</author>
<author>
<name sortKey="Shi, Y" uniqKey="Shi Y">Y Shi</name>
</author>
<author>
<name sortKey="Lu, J" uniqKey="Lu J">J Lu</name>
</author>
<author>
<name sortKey="Liu, B" uniqKey="Liu B">B Liu</name>
</author>
<author>
<name sortKey="Li, Z" uniqKey="Li Z">Z Li</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gurevich, A" uniqKey="Gurevich A">A Gurevich</name>
</author>
<author>
<name sortKey="Saveliev, V" uniqKey="Saveliev V">V Saveliev</name>
</author>
<author>
<name sortKey="Vyahhi, N" uniqKey="Vyahhi N">N Vyahhi</name>
</author>
<author>
<name sortKey="Tesler, G" uniqKey="Tesler G">G Tesler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, H" uniqKey="Li H">H Li</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Quinlan, Ar" uniqKey="Quinlan A">AR Quinlan</name>
</author>
<author>
<name sortKey="Hall, Im" uniqKey="Hall I">IM Hall</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Paulino, D" uniqKey="Paulino D">D Paulino</name>
</author>
<author>
<name sortKey="Warren, Rl" uniqKey="Warren R">RL Warren</name>
</author>
<author>
<name sortKey="Vandervalk, Bp" uniqKey="Vandervalk B">BP Vandervalk</name>
</author>
<author>
<name sortKey="Raymond, A" uniqKey="Raymond A">A Raymond</name>
</author>
<author>
<name sortKey="Jackman, Sd" uniqKey="Jackman S">SD Jackman</name>
</author>
<author>
<name sortKey="Birol, I" uniqKey="Birol I">I Birol</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Boetzer, M" uniqKey="Boetzer M">M Boetzer</name>
</author>
<author>
<name sortKey="Pirovano, W" uniqKey="Pirovano W">W Pirovano</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Luo, R" uniqKey="Luo R">R Luo</name>
</author>
<author>
<name sortKey="Liu, B" uniqKey="Liu B">B Liu</name>
</author>
<author>
<name sortKey="Xie, Y" uniqKey="Xie Y">Y Xie</name>
</author>
<author>
<name sortKey="Li, Z" uniqKey="Li Z">Z Li</name>
</author>
<author>
<name sortKey="Huang, W" uniqKey="Huang W">W Huang</name>
</author>
<author>
<name sortKey="Yuan, J" uniqKey="Yuan J">J Yuan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Birol, I" uniqKey="Birol I">I Birol</name>
</author>
<author>
<name sortKey="Raymond, A" uniqKey="Raymond A">A Raymond</name>
</author>
<author>
<name sortKey="Jackman, Sd" uniqKey="Jackman S">SD Jackman</name>
</author>
<author>
<name sortKey="Pleasance, S" uniqKey="Pleasance S">S Pleasance</name>
</author>
<author>
<name sortKey="Coope, R" uniqKey="Coope R">R Coope</name>
</author>
<author>
<name sortKey="Taylor, Ga" uniqKey="Taylor G">GA Taylor</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Koboldt, Dc" uniqKey="Koboldt D">DC Koboldt</name>
</author>
<author>
<name sortKey="Chen, K" uniqKey="Chen K">K Chen</name>
</author>
<author>
<name sortKey="Wylie, T" uniqKey="Wylie T">T Wylie</name>
</author>
<author>
<name sortKey="Larson, De" uniqKey="Larson D">DE Larson</name>
</author>
<author>
<name sortKey="Mclellan, Md" uniqKey="Mclellan M">MD McLellan</name>
</author>
<author>
<name sortKey="Mardis, Er" uniqKey="Mardis E">ER Mardis</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bartenhagen, C" uniqKey="Bartenhagen C">C Bartenhagen</name>
</author>
<author>
<name sortKey="Dugas, M" uniqKey="Dugas M">M Dugas</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article" xml:lang="en">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">BMC Med Genomics</journal-id>
<journal-id journal-id-type="iso-abbrev">BMC Med Genomics</journal-id>
<journal-title-group>
<journal-title>BMC Medical Genomics</journal-title>
</journal-title-group>
<issn pub-type="epub">1755-8794</issn>
<publisher>
<publisher-name>BioMed Central</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">26399504</article-id>
<article-id pub-id-type="pmc">4582294</article-id>
<article-id pub-id-type="publisher-id">1755-8794-8-S3-S1</article-id>
<article-id pub-id-type="doi">10.1186/1755-8794-8-S3-S1</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Research</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Konnector v2.0: pseudo-long reads from paired-end sequencing data</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" id="A1">
<name>
<surname>Vandervalk</surname>
<given-names>Benjamin P</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>benv@bcgsc.ca</email>
</contrib>
<contrib contrib-type="author" id="A2">
<name>
<surname>Yang</surname>
<given-names>Chen</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>cheny@bcgsc.ca</email>
</contrib>
<contrib contrib-type="author" id="A3">
<name>
<surname>Xue</surname>
<given-names>Zhuyi</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>zxue@bcgsc.ca</email>
</contrib>
<contrib contrib-type="author" id="A4">
<name>
<surname>Raghavan</surname>
<given-names>Karthika</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>kraghavan@bcgsc.ca</email>
</contrib>
<contrib contrib-type="author" id="A5">
<name>
<surname>Chu</surname>
<given-names>Justin</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>cjustin@bcgsc.ca</email>
</contrib>
<contrib contrib-type="author" id="A6">
<name>
<surname>Mohamadi</surname>
<given-names>Hamid</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>hmohamadi@bcgsc.ca</email>
</contrib>
<contrib contrib-type="author" id="A7">
<name>
<surname>Jackman</surname>
<given-names>Shaun D</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>sjackman@bcgsc.ca</email>
</contrib>
<contrib contrib-type="author" id="A8">
<name>
<surname>Chiu</surname>
<given-names>Readman</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>rchiu@bcgsc.ca</email>
</contrib>
<contrib contrib-type="author" id="A9">
<name>
<surname>Warren</surname>
<given-names>René L</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>rwarren@bcgsc.ca</email>
</contrib>
<contrib contrib-type="author" corresp="yes" id="A10">
<name>
<surname>Birol</surname>
<given-names>Inanç</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>ibirol@bcgsc.ca</email>
</contrib>
</contrib-group>
<aff id="I1">
<label>1</label>
Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC V5Z 4S6, Canada</aff>
<pub-date pub-type="collection">
<year>2015</year>
</pub-date>
<pub-date pub-type="epub">
<day>23</day>
<month>9</month>
<year>2015</year>
</pub-date>
<volume>8</volume>
<issue>Suppl 3</issue>
<supplement>
<named-content content-type="supplement-title">Selected articles from the IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2014): Medical Genomics</named-content>
<named-content content-type="supplement-editor">Feng Luo</named-content>
<named-content content-type="supplement-sponsor">Publication of this supplement has not been supported by sponsorship. Information about the source of funding for the publication charges can be found in the individual articles. Articles have undergone the journal's standard peer review process for supplements. The Supplement Editor declares that they have no competing interests.</named-content>
</supplement>
<fpage>S1</fpage>
<lpage>S1</lpage>
<permissions>
<copyright-statement>Copyright © 2015 Vandervalk et al.;</copyright-statement>
<copyright-year>2015</copyright-year>
<copyright-holder>Vandervalk et al.;</copyright-holder>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0">
<license-p>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0">http://creativecommons.org/licenses/by/4.0</ext-link>
), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/publicdomain/zero/1.0/">http://creativecommons.org/publicdomain/zero/1.0/</ext-link>
) applies to the data made available in this article, unless otherwise stated.</license-p>
</license>
</permissions>
<self-uri xlink:href="http://www.biomedcentral.com/1755-8794/8/S3/S1"></self-uri>
<abstract>
<sec>
<title>Background</title>
<p>Reading the nucleotides from two ends of a DNA fragment is called paired-end tag (PET) sequencing. When the fragment length is longer than the combined read length, there remains a gap of unsequenced nucleotides between read pairs. If the target in such experiments is sequenced at a level to provide redundant coverage, it may be possible to bridge these gaps using bioinformatics methods. Konnector is a local
<italic>de novo </italic>
assembly tool that addresses this problem. Here we report on version 2.0 of our tool.</p>
</sec>
<sec>
<title>Results</title>
<p>Konnector uses a probabilistic and memory-efficient data structure called Bloom filter to represent a k-mer spectrum - all possible sequences of length k in an input file, such as the collection of reads in a PET sequencing experiment. It performs look-ups to this data structure to construct an implicit de Bruijn graph, which describes (k-1) base pair overlaps between adjacent k-mers. It traverses this graph to bridge the gap between a given pair of flanking sequences.</p>
</sec>
<sec>
<title>Conclusions</title>
<p>Here we report the performance of Konnector v2.0 on simulated and experimental datasets, and compare it against other tools with similar functionality. We note that, representing k-mers with 1.5 bytes of memory on average, Konnector can scale to very large genomes. With our parallel implementation, it can also process over a billion bases on commodity hardware.</p>
</sec>
</abstract>
<kwd-group>
<kwd>Bloom filter</kwd>
<kwd>de Bruijn graph</kwd>
<kwd>paired-end sequencing</kwd>
<kwd>
<italic>de novo </italic>
</kwd>
<kwd>genome assembly</kwd>
</kwd-group>
<conference>
<conf-date>2-5 November 2014</conf-date>
<conf-name>IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2014)</conf-name>
<conf-loc>Belfast, UK</conf-loc>
</conference>
</article-meta>
</front>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Pmc/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000959 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd -nk 000959 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Pmc
   |étape=   Curation
   |type=    RBID
   |clé=     PMC:4582294
   |texte=   Konnector v2.0: pseudo-long reads from paired-end sequencing data
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Curation/RBID.i   -Sk "pubmed:26399504" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021