Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Accelerating metagenomic read classification on CUDA-enabled GPUs

Identifieur interne : 000262 ( Pmc/Curation ); précédent : 000261; suivant : 000263

Accelerating metagenomic read classification on CUDA-enabled GPUs

Auteurs : Robin Kobus [Allemagne] ; Christian Hundt [Allemagne] ; André Müller [Allemagne] ; Bertil Schmidt [Allemagne]

Source :

RBID : PMC:5209836

Abstract

Background

Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes software tools for fast and accurate metagenomic read classification are urgently needed.

Results

We present cuCLARK, a read-level classifier for CUDA-enabled GPUs, based on the fast and accurate classification of metagenomic sequences using reduced k-mers (CLARK) method. Using the processing power of a single Titan X GPU, cuCLARK can reach classification speeds of up to 50 million reads per minute. Corresponding speedups for species- (genus-)level classification range between 3.2 and 6.6 (3.7 and 6.4) compared to multi-threaded CLARK executed on a 16-core Xeon CPU workstation.

Conclusion

cuCLARK can perform metagenomic read classification at superior speeds on CUDA-enabled GPUs. It is free software licensed under GPL and can be downloaded at https://github.com/funatiq/cuclark free of charge.


Url:
DOI: 10.1186/s12859-016-1434-6
PubMed: 28049411
PubMed Central: 5209836

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:5209836

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Accelerating metagenomic read classification on CUDA-enabled GPUs</title>
<author>
<name sortKey="Kobus, Robin" sort="Kobus, Robin" uniqKey="Kobus R" first="Robin" last="Kobus">Robin Kobus</name>
<affiliation wicri:level="1">
<nlm:aff id="Aff1">Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz, 55435 Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Hundt, Christian" sort="Hundt, Christian" uniqKey="Hundt C" first="Christian" last="Hundt">Christian Hundt</name>
<affiliation wicri:level="1">
<nlm:aff id="Aff1">Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz, 55435 Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Muller, Andre" sort="Muller, Andre" uniqKey="Muller A" first="André" last="Müller">André Müller</name>
<affiliation wicri:level="1">
<nlm:aff id="Aff1">Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz, 55435 Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Schmidt, Bertil" sort="Schmidt, Bertil" uniqKey="Schmidt B" first="Bertil" last="Schmidt">Bertil Schmidt</name>
<affiliation wicri:level="1">
<nlm:aff id="Aff1">Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz, 55435 Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">28049411</idno>
<idno type="pmc">5209836</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209836</idno>
<idno type="RBID">PMC:5209836</idno>
<idno type="doi">10.1186/s12859-016-1434-6</idno>
<date when="2017">2017</date>
<idno type="wicri:Area/Pmc/Corpus">000262</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000262</idno>
<idno type="wicri:Area/Pmc/Curation">000262</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Curation">000262</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Accelerating metagenomic read classification on CUDA-enabled GPUs</title>
<author>
<name sortKey="Kobus, Robin" sort="Kobus, Robin" uniqKey="Kobus R" first="Robin" last="Kobus">Robin Kobus</name>
<affiliation wicri:level="1">
<nlm:aff id="Aff1">Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz, 55435 Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Hundt, Christian" sort="Hundt, Christian" uniqKey="Hundt C" first="Christian" last="Hundt">Christian Hundt</name>
<affiliation wicri:level="1">
<nlm:aff id="Aff1">Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz, 55435 Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Muller, Andre" sort="Muller, Andre" uniqKey="Muller A" first="André" last="Müller">André Müller</name>
<affiliation wicri:level="1">
<nlm:aff id="Aff1">Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz, 55435 Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Schmidt, Bertil" sort="Schmidt, Bertil" uniqKey="Schmidt B" first="Bertil" last="Schmidt">Bertil Schmidt</name>
<affiliation wicri:level="1">
<nlm:aff id="Aff1">Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz, 55435 Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz</wicri:regionArea>
</affiliation>
</author>
</analytic>
<series>
<title level="j">BMC Bioinformatics</title>
<idno type="eISSN">1471-2105</idno>
<imprint>
<date when="2017">2017</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec>
<title>Background</title>
<p>Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes software tools for fast and accurate metagenomic read classification are urgently needed.</p>
</sec>
<sec>
<title>Results</title>
<p>We present cuCLARK, a read-level classifier for CUDA-enabled GPUs, based on the fast and accurate classification of metagenomic sequences using reduced
<italic>k</italic>
-mers (CLARK) method. Using the processing power of a single Titan X GPU, cuCLARK can reach classification speeds of up to 50 million reads per minute. Corresponding speedups for species- (genus-)level classification range between 3.2 and 6.6 (3.7 and 6.4) compared to multi-threaded CLARK executed on a 16-core Xeon CPU workstation.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>cuCLARK can perform metagenomic read classification at superior speeds on CUDA-enabled GPUs. It is free software licensed under GPL and can be downloaded at
<ext-link ext-link-type="uri" xlink:href="https://github.com/funatiq/cuclark">https://github.com/funatiq/cuclark</ext-link>
free of charge.</p>
</sec>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ounit, R" uniqKey="Ounit R">R Ounit</name>
</author>
<author>
<name sortKey="Wanamaker, S" uniqKey="Wanamaker S">S Wanamaker</name>
</author>
<author>
<name sortKey="Close, Tj" uniqKey="Close T">TJ Close</name>
</author>
<author>
<name sortKey="Lonardi, S" uniqKey="Lonardi S">S Lonardi</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wilton, R" uniqKey="Wilton R">R Wilton</name>
</author>
<author>
<name sortKey="Budavari, T" uniqKey="Budavari T">T Budavari</name>
</author>
<author>
<name sortKey="Langmead, B" uniqKey="Langmead B">B Langmead</name>
</author>
<author>
<name sortKey="Wheelan, Sj" uniqKey="Wheelan S">SJ Wheelan</name>
</author>
<author>
<name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
<author>
<name sortKey="Szalay, As" uniqKey="Szalay A">AS Szalay</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Liu, Y" uniqKey="Liu Y">Y Liu</name>
</author>
<author>
<name sortKey="Schmidt, B" uniqKey="Schmidt B">B Schmidt</name>
</author>
<author>
<name sortKey="Maskell, Dl" uniqKey="Maskell D">DL Maskell</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, D" uniqKey="Li D">D Li</name>
</author>
<author>
<name sortKey="Liu, Cm" uniqKey="Liu C">CM Liu</name>
</author>
<author>
<name sortKey="Luo, R" uniqKey="Luo R">R Luo</name>
</author>
<author>
<name sortKey="Sadakane, K" uniqKey="Sadakane K">K Sadakane</name>
</author>
<author>
<name sortKey="Lam, Tw" uniqKey="Lam T">TW Lam</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Huang, W" uniqKey="Huang W">W Huang</name>
</author>
<author>
<name sortKey="Li, L" uniqKey="Li L">L Li</name>
</author>
<author>
<name sortKey="Myers, Jr" uniqKey="Myers J">JR Myers</name>
</author>
<author>
<name sortKey="Marth, Gt" uniqKey="Marth G">GT Marth</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Shcherbina, A" uniqKey="Shcherbina A">A Shcherbina</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">BMC Bioinformatics</journal-id>
<journal-id journal-id-type="iso-abbrev">BMC Bioinformatics</journal-id>
<journal-title-group>
<journal-title>BMC Bioinformatics</journal-title>
</journal-title-group>
<issn pub-type="epub">1471-2105</issn>
<publisher>
<publisher-name>BioMed Central</publisher-name>
<publisher-loc>London</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">28049411</article-id>
<article-id pub-id-type="pmc">5209836</article-id>
<article-id pub-id-type="publisher-id">1434</article-id>
<article-id pub-id-type="doi">10.1186/s12859-016-1434-6</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Software</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Accelerating metagenomic read classification on CUDA-enabled GPUs</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Kobus</surname>
<given-names>Robin</given-names>
</name>
<address>
<email>rkobus@students.uni-mainz.de</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Hundt</surname>
<given-names>Christian</given-names>
</name>
<address>
<email>hundt@uni-mainz.de</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Müller</surname>
<given-names>André</given-names>
</name>
<address>
<email>muellan@uni-mainz.de</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Schmidt</surname>
<given-names>Bertil</given-names>
</name>
<address>
<email>bertil.schmidt@uni-mainz.de</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<aff id="Aff1">Institute of Computer Science, Johannes Gutenberg University Mainz, Staudingerweg 9, Mainz, 55435 Germany</aff>
</contrib-group>
<pub-date pub-type="epub">
<day>3</day>
<month>1</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="pmc-release">
<day>3</day>
<month>1</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>18</volume>
<elocation-id>11</elocation-id>
<history>
<date date-type="received">
<day>9</day>
<month>9</month>
<year>2016</year>
</date>
<date date-type="accepted">
<day>16</day>
<month>12</month>
<year>2016</year>
</date>
</history>
<permissions>
<copyright-statement>© The Author(s) 2017</copyright-statement>
<license license-type="OpenAccess">
<license-p>
<bold>Open Access</bold>
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</ext-link>
), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/publicdomain/zero/1.0/">http://creativecommons.org/publicdomain/zero/1.0/</ext-link>
) applies to the data made available in this article, unless otherwise stated.</license-p>
</license>
</permissions>
<abstract id="Abs1">
<sec>
<title>Background</title>
<p>Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes software tools for fast and accurate metagenomic read classification are urgently needed.</p>
</sec>
<sec>
<title>Results</title>
<p>We present cuCLARK, a read-level classifier for CUDA-enabled GPUs, based on the fast and accurate classification of metagenomic sequences using reduced
<italic>k</italic>
-mers (CLARK) method. Using the processing power of a single Titan X GPU, cuCLARK can reach classification speeds of up to 50 million reads per minute. Corresponding speedups for species- (genus-)level classification range between 3.2 and 6.6 (3.7 and 6.4) compared to multi-threaded CLARK executed on a 16-core Xeon CPU workstation.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>cuCLARK can perform metagenomic read classification at superior speeds on CUDA-enabled GPUs. It is free software licensed under GPL and can be downloaded at
<ext-link ext-link-type="uri" xlink:href="https://github.com/funatiq/cuclark">https://github.com/funatiq/cuclark</ext-link>
free of charge.</p>
</sec>
</abstract>
<kwd-group xml:lang="en">
<title>Keywords</title>
<kwd>Metagenomics</kwd>
<kwd>Taxonomic assignment</kwd>
<kwd>Exact
<italic>k</italic>
-mer matching</kwd>
<kwd>CUDA</kwd>
<kwd>GPUs</kwd>
</kwd-group>
<custom-meta-group>
<custom-meta>
<meta-name>issue-copyright-statement</meta-name>
<meta-value>© The Author(s) 2017</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
</front>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Pmc/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000262 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd -nk 000262 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Pmc
   |étape=   Curation
   |type=    RBID
   |clé=     PMC:5209836
   |texte=   Accelerating metagenomic read classification on CUDA-enabled GPUs
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Curation/RBID.i   -Sk "pubmed:28049411" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021