Accurate genotyping across variant classes and lengths using variant graphs.
Identifieur interne : 000A76 ( Main/Curation ); précédent : 000A75; suivant : 000A77Accurate genotyping across variant classes and lengths using variant graphs.
Auteurs : Jonas Andreas Sibbesen [Danemark] ; Lasse Maretty [Danemark] ; Anders Krogh [Danemark]Source :
- Nature genetics [ 1546-1718 ] ; 2018.
Descripteurs français
- KwdFr :
- MESH :
English descriptors
- KwdEn :
- MESH :
- genetics : Genetic Variation, Genome, Human.
- methods : High-Throughput Nucleotide Sequencing, Sequence Analysis, DNA.
- Genotype, Humans.
Abstract
Genotype estimates from short-read sequencing data are typically based on the alignment of reads to a linear reference, but reads originating from more complex variants (for example, structural variants) often align poorly, resulting in biased genotype estimates. This bias can be mitigated by first collecting a set of candidate variants across discovery methods, individuals and databases, and then realigning the reads to the variants and reference simultaneously. However, this realignment problem has proved computationally difficult. Here, we present a new method (BayesTyper) that uses exact alignment of read k-mers to a graph representation of the reference and variants to efficiently perform unbiased, probabilistic genotyping across the variation spectrum. We demonstrate that BayesTyper generally provides superior variant sensitivity and genotyping accuracy relative to existing methods when used to integrate variants across discovery approaches and individuals. Finally, we demonstrate that including a 'variation-prior' database containing already known variants significantly improves sensitivity.
DOI: 10.1038/s41588-018-0145-5
PubMed: 29915429
Links toward previous steps (curation, corpus...)
- to stream PubMed, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000863
- to stream PubMed, to step Curation: Pour aller vers cette notice dans l'étape Curation :000863
- to stream PubMed, to step Checkpoint: Pour aller vers cette notice dans l'étape Curation :000A25
- to stream Ncbi, to step Merge: Pour aller vers cette notice dans l'étape Curation :001E70
- to stream Ncbi, to step Curation: Pour aller vers cette notice dans l'étape Curation :001E70
- to stream Ncbi, to step Checkpoint: Pour aller vers cette notice dans l'étape Curation :001E70
- to stream Main, to step Merge: Pour aller vers cette notice dans l'étape Curation :000A79
Links to Exploration step
pubmed:29915429Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Accurate genotyping across variant classes and lengths using variant graphs.</title>
<author><name sortKey="Sibbesen, Jonas Andreas" sort="Sibbesen, Jonas Andreas" uniqKey="Sibbesen J" first="Jonas Andreas" last="Sibbesen">Jonas Andreas Sibbesen</name>
<affiliation wicri:level="3"><nlm:affiliation>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen, Denmark.</nlm:affiliation>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen</wicri:regionArea>
<placeName><settlement type="city">Copenhague</settlement>
<region type="région" nuts="2">Hovedstaden</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Maretty, Lasse" sort="Maretty, Lasse" uniqKey="Maretty L" first="Lasse" last="Maretty">Lasse Maretty</name>
<affiliation wicri:level="3"><nlm:affiliation>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen, Denmark.</nlm:affiliation>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen</wicri:regionArea>
<placeName><settlement type="city">Copenhague</settlement>
<region type="région" nuts="2">Hovedstaden</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Krogh, Anders" sort="Krogh, Anders" uniqKey="Krogh A" first="Anders" last="Krogh">Anders Krogh</name>
<affiliation wicri:level="3"><nlm:affiliation>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen, Denmark. krogh@binf.ku.dk.</nlm:affiliation>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen</wicri:regionArea>
<placeName><settlement type="city">Copenhague</settlement>
<region type="région" nuts="2">Hovedstaden</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2018">2018</date>
<idno type="RBID">pubmed:29915429</idno>
<idno type="pmid">29915429</idno>
<idno type="doi">10.1038/s41588-018-0145-5</idno>
<idno type="wicri:Area/PubMed/Corpus">000863</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">000863</idno>
<idno type="wicri:Area/PubMed/Curation">000863</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">000863</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000A25</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">000A25</idno>
<idno type="wicri:Area/Ncbi/Merge">001E70</idno>
<idno type="wicri:Area/Ncbi/Curation">001E70</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">001E70</idno>
<idno type="wicri:Area/Main/Merge">000A79</idno>
<idno type="wicri:Area/Main/Curation">000A76</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Accurate genotyping across variant classes and lengths using variant graphs.</title>
<author><name sortKey="Sibbesen, Jonas Andreas" sort="Sibbesen, Jonas Andreas" uniqKey="Sibbesen J" first="Jonas Andreas" last="Sibbesen">Jonas Andreas Sibbesen</name>
<affiliation wicri:level="3"><nlm:affiliation>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen, Denmark.</nlm:affiliation>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen</wicri:regionArea>
<placeName><settlement type="city">Copenhague</settlement>
<region type="région" nuts="2">Hovedstaden</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Maretty, Lasse" sort="Maretty, Lasse" uniqKey="Maretty L" first="Lasse" last="Maretty">Lasse Maretty</name>
<affiliation wicri:level="3"><nlm:affiliation>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen, Denmark.</nlm:affiliation>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen</wicri:regionArea>
<placeName><settlement type="city">Copenhague</settlement>
<region type="région" nuts="2">Hovedstaden</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Krogh, Anders" sort="Krogh, Anders" uniqKey="Krogh A" first="Anders" last="Krogh">Anders Krogh</name>
<affiliation wicri:level="3"><nlm:affiliation>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen, Denmark. krogh@binf.ku.dk.</nlm:affiliation>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea>The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen</wicri:regionArea>
<placeName><settlement type="city">Copenhague</settlement>
<region type="région" nuts="2">Hovedstaden</region>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j">Nature genetics</title>
<idno type="eISSN">1546-1718</idno>
<imprint><date when="2018" type="published">2018</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Genetic Variation (genetics)</term>
<term>Genome, Human (genetics)</term>
<term>Genotype</term>
<term>High-Throughput Nucleotide Sequencing (methods)</term>
<term>Humans</term>
<term>Sequence Analysis, DNA (methods)</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr"><term>Analyse de séquence d'ADN ()</term>
<term>Génome humain (génétique)</term>
<term>Génotype</term>
<term>Humains</term>
<term>Séquençage nucléotidique à haut débit ()</term>
<term>Variation génétique (génétique)</term>
</keywords>
<keywords scheme="MESH" qualifier="genetics" xml:lang="en"><term>Genetic Variation</term>
<term>Genome, Human</term>
</keywords>
<keywords scheme="MESH" qualifier="génétique" xml:lang="fr"><term>Génome humain</term>
<term>Variation génétique</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en"><term>High-Throughput Nucleotide Sequencing</term>
<term>Sequence Analysis, DNA</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Genotype</term>
<term>Humans</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr"><term>Analyse de séquence d'ADN</term>
<term>Génotype</term>
<term>Humains</term>
<term>Séquençage nucléotidique à haut débit</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Genotype estimates from short-read sequencing data are typically based on the alignment of reads to a linear reference, but reads originating from more complex variants (for example, structural variants) often align poorly, resulting in biased genotype estimates. This bias can be mitigated by first collecting a set of candidate variants across discovery methods, individuals and databases, and then realigning the reads to the variants and reference simultaneously. However, this realignment problem has proved computationally difficult. Here, we present a new method (BayesTyper) that uses exact alignment of read k-mers to a graph representation of the reference and variants to efficiently perform unbiased, probabilistic genotyping across the variation spectrum. We demonstrate that BayesTyper generally provides superior variant sensitivity and genotyping accuracy relative to existing methods when used to integrate variants across discovery approaches and individuals. Finally, we demonstrate that including a 'variation-prior' database containing already known variants significantly improves sensitivity.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000A76 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Curation/biblio.hfd -nk 000A76 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Sante |area= MersV1 |flux= Main |étape= Curation |type= RBID |clé= pubmed:29915429 |texte= Accurate genotyping across variant classes and lengths using variant graphs. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Curation/RBID.i -Sk "pubmed:29915429" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Curation/biblio.hfd \ | NlmPubMed2Wicri -a MersV1
This area was generated with Dilib version V0.6.33. |