Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Scaffolding a Caenorhabditis nematode genome with RNA-seq.

Identifieur interne : 002611 ( Main/Exploration ); précédent : 002610; suivant : 002612

Scaffolding a Caenorhabditis nematode genome with RNA-seq.

Auteurs : Ali Mortazavi [États-Unis] ; Erich M. Schwarz ; Brian Williams ; Lorian Schaeffer ; Igor Antoshechkin ; Barbara J. Wold ; Paul W. Sternberg

Source :

RBID : pubmed:20980554

Descripteurs français

English descriptors

Abstract

Efficient sequencing of animal and plant genomes by next-generation technology should allow many neglected organisms of biological and medical importance to be better understood. As a test case, we have assembled a draft genome of Caenorhabditis sp. 3 PS1010 through a combination of direct sequencing and scaffolding with RNA-seq. We first sequenced genomic DNA and mixed-stage cDNA using paired 75-nt reads from an Illumina GAII. A set of 230 million genomic reads yielded an 80-Mb assembly, with a supercontig N50 of 5.0 kb, covering 90% of 429 kb from previously published genomic contigs. Mixed-stage poly(A)(+) cDNA gave 47.3 million mappable 75-mers (including 5.1 million spliced reads), which separately assembled into 17.8 Mb of cDNA, with an N50 of 1.06 kb. By further scaffolding our genomic supercontigs with cDNA, we increased their N50 to 9.4 kb, nearly double the average gene size in C. elegans. We predicted 22,851 protein-coding genes, and detected expression in 78% of them. Multigenome alignment and data filtering identified 2672 DNA elements conserved between PS1010 and C. elegans that are likely to encode regulatory sequences or previously unknown ncRNAs. Genomic and cDNA sequencing followed by joint assembly is a rapid and useful strategy for biological analysis.

DOI: 10.1101/gr.111021.110
PubMed: 20980554


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Scaffolding a Caenorhabditis nematode genome with RNA-seq.</title>
<author>
<name sortKey="Mortazavi, Ali" sort="Mortazavi, Ali" uniqKey="Mortazavi A" first="Ali" last="Mortazavi">Ali Mortazavi</name>
<affiliation wicri:level="1">
<nlm:affiliation>Division of Biology, California Institute of Technology, Pasadena, California 91125, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Division of Biology, California Institute of Technology, Pasadena, California 91125</wicri:regionArea>
<wicri:noRegion>California 91125</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Schwarz, Erich M" sort="Schwarz, Erich M" uniqKey="Schwarz E" first="Erich M" last="Schwarz">Erich M. Schwarz</name>
</author>
<author>
<name sortKey="Williams, Brian" sort="Williams, Brian" uniqKey="Williams B" first="Brian" last="Williams">Brian Williams</name>
</author>
<author>
<name sortKey="Schaeffer, Lorian" sort="Schaeffer, Lorian" uniqKey="Schaeffer L" first="Lorian" last="Schaeffer">Lorian Schaeffer</name>
</author>
<author>
<name sortKey="Antoshechkin, Igor" sort="Antoshechkin, Igor" uniqKey="Antoshechkin I" first="Igor" last="Antoshechkin">Igor Antoshechkin</name>
</author>
<author>
<name sortKey="Wold, Barbara J" sort="Wold, Barbara J" uniqKey="Wold B" first="Barbara J" last="Wold">Barbara J. Wold</name>
</author>
<author>
<name sortKey="Sternberg, Paul W" sort="Sternberg, Paul W" uniqKey="Sternberg P" first="Paul W" last="Sternberg">Paul W. Sternberg</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2010">2010</date>
<idno type="RBID">pubmed:20980554</idno>
<idno type="pmid">20980554</idno>
<idno type="doi">10.1101/gr.111021.110</idno>
<idno type="wicri:Area/PubMed/Corpus">001F27</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">001F27</idno>
<idno type="wicri:Area/PubMed/Curation">001F27</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">001F27</idno>
<idno type="wicri:Area/PubMed/Checkpoint">001E20</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">001E20</idno>
<idno type="wicri:Area/Ncbi/Merge">000785</idno>
<idno type="wicri:Area/Ncbi/Curation">000785</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000785</idno>
<idno type="wicri:Area/Main/Merge">002636</idno>
<idno type="wicri:Area/Main/Curation">002611</idno>
<idno type="wicri:Area/Main/Exploration">002611</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Scaffolding a Caenorhabditis nematode genome with RNA-seq.</title>
<author>
<name sortKey="Mortazavi, Ali" sort="Mortazavi, Ali" uniqKey="Mortazavi A" first="Ali" last="Mortazavi">Ali Mortazavi</name>
<affiliation wicri:level="1">
<nlm:affiliation>Division of Biology, California Institute of Technology, Pasadena, California 91125, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Division of Biology, California Institute of Technology, Pasadena, California 91125</wicri:regionArea>
<wicri:noRegion>California 91125</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Schwarz, Erich M" sort="Schwarz, Erich M" uniqKey="Schwarz E" first="Erich M" last="Schwarz">Erich M. Schwarz</name>
</author>
<author>
<name sortKey="Williams, Brian" sort="Williams, Brian" uniqKey="Williams B" first="Brian" last="Williams">Brian Williams</name>
</author>
<author>
<name sortKey="Schaeffer, Lorian" sort="Schaeffer, Lorian" uniqKey="Schaeffer L" first="Lorian" last="Schaeffer">Lorian Schaeffer</name>
</author>
<author>
<name sortKey="Antoshechkin, Igor" sort="Antoshechkin, Igor" uniqKey="Antoshechkin I" first="Igor" last="Antoshechkin">Igor Antoshechkin</name>
</author>
<author>
<name sortKey="Wold, Barbara J" sort="Wold, Barbara J" uniqKey="Wold B" first="Barbara J" last="Wold">Barbara J. Wold</name>
</author>
<author>
<name sortKey="Sternberg, Paul W" sort="Sternberg, Paul W" uniqKey="Sternberg P" first="Paul W" last="Sternberg">Paul W. Sternberg</name>
</author>
</analytic>
<series>
<title level="j">Genome research</title>
<idno type="eISSN">1549-5469</idno>
<imprint>
<date when="2010" type="published">2010</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Animals</term>
<term>Base Sequence</term>
<term>Caenorhabditis (genetics)</term>
<term>Conserved Sequence (genetics)</term>
<term>DNA, Complementary (genetics)</term>
<term>Genome (genetics)</term>
<term>Genomics (methods)</term>
<term>Molecular Sequence Data</term>
<term>Phylogeny</term>
<term>Sequence Alignment</term>
<term>Sequence Analysis, DNA (methods)</term>
<term>Software</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr">
<term>ADN complémentaire (génétique)</term>
<term>Alignement de séquences</term>
<term>Analyse de séquence d'ADN ()</term>
<term>Animaux</term>
<term>Caenorhabditis (génétique)</term>
<term>Données de séquences moléculaires</term>
<term>Génome (génétique)</term>
<term>Génomique ()</term>
<term>Logiciel</term>
<term>Phylogénie</term>
<term>Séquence conservée (génétique)</term>
<term>Séquence nucléotidique</term>
</keywords>
<keywords scheme="MESH" type="chemical" qualifier="genetics" xml:lang="en">
<term>DNA, Complementary</term>
</keywords>
<keywords scheme="MESH" qualifier="genetics" xml:lang="en">
<term>Caenorhabditis</term>
<term>Conserved Sequence</term>
<term>Genome</term>
</keywords>
<keywords scheme="MESH" qualifier="génétique" xml:lang="fr">
<term>ADN complémentaire</term>
<term>Caenorhabditis</term>
<term>Génome</term>
<term>Séquence conservée</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Genomics</term>
<term>Sequence Analysis, DNA</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Animals</term>
<term>Base Sequence</term>
<term>Molecular Sequence Data</term>
<term>Phylogeny</term>
<term>Sequence Alignment</term>
<term>Software</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr">
<term>Alignement de séquences</term>
<term>Analyse de séquence d'ADN</term>
<term>Animaux</term>
<term>Données de séquences moléculaires</term>
<term>Génomique</term>
<term>Logiciel</term>
<term>Phylogénie</term>
<term>Séquence nucléotidique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Efficient sequencing of animal and plant genomes by next-generation technology should allow many neglected organisms of biological and medical importance to be better understood. As a test case, we have assembled a draft genome of Caenorhabditis sp. 3 PS1010 through a combination of direct sequencing and scaffolding with RNA-seq. We first sequenced genomic DNA and mixed-stage cDNA using paired 75-nt reads from an Illumina GAII. A set of 230 million genomic reads yielded an 80-Mb assembly, with a supercontig N50 of 5.0 kb, covering 90% of 429 kb from previously published genomic contigs. Mixed-stage poly(A)(+) cDNA gave 47.3 million mappable 75-mers (including 5.1 million spliced reads), which separately assembled into 17.8 Mb of cDNA, with an N50 of 1.06 kb. By further scaffolding our genomic supercontigs with cDNA, we increased their N50 to 9.4 kb, nearly double the average gene size in C. elegans. We predicted 22,851 protein-coding genes, and detected expression in 78% of them. Multigenome alignment and data filtering identified 2672 DNA elements conserved between PS1010 and C. elegans that are likely to encode regulatory sequences or previously unknown ncRNAs. Genomic and cDNA sequencing followed by joint assembly is a rapid and useful strategy for biological analysis.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
</list>
<tree>
<noCountry>
<name sortKey="Antoshechkin, Igor" sort="Antoshechkin, Igor" uniqKey="Antoshechkin I" first="Igor" last="Antoshechkin">Igor Antoshechkin</name>
<name sortKey="Schaeffer, Lorian" sort="Schaeffer, Lorian" uniqKey="Schaeffer L" first="Lorian" last="Schaeffer">Lorian Schaeffer</name>
<name sortKey="Schwarz, Erich M" sort="Schwarz, Erich M" uniqKey="Schwarz E" first="Erich M" last="Schwarz">Erich M. Schwarz</name>
<name sortKey="Sternberg, Paul W" sort="Sternberg, Paul W" uniqKey="Sternberg P" first="Paul W" last="Sternberg">Paul W. Sternberg</name>
<name sortKey="Williams, Brian" sort="Williams, Brian" uniqKey="Williams B" first="Brian" last="Williams">Brian Williams</name>
<name sortKey="Wold, Barbara J" sort="Wold, Barbara J" uniqKey="Wold B" first="Barbara J" last="Wold">Barbara J. Wold</name>
</noCountry>
<country name="États-Unis">
<noRegion>
<name sortKey="Mortazavi, Ali" sort="Mortazavi, Ali" uniqKey="Mortazavi A" first="Ali" last="Mortazavi">Ali Mortazavi</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002611 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002611 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     pubmed:20980554
   |texte=   Scaffolding a Caenorhabditis nematode genome with RNA-seq.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:20980554" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021