Serveur d'exploration SRAS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Genomic Characterization of a Newly Discovered Coronavirus Associated with Acute Respiratory Distress Syndrome in Humans

Identifieur interne : 001631 ( Pmc/Corpus ); précédent : 001630; suivant : 001632

Genomic Characterization of a Newly Discovered Coronavirus Associated with Acute Respiratory Distress Syndrome in Humans

Auteurs : Sander Van Boheemen ; Miranda De Graaf ; Chris Lauber ; Theo M. Bestebroer ; V. Stalin Raj ; Ali Moh Zaki ; Albert D. M. E. Osterhaus ; Bart L. Haagmans ; Alexander E. Gorbalenya ; Eric J. Snijder ; Ron A. M. Fouchier

Source :

RBID : PMC:3509437

Abstract

ABSTRACT

A novel human coronavirus (HCoV-EMC/2012) was isolated from a man with acute pneumonia and renal failure in June 2012. This report describes the complete genome sequence, genome organization, and expression strategy of HCoV-EMC/2012 and its relation with known coronaviruses. The genome contains 30,119 nucleotides and contains at least 10 predicted open reading frames, 9 of which are predicted to be expressed from a nested set of seven subgenomic mRNAs. Phylogenetic analysis of the replicase gene of coronaviruses with completely sequenced genomes showed that HCoV-EMC/2012 is most closely related to Tylonycteris bat coronavirus HKU4 (BtCoV-HKU4) and Pipistrellus bat coronavirus HKU5 (BtCoV-HKU5), which prototype two species in lineage C of the genus Betacoronavirus. In accordance with the guidelines of the International Committee on Taxonomy of Viruses, and in view of the 75% and 77% amino acid sequence identity in 7 conserved replicase domains with BtCoV-HKU4 and BtCoV-HKU5, respectively, we propose that HCoV-EMC/2012 prototypes a novel species in the genus Betacoronavirus. HCoV-EMC/2012 may be most closely related to a coronavirus detected in Pipistrellus pipistrellus in The Netherlands, but because only a short sequence from the most conserved part of the RNA-dependent RNA polymerase-encoding region of the genome was reported for this bat virus, its genetic distance from HCoV-EMC remains uncertain. HCoV-EMC/2012 is the sixth coronavirus known to infect humans and the first human virus within betacoronavirus lineage C.


Url:
DOI: 10.1128/mBio.00473-12
PubMed: 23170002
PubMed Central: 3509437

Links to Exploration step

PMC:3509437

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Genomic Characterization of a Newly Discovered Coronavirus Associated with Acute Respiratory Distress Syndrome in Humans</title>
<author>
<name sortKey="Van Boheemen, Sander" sort="Van Boheemen, Sander" uniqKey="Van Boheemen S" first="Sander" last="Van Boheemen">Sander Van Boheemen</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="De Graaf, Miranda" sort="De Graaf, Miranda" uniqKey="De Graaf M" first="Miranda" last="De Graaf">Miranda De Graaf</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Lauber, Chris" sort="Lauber, Chris" uniqKey="Lauber C" first="Chris" last="Lauber">Chris Lauber</name>
<affiliation>
<nlm:aff id="aff2"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Bestebroer, Theo M" sort="Bestebroer, Theo M" uniqKey="Bestebroer T" first="Theo M." last="Bestebroer">Theo M. Bestebroer</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Raj, V Stalin" sort="Raj, V Stalin" uniqKey="Raj V" first="V. Stalin" last="Raj">V. Stalin Raj</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Zaki, Ali Moh" sort="Zaki, Ali Moh" uniqKey="Zaki A" first="Ali Moh" last="Zaki">Ali Moh Zaki</name>
<affiliation>
<nlm:aff id="aff3"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Osterhaus, Albert D M E" sort="Osterhaus, Albert D M E" uniqKey="Osterhaus A" first="Albert D. M. E." last="Osterhaus">Albert D. M. E. Osterhaus</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Haagmans, Bart L" sort="Haagmans, Bart L" uniqKey="Haagmans B" first="Bart L." last="Haagmans">Bart L. Haagmans</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Gorbalenya, Alexander E" sort="Gorbalenya, Alexander E" uniqKey="Gorbalenya A" first="Alexander E." last="Gorbalenya">Alexander E. Gorbalenya</name>
<affiliation>
<nlm:aff id="aff2"></nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff4"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Snijder, Eric J" sort="Snijder, Eric J" uniqKey="Snijder E" first="Eric J." last="Snijder">Eric J. Snijder</name>
<affiliation>
<nlm:aff id="aff2"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Fouchier, Ron A M" sort="Fouchier, Ron A M" uniqKey="Fouchier R" first="Ron A. M." last="Fouchier">Ron A. M. Fouchier</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">23170002</idno>
<idno type="pmc">3509437</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3509437</idno>
<idno type="RBID">PMC:3509437</idno>
<idno type="doi">10.1128/mBio.00473-12</idno>
<date when="2012">2012</date>
<idno type="wicri:Area/Pmc/Corpus">001631</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">001631</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Genomic Characterization of a Newly Discovered Coronavirus Associated with Acute Respiratory Distress Syndrome in Humans</title>
<author>
<name sortKey="Van Boheemen, Sander" sort="Van Boheemen, Sander" uniqKey="Van Boheemen S" first="Sander" last="Van Boheemen">Sander Van Boheemen</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="De Graaf, Miranda" sort="De Graaf, Miranda" uniqKey="De Graaf M" first="Miranda" last="De Graaf">Miranda De Graaf</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Lauber, Chris" sort="Lauber, Chris" uniqKey="Lauber C" first="Chris" last="Lauber">Chris Lauber</name>
<affiliation>
<nlm:aff id="aff2"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Bestebroer, Theo M" sort="Bestebroer, Theo M" uniqKey="Bestebroer T" first="Theo M." last="Bestebroer">Theo M. Bestebroer</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Raj, V Stalin" sort="Raj, V Stalin" uniqKey="Raj V" first="V. Stalin" last="Raj">V. Stalin Raj</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Zaki, Ali Moh" sort="Zaki, Ali Moh" uniqKey="Zaki A" first="Ali Moh" last="Zaki">Ali Moh Zaki</name>
<affiliation>
<nlm:aff id="aff3"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Osterhaus, Albert D M E" sort="Osterhaus, Albert D M E" uniqKey="Osterhaus A" first="Albert D. M. E." last="Osterhaus">Albert D. M. E. Osterhaus</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Haagmans, Bart L" sort="Haagmans, Bart L" uniqKey="Haagmans B" first="Bart L." last="Haagmans">Bart L. Haagmans</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Gorbalenya, Alexander E" sort="Gorbalenya, Alexander E" uniqKey="Gorbalenya A" first="Alexander E." last="Gorbalenya">Alexander E. Gorbalenya</name>
<affiliation>
<nlm:aff id="aff2"></nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="aff4"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Snijder, Eric J" sort="Snijder, Eric J" uniqKey="Snijder E" first="Eric J." last="Snijder">Eric J. Snijder</name>
<affiliation>
<nlm:aff id="aff2"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Fouchier, Ron A M" sort="Fouchier, Ron A M" uniqKey="Fouchier R" first="Ron A. M." last="Fouchier">Ron A. M. Fouchier</name>
<affiliation>
<nlm:aff id="aff1"></nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">mBio</title>
<idno type="eISSN">2150-7511</idno>
<imprint>
<date when="2012">2012</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<title>ABSTRACT</title>
<p>A novel human coronavirus (HCoV-EMC/2012) was isolated from a man with acute pneumonia and renal failure in June 2012. This report describes the complete genome sequence, genome organization, and expression strategy of HCoV-EMC/2012 and its relation with known coronaviruses. The genome contains 30,119 nucleotides and contains at least 10 predicted open reading frames, 9 of which are predicted to be expressed from a nested set of seven subgenomic mRNAs. Phylogenetic analysis of the replicase gene of coronaviruses with completely sequenced genomes showed that HCoV-EMC/2012 is most closely related to
<italic>Tylonycteris</italic>
bat coronavirus HKU4 (BtCoV-HKU4) and
<italic>Pipistrellus</italic>
bat coronavirus HKU5 (BtCoV-HKU5), which prototype two species in lineage C of the genus
<italic>Betacoronavirus</italic>
. In accordance with the guidelines of the International Committee on Taxonomy of Viruses, and in view of the 75% and 77% amino acid sequence identity in 7 conserved replicase domains with BtCoV-HKU4 and BtCoV-HKU5, respectively, we propose that HCoV-EMC/2012 prototypes a novel species in the genus
<italic>Betacoronavirus</italic>
. HCoV-EMC/2012 may be most closely related to a coronavirus detected in
<italic>Pipistrellus pipistrellus</italic>
in The Netherlands, but because only a short sequence from the most conserved part of the RNA-dependent RNA polymerase-encoding region of the genome was reported for this bat virus, its genetic distance from HCoV-EMC remains uncertain. HCoV-EMC/2012 is the sixth coronavirus known to infect humans and the first human virus within betacoronavirus lineage C.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="De Groot, Rj" uniqKey="De Groot R">RJ de Groot</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Perlman, S" uniqKey="Perlman S">S Perlman</name>
</author>
<author>
<name sortKey="Netland, J" uniqKey="Netland J">J Netland</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gloza Rausch, F" uniqKey="Gloza Rausch F">F Gloza-Rausch</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lau, Sk" uniqKey="Lau S">SK Lau</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pfefferle, S" uniqKey="Pfefferle S">S Pfefferle</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Vijaykrishna, D" uniqKey="Vijaykrishna D">D Vijaykrishna</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Woo, Pc" uniqKey="Woo P">PC Woo</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hamre, D" uniqKey="Hamre D">D Hamre</name>
</author>
<author>
<name sortKey="Procknow, Jj" uniqKey="Procknow J">JJ Procknow</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mcintosh, K" uniqKey="Mcintosh K">K McIntosh</name>
</author>
<author>
<name sortKey="Dees, Jh" uniqKey="Dees J">JH Dees</name>
</author>
<author>
<name sortKey="Becker, Wb" uniqKey="Becker W">WB Becker</name>
</author>
<author>
<name sortKey="Kapikian, Az" uniqKey="Kapikian A">AZ Kapikian</name>
</author>
<author>
<name sortKey="Chanock, Rm" uniqKey="Chanock R">RM Chanock</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Drosten, C" uniqKey="Drosten C">C Drosten</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Marra, Ma" uniqKey="Marra M">MA Marra</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Peiris, Js" uniqKey="Peiris J">JS Peiris</name>
</author>
<author>
<name sortKey="Guan, Y" uniqKey="Guan Y">Y Guan</name>
</author>
<author>
<name sortKey="Yuen, Ky" uniqKey="Yuen K">KY Yuen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rota, Pa" uniqKey="Rota P">PA Rota</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Fouchier, Ra" uniqKey="Fouchier R">RA Fouchier</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Van Der Hoek, L" uniqKey="Van Der Hoek L">L van der Hoek</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Woo, Pc" uniqKey="Woo P">PC Woo</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zlateva, Kt" uniqKey="Zlateva K">KT Zlateva</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gorbalenya, Ae" uniqKey="Gorbalenya A">AE Gorbalenya</name>
</author>
<author>
<name sortKey="Enjuanes, L" uniqKey="Enjuanes L">L Enjuanes</name>
</author>
<author>
<name sortKey="Ziebuhr, J" uniqKey="Ziebuhr J">J Ziebuhr</name>
</author>
<author>
<name sortKey="Snijder, Ej" uniqKey="Snijder E">EJ Snijder</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Adams, Mj" uniqKey="Adams M">MJ Adams</name>
</author>
<author>
<name sortKey="Carstens, Eb" uniqKey="Carstens E">EB Carstens</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lauber, C" uniqKey="Lauber C">C Lauber</name>
</author>
<author>
<name sortKey="Gorbalenya, Ae" uniqKey="Gorbalenya A">AE Gorbalenya</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Masters, Ps" uniqKey="Masters P">PS Masters</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Snijder, Ej" uniqKey="Snijder E">EJ Snijder</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zaki, Am" uniqKey="Zaki A">AM Zaki</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bermingham, A" uniqKey="Bermingham A">A Bermingham</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ziebuhr, J" uniqKey="Ziebuhr J">J Ziebuhr</name>
</author>
<author>
<name sortKey="Snijder, Ej" uniqKey="Snijder E">EJ Snijder</name>
</author>
<author>
<name sortKey="Gorbalenya, Ae" uniqKey="Gorbalenya A">AE Gorbalenya</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pasternak, Ao" uniqKey="Pasternak A">AO Pasternak</name>
</author>
<author>
<name sortKey="Spaan, Wj" uniqKey="Spaan W">WJ Spaan</name>
</author>
<author>
<name sortKey="Snijder, Ej" uniqKey="Snijder E">EJ Snijder</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sawicki, Sg" uniqKey="Sawicki S">SG Sawicki</name>
</author>
<author>
<name sortKey="Sawicki, Dl" uniqKey="Sawicki D">DL Sawicki</name>
</author>
<author>
<name sortKey="Siddell, Sg" uniqKey="Siddell S">SG Siddell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sola, I" uniqKey="Sola I">I Sola</name>
</author>
<author>
<name sortKey="Mateos Gomez, Pa" uniqKey="Mateos Gomez P">PA Mateos-Gomez</name>
</author>
<author>
<name sortKey="Almazan, F" uniqKey="Almazan F">F Almazan</name>
</author>
<author>
<name sortKey="Zu Iga, S" uniqKey="Zu Iga S">S Zuñiga</name>
</author>
<author>
<name sortKey="Enjuanes, L" uniqKey="Enjuanes L">L Enjuanes</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Firth, Ae" uniqKey="Firth A">AE Firth</name>
</author>
<author>
<name sortKey="Brierley, I" uniqKey="Brierley I">I Brierley</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Reusken, Cb" uniqKey="Reusken C">CB Reusken</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Holmes, Kv" uniqKey="Holmes K">KV Holmes</name>
</author>
<author>
<name sortKey="Lai, Mc" uniqKey="Lai M">MC Lai</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mcintosh, K" uniqKey="Mcintosh K">K McIntosh</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Keng, Ct" uniqKey="Keng C">CT Keng</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Narayanan, K" uniqKey="Narayanan K">K Narayanan</name>
</author>
<author>
<name sortKey="Huang, C" uniqKey="Huang C">C Huang</name>
</author>
<author>
<name sortKey="Makino, S" uniqKey="Makino S">S Makino</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Senanayake, Sd" uniqKey="Senanayake S">SD Senanayake</name>
</author>
<author>
<name sortKey="Brian, Da" uniqKey="Brian D">DA Brian</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mazumder, R" uniqKey="Mazumder R">R Mazumder</name>
</author>
<author>
<name sortKey="Iyer, Lm" uniqKey="Iyer L">LM Iyer</name>
</author>
<author>
<name sortKey="Vasudevan, S" uniqKey="Vasudevan S">S Vasudevan</name>
</author>
<author>
<name sortKey="Aravind, L" uniqKey="Aravind L">L Aravind</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Snijder, Ej" uniqKey="Snijder E">EJ Snijder</name>
</author>
<author>
<name sortKey="Den Boon, Ja" uniqKey="Den Boon J">JA den Boon</name>
</author>
<author>
<name sortKey="Horzinek, Mc" uniqKey="Horzinek M">MC Horzinek</name>
</author>
<author>
<name sortKey="Spaan, Wj" uniqKey="Spaan W">WJ Spaan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yount, B" uniqKey="Yount B">B Yount</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cruz, Jl" uniqKey="Cruz J">JL Cruz</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="De Haan, Ca" uniqKey="De Haan C">CA de Haan</name>
</author>
<author>
<name sortKey="Masters, Ps" uniqKey="Masters P">PS Masters</name>
</author>
<author>
<name sortKey="Shen, X" uniqKey="Shen X">X Shen</name>
</author>
<author>
<name sortKey="Weiss, S" uniqKey="Weiss S">S Weiss</name>
</author>
<author>
<name sortKey="Rottier, Pj" uniqKey="Rottier P">PJ Rottier</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pewe, L" uniqKey="Pewe L">L Pewe</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Frieman, M" uniqKey="Frieman M">M Frieman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Oostra, M" uniqKey="Oostra M">M Oostra</name>
</author>
<author>
<name sortKey="De Haan, Ca" uniqKey="De Haan C">CA de Haan</name>
</author>
<author>
<name sortKey="Rottier, Pj" uniqKey="Rottier P">PJ Rottier</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cornman, Vm" uniqKey="Cornman V">VM Cornman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Welsh, J" uniqKey="Welsh J">J Welsh</name>
</author>
<author>
<name sortKey="Mcclelland, M" uniqKey="Mcclelland M">M McClelland</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hall, Ta" uniqKey="Hall T">TA Hall</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gorbalenya, Ae" uniqKey="Gorbalenya A">AE Gorbalenya</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Castresana, J" uniqKey="Castresana J">J Castresana</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Antonov, Iv" uniqKey="Antonov I">IV Antonov</name>
</author>
<author>
<name sortKey="Leontovich, Am" uniqKey="Leontovich A">AM Leontovich</name>
</author>
<author>
<name sortKey="Gorbalenya, Ae" uniqKey="Gorbalenya A">AE Gorbalenya</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Darriba, D" uniqKey="Darriba D">D Darriba</name>
</author>
<author>
<name sortKey="Taboada, Gl" uniqKey="Taboada G">GL Taboada</name>
</author>
<author>
<name sortKey="Doallo, R" uniqKey="Doallo R">R Doallo</name>
</author>
<author>
<name sortKey="Posada, D" uniqKey="Posada D">D Posada</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Guindon, S" uniqKey="Guindon S">S Guindon</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Katoh, K" uniqKey="Katoh K">K Katoh</name>
</author>
<author>
<name sortKey="Misawa, K" uniqKey="Misawa K">K Misawa</name>
</author>
<author>
<name sortKey="Kuma, K" uniqKey="Kuma K">K Kuma</name>
</author>
<author>
<name sortKey="Miyata, T" uniqKey="Miyata T">T Miyata</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Posada, D" uniqKey="Posada D">D Posada</name>
</author>
<author>
<name sortKey="Crandall, Ka" uniqKey="Crandall K">KA Crandall</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">mBio</journal-id>
<journal-id journal-id-type="iso-abbrev">MBio</journal-id>
<journal-id journal-id-type="hwp">mbio</journal-id>
<journal-id journal-id-type="pmc">mbio</journal-id>
<journal-id journal-id-type="publisher-id">mBio</journal-id>
<journal-title-group>
<journal-title>mBio</journal-title>
</journal-title-group>
<issn pub-type="epub">2150-7511</issn>
<publisher>
<publisher-name>American Society of Microbiology</publisher-name>
<publisher-loc>1752 N St., N.W., Washington, DC</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">23170002</article-id>
<article-id pub-id-type="pmc">3509437</article-id>
<article-id pub-id-type="publisher-id">mBio00473-12</article-id>
<article-id pub-id-type="doi">10.1128/mBio.00473-12</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Research Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Genomic Characterization of a Newly Discovered Coronavirus Associated with Acute Respiratory Distress Syndrome in Humans</article-title>
<alt-title>Genome and Phylogeny of Human Coronavirus EMC/2012</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>van Boheemen</surname>
<given-names>Sander</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>a</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>de Graaf</surname>
<given-names>Miranda</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>a</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Lauber</surname>
<given-names>Chris</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>b</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Bestebroer</surname>
<given-names>Theo M.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>a</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Raj</surname>
<given-names>V. Stalin</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>a</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zaki</surname>
<given-names>Ali Moh</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>c</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Osterhaus</surname>
<given-names>Albert D. M. E.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>a</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Haagmans</surname>
<given-names>Bart L.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>a</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Gorbalenya</surname>
<given-names>Alexander E.</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>b</sup>
</xref>
<xref ref-type="aff" rid="aff4">
<sup>d</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Snijder</surname>
<given-names>Eric J.</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>b</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Fouchier</surname>
<given-names>Ron A. M.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>a</sup>
</xref>
</contrib>
<aff id="aff1">Viroscience Lab, Erasmus MC, Rotterdam, The Netherlands
<label>
<sup>a</sup>
</label>
;</aff>
<aff id="aff2">Molecular Virology Laboratory, Department of Medical Microbiology, Center of Infectious Diseases, Leiden University Medical Center, Leiden, The Netherlands
<label>
<sup>b</sup>
</label>
;</aff>
<aff id="aff3">Dr. Soliman Fakeeh Hospital, Jeddah, Saudi Arabia
<label>
<sup>c</sup>
</label>
; and</aff>
<aff id="aff4">Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
<label>
<sup>d</sup>
</label>
</aff>
</contrib-group>
<author-notes>
<corresp id="cor1">Address correspondence to Ron A. M. Fouchier,
<email>r.fouchier@erasmusmc.nl</email>
.</corresp>
<fn fn-type="edited-by">
<p>
<bold>Editor</bold>
Michael Buchmeier, University of California, Irvine</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>20</day>
<month>11</month>
<year>2012</year>
</pub-date>
<pub-date pub-type="collection">
<season>Nov-Dec</season>
<year>2012</year>
</pub-date>
<volume>3</volume>
<issue>6</issue>
<elocation-id>e00473-12</elocation-id>
<history>
<date date-type="received">
<day>24</day>
<month>10</month>
<year>2012</year>
</date>
<date date-type="accepted">
<day>1</day>
<month>11</month>
<year>2012</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright © 2012 van Boheemen et al.</copyright-statement>
<copyright-year>2012</copyright-year>
<copyright-holder>van Boheemen et al.</copyright-holder>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by-nc-sa/3.0/">
<license-p>This is an open-access article distributed under the terms of the
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by-nc-sa/3.0/">Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported</ext-link>
license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.</license-p>
</license>
</permissions>
<abstract>
<title>ABSTRACT</title>
<p>A novel human coronavirus (HCoV-EMC/2012) was isolated from a man with acute pneumonia and renal failure in June 2012. This report describes the complete genome sequence, genome organization, and expression strategy of HCoV-EMC/2012 and its relation with known coronaviruses. The genome contains 30,119 nucleotides and contains at least 10 predicted open reading frames, 9 of which are predicted to be expressed from a nested set of seven subgenomic mRNAs. Phylogenetic analysis of the replicase gene of coronaviruses with completely sequenced genomes showed that HCoV-EMC/2012 is most closely related to
<italic>Tylonycteris</italic>
bat coronavirus HKU4 (BtCoV-HKU4) and
<italic>Pipistrellus</italic>
bat coronavirus HKU5 (BtCoV-HKU5), which prototype two species in lineage C of the genus
<italic>Betacoronavirus</italic>
. In accordance with the guidelines of the International Committee on Taxonomy of Viruses, and in view of the 75% and 77% amino acid sequence identity in 7 conserved replicase domains with BtCoV-HKU4 and BtCoV-HKU5, respectively, we propose that HCoV-EMC/2012 prototypes a novel species in the genus
<italic>Betacoronavirus</italic>
. HCoV-EMC/2012 may be most closely related to a coronavirus detected in
<italic>Pipistrellus pipistrellus</italic>
in The Netherlands, but because only a short sequence from the most conserved part of the RNA-dependent RNA polymerase-encoding region of the genome was reported for this bat virus, its genetic distance from HCoV-EMC remains uncertain. HCoV-EMC/2012 is the sixth coronavirus known to infect humans and the first human virus within betacoronavirus lineage C.</p>
</abstract>
<abstract abstract-type="executive-summary">
<title>IMPORTANCE</title>
<p>Coronaviruses are capable of infecting humans and many animal species. Most infections caused by human coronaviruses are relatively mild. However, the outbreak of severe acute respiratory syndrome (SARS) caused by SARS-CoV in 2002 to 2003 and the fatal infection of a human by HCoV-EMC/2012 in 2012 show that coronaviruses are able to cause severe, sometimes fatal disease in humans. We have determined the complete genome of HCoV-EMC/2012 using an unbiased virus discovery approach involving next-generation sequencing techniques, which enabled subsequent state-of-the-art bioinformatics, phylogenetics, and taxonomic analyses. By establishing its complete genome sequence, HCoV-EMC/2012 was characterized as a new genotype which is closely related to bat coronaviruses that are distant from SARS-CoV. We expect that this information will be vital to rapid advancement of both clinical and vital research on this emerging pathogen.</p>
</abstract>
<counts>
<page-count count="9"></page-count>
</counts>
<custom-meta-group>
<custom-meta>
<meta-name>cover-date</meta-name>
<meta-value>November/December 2012</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
</front>
<body>
<sec id="h0.0">
<title>Introduction</title>
<p>Coronaviruses (CoVs) infect and cause disease in a wide variety of species, including bats, birds, cats, dogs, pigs, mice, horses, whales, and humans (
<xref ref-type="bibr" rid="B1">1</xref>
,
<xref ref-type="bibr" rid="B2">2</xref>
). Recent studies suggest that bats act as a natural reservoir for coronaviruses (
<xref ref-type="bibr" rid="B3">3</xref>
<xref ref-type="bibr" rid="B8">8</xref>
). Coronaviruses may cause respiratory, enteric, hepatic, or neurological diseases with highly variable severity in their hosts. Until 2003, only two coronaviruses were known to infect humans. Human coronaviruses (HCoVs) HCoV-229E and HCoV-OC43 were identified in the 1960s as the causative agents of—generally mild—respiratory illnesses (
<xref ref-type="bibr" rid="B9">9</xref>
,
<xref ref-type="bibr" rid="B10">10</xref>
). In 2002 to 2003, a previously unknown coronavirus—severe acute respiratory syndrome coronavirus (SARS-CoV)—caused a widespread outbreak of respiratory disease in humans, resulting in approximately 800 deaths and affecting around 30 countries (
<xref ref-type="bibr" rid="B11">11</xref>
<xref ref-type="bibr" rid="B14">14</xref>
). As a consequence of the renewed interest in coronaviruses after the SARS outbreak, two additional human coronaviruses were discovered after 2003: HCoV-NL63 in 2004 (
<xref ref-type="bibr" rid="B15">15</xref>
,
<xref ref-type="bibr" rid="B16">16</xref>
) and HCoV-HKU1 in 2005 (
<xref ref-type="bibr" rid="B17">17</xref>
). A recent analysis of a large collection of human nasopharyngeal specimens using a
<italic>Coronaviridae</italic>
-wide primer set suggested that HCoV-229E, -OC43, -NL63, and -HKU1 are the only coronaviruses circulating in the human population (
<xref ref-type="bibr" rid="B18">18</xref>
).</p>
<p>Coronaviruses are enveloped single-stranded positive-sense RNA viruses with genomes of 25 to 32 kb, and the group includes the largest known genomes among the RNA viruses (
<xref ref-type="bibr" rid="B1">1</xref>
,
<xref ref-type="bibr" rid="B19">19</xref>
). The coronaviruses form a subfamily (
<italic>Coronavirinae</italic>
) within the family
<italic>Coronaviridae</italic>
of the order
<italic>Nidovirales</italic>
. The International Committee on Taxonomy of Viruses (ICTV) has recognized four genera within the
<italic>Coronavirinae</italic>
subfamily:
<italic>Alphacoronavirus</italic>
,
<italic>Betacoronavirus</italic>
, and
<italic>Gammacoronavirus</italic>
, which were previously referred to as coronavirus groups 1, 2, and 3, and
<italic>Deltacoronavirus</italic>
(
<xref ref-type="bibr" rid="B20">20</xref>
). Coronaviruses are assigned to a genus on the basis of rooted phylogeny and calculation of pairwise evolutionary distances for seven highly conserved domains in the replicase polyprotein (
<xref ref-type="bibr" rid="B1">1</xref>
,
<xref ref-type="bibr" rid="B21">21</xref>
) (C. Lauber and A. E. Gorbalenya, unpublished data). HCoV-229E and HCoV-NL63 are viruses belonging to the genus
<italic>Alphacoronavirus</italic>
(
<xref ref-type="bibr" rid="B1">1</xref>
). Four monophyletic lineages (A through D) with no formal taxonomic standing, some of them encompassing multiple virus species, are commonly recognized within the genus
<italic>Betacoronavirus</italic>
. Lineage A includes HCoV-OC43 and HCoV-HKU1 and lineage B SARS-CoV, all of which belong to different species. Lineages C and D include viruses detected only in bats, such as
<italic>Rousettus</italic>
bat coronavirus HKU9 (BtCoV-HKU9) (lineage D),
<italic>Tylonycteris</italic>
bat coronavirus HKU4 (BtCoV-HKU4), and
<italic>Pipistrellus</italic>
bat coronavirus HKU5 (BtCoV-HKU5) (both lineage C) (
<xref ref-type="bibr" rid="B1">1</xref>
). The genetic diversity of coronaviruses is likely facilitated by a high frequency of RNA recombination and the ability of their unusually large RNA genomes to both gain and lose domains (
<xref ref-type="bibr" rid="B1">1</xref>
,
<xref ref-type="bibr" rid="B22">22</xref>
,
<xref ref-type="bibr" rid="B23">23</xref>
). These factors are believed to have promoted the emergence of viruses with novel traits that are able to adapt to new hosts and ecological niches, sometimes causing zoonotic events.</p>
<p>For the present study, we report and analyze the complete genome sequence of the recently identified HCoV-EMC/2012, which was isolated from the sputum of a 60-year-old man who died in a hospital in Jeddah, Saudi Arabia, after developing acute respiratory distress syndrome (ARDS) and multiple organ dysfunction syndrome (MODS) in June 2012 (
<xref ref-type="bibr" rid="B24">24</xref>
). This virus appears to be closely related to the HCoV detected in a second patient who was transported from a hospital in Qatar to a hospital in London, 3 months after hospitalization of the first patient (
<xref ref-type="bibr" rid="B25">25</xref>
). These two cases of human infection with very similar or identical coronaviruses alarmed health authorities globally, as it was a reminder of the potential threat of coronaviruses to human health that was first highlighted by the SARS outbreak of 2003 (
<xref ref-type="bibr" rid="B25">25</xref>
). The sequence analysis of a small reverse transcription-PCR (RT-PCR) fragment that was first amplified from the HCoV-EMC/2012 genome revealed the highest similarity to two betacoronaviruses circulating in bats, BtCoV-HKU4 and -HKU5 (
<xref ref-type="bibr" rid="B24">24</xref>
). Here we present the complete genome sequence of the newly isolated HCoV-EMC/2012, accompanied by a detailed annotation of its genome organization and expression strategy. Furthermore, comparative genomic analysis and state-of-the-art classification and phylogenetic analyses were applied to determine the position of the novel agent with respect to previously characterized coronaviruses. We conclude that the HCoV-EMC/2012 genome organization and expression indeed most closely resemble those of BtCoV-HKU4 and -HKU5. However, based on our analysis and in line with the ICTV guidelines for the demarcation of coronavirus species, HCoV-EMC/2012 clearly qualifies to be recognized as the prototype of a novel species, which would thus constitute the first human coronavirus in lineage C of the genus
<italic>Betacoronavirus</italic>
.</p>
</sec>
<sec sec-type="results" id="h1">
<title>RESULTS</title>
<sec id="h1.1">
<title>Sequencing of the HCoV-EMC/2012 genome.</title>
<p>Using a combination of approaches, including deep sequencing, cycle sequencing on a more traditional capillary sequencer, and determination of the genomic termini by rapid amplification of cDNA ends (RACE), the complete genome sequence of HCoV-EMC/2012 was determined from material that had been subjected to passage in cell culture 6 times. The data from the Roche 454 GS Junior deep-sequencing run yielded a total of 90,808 sequence reads, of which 87,256 were specific for HCoV-EMC/2012. Genome coverage ranged from 1 to 5,697 reads at single nucleotide positions, with an average of 1,006 reads in the deep-sequencing run. Based on the contigs assembled from these initial data, primers approximately 800 nucleotides (nt) apart were designed to amplify PCR fragments with 100-nt overlaps covering the entire virus genome (see
<xref ref-type="supplementary-material" rid="tabS1">Table S1</xref>
in the supplemental material for primer sequences). These amplicons were sequenced using Sanger sequencing, and a total of 104 sequence runs were assembled—along with the original 90,808 deep-sequencing reads—into a single contiguous sequence of 30,119 nt, including the first 12 nt of the 3′ poly(A) tail. Although 454 sequencing resulted in a higher single-read error rate than Sanger sequencing, the high coverage in the first data set largely corrected for these errors. Occasionally, the correct number of bases in homopolymer stretches was difficult to determine, which is a typical problem in 454 sequencing. Nevertheless, there was excellent agreement between the deep-sequencing data and the confirmatory Sanger sequencing. The final consensus sequence was submitted to GenBank (see below). This sequence contains only two ambiguous positions, nt 11623 and 27162. The variation at position 11623 translates into a Val or Gly uncertainty at amino acid (aa) 3782 of pp1a/pp1ab. Position 27162 was either a G or an A, with the A creating a premature stop codon for translation of open reading frame 5 (ORF5) (see Discussion). The verification of our consensus sequence awaits the availability of a second HCoV-EMC/2012 virus isolate or original specimen. The overall content of G and C residues in the HCoV-EMC/2012 genome was 41%, which is similar to values reported for other coronaviruses (37% to 42%) (
<xref ref-type="bibr" rid="B14">14</xref>
).</p>
</sec>
<sec id="h1.2">
<title>Genome organization and expression strategy.</title>
<p>Coronavirus genomes are polycistronic positive-stranded RNAs (
<xref ref-type="fig" rid="fig1">Fig. 1A</xref>
), of which the 5′-proximal three-fourths are occupied by the large replicase open reading frames ORF1a and ORF1b. These are translated from the genomic mRNA to produce polyproteins pp1a and, following −1 ribosomal frameshifting, pp1ab, which are subsequently cleaved into 15 or 16 nonstructural proteins (nsps) (
<xref ref-type="bibr" rid="B19">19</xref>
,
<xref ref-type="bibr" rid="B23">23</xref>
,
<xref ref-type="bibr" rid="B26">26</xref>
). The region downstream of ORF1b is characterized by containing a variable number of smaller genes, always including those encoding the spike (S), envelope (E), membrane (M), and nucleocapsid (N) structural proteins. These genes are translated from subgenomic (sg) mRNAs that form a 5′- and 3′-coterminal nested set with the viral genome. Subgenomic mRNAs are composed of a common 5′ leader sequence that is identical to the genomic 5′ region and a variable part of the 3′ quarter of the genome, with different sg mRNAs making different ORFs available for translation. The complement of the leader and “body” segments of the sg mRNAs are assumed to be joined during discontinuous negative-strand RNA synthesis. This step produces the templates for sg mRNA synthesis and is directed by a base-pairing interaction between conserved transcription-regulatory sequences (TRSs) (
<xref ref-type="bibr" rid="B27">27</xref>
<xref ref-type="bibr" rid="B29">29</xref>
). Such TRSs are found at the 3′ end of the leader sequence (leader TRS) and at different positions upstream of genes in the genomic 3′-proximal domain (body TRSs). The synthesis of subgenome-length negative-stranded RNAs is directed by the complement of a body TRS at the 3′ end of a nascent minus-strand base pairing with the leader TRS, with the extent of sequence complementarity being an important determinant of the level at which a given sg mRNA is produced.</p>
<fig id="fig1" position="float">
<label>FIG 1</label>
<caption>
<p>Genome organization and expression of HCoV-EMC/2012. (A) The coding part of the genome and terminal untranslated regions are depicted, respectively, by a gray background and horizontal lines. Rectangles indicate ORFs and their locations in three reading frames. The dashed lines in ORF1a and ORF5 indicate base ambiguities observed during sequencing. Triangles represent sites in the replicase polyproteins pp1a and pp1ab that are predicted to be cleaved by papain-like proteinases (gray) or the 3C-like cysteine proteinase (black). Cleavage products are numbered nsp1 to nsp16, according to the convention established for other coronaviruses (
<xref ref-type="bibr" rid="B23">23</xref>
). The −1 ribosomal frameshift site (RFS) in the ORF1a/ORF1b overlap region is indicated. The location of the leader TRS (transcription-regulatory sequences) (L) and seven body TRSs (numbered) are highlighted by black dots. All coordinates correspond to the scale shown at the bottom. (B) Sequence comparison of leader TRS region and seven body TRSs. The fully conserved TRS core sequence
<named-content content-type="gene">AACGAA</named-content>
is highlighted. Nucleotides in the body TRSs are written in uppercase letters if the complementary nucleotide can base pair with the corresponding residue in the leader TRS region (including G-U base pairs). TRS starting coordinates in the HCoV-EMC/2012 genome are shown at the left; for the body TRSs, the numbers of (potential) base pairs with the leader TRS region are shown at the right.</p>
</caption>
<graphic xlink:href="mbo0061213860001"></graphic>
</fig>
<p>Inspection of the genome sequence of HCoV-EMC/2012 revealed the two large, partially overlapping replicase open reading frames ORF1a and ORF1b, as well as (at least) nine downstream ORFs (
<xref ref-type="fig" rid="fig1">Fig. 1A</xref>
). The ORF1a sequence encodes the two protease domains conserved in all other coronaviruses, a papain-like protease (PL2pro) in nsp3 and a 3C-like protease (3CLpro; also known as the “main protease”) in nsp5. Sequence comparison with other coronaviruses allowed us to predict the putative pp1a/pp1ab cleavage sites and annotate the resulting nsp1 through -16 (
<xref ref-type="table" rid="tab1">Table 1</xref>
). According to sequence conservation analyses performed with other coronaviruses, open reading frames ORF2, -6, -7, and -8a are predicted to encode the four canonical structural proteins of coronaviruses, the envelope proteins S, E, and M and the N protein, respectively (
<xref ref-type="fig" rid="fig1">Fig. 1A</xref>
; see also
<xref ref-type="supplementary-material" rid="figS1">Fig. S1</xref>
in the supplemental material). A leader TRS and seven putative body TRSs could be readily identified, with the sequence
<named-content content-type="gene">5′ AACGAA 3′</named-content>
forming the conserved TRS core and potential TRS duplexes during leader-body joining ranging from 14 to 19 matches over a 22-nt window that includes the core of the leader TRS (
<xref ref-type="fig" rid="fig1">Fig. 1B</xref>
). From this analysis, it can be predicted that seven subgenomic mRNAs carrying a 67-nt common leader sequence would be produced in HCoV-EMC/2012-infected cells, with sizes ranging from ~4.7 kb for mRNA2 to ~1.7 kb for mRNA8. Experimental studies are needed to confirm the correct identification of the TRSs in the genomes of HCoV-EMC/2012 and related lineage C betacoronaviruses.</p>
<table-wrap id="tab1" position="float">
<label>TABLE 1 </label>
<caption>
<p>Cleavage products of the replicase polyproteins of HCoV-EMC/2012</p>
</caption>
<table frame="hsides" rules="groups">
<thead valign="bottom">
<tr>
<th align="left" rowspan="1" colspan="1">Cleavage product</th>
<th align="left" rowspan="1" colspan="1">Position in polyprotein
<break></break>
pp1a/pp1ab
<italic>
<sup>
<xref rid="ngtab1.1">a</xref>
</sup>
</italic>
</th>
<th align="left" rowspan="1" colspan="1">Protein size
<break></break>
(no. of amino acids)</th>
<th align="left" rowspan="1" colspan="1">Putative functional domain(s)
<italic>
<sup>
<xref rid="ngtab1.2">b</xref>
</sup>
</italic>
</th>
</tr>
</thead>
<tbody valign="bottom">
<tr>
<td align="left" rowspan="1" colspan="1">nsp1</td>
<td align="left" rowspan="1" colspan="1">1Met-Gly193</td>
<td align="left" rowspan="1" colspan="1">193</td>
<td rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp2</td>
<td align="left" rowspan="1" colspan="1">194Asp-Gly853</td>
<td align="left" rowspan="1" colspan="1">660</td>
<td rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp3</td>
<td align="left" rowspan="1" colspan="1">854Ala-Gly2740</td>
<td align="left" rowspan="1" colspan="1">1887</td>
<td align="left" rowspan="1" colspan="1">ADRP, PL2pro, TM1</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp4</td>
<td align="left" rowspan="1" colspan="1">2741Ala-Gln3247</td>
<td align="left" rowspan="1" colspan="1">507</td>
<td align="left" rowspan="1" colspan="1">TM-2</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp5</td>
<td align="left" rowspan="1" colspan="1">3248Ser-Gln3553</td>
<td align="left" rowspan="1" colspan="1">306</td>
<td align="left" rowspan="1" colspan="1">3CLpro</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp6</td>
<td align="left" rowspan="1" colspan="1">3554Ser-Gln3845</td>
<td align="left" rowspan="1" colspan="1">292</td>
<td align="left" rowspan="1" colspan="1">TM-3</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp7</td>
<td align="left" rowspan="1" colspan="1">3846Ser-Gln3928</td>
<td align="left" rowspan="1" colspan="1">83</td>
<td rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp8</td>
<td align="left" rowspan="1" colspan="1">3929Ala-Gln4127</td>
<td align="left" rowspan="1" colspan="1">199</td>
<td align="left" rowspan="1" colspan="1">Putative primase</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp9</td>
<td align="left" rowspan="1" colspan="1">4128Asn-Gln4237</td>
<td align="left" rowspan="1" colspan="1">110</td>
<td rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp10</td>
<td align="left" rowspan="1" colspan="1">4238Ala-Gln4377</td>
<td align="left" rowspan="1" colspan="1">140</td>
<td rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp11</td>
<td align="left" rowspan="1" colspan="1">4378Ser-Leu4391</td>
<td align="left" rowspan="1" colspan="1">14</td>
<td rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp12</td>
<td align="left" rowspan="1" colspan="1">4378Ser-Gln5310</td>
<td align="left" rowspan="1" colspan="1">933</td>
<td align="left" rowspan="1" colspan="1">RdRp</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp13</td>
<td align="left" rowspan="1" colspan="1">5311Ala-Gln5908</td>
<td align="left" rowspan="1" colspan="1">598</td>
<td align="left" rowspan="1" colspan="1">ZD, HEL1</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp14</td>
<td align="left" rowspan="1" colspan="1">5909Ser-Gln6432</td>
<td align="left" rowspan="1" colspan="1">524</td>
<td align="left" rowspan="1" colspan="1">ExoN, NMT</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp15</td>
<td align="left" rowspan="1" colspan="1">6433Gly-Gln6775</td>
<td align="left" rowspan="1" colspan="1">343</td>
<td align="left" rowspan="1" colspan="1">NendoU</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">nsp16</td>
<td align="left" rowspan="1" colspan="1">6776Ala-Arg7078</td>
<td align="left" rowspan="1" colspan="1">303</td>
<td align="left" rowspan="1" colspan="1">OMT</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="ngtab1.1">
<label>
<sup>a</sup>
</label>
<p> Amino acids of the replicase proteins pp1a and pp1ab were numbered with the assumption that a −1 ribosomal frameshift occurs to express ORF1b, as in other coronaviruses (see text); the use of the slippery sequence
<named-content content-type="gene">UUUAAAC</named-content>
is predicted to result in a peptide bond between Asn4385 and Arg4386 in pp1ab.</p>
</fn>
<fn id="ngtab1.2">
<label>
<sup>b</sup>
</label>
<p> The major transmembrane domains and a selection of the most conserved domains with enzymatic activities that have been characterized functionally and/or structurally in coronaviruses are listed. Abbreviations: PL2pro, papain-like proteinase 2; ADRP, ADP-ribose 1″-phosphatase; TM, transmembrane domain; 3CLpro, 3C-like cysteine proteinase; RdRp, RNA-dependent RNA polymerase; ZD, putative zinc-binding domain; HEL1, superfamily 1 helicase; ExoN, 3′-to-5′ exonuclease; NMT, N7-methyltransferase; NendoU, nidoviral endoribonuclease specific for U; OMT, S-adenosylmethionine-dependent ribose 2′-
<italic>O-</italic>
methyltransferase.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>Furthermore, mRNA4 and -8 are predicted to be functionally bicistronic, with ribosomal leaky scanning being the likely translation initiation mechanism for both ORF4b and ORF8b. The ORF4b AUG codon is not preceded by a separate body TRS, and the 241-nt sequence separating the 5′ ends of ORF4a and ORF4b is entirely devoid of AUG codons. The AUG codon of the current ORF8b, an internal ORF that is overlapped by the N protein gene (ORF8a) and is present in all betacoronaviruses, is the third AUG codon on mRNA8, but sequence analysis and comparison with the BtCov-HKU4 and -HKU5 sequences (
<xref ref-type="bibr" rid="B8">8</xref>
) suggests that the 5′ end of ORF8b may have become truncated relatively recently (see Discussion).</p>
<p>Twenty-two additional putative ORFs of 150 to 432 nt in length were detected throughout the genome of HCoV-EMC/2012, overlapping the major ORFs. In contrast to the ORFs shown in
<xref ref-type="fig" rid="fig1">Fig. 1A</xref>
, these 22 additional ORFs are not positioned (immediately) downstream of a body TRS, and hence it is unlikely that they are expressed. The synthesis of the replicase pp1ab polyprotein of HCoV-EMC/2012 involves −1 programmed ribosomal frameshifting, with nt 13427 to 13433 predicted to form the conserved “slippery sequence” (
<named-content content-type="gene">5′ UUUAAAC 3′</named-content>
) in the ORF1a/ORF1b overlap region that is typical for coronaviruses (
<xref ref-type="bibr" rid="B30">30</xref>
). The frameshift region is followed by a predicted RNA hairpin, formed by nucleotides at positions 13439 to 13450 base pairing with those at 13462 to 13473, with potential RNA pseudoknot formation occurring by base pairing of the loop of the hairpin (nt 13452 to 13460) with a downstream complementary sequence (nt 13506 to 13514). As is common in coronavirus genomes, nontranslated sequences are found only at the genomic termini, with the 5′ and 3′ untranslated regions (278 and 300 nt, respectively) having sizes similar to those found in other family members. The only other apparently untranslated region in the genome that is larger than 50 nucleotides concerns the intergenic region between ORF5 and -6 (nt 27515 to 27589). This region appears to be conserved between HCoV-EMC/2012, BtCoV-HKU4, and BtCoV-HKU5, with sequence identities ranging from 63% to 84%, but we have no explanation for this observation thus far.</p>
</sec>
<sec id="h1.3">
<title>Phylogenetic relations and taxonomic position of HCoV-EMC/2012.</title>
<p>Phylogenetic trees were inferred using nucleotide sequences for ORF1ab (
<xref ref-type="fig" rid="fig2">Fig. 2A</xref>
) and a 332-nt fragment from ORF1b (
<xref ref-type="fig" rid="fig2">Fig. 2B</xref>
) encoding the most conserved part of the RNA-dependent RNA polymerase (RdRp) domain, which is commonly targeted in virus discovery studies. The first tree was produced for a representative set of coronaviruses for which complete genome sequences are available. In the second tree, we also included coronaviruses for which only partial genome sequences are known, particularly that of P.pipi/VM314/2008/NLD (
<xref ref-type="bibr" rid="B31">31</xref>
) which produced the best match with HCoV-EMC/2012. In both trees, HCoV-EMC/2012 clearly groups within lineage C of the genus
<italic>Betacoronavirus</italic>
, relatively close to BtCoV-HKU4 and BtCoV-HKU5. However, based on the 332-nt fragment from ORF1b, HCoV-EMC/2012 is more closely related to bat-derived isolate VM314/2008 (GenBank accession number GQ259977), which was isolated from
<italic>Pipistrellus</italic>
bats in The Netherlands 4 years ago. Phylogenetic trees were also constructed based on amino acid sequences, using coronavirus-wide conserved domains of replicative proteins in pp1ab (
<xref ref-type="fig" rid="fig3">Fig. 3A</xref>
) as well as using conserved parts of structural proteins (
<xref ref-type="fig" rid="fig3">Fig. 3B</xref>
). In both trees, HCoV-EMC/2012 clusters with betacoronaviruses, supporting its classification as a member of the genus
<italic>Betacoronavirus</italic>
.</p>
<fig id="fig2" position="float">
<label>FIG 2</label>
<caption>
<p>Phylogenetic trees for HCoV-EMC/2012 and selected other coronaviruses. Unrooted maximum likelihood phylogenies inferred from the nucleotide sequences of full-length ORF1ab (A) or a 332-nt fragment from the RdRp-encoding domain of ORF1b (B) are shown. HCoV-EMC/2012 and 20 viruses representing the recognized species diversity of coronaviruses were included, with bat-derived isolate VM314/2008 also included in the analysis presented in panel B (
<xref ref-type="bibr" rid="B31">31</xref>
). The viruses and corresponding species used are
<italic>Alphacoronavirus 1</italic>
(Alpha-CoV1),
<italic>Human coronavirus 229E</italic>
(HCoV-229E),
<italic>Human coronavirus NL63</italic>
(HCoV-NL63),
<italic>Miniopterus bat coronavirus 1</italic>
(BtCoV-1AB),
<italic>Miniopterus bat coronavirus HKU8</italic>
(BtCoV-HKU8),
<italic>Porcine epidemic diarrhea virus</italic>
(PED),
<italic>Rhinolophus bat coronavirus HKU2</italic>
(BtCoV-HKU2),
<italic>Scotophilus bat coronavirus 512</italic>
(BtCoV-512),
<italic>Betacoronavirus 1</italic>
(Beta-CoV1),
<italic>Human coronavirus HKU1</italic>
(HCoV-HKU1),
<italic>Murine coronavirus</italic>
(MHV),
<italic>Tylonycteris bat coronavirus HKU4</italic>
(BtCoV-HKU4),
<italic>Pipistrellus bat coronavirus HKU5</italic>
(BtCoV-HKU5),
<italic>Rousettus bat coronavirus HKU9</italic>
(BtCoV-HKU9),
<italic>Severe acute respiratory syndrome-related coronavirus</italic>
(SARS-CoV),
<italic>Avian coronavirus</italic>
(IBV),
<italic>Beluga whale coronavirus SW1</italic>
(BWCoV-SW1),
<italic>Bulbul coronavirus HKU11</italic>
(ACoV-HKU11),
<italic>Thrush coronavirus HKU12</italic>
(ACoV-HKU12), and
<italic>Munia coronavirus HKU13</italic>
(ACoV-HKU13). Bootstrap values above 50 are shown. Arcs and symbols indicate the four coronavirus genera. The scale bar represents the number of nucleotide substitutions per site.</p>
</caption>
<graphic xlink:href="mbo0061213860002"></graphic>
</fig>
<fig id="fig3" position="float">
<label>FIG 3</label>
<caption>
<p>Phylogenetic trees for HCoV-EMC/2012 and selected other coronaviruses. Unrooted maximum likelihood phylogenies based on coronavirus-wide conserved protein domains in replicase pp1ab (A) or on the conserved parts of structural proteins S2, E, M, and N (B) for HCoV-EMC/2012 and 20 viruses representing the recognized species diversity of coronaviruses are shown (see
<xref ref-type="fig" rid="fig2">Fig. 2</xref>
legend for names and abbreviations). Branch support values are based on the Shimodaira-Hasegawa-like procedure and are in the range of zero to one; only nonoptimal values smaller than one are shown. Arcs and symbols indicate the four coronavirus genera. The scale bars represent average numbers of substitutions per amino acid position.</p>
</caption>
<graphic xlink:href="mbo0061213860003"></graphic>
</fig>
<p>ICTV assigns newly identified members of the family
<italic>Coronaviridae</italic>
to a subfamily and genus on the basis of rooted phylogeny and calculation of pairwise evolutionary distances for seven replicase domains (
<xref ref-type="bibr" rid="B1">1</xref>
,
<xref ref-type="bibr" rid="B21">21</xref>
). To establish whether HCoV-EMC/2012 indeed prototypes a new species, amino acid sequence alignments were generated for each of these domains and concatenated, after which the sequence identity of HCoV-EMC/2012 with closely related strains was calculated. For this purpose, the full genomes of 9 strains, derived from 3 species, belonging to
<italic>Betacoronavirus</italic>
lineage C were available (
<xref ref-type="table" rid="tab2">Table 2</xref>
). Amino acid sequence identity between conserved replicase domains of HCoV-EMC/2012 and those of other lineage C viruses ranged from 57% (ADP-ribose 1″-phosphatase [ADRP]) to 94% (helicase [Hel]). Overall amino acid sequence identities to BtCoV-HKU4 and BtCoV-HKU5 strains across the conserved domains were around 75% and 76.7%, respectively. These percentages are well below the threshold of 90% amino acid sequence identity that is used for coronavirus species identification by the ICTV. The distance between HCoV-EMC/2012 and members of these two species is as large as that observed upon interspecies comparison of other species pairs, for example,
<italic>Murine coronavirus</italic>
versus
<italic>Human coronavirus HKU1</italic>
or
<italic>Porcine epidemic diarrhea virus</italic>
versus
<italic>Scotophilus bat coronavirus 512</italic>
(
<xref ref-type="fig" rid="fig3">Fig. 3</xref>
). Consequently, we propose that HCoV-EMC/2012 prototypes a novel species of lineage C of the genus
<italic>Betacoronavirus</italic>
.</p>
<table-wrap id="tab2" position="float">
<label>TABLE 2 </label>
<caption>
<p>Percent amino acid sequence identity between conserved domains of the replicase polyprotein of HCoV-EMC/2012 and established betacoronaviruses
<italic>
<sup>
<xref rid="ngtab2.1">a</xref>
</sup>
</italic>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead valign="bottom">
<tr>
<th align="left" rowspan="2" colspan="1">Virus strain</th>
<th align="left" colspan="8" rowspan="1">% amino acid sequence identity with conserved domain of the indicated HCoV-EMC/2012 replicase polyprotein
<italic>
<sup>
<xref rid="ngtab2.2">b</xref>
</sup>
</italic>
<hr></hr>
</th>
</tr>
<tr>
<th align="left" rowspan="1" colspan="1">ADRP</th>
<th align="left" rowspan="1" colspan="1">3CLpro</th>
<th align="left" rowspan="1" colspan="1">RdRp</th>
<th align="left" rowspan="1" colspan="1">Hel</th>
<th align="left" rowspan="1" colspan="1">ExoN</th>
<th align="left" rowspan="1" colspan="1">NendoU</th>
<th align="left" rowspan="1" colspan="1">O-MT</th>
<th align="left" rowspan="1" colspan="1">All domains</th>
</tr>
</thead>
<tbody valign="bottom">
<tr>
<td align="left" rowspan="1" colspan="1">BtCoV-HKU4.1</td>
<td align="left" rowspan="1" colspan="1">57.4</td>
<td align="left" rowspan="1" colspan="1">81.0</td>
<td align="left" rowspan="1" colspan="1">90.0</td>
<td align="left" rowspan="1" colspan="1">92.1</td>
<td align="left" rowspan="1" colspan="1">85.4</td>
<td align="left" rowspan="1" colspan="1">72.6</td>
<td align="left" rowspan="1" colspan="1">83.4</td>
<td align="left" rowspan="1" colspan="1">75.1</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">BtCoV-HKU4.2</td>
<td align="left" rowspan="1" colspan="1">57.5</td>
<td align="left" rowspan="1" colspan="1">81.0</td>
<td align="left" rowspan="1" colspan="1">90.0</td>
<td align="left" rowspan="1" colspan="1">92.1</td>
<td align="left" rowspan="1" colspan="1">85.4</td>
<td align="left" rowspan="1" colspan="1">72.6</td>
<td align="left" rowspan="1" colspan="1">83.4</td>
<td align="left" rowspan="1" colspan="1">75.1</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">BtCoV-HKU4.3</td>
<td align="left" rowspan="1" colspan="1">57.4</td>
<td align="left" rowspan="1" colspan="1">81.0</td>
<td align="left" rowspan="1" colspan="1">90.0</td>
<td align="left" rowspan="1" colspan="1">92.1</td>
<td align="left" rowspan="1" colspan="1">85.4</td>
<td align="left" rowspan="1" colspan="1">72.6</td>
<td align="left" rowspan="1" colspan="1">83.4</td>
<td align="left" rowspan="1" colspan="1">75.1</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">BtCoV-HKU4.4</td>
<td align="left" rowspan="1" colspan="1">57.5</td>
<td align="left" rowspan="1" colspan="1">81.0</td>
<td align="left" rowspan="1" colspan="1">89.9</td>
<td align="left" rowspan="1" colspan="1">92.1</td>
<td align="left" rowspan="1" colspan="1">85.4</td>
<td align="left" rowspan="1" colspan="1">72.6</td>
<td align="left" rowspan="1" colspan="1">83.4</td>
<td align="left" rowspan="1" colspan="1">74.9</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">BtCoV/133/2005</td>
<td align="left" rowspan="1" colspan="1">57.6</td>
<td align="left" rowspan="1" colspan="1">80.7</td>
<td align="left" rowspan="1" colspan="1">89.9</td>
<td align="left" rowspan="1" colspan="1">91.6</td>
<td align="left" rowspan="1" colspan="1">86.4</td>
<td align="left" rowspan="1" colspan="1">72.0</td>
<td align="left" rowspan="1" colspan="1">83.4</td>
<td align="left" rowspan="1" colspan="1">74.9</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">BtCoV-HKU5.1</td>
<td align="left" rowspan="1" colspan="1">57.6</td>
<td align="left" rowspan="1" colspan="1">82.6</td>
<td align="left" rowspan="1" colspan="1">92.1</td>
<td align="left" rowspan="1" colspan="1">93.8</td>
<td align="left" rowspan="1" colspan="1">91.7</td>
<td align="left" rowspan="1" colspan="1">79.7</td>
<td align="left" rowspan="1" colspan="1">85.3</td>
<td align="left" rowspan="1" colspan="1">76.7</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">BtCoV-HKU5.2</td>
<td align="left" rowspan="1" colspan="1">57.6</td>
<td align="left" rowspan="1" colspan="1">82.0</td>
<td align="left" rowspan="1" colspan="1">92.2</td>
<td align="left" rowspan="1" colspan="1">93.8</td>
<td align="left" rowspan="1" colspan="1">91.7</td>
<td align="left" rowspan="1" colspan="1">80.0</td>
<td align="left" rowspan="1" colspan="1">85.3</td>
<td align="left" rowspan="1" colspan="1">76.7</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">BtCoV-HKU5.3</td>
<td align="left" rowspan="1" colspan="1">57.2</td>
<td align="left" rowspan="1" colspan="1">82.0</td>
<td align="left" rowspan="1" colspan="1">92.2</td>
<td align="left" rowspan="1" colspan="1">93.8</td>
<td align="left" rowspan="1" colspan="1">91.7</td>
<td align="left" rowspan="1" colspan="1">80.0</td>
<td align="left" rowspan="1" colspan="1">85.3</td>
<td align="left" rowspan="1" colspan="1">76.7</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">BtCoV-HKU5.5</td>
<td align="left" rowspan="1" colspan="1">57.3</td>
<td align="left" rowspan="1" colspan="1">82.0</td>
<td align="left" rowspan="1" colspan="1">92.2</td>
<td align="left" rowspan="1" colspan="1">93.8</td>
<td align="left" rowspan="1" colspan="1">91.7</td>
<td align="left" rowspan="1" colspan="1">80.0</td>
<td align="left" rowspan="1" colspan="1">85.3</td>
<td align="left" rowspan="1" colspan="1">76.7</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="ngtab2.1">
<label>
<sup>a</sup>
</label>
<p> Accession numbers used are as follows: for BtCoV-HKU4 strains, EF065505, EF065506, EF065507, and EF065508; for BtCoV/133/2005, DQ648794; and for BtCoV-HKU5 strains, EF065509, EF065510, EF065511, and EF065512.</p>
</fn>
<fn id="ngtab2.2">
<label>
<sup>b</sup>
</label>
<p> For abbreviations, see
<xref ref-type="table" rid="tab1">Table 1</xref>
.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="h1.4">
<title>Genome similarities between coronavirus HCoV-EMC/2012 and coronaviruses BtCoV-HKU4 and BtCoV-HKU5.</title>
<p>BtCoV-HKU4 and BtCoV-HKU5 (
<xref ref-type="bibr" rid="B8">8</xref>
) are the closest relatives of HCoV-EMC/2012 for which full-length genome sequences are available (see above). Accordingly, comparison of the genomes of these three viruses revealed important similarities, including the organization of the “accessory protein genes,” ORF3a through ORF5, residing between the S protein gene and those encoding the E, M, and N proteins. Upon annotating this region of the BtCoV-HKU4 and BtCoV-HKU5 genomes, Woo et al. (
<xref ref-type="bibr" rid="B8">8</xref>
) identified the body TRSs for sg mRNA3, mRNA4, and mRNA5 but unfortunately did not follow standard coronavirus nomenclature, naming the downstream open reading frames ORF3a through ORF3d (encoding ns3a through ns3d) rather than ORF3, ORF4a, ORF4b, and ORF5 (
<xref ref-type="fig" rid="fig1">Fig. 1A</xref>
). The similarities of all ORFs and proteins of HCoV-EMC/2012, BtCoV-HKU4, and BtCoV-HKU5 were calculated, and percentages of sequence identity are summarized in
<xref ref-type="table" rid="tab3">Table 3</xref>
. The lowest percentages of sequence identity to BtCoV-HKU4 and BtCoV-HKU5 were observed for ORF3 at the nucleotide level (46.4% and 46.0%, respectively) and for ORF4b at the amino acid level (23.5% and 25.9%, respectively). The highest percentages of sequence identity to BtCoV-HKU4 and BtCoV-HKU5 were observed for the E ORF at the nucleotide level (74.6% and 75.1%, respectively) and for the M ORF at the amino acid level (82.6% and 82.2%, respectively). These data further supported the characterization of HCoV-EMC/2012 as a close relative of BtCoV-HKU4 and BtCoV-HKU5.</p>
<table-wrap id="tab3" position="float">
<label>TABLE 3 </label>
<caption>
<p>Percent identity between open reading frames of coronavirus HCoV-EMC/2012 and coronaviruses BtCoV-HKU4 and BtCoV-HKU5 at the nucleotide and amino acid levels</p>
</caption>
<table frame="hsides" rules="groups">
<thead valign="bottom">
<tr>
<th align="left" rowspan="2" colspan="1">Annotation
<break></break>
in HCoV-EMC/2012</th>
<th align="left" rowspan="2" colspan="1">Annotation in
<break></break>
BtCoV-HKU4
<break></break>
and BtCoV-HKU5
<italic>
<sup>
<xref rid="ngtab3.1">a</xref>
</sup>
</italic>
</th>
<th align="left" colspan="2" rowspan="1">% identity
<break></break>
to BtCoV-HKU4
<italic>
<sup>
<xref rid="ngtab3.2">b</xref>
</sup>
</italic>
<hr></hr>
</th>
<th align="left" colspan="2" rowspan="1">% identity
<break></break>
to BtCoV-HKU5
<italic>
<sup>
<xref rid="ngtab3.2">b</xref>
</sup>
</italic>
<hr></hr>
</th>
</tr>
<tr>
<th align="left" rowspan="1" colspan="1">nt</th>
<th align="left" rowspan="1" colspan="1">aa</th>
<th align="left" rowspan="1" colspan="1">nt</th>
<th align="left" rowspan="1" colspan="1">aa</th>
</tr>
</thead>
<tbody valign="bottom">
<tr>
<td align="left" rowspan="1" colspan="1">ORF1ab</td>
<td align="left" rowspan="1" colspan="1">ORF1ab</td>
<td align="left" rowspan="1" colspan="1">70.6</td>
<td align="left" rowspan="1" colspan="1">72.2</td>
<td align="left" rowspan="1" colspan="1">70.7</td>
<td align="left" rowspan="1" colspan="1">73.8</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">S</td>
<td align="left" rowspan="1" colspan="1">S</td>
<td align="left" rowspan="1" colspan="1">66.3</td>
<td align="left" rowspan="1" colspan="1">66.1</td>
<td align="left" rowspan="1" colspan="1">63.8</td>
<td align="left" rowspan="1" colspan="1">63.5</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">ORF3</td>
<td align="left" rowspan="1" colspan="1">NS3a</td>
<td align="left" rowspan="1" colspan="1">46.4</td>
<td align="left" rowspan="1" colspan="1">34.9</td>
<td align="left" rowspan="1" colspan="1">46.0</td>
<td align="left" rowspan="1" colspan="1">31.4</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">ORF4a</td>
<td align="left" rowspan="1" colspan="1">NS3b</td>
<td align="left" rowspan="1" colspan="1">51.5</td>
<td align="left" rowspan="1" colspan="1">37.5</td>
<td align="left" rowspan="1" colspan="1">47.8</td>
<td align="left" rowspan="1" colspan="1">38.0</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">ORF4b</td>
<td align="left" rowspan="1" colspan="1">NS3c</td>
<td align="left" rowspan="1" colspan="1">35.1</td>
<td align="left" rowspan="1" colspan="1">23.5</td>
<td align="left" rowspan="1" colspan="1">45.2</td>
<td align="left" rowspan="1" colspan="1">25.9</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">ORF5</td>
<td align="left" rowspan="1" colspan="1">NS3d</td>
<td align="left" rowspan="1" colspan="1">56.6</td>
<td align="left" rowspan="1" colspan="1">46.9</td>
<td align="left" rowspan="1" colspan="1">58.1</td>
<td align="left" rowspan="1" colspan="1">54.2</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">E</td>
<td align="left" rowspan="1" colspan="1">E</td>
<td align="left" rowspan="1" colspan="1">74.6</td>
<td align="left" rowspan="1" colspan="1">69.5</td>
<td align="left" rowspan="1" colspan="1">75.1</td>
<td align="left" rowspan="1" colspan="1">68.2</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">M</td>
<td align="left" rowspan="1" colspan="1">M</td>
<td align="left" rowspan="1" colspan="1">72.8</td>
<td align="left" rowspan="1" colspan="1">82.6</td>
<td align="left" rowspan="1" colspan="1">73.0</td>
<td align="left" rowspan="1" colspan="1">82.2</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">N</td>
<td align="left" rowspan="1" colspan="1">N</td>
<td align="left" rowspan="1" colspan="1">67.2</td>
<td align="left" rowspan="1" colspan="1">71.8</td>
<td align="left" rowspan="1" colspan="1">66.7</td>
<td align="left" rowspan="1" colspan="1">67.8</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">ORF8b</td>
<td align="left" rowspan="1" colspan="1">Undescribed</td>
<td align="left" rowspan="1" colspan="1">45.3</td>
<td align="left" rowspan="1" colspan="1">32.1</td>
<td align="left" rowspan="1" colspan="1">48.0</td>
<td align="left" rowspan="1" colspan="1">33.8</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="ngtab3.1">
<label>
<sup>a</sup>
</label>
<p>Annotations used for HCoV-EMC/2012 differ from those used for BtCoV-HKU4 and BtCoV-HKU5 (10).</p>
</fn>
<fn id="ngtab3.2">
<label>
<sup>b</sup>
</label>
<p> Accession numbers used for BtCoV-HKU4 and BtCoV-HKU5 were
<ext-link ext-link-type="gen" xlink:href="EF065505">EF065505</ext-link>
and
<ext-link ext-link-type="gen" xlink:href="EF065509">EF065509</ext-link>
.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec sec-type="discussion" id="h2">
<title>DISCUSSION</title>
<p>Coronaviruses have been known for quite some time as viruses that cause a variety of diseases in humans and animals (
<xref ref-type="bibr" rid="B32">32</xref>
,
<xref ref-type="bibr" rid="B33">33</xref>
). The discovery of a coronavirus as the causative agent of SARS revived the interest in coronaviruses and resulted in a rapid increase of the number of identified coronaviruses, as well as of the number of full coronavirus genome sequences. Until this study, lineage C of the genus
<italic>Betacoronavirus</italic>
(formerly known as subgroup 2c) included virus isolates from bats. Here, we determined and analyzed the complete genome sequence of a previously unknown lineage C betacoronavirus that was isolated from the sputum of a 60-year-old male suffering from acute pneumonia and renal failure in the Kingdom of Saudi Arabia whose death was probably a consequence of this infection (
<xref ref-type="bibr" rid="B24">24</xref>
).</p>
<p>The sequencing of the full HCoV-EMC/2012 genome was greatly facilitated by the advent of high-throughput techniques. Using an optimized random amplification deep-sequencing approach, approximately 90% of the virus genome was covered with high accuracy in a single run. Using the data from this first run, primers could be designed to perform conventional Sanger sequencing for confirmation. This combination of techniques allowed the determination of the complete virus genome within a few days, without a requirement for prior knowledge of the virus genome under investigation. The error rate in 454 deep sequencing was generally higher than in Sanger sequencing, but the high coverage across the HCoV-EMC/2012 virus genome (up to 5,697 reads per nucleotide position) corrected for most of the incorrect base callings. The sequence obtained using the 454 platform aligned almost perfectly with that obtained by Sanger sequencing, with the exception of two nucleotide positions. The deep-sequencing data revealed variation at position 11623 (U or G), with G occurring in 44% of the reads, suggesting that ORF1a-encoded residue 3782 can be either valine (codon GUC) or glycine (codon GGC). The valine codon was the more abundant codon at this position in HCoV-EMC/2012, and valine is also present in most other betacoronaviruses. At position 27162, both G and A were detected in different runs, with an A in 45% of the reads. This G-to-A substitution introduces a premature stop codon (UGG to UAG) in ORF5. The virus stock that we sequenced was derived from passage of the virus from a sputum specimen six times in Vero cell culture. Hence, the observed sequence variants may reflect either natural heterogeneity or emerging genetic changes that occurred during virus passage in cell culture. Additional HCoV-EMC/2012 virus isolates or patient materials are currently not available to verify these genome sequence ambiguities at positions 11623 and 27162.</p>
<p>Adaptation to cell culture leading to a loss of functionality of genes, and in particular in relation to the so-called “accessory protein genes,” has previously been described for a variety of coronaviruses, including SARS-CoV (
<xref ref-type="bibr" rid="B2">2</xref>
,
<xref ref-type="bibr" rid="B34">34</xref>
). These genes, like ORF3 through ORF5 of HCoV-EMC/2012, are dispersed between the structural protein genes (
<xref ref-type="bibr" rid="B35">35</xref>
) and in some cases may even overlap such a gene, as in the case of the ORF overlapping the N protein gene in betacoronaviruses (
<xref ref-type="fig" rid="fig1">Fig. 1A</xref>
) (23, 36). The origin of most accessory protein genes remains unclear, although for some, acquisition by recombination with cellular or heterologous viral sequences seems plausible (
<xref ref-type="bibr" rid="B37">37</xref>
,
<xref ref-type="bibr" rid="B38">38</xref>
). Accessory gene functions have been probed by reverse genetics (knockout mutants) for a variety of coronaviruses, including SARS coronavirus (
<xref ref-type="bibr" rid="B39">39</xref>
), which established that they are not essential for replication in cell culture systems. In animal models, on the other hand, profound effects on pathogenesis after the inactivation (or transfer to a heterologous coronavirus) of accessory protein genes have been previously described (
<xref ref-type="bibr" rid="B40">40</xref>
<xref ref-type="bibr" rid="B42">42</xref>
). In some cases, accessory gene products have been implicated in immune evasion, e.g., by interfering with cellular innate immune signaling (
<xref ref-type="bibr" rid="B43">43</xref>
).</p>
<p>The apparent absence of selection pressure on coronavirus accessory protein genes during cell culture passage may explain the relatively high frequency with which loss of functionality appears to occur. The detection of an internal termination codon in part of the HCoV-EMC/2012 ORF5 sequences (45% of the reads) may constitute another example of such an event, which would lead to the truncation of the ORF5 protein after 107 amino acids. This would resemble a 29-nt deletion that occurred in the SARS-CoV genome, which resulted in the truncation of ORF8 (
<xref ref-type="bibr" rid="B34">34</xref>
,
<xref ref-type="bibr" rid="B44">44</xref>
), and a 45-nt in-frame deletion in ORF7b of the same virus that emerged upon cell culture passage (
<xref ref-type="bibr" rid="B23">23</xref>
).</p>
<p>Our analysis identified a potential ORF underlying the N protein gene (ORF8a), which is a common feature in betacoronaviruses. This ORF was not previously described for BtCoV-HKU4 and BtCoV-HKU5 (
<xref ref-type="bibr" rid="B8">8</xref>
) but is conserved in the genome sequences of both viruses (see
<xref ref-type="supplementary-material" rid="figS1">Fig. S1J</xref>
in the supplemental material). Remarkably, in HCoV-EMC/2012, both the 5′ and 3′ parts of the ORF appear to have been truncated. In BtCoV-HKU4 and BtCoV-HKU5, the ORF8b AUG codon would be the second AUG on sg mRNA8, making leaky ribosomal scanning a likely mechanism for translation initiation. In HCoV-EMC/2012, however, this AUG codon (positions 28606 to 28608) seems to have been mutated to AUA. Conservation of the sequence immediately downstream of this position, which is now formally upstream of ORF8b in HCoV-EMC/2012, was observed with BtCoV-HKU4 and BtCoV-HKU5, suggesting that the putative loss of this AUG codon may also have been a relatively recent event. In the 3′ part of ORF8b, sequence alignment of HCoV-EMC/2012 with BtCoV-HKU4 and BtCoV-HKU5 suggests that the former acquired a premature termination codon at positions 29099 to 29101 (UAA). Although we cannot at present assess the timing of these events in HCoV-EMC/2012 evolution, due to the lack of alternative samples for this species, the presumed loss of ORF8b functionality may also be a consequence of virus passage in cell culture.</p>
<p>To classify newly identified coronaviruses as the prototype of a novel virus species, it is required that the amino acid sequence identity in the conserved replicase domains in all intervirus pairwise comparisons is below the 90% threshold (
<xref ref-type="bibr" rid="B1">1</xref>
). Here, we propose HCoV-EMC/2012 to represent a novel species of the betacoronavirus genus, since the amino acid sequence identities between HCoV-EMC/2012 and its closest relatives BtCoV-HKU4 and BtCoV-HKU5 in the seven conserved domains of ORF1ab were 75% and 77%, respectively. These viruses were originally detected in Asia in lesser bamboo bats (
<italic>Tylonycteris pachypus</italic>
) and Japanese house bats (
<italic>Pipistrellus abramus</italic>
), respectively (
<xref ref-type="bibr" rid="B8">8</xref>
). This proposed classification will remain provisional until approved by ICTV.</p>
<p>The ICTV guidelines for coronavirus species demarcation require the availability of a (nearly) complete genome sequence prior to virus classification. However, there is considerable correlation between the results based on full-genome sequence analysis and those determined using the most conserved part of the ORF1b-encoded RdRp domain, which is commonly used in screening for new coronaviruses. In 2010, this partial sequence was reported for a betacoronavirus (VM314/2008) that was isolated 2 years earlier from a
<italic>Pipistrellus pipistrellus</italic>
bat in The Netherlands. This virus was provisionally classified a betacoronavirus based on a 332-nt fragment from the RdRp-encoding domain of ORF1b (
<xref ref-type="bibr" rid="B31">31</xref>
), which shares 88% nucleotide sequence identity with HCoV-EMC/2012, the highest identity with any coronavirus sequence available in the public domain. Although this high similarity is not sufficient to resolve the taxonomic relation between HCoV-EMC/2012 and isolate VM314/2008, it suggests that they may both belong to the same coronavirus species. Establishing the genome sequence of VM314/2008, or closely related viruses, is urgently required to verify this hypothesis. Based on the genetic relation between HCoV-EMC/2012 and bat coronaviruses, it is tempting to speculate that HCoV-EMC/2012 emerged from bats—either directly or via an intermediate animal host, possibly
<italic>Pipistrellus</italic>
bats. This bat species is known to be present in the Kingdom of Saudi Arabia and neighboring countries.</p>
<p>Although most infections of human coronaviruses are relatively mild, the infection by HCoV-EMC/2012 with fatal outcome, and a similar severe case of an infection with a closely related coronavirus in London (
<xref ref-type="bibr" rid="B25">25</xref>
), is a reminder that certain coronaviruses may cause severe and sometimes fatal infections in humans. It is important to develop an animal model that can be used to fulfill Koch’s postulates for the novel virus, by demonstrating that the isolated virus can indeed cause the observed disease. The availability of the HCoV-EMC/2012 genome sequence will facilitate the development of a variety of diagnostic assays that can be used to study the prevalence and clinical impact of HCoV-EMC/2012 infections in humans. The first generation of assays for this purpose has recently been described (
<xref ref-type="bibr" rid="B45">45</xref>
). We anticipate that the availability of this full-length virus genome sequence will be valuable for the development of additional applied and fundamental research.</p>
</sec>
<sec sec-type="materials|methods" id="h3">
<title>MATERIALS AND METHODS</title>
<sec id="h3.1">
<title>Virus propagation.</title>
<p>Patient material had been subjected to passage in Vero cells four times in the Dr. Soliman Fakeeh Hospital, Jeddah, Saudi Arabia. Subsequently, in the Erasmus Medical Center, Rotterdam, The Netherlands, LLC-MK2 cells were inoculated with HCoV-EMC/2012 in minimal essential medium (MEM-Eagle) with Earle’s salts (BioWhittaker, Verviers, Belgium), supplemented with 2% serum, 100 U/ml penicillin, 100 mg/ml streptomycin, and 2 mM glutamine. Vero cells were inoculated with virus in Dulbecco’s modified Eagle medium (BioWhittaker) supplemented with 1% serum, 100 U/ml penicillin, 100 mg/ml streptomycin, and 2 mM glutamine. After inoculation, the cultures were incubated at 37°C in a CO
<sub>2</sub>
incubator and checked daily for cytopathic changes. Three days after inoculation, supernatant from Vero cells was collected and used for virus genome characterization.</p>
</sec>
<sec id="h3.2">
<title>Arbitrarily primed PCR and virus genome sequencing.</title>
<p>To characterize the viral genome, we used a random amplification deep-sequencing approach as the first step. Virus-containing supernatant was centrifuged for 10 min at 3,000 rpm to remove cellular debris. This supernatant was then filtered through a 0.45-µm-pore-size centrifugal filter unit (Millipore, Amsterdam, The Netherlands) to minimize bacterial contamination. Omnicleave endonuclease (Epicenter Biotechnologies, Madison, WI) was used to remove any free DNA and RNA, according to the manufacturer’s protocol. Subsequently, viral RNA was extracted from the purified, infected cell culture supernatant using a High Pure RNA isolation kit (Roche Diagnostics, Almere, The Netherlands). To remove contaminating mammalian rRNA, a Ribo-Zero RZH110424 rRNA removal kit (Epicenter Biotechnologies, Madison, WI) was used according to the manufacturer’s protocol. RNA was reverse transcribed using circular permuted primers (
<xref ref-type="bibr" rid="B46">46</xref>
) that were extended with random hexamer sequences, namely,
<named-content content-type="gene">CCCACCACCAGAGAGAAAN</named-content>
(6),
<named-content content-type="gene">ACCAGAGAGAAACCCACCN</named-content>
(6),
<named-content content-type="gene">GAGAAACCCACCACCAGAN</named-content>
(6),
<named-content content-type="gene">GGAGGCAAGCGAACGCAAN</named-content>
(6),
<named-content content-type="gene">AAGCGAACGCAAGGAGGCN</named-content>
(6), and
<named-content content-type="gene">ACGCAAGGAGGCAAGCGAN</named-content>
(6). Per reaction, reverse transcription mixtures contained 6 µl RNA, 1 µl primer (20 pmol), 0.5 µl (20 U) RNase inhibitor (Promega, Leiden, The Netherlands), 1 µl (10 mM each) deoxynucleoside triphosphates (Roche), and 5 µl water. After a 5 min incubation at 65°C for optimal primer hybridization to the template, 4 µl (10×) first-strand buffer, 1 µl (200 U/µl) SuperScript III reverse transcriptase (Invitrogen, Bleiswijk, The Netherlands), 1 µl (0.1 M) dithiothreitol (DTT), and 0.5 µl (20 U) RNase inhibitor (Promega) were added to the mixture in a 20-µl volume. To obtain cDNA, the reverse transcription mixture was sequentially incubated at 25°C for 5 min and at 42°C for 1 h. After 3 min at 95°C and 2 min on ice, 1 µl Klenow DNA polymerase (5 U) (New England BioLabs Inc., Ipswich, MA) was added and the mixture was sequentially incubated at 25°C for 5 min, 37°C for 1 h, and 75°C for 20 min to obtain double-stranded cDNA. The cDNA was purified using a MinElute PCR purification kit (Qiagen, Venlo, The Netherlands) according to the instructions of the manufacturer. To amplify the purified cDNA, a PCR with the individual circular permuted primers without the random hexamer was performed. The PCR mixture contained 2 µl primer (40 pmol), 2 µl purified cDNA, 1.25 µl (10 mM each) deoxynucleoside triphosphate (Roche), 5 µl (10×) PfuUltra II Rxn buffer, and 1 µl (2.5 U) PfuUltra II DNA polymerase (Stratagene, Amsterdam, The Netherlands). Water was added to reach a final volume of 50 µl. The PCR mixture was incubated at 95°C for 2 min and then for 40 cycles of 95°C for 20 s, 56°C for 1 min, and 72°C for 2 min, followed by a final extension at 72°C for 10 min. Fragments were purified using a MinElute PCR purification kit (Qiagen) according to the instructions of the manufacturer.</p>
<p>Amplified fragments were sequenced using a 454/Roche GS Junior sequencing platform. A fragment library was created according to the manufacturer’s protocol without DNA fragmentation (GS FLX Titanium rapid library preparation; Roche), selecting for fragments larger than 100 bp. The emulsion-based PCR (emPCR) (amplification method Lib-L) and GS Junior sequencing run were performed according to the instructions of the manufacturer (Roche). The sequence reads were trimmed at 30 nt from the 3′ and 5′ ends to remove all primer sequences. Sequence reads were assembled into contigs using CLC Genomics 5.5.1 software (CLC Bio, Aarhus, Denmark). Using this deep-sequencing approach, approximately 90% of the virus genome sequence was obtained.</p>
<p>As a second step, specific primers were designed to amplify overlapping fragments of approximately 800 bp by RT-PCR. These PCR products were purified from agarose gels and sequenced using a BigDye Terminator v3.1 cycle sequencing kit (Applied Biosystems, Nieuwerkerk a/d IJssel, The Netherlands) and a 3130XL genetic analyzer (Applied Biosystems), according to the instructions of the manufacturers. The genomic 5′- and 3′-terminal sequences were determined using a FirstChoice RLM-RACE kit (Ambion, Bleiswijk, The Netherlands).</p>
</sec>
<sec id="h3.3">
<title>Virus classification.</title>
<p>Newly identified members of the family
<italic>Coronaviridae</italic>
are generally assigned by the ICTV to a subfamily and genus on the basis of rooted phylogeny and calculation of pairwise evolutionary distances for seven replicase polyprotein domains (
<xref ref-type="bibr" rid="B1">1</xref>
): the ADP-ribose 1″-phosphatase (ADRP) in nsp3, the coronavirus 3C-like (3 CL) protease (3CLpro, or “main protease”) in nsp5, the RNA-dependent RNA polymerase (RdRp) in nsp12, the helicase (Hel) in nsp13, the exoribonuclease (ExoN) in nsp14, the nidoviral endoribonuclease specific for U (NendoU) in nsp15, and the ribose-2′-
<italic>O</italic>
-methyltransferase (O-MT) in nsp16. Amino acid sequence alignments were generated for each of these domains using ClustalW within the BioEdit (version 7.0.5.3) (
<xref ref-type="bibr" rid="B47">47</xref>
) program and concatenated, after which the sequence identity of HCoV-EMC/2012 with closely related strains was calculated. For this purpose, the full genomes of 9 strains, derived from 3 species, belonging to
<italic>Betacoronavirus</italic>
lineage C were available.</p>
<p>To support virus classification, protein-based phylogenetic trees were generated. Multiple amino acid alignments, including sequences of HCoV-EMC/2012 and one representative of each of the 20 recognized species of the subfamily
<italic>Coronavirinae</italic>
, were produced for the following proteins, using the Viralis platform (
<xref ref-type="bibr" rid="B48">48</xref>
) followed by manual correction: ADRP, the N-terminal part of PLP2, TM1, Y domain, nsp4 to nsp16, and the C-terminal part of the spike (S) protein (S2), envelope (E) protein, membrane (M) protein, and nucleocapsid (N) protein. From each protein alignment, the most informative blocks (
<xref ref-type="bibr" rid="B49">49</xref>
) were extracted using the BAGG program (
<xref ref-type="bibr" rid="B50">50</xref>
), and only these strongly conserved alignment regions were used for further analyses. Two concatenated alignments were used. The first included replicase pp1ab protein regions (4,110 aa positions, gap content of 0.9%), and the second included regions in the C-terminal domain of the S protein (S2) and the E, M, and N proteins (1,127 aa positions, gap content of 3.9%). ProtTest version 3.2 (
<xref ref-type="bibr" rid="B51">51</xref>
) was used to select the best-fitting model of protein evolution. For both datasets, the LG model with rate heterogeneity (4 categories) ranked top among 112 models tested, with a relative weight of 0.98 under the Bayesian information criterion (BIC) and 0.74 under the corrected Akaike information criterion (AICc) for the first data set and 0.97/0.75 (BIC/AICc) for the second data set. Hence, this model was applied for maximum likelihood phylogeny reconstruction using PhyML version 3.0 (
<xref ref-type="bibr" rid="B52">52</xref>
).</p>
</sec>
<sec id="h3.4">
<title>Phylogenetic reconstruction.</title>
<p>Nucleotide sequences were aligned using the ClustalW software running within the BioEdit (version 7.0.5.3) (
<xref ref-type="bibr" rid="B47">47</xref>
) program and MAFFT version 6 (
<xref ref-type="bibr" rid="B53">53</xref>
). Maximum likelihood phylogenetic trees with 100 bootstrap replicates were estimated under the general time-reversible model (GTR) + I + Γ4 and the transversion model (TVM) + I + Γ4 (determined by ModelTest [54]), using PhyML 3.0 software (
<xref ref-type="bibr" rid="B52">52</xref>
). For both the 332-nt ORF1ab alignment that included isolate VM314/2008 and the alignment of the complete ORF1ab, the GTR + I + Γ4 model ranked top among 65 models tested, with relative weights of 0.8185 and 1.000 under AIC, respectively.</p>
</sec>
<sec id="h3.5">
<title>Nucleotide sequence accession number.</title>
<p>The final HCoV-EMC/2012 consensus sequence was submitted to GenBank under accession number
<ext-link ext-link-type="gen" xlink:href="JX869059">JX869059</ext-link>
.</p>
</sec>
</sec>
<sec sec-type="supplementary-material" id="h4">
<title>SUPPLEMENTAL MATERIAL</title>
<supplementary-material id="figS1">
<label>Figure S1</label>
<caption>
<p>Alignments of (poly)proteins encoded by the genome of HCoV-EMC/2012 and its two closest relatives, BtCoV-HKU4 and BtCoV-HKU5. Expert-curated multiple alignments for pp1ab (A), S protein (B), ORF3 protein (C), ORF4a protein (D), ORF4b protein (E), ORF5 protein (F), E protein (G), M protein (H), N protein (I), and the ORF8b product (J) are shown. (A) Amino acid residues in pp1ab between which cleavage by viral proteinases is predicted to occur are highlighted in gray. Download
<ext-link ext-link-type="uri" xlink:href="/lookup/suppl/doi:10.1128/mBio.00473-12/-/DCSupplemental/mbo006121386sf01.doc">Figure S1, DOC file, 0.1 MB</ext-link>
.</p>
</caption>
<media xlink:href="mbo006121386sf01.doc" mimetype="application" mime-subtype="msword">
<caption>
<p>Figure S1, DOC file, 0.1 MB</p>
</caption>
</media>
</supplementary-material>
<supplementary-material id="tabS1">
<label>Table S1</label>
<caption>
<p>Primers designed to amplify ~800-nucleotide PCR fragments with 100-nucleotide overlaps covering the entire HCoV-EMC/2012 genome.</p>
</caption>
<media xlink:href="mbo006121386st1.docx" mimetype="application" mime-subtype="msword">
<caption>
<p>Table S1, DOCX file, 0.1 MB.</p>
</caption>
</media>
</supplementary-material>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="other">
<p>
<bold>Citation</bold>
van Boheemen S, et al. 2012. Genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans. mBio 3(6):e00473-12. doi:10.1128/mBio.00473-12.</p>
</fn>
</fn-group>
<ack>
<title>ACKNOWLEDGMENTS</title>
<p>We thank Igor Sidorov, Dmitry Samborskiy, and Alexander Kravchenko (partially supported through the MoBiLe program) for help and administration of the Viralis.</p>
<p>The research leading to these results has received funding from the
<funding-source>European Union Seventh Framework Programme (FP7) under EMPERIE</funding-source>
grant agreement no.
<award-id>223498</award-id>
and SILVER grant agreement no.
<award-id>260644</award-id>
.</p>
</ack>
<ref-list>
<title>REFERENCES</title>
<ref id="B1">
<label>1.</label>
<mixed-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>de Groot</surname>
<given-names>RJ</given-names>
</name>
<etal></etal>
</person-group>
<year>2012</year>
<article-title>Family
<italic>Coronaviridae</italic>
</article-title>
, p.
<fpage>806</fpage>
<lpage>828</lpage>
<italic>In</italic>
<person-group person-group-type="editor">
<name>
<surname>King</surname>
<given-names>AMQ</given-names>
</name>
<name>
<surname>Adams</surname>
<given-names>MJ</given-names>
</name>
<name>
<surname>Cartens</surname>
<given-names>EB</given-names>
</name>
<name>
<surname>Lefkowitz</surname>
<given-names>EJ</given-names>
</name>
</person-group>
,
<source>Virus taxonomy, the 9th report of the international committee on taxonomy of viruses</source>
.
<publisher-name>Academic Press</publisher-name>
,
<publisher-loc>San Diego, CA.</publisher-loc>
</mixed-citation>
</ref>
<ref id="B2">
<label>2.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Perlman</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Netland</surname>
<given-names>J</given-names>
</name>
</person-group>
<year>2009</year>
<article-title>Coronaviruses post-SARS: update on replication and pathogenesis</article-title>
.
<source>Nat. Rev. Microbiol</source>
.
<volume>7</volume>
:
<fpage>439</fpage>
<lpage>450</lpage>
<pub-id pub-id-type="pmid">19430490</pub-id>
</mixed-citation>
</ref>
<ref id="B3">
<label>3.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gloza-Rausch</surname>
<given-names>F</given-names>
</name>
<etal></etal>
</person-group>
<year>2008</year>
<article-title>Detection and prevalence patterns of group I coronaviruses in bats, northern Germany</article-title>
.
<source>Emerg. Infect. Dis</source>
.
<volume>14</volume>
:
<fpage>626</fpage>
<lpage>631</lpage>
<pub-id pub-id-type="pmid">18400147</pub-id>
</mixed-citation>
</ref>
<ref id="B4">
<label>4.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lau</surname>
<given-names>SK</given-names>
</name>
<etal></etal>
</person-group>
<year>2005</year>
<article-title>Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats</article-title>
.
<source>Proc. Natl. Acad. Sci. U. S. A</source>
.
<volume>102</volume>
:
<fpage>14040</fpage>
<lpage>14045</lpage>
<pub-id pub-id-type="pmid">16169905</pub-id>
</mixed-citation>
</ref>
<ref id="B5">
<label>5.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>W</given-names>
</name>
<etal></etal>
</person-group>
<year>2005</year>
<article-title>Bats are natural reservoirs of SARS-like coronaviruses</article-title>
.
<source>Science</source>
<volume>310</volume>
:
<fpage>676</fpage>
<lpage>679</lpage>
<pub-id pub-id-type="pmid">16195424</pub-id>
</mixed-citation>
</ref>
<ref id="B6">
<label>6.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pfefferle</surname>
<given-names>S</given-names>
</name>
<etal></etal>
</person-group>
<year>2009</year>
<article-title>Distant relatives of severe acute respiratory syndrome coronavirus and close relatives of human coronavirus 229E in bats, Ghana</article-title>
.
<source>Emerg. Infect. Dis</source>
.
<volume>15</volume>
:
<fpage>1377</fpage>
<lpage>1384</lpage>
<pub-id pub-id-type="pmid">19788804</pub-id>
</mixed-citation>
</ref>
<ref id="B7">
<label>7.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vijaykrishna</surname>
<given-names>D</given-names>
</name>
<etal></etal>
</person-group>
<year>2007</year>
<article-title>Evolutionary insights into the ecology of coronaviruses</article-title>
.
<source>J. Virol</source>
.
<volume>81</volume>
:
<fpage>4012</fpage>
<lpage>4020</lpage>
<pub-id pub-id-type="pmid">17267506</pub-id>
</mixed-citation>
</ref>
<ref id="B8">
<label>8.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Woo</surname>
<given-names>PC</given-names>
</name>
<etal></etal>
</person-group>
<year>2007</year>
<article-title>Comparative analysis of twelve genomes of three novel group 2c and group 2d coronaviruses reveals unique group and subgroup features</article-title>
.
<source>J. Virol</source>
.
<volume>81</volume>
:
<fpage>1574</fpage>
<lpage>1585</lpage>
<pub-id pub-id-type="pmid">17121802</pub-id>
</mixed-citation>
</ref>
<ref id="B9">
<label>9.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hamre</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Procknow</surname>
<given-names>JJ</given-names>
</name>
</person-group>
<year>1966</year>
<article-title>A new virus isolated from the human respiratory tract</article-title>
.
<source>Proc. Soc. Exp. Biol. Med</source>
.
<volume>121</volume>
:
<fpage>190</fpage>
<lpage>193</lpage>
<pub-id pub-id-type="pmid">4285768</pub-id>
</mixed-citation>
</ref>
<ref id="B10">
<label>10.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>McIntosh</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Dees</surname>
<given-names>JH</given-names>
</name>
<name>
<surname>Becker</surname>
<given-names>WB</given-names>
</name>
<name>
<surname>Kapikian</surname>
<given-names>AZ</given-names>
</name>
<name>
<surname>Chanock</surname>
<given-names>RM</given-names>
</name>
</person-group>
<year>1967</year>
<article-title>Recovery in tracheal organ cultures of novel viruses from patients with respiratory disease</article-title>
.
<source>Proc. Natl. Acad. Sci. U. S. A</source>
.
<volume>57</volume>
:
<fpage>933</fpage>
<lpage>940</lpage>
<pub-id pub-id-type="pmid">5231356</pub-id>
</mixed-citation>
</ref>
<ref id="B11">
<label>11.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Drosten</surname>
<given-names>C</given-names>
</name>
<etal></etal>
</person-group>
<year>2003</year>
<article-title>Identification of a novel coronavirus in patients with severe acute respiratory syndrome</article-title>
.
<source>N Engl J. Med</source>
.
<volume>348</volume>
:
<fpage>1967</fpage>
<lpage>1976</lpage>
<pub-id pub-id-type="pmid">12690091</pub-id>
</mixed-citation>
</ref>
<ref id="B12">
<label>12.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marra</surname>
<given-names>MA</given-names>
</name>
<etal></etal>
</person-group>
<year>2003</year>
<article-title>The genome sequence of the SARS-associated coronavirus</article-title>
.
<source>Science</source>
<volume>300</volume>
:
<fpage>1399</fpage>
<lpage>1404</lpage>
<pub-id pub-id-type="pmid">12730501</pub-id>
</mixed-citation>
</ref>
<ref id="B13">
<label>13.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peiris</surname>
<given-names>JS</given-names>
</name>
<name>
<surname>Guan</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Yuen</surname>
<given-names>KY</given-names>
</name>
</person-group>
<year>2004</year>
<article-title>Severe acute respiratory syndrome</article-title>
.
<source>Nat. Med</source>
.
<volume>10</volume>
:
<fpage>S88</fpage>
<lpage>S97</lpage>
<pub-id pub-id-type="pmid">15577937</pub-id>
</mixed-citation>
</ref>
<ref id="B14">
<label>14.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rota</surname>
<given-names>PA</given-names>
</name>
<etal></etal>
</person-group>
<year>2003</year>
<article-title>Characterization of a novel coronavirus associated with severe acute respiratory syndrome</article-title>
.
<source>Science</source>
<volume>300</volume>
:
<fpage>1394</fpage>
<lpage>1399</lpage>
<pub-id pub-id-type="pmid">12730500</pub-id>
</mixed-citation>
</ref>
<ref id="B15">
<label>15.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fouchier</surname>
<given-names>RA</given-names>
</name>
<etal></etal>
</person-group>
<year>2004</year>
<article-title>A previously undescribed coronavirus associated with respiratory disease in humans</article-title>
.
<source>Proc. Natl. Acad. Sci. U. S. A</source>
.
<volume>101</volume>
:
<fpage>6212</fpage>
<lpage>6216</lpage>
<pub-id pub-id-type="pmid">15073334</pub-id>
</mixed-citation>
</ref>
<ref id="B16">
<label>16.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>van der Hoek</surname>
<given-names>L</given-names>
</name>
<etal></etal>
</person-group>
<year>2004</year>
<article-title>Identification of a new human coronavirus</article-title>
.
<source>Nat. Med</source>
.
<volume>10</volume>
:
<fpage>368</fpage>
<lpage>373</lpage>
<pub-id pub-id-type="pmid">15034574</pub-id>
</mixed-citation>
</ref>
<ref id="B17">
<label>17.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Woo</surname>
<given-names>PC</given-names>
</name>
<etal></etal>
</person-group>
<year>2005</year>
<article-title>Characterization and complete genome sequence of a novel coronavirus, coronavirus HKU1, from patients with pneumonia</article-title>
.
<source>J. Virol</source>
.
<volume>79</volume>
:
<fpage>884</fpage>
<lpage>895</lpage>
<pub-id pub-id-type="pmid">15613317</pub-id>
</mixed-citation>
</ref>
<ref id="B18">
<label>18.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zlateva</surname>
<given-names>KT</given-names>
</name>
<etal></etal>
</person-group>
<year>2012</year>
<article-title>No novel coronaviruses identified in a large collection of human nasopharyngeal specimens using family-wide CODEHOP-based primers</article-title>
.
<source>Arch. Virol</source>
.,
<comment>in press</comment>
</mixed-citation>
</ref>
<ref id="B19">
<label>19.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gorbalenya</surname>
<given-names>AE</given-names>
</name>
<name>
<surname>Enjuanes</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Ziebuhr</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Snijder</surname>
<given-names>EJ</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>Nidovirales: evolving the largest RNA virus genome</article-title>
.
<source>Virus Res</source>
.
<volume>117</volume>
:
<fpage>17</fpage>
<lpage>37</lpage>
<pub-id pub-id-type="pmid">16503362</pub-id>
</mixed-citation>
</ref>
<ref id="B20">
<label>20.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adams</surname>
<given-names>MJ</given-names>
</name>
<name>
<surname>Carstens</surname>
<given-names>EB</given-names>
</name>
</person-group>
<year>2012</year>
<article-title>Ratification vote on taxonomic proposals to the International Committee on Taxonomy of Viruses (2012)</article-title>
.
<source>Arch. Virol</source>
.
<volume>157</volume>
:
<fpage>1411</fpage>
<lpage>1422</lpage>
<pub-id pub-id-type="pmid">22481600</pub-id>
</mixed-citation>
</ref>
<ref id="B21">
<label>21.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lauber</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Gorbalenya</surname>
<given-names>AE</given-names>
</name>
</person-group>
<year>2012</year>
<article-title>Partitioning the genetic diversity of a virus family: approach and evaluation through a case study of picornaviruses</article-title>
.
<source>J. Virol</source>
.
<volume>86</volume>
:
<fpage>3890</fpage>
<lpage>3904</lpage>
<pub-id pub-id-type="pmid">22278230</pub-id>
</mixed-citation>
</ref>
<ref id="B22">
<label>22.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Masters</surname>
<given-names>PS</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>The molecular biology of coronaviruses</article-title>
.
<source>Adv. Virus Res</source>
.
<volume>66</volume>
:
<fpage>193</fpage>
<lpage>292</lpage>
<pub-id pub-id-type="pmid">16877062</pub-id>
</mixed-citation>
</ref>
<ref id="B23">
<label>23.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Snijder</surname>
<given-names>EJ</given-names>
</name>
<etal></etal>
</person-group>
<year>2003</year>
<article-title>Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage</article-title>
.
<source>J. Mol. Biol</source>
.
<volume>331</volume>
:
<fpage>991</fpage>
<lpage>1004</lpage>
<pub-id pub-id-type="pmid">12927536</pub-id>
</mixed-citation>
</ref>
<ref id="B24">
<label>24.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zaki</surname>
<given-names>AM</given-names>
</name>
<etal></etal>
</person-group>
<year>2012</year>
<article-title>Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia</article-title>
.
<source>N Engl J. Med</source>
., in press. </mixed-citation>
</ref>
<ref id="B25">
<label>25.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bermingham</surname>
<given-names>A</given-names>
</name>
<etal></etal>
</person-group>
<year>2012</year>
<article-title>Severe respiratory illness caused by a novel coronavirus, in a patient transferred to the United Kingdom from the Middle East, September 2012</article-title>
.
<source>Euro Surveill</source>
.
<volume>17</volume>
:
<fpage>pii=10290</fpage>
<uri xlink:type="simple" xlink:href="http://www.eurosurveillance.org/viewarticle.aspx?articleid=20290">http://www.eurosurveillance.org/ViewArticle.aspx?ArticleId=20290</uri>
</mixed-citation>
</ref>
<ref id="B26">
<label>26.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ziebuhr</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Snijder</surname>
<given-names>EJ</given-names>
</name>
<name>
<surname>Gorbalenya</surname>
<given-names>AE</given-names>
</name>
</person-group>
<year>2000</year>
<article-title>Virus-encoded proteinases and proteolytic processing in the nidovirales</article-title>
.
<source>J. Gen. Virol</source>
.
<volume>81</volume>
:
<fpage>853</fpage>
<lpage>879</lpage>
<pub-id pub-id-type="pmid">10725411</pub-id>
</mixed-citation>
</ref>
<ref id="B27">
<label>27.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pasternak</surname>
<given-names>AO</given-names>
</name>
<name>
<surname>Spaan</surname>
<given-names>WJ</given-names>
</name>
<name>
<surname>Snijder</surname>
<given-names>EJ</given-names>
</name>
</person-group>
<year>2006</year>
<article-title>Nidovirus transcription: how to make sense…?</article-title>
<source>J. Gen. Virol</source>
.
<volume>87</volume>
:
<fpage>1403</fpage>
<lpage>1421</lpage>
<pub-id pub-id-type="pmid">16690906</pub-id>
</mixed-citation>
</ref>
<ref id="B28">
<label>28.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sawicki</surname>
<given-names>SG</given-names>
</name>
<name>
<surname>Sawicki</surname>
<given-names>DL</given-names>
</name>
<name>
<surname>Siddell</surname>
<given-names>SG</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>A contemporary view of coronavirus transcription</article-title>
.
<source>J. Virol</source>
.
<volume>81</volume>
:
<fpage>20</fpage>
<lpage>29</lpage>
<pub-id pub-id-type="pmid">16928755</pub-id>
</mixed-citation>
</ref>
<ref id="B29">
<label>29.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sola</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Mateos-Gomez</surname>
<given-names>PA</given-names>
</name>
<name>
<surname>Almazan</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Zuñiga</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Enjuanes</surname>
<given-names>L</given-names>
</name>
</person-group>
<year>2011</year>
<article-title>RNA-RNA and RNA-protein interactions in coronavirus replication and transcription</article-title>
.
<source>RNA Biol</source>
.
<volume>8</volume>
:
<fpage>237</fpage>
<lpage>248</lpage>
<pub-id pub-id-type="pmid">21378501</pub-id>
</mixed-citation>
</ref>
<ref id="B30">
<label>30.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Firth</surname>
<given-names>AE</given-names>
</name>
<name>
<surname>Brierley</surname>
<given-names>I</given-names>
</name>
</person-group>
<year>2012</year>
<article-title>Non-canonical translation in RNA viruses</article-title>
.
<source>J. Gen. Virol</source>
.
<volume>93</volume>
:
<fpage>1385</fpage>
<lpage>1409</lpage>
<pub-id pub-id-type="pmid">22535777</pub-id>
</mixed-citation>
</ref>
<ref id="B31">
<label>31.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reusken</surname>
<given-names>CB</given-names>
</name>
<etal></etal>
</person-group>
<year>2010</year>
<article-title>Circulation of group 2 coronaviruses in a bat species common to urban areas in Western Europe</article-title>
.
<source>Vector Borne Zoonotic Dis</source>
.
<volume>10</volume>
:
<fpage>785</fpage>
<lpage>791</lpage>
<pub-id pub-id-type="pmid">20055576</pub-id>
</mixed-citation>
</ref>
<ref id="B32">
<label>32.</label>
<mixed-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Holmes</surname>
<given-names>KV</given-names>
</name>
<name>
<surname>Lai</surname>
<given-names>MC</given-names>
</name>
</person-group>
<year>1996</year>
<article-title>Coronaviridae: the viruses and their replication</article-title>
, p.
<fpage>1075</fpage>
<lpage>1093</lpage>
<italic>In</italic>
<person-group person-group-type="editor">
<name>
<surname>Fields</surname>
<given-names>BN</given-names>
</name>
<name>
<surname>Knipe</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Howley</surname>
<given-names>PM</given-names>
</name>
</person-group>
,
<source>Fields virology</source>
,
<edition>3rd</edition>
ed.
<publisher-name>Lippincott-Raven</publisher-name>
,
<publisher-loc>Philadelphia, PA.</publisher-loc>
</mixed-citation>
</ref>
<ref id="B33">
<label>33.</label>
<mixed-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>McIntosh</surname>
<given-names>K</given-names>
</name>
</person-group>
<year>1996</year>
<article-title>Coronaviruses</article-title>
, p.
<fpage>1095</fpage>
<lpage>1103</lpage>
<italic>In</italic>
<person-group person-group-type="editor">
<name>
<surname>Fields</surname>
<given-names>BN</given-names>
</name>
<name>
<surname>Knipe</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Howley</surname>
<given-names>PM</given-names>
</name>
</person-group>
,
<source>Fields virology</source>
,
<edition>3rd</edition>
ed.
<publisher-name>Lippincott-Raven</publisher-name>
,
<publisher-loc>Philadelphia, PA.</publisher-loc>
</mixed-citation>
</ref>
<ref id="B34">
<label>34.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Keng</surname>
<given-names>CT</given-names>
</name>
<etal></etal>
</person-group>
<year>2011</year>
<article-title>SARS coronavirus 8b reduces viral replication by down-regulating E via an ubiquitin-independent proteasome pathway</article-title>
.
<source>Microbes Infect</source>
.
<volume>13</volume>
:
<fpage>179</fpage>
<lpage>188</lpage>
<pub-id pub-id-type="pmid">21035562</pub-id>
</mixed-citation>
</ref>
<ref id="B35">
<label>35.</label>
<mixed-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Narayanan</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Makino</surname>
<given-names>S</given-names>
</name>
</person-group>
<year>2008</year>
<article-title>Coronavirus accessory proteins</article-title>
, p.
<fpage>235</fpage>
<lpage>244</lpage>
<italic>In</italic>
<person-group person-group-type="editor">
<name>
<surname>Perlman</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Gallagher</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Snijder</surname>
<given-names>EJ</given-names>
</name>
</person-group>
,
<source>Nidoviruses</source>
.
<publisher-name>ASM Press</publisher-name>
,
<publisher-loc>Washington, DC</publisher-loc>
</mixed-citation>
</ref>
<ref id="B36">
<label>36.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Senanayake</surname>
<given-names>SD</given-names>
</name>
<name>
<surname>Brian</surname>
<given-names>DA</given-names>
</name>
</person-group>
<year>1997</year>
<article-title>Bovine coronavirus I protein synthesis follows ribosomal scanning on the bicistronic N mRNA</article-title>
.
<source>Virus Res</source>
.
<volume>48</volume>
:
<fpage>101</fpage>
<lpage>105</lpage>
<pub-id pub-id-type="pmid">9140198</pub-id>
</mixed-citation>
</ref>
<ref id="B37">
<label>37.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mazumder</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Iyer</surname>
<given-names>LM</given-names>
</name>
<name>
<surname>Vasudevan</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Aravind</surname>
<given-names>L</given-names>
</name>
</person-group>
<year>2002</year>
<article-title>Detection of novel members, structure-function analysis and evolutionary classification of the 2H phosphoesterase superfamily</article-title>
.
<source>Nucleic Acids Res</source>
.
<volume>30</volume>
:
<fpage>5229</fpage>
<lpage>5243</lpage>
<pub-id pub-id-type="pmid">12466548</pub-id>
</mixed-citation>
</ref>
<ref id="B38">
<label>38.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Snijder</surname>
<given-names>EJ</given-names>
</name>
<name>
<surname>den Boon</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Horzinek</surname>
<given-names>MC</given-names>
</name>
<name>
<surname>Spaan</surname>
<given-names>WJ</given-names>
</name>
</person-group>
<year>1991</year>
<article-title>Comparison of the genome organization of toro- and coronaviruses: evidence for two nonhomologous RNA recombination events during Berne virus evolution</article-title>
.
<source>Virology</source>
<volume>180</volume>
:
<fpage>448</fpage>
<lpage>452</lpage>
<pub-id pub-id-type="pmid">1984666</pub-id>
</mixed-citation>
</ref>
<ref id="B39">
<label>39.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yount</surname>
<given-names>B</given-names>
</name>
<etal></etal>
</person-group>
<year>2005</year>
<article-title>Severe acute respiratory syndrome coronavirus group-specific open reading frames encode nonessential functions for replication in cell cultures and mice</article-title>
.
<source>J. Virol</source>
.
<volume>79</volume>
:
<fpage>14909</fpage>
<lpage>14922</lpage>
<pub-id pub-id-type="pmid">16282490</pub-id>
</mixed-citation>
</ref>
<ref id="B40">
<label>40.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cruz</surname>
<given-names>JL</given-names>
</name>
<etal></etal>
</person-group>
<year>2011</year>
<article-title>Coronavirus gene 7 counteracts host defenses and modulates virus virulence</article-title>
.
<source>PLoS Pathog</source>
.
<volume>7</volume>
:
<fpage>e1002090</fpage>
<pub-id pub-id-type="pmid">21695242</pub-id>
</mixed-citation>
</ref>
<ref id="B41">
<label>41.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>de Haan</surname>
<given-names>CA</given-names>
</name>
<name>
<surname>Masters</surname>
<given-names>PS</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Weiss</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Rottier</surname>
<given-names>PJ</given-names>
</name>
</person-group>
<year>2002</year>
<article-title>The group-specific murine coronavirus genes are not essential, but their deletion, by reverse genetics, is attenuating in the natural host</article-title>
.
<source>Virology</source>
<volume>296</volume>
:
<fpage>177</fpage>
<lpage>189</lpage>
<pub-id pub-id-type="pmid">12036329</pub-id>
</mixed-citation>
</ref>
<ref id="B42">
<label>42.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pewe</surname>
<given-names>L</given-names>
</name>
<etal></etal>
</person-group>
<year>2005</year>
<article-title>A severe acute respiratory syndrome-associated coronavirus-specific protein enhances virulence of an attenuated murine coronavirus</article-title>
.
<source>J. Virol</source>
.
<volume>79</volume>
:
<fpage>11335</fpage>
<lpage>11342</lpage>
<pub-id pub-id-type="pmid">16103185</pub-id>
</mixed-citation>
</ref>
<ref id="B43">
<label>43.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Frieman</surname>
<given-names>M</given-names>
</name>
<etal></etal>
</person-group>
<year>2007</year>
<article-title>Severe acute respiratory syndrome coronavirus ORF6 antagonizes STAT1 function by sequestering nuclear import factors on the rough endoplasmic reticulum/Golgi membrane</article-title>
.
<source>J. Virol</source>
.
<volume>81</volume>
:
<fpage>9812</fpage>
<lpage>9824</lpage>
<pub-id pub-id-type="pmid">17596301</pub-id>
</mixed-citation>
</ref>
<ref id="B44">
<label>44.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Oostra</surname>
<given-names>M</given-names>
</name>
<name>
<surname>de Haan</surname>
<given-names>CA</given-names>
</name>
<name>
<surname>Rottier</surname>
<given-names>PJ</given-names>
</name>
</person-group>
<year>2007</year>
<article-title>The 29-nucleotide deletion present in human but not in animal severe acute respiratory syndrome coronaviruses disrupts the functional expression of open reading frame 8</article-title>
.
<source>J. Virol</source>
.
<volume>81</volume>
:
<fpage>13876</fpage>
<lpage>13888</lpage>
<pub-id pub-id-type="pmid">17928347</pub-id>
</mixed-citation>
</ref>
<ref id="B45">
<label>45.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cornman</surname>
<given-names>VM</given-names>
</name>
<etal></etal>
</person-group>
<year>2012</year>
<article-title>Detection of a novel human coronavirus by real-time reverse-transcription polymerase chain reaction</article-title>
.
<source>Euro Surveill</source>
.
<volume>17</volume>
:
<fpage>pii=20285</fpage>
<uri xlink:type="simple" xlink:href="http://www.eurosurveillance.org/viewarticle.aspx?articleid=20285">http://www.eurosurveillance.org/ViewArticle.aspx?ArticleId=20285</uri>
</mixed-citation>
</ref>
<ref id="B46">
<label>46.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Welsh</surname>
<given-names>J</given-names>
</name>
<name>
<surname>McClelland</surname>
<given-names>M</given-names>
</name>
</person-group>
<year>1990</year>
<article-title>Fingerprinting genomes using PCR with arbitrary primers</article-title>
.
<source>Nucleic Acids Res</source>
.
<volume>18</volume>
:
<fpage>7213</fpage>
<lpage>7218</lpage>
<pub-id pub-id-type="pmid">2259619</pub-id>
</mixed-citation>
</ref>
<ref id="B47">
<label>47.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hall</surname>
<given-names>TA</given-names>
</name>
</person-group>
<year>1999</year>
<article-title>BioEdit: a user-friendly biological sequence alignment Editor and analysis program for Windows 95/98/NT</article-title>
.
<source>Nucleic Acids Symp. Ser</source>
.
<volume>41</volume>
:
<fpage>95</fpage>
<lpage>98</lpage>
</mixed-citation>
</ref>
<ref id="B48">
<label>48.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gorbalenya</surname>
<given-names>AE</given-names>
</name>
<etal></etal>
</person-group>
<year>2010</year>
<article-title>Practical application of bioinformatics by the multidisciplinary VIZIER consortium</article-title>
.
<source>Antiviral. Res</source>
.
<volume>87</volume>
:
<fpage>95</fpage>
<lpage>110</lpage>
<pub-id pub-id-type="pmid">20153379</pub-id>
</mixed-citation>
</ref>
<ref id="B49">
<label>49.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Castresana</surname>
<given-names>J</given-names>
</name>
</person-group>
<year>2000</year>
<article-title>Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis</article-title>
.
<source>Mol. Biol. Evol</source>
.
<volume>17</volume>
:
<fpage>540</fpage>
<lpage>552</lpage>
<pub-id pub-id-type="pmid">10742046</pub-id>
</mixed-citation>
</ref>
<ref id="B50">
<label>50.</label>
<mixed-citation publication-type="webpage">
<person-group person-group-type="author">
<name>
<surname>Antonov</surname>
<given-names>IV</given-names>
</name>
<name>
<surname>Leontovich</surname>
<given-names>AM</given-names>
</name>
<name>
<surname>Gorbalenya</surname>
<given-names>AE</given-names>
</name>
</person-group>
<year>2008</year>
<source>BAGG—Blocks accepting gaps generator</source>
,
<comment>version 1.0</comment>
<uri xlink:type="simple" xlink:href="http://www.genebee.msu.su/~antonov/bagg/cgi/bagg.cgi">http://www.genebee.msu.su/~antonov/bagg/cgi/bagg.cgi</uri>
</mixed-citation>
</ref>
<ref id="B51">
<label>51.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Darriba</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Taboada</surname>
<given-names>GL</given-names>
</name>
<name>
<surname>Doallo</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Posada</surname>
<given-names>D</given-names>
</name>
</person-group>
<year>2011</year>
<article-title>ProtTest 3: fast selection of best-fit models of protein evolution</article-title>
.
<source>Bioinformatics</source>
<volume>27</volume>
:
<fpage>1164</fpage>
<lpage>1165</lpage>
<pub-id pub-id-type="pmid">21335321</pub-id>
</mixed-citation>
</ref>
<ref id="B52">
<label>52.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guindon</surname>
<given-names>S</given-names>
</name>
<etal></etal>
</person-group>
<year>2010</year>
<article-title>New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0</article-title>
.
<source>Syst. Biol</source>
.
<volume>59</volume>
:
<fpage>307</fpage>
<lpage>321</lpage>
<pub-id pub-id-type="pmid">20525638</pub-id>
</mixed-citation>
</ref>
<ref id="B53">
<label>53.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Katoh</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Misawa</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Kuma</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Miyata</surname>
<given-names>T</given-names>
</name>
</person-group>
<year>2002</year>
<article-title>MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform</article-title>
.
<source>Nucleic Acids Res</source>
.
<volume>30</volume>
:
<fpage>3059</fpage>
<lpage>3066</lpage>
<pub-id pub-id-type="pmid">12136088</pub-id>
</mixed-citation>
</ref>
<ref id="B54">
<label>54.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Posada</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Crandall</surname>
<given-names>KA</given-names>
</name>
</person-group>
<year>1998</year>
<article-title>ModelTest: testing the model of DNA substitution</article-title>
.
<source>Bioinformatics</source>
<volume>14</volume>
:
<fpage>817</fpage>
<lpage>818</lpage>
<pub-id pub-id-type="pmid">9918953</pub-id>
</mixed-citation>
</ref>
</ref-list>
</back>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/SrasV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001631 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 001631 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    SrasV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:3509437
   |texte=   Genomic Characterization of a Newly Discovered Coronavirus Associated with Acute Respiratory Distress Syndrome in Humans
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:23170002" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a SrasV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Tue Apr 28 14:49:16 2020. Site generation: Sat Mar 27 22:06:49 2021