MersV1, Pmc, Corpus, bibRecord, 001198

***** Acces problem to record *****\

Identifieur interne : 001198 ( Pmc/Corpus ); précédent : 0011979; suivant : 0011990 ***** probable Xml problem with record *****

Links to Exploration step

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">On the biased nucleotide composition of the human coronavirus RNA genome</title>
<author><name sortKey="Berkhout, Ben" sort="Berkhout, Ben" uniqKey="Berkhout B" first="Ben" last="Berkhout">Ben Berkhout</name>
</author>
<author><name sortKey="Van Hemert, Formijn" sort="Van Hemert, Formijn" uniqKey="Van Hemert F" first="Formijn" last="Van Hemert">Formijn Van Hemert</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">25656063</idno>
<idno type="pmc">7114406</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7114406</idno>
<idno type="RBID">PMC:7114406</idno>
<idno type="doi">10.1016/j.virusres.2014.11.031</idno>
<date when="2015">2015</date>
<idno type="wicri:Area/Pmc/Corpus">001198</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">001198</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">On the biased nucleotide composition of the human coronavirus RNA genome</title>
<author><name sortKey="Berkhout, Ben" sort="Berkhout, Ben" uniqKey="Berkhout B" first="Ben" last="Berkhout">Ben Berkhout</name>
</author>
<author><name sortKey="Van Hemert, Formijn" sort="Van Hemert, Formijn" uniqKey="Van Hemert F" first="Formijn" last="Van Hemert">Formijn Van Hemert</name>
</author>
</analytic>
<series><title level="j">Virus Research</title>
<idno type="ISSN">0168-1702</idno>
<idno type="eISSN">1872-7492</idno>
<imprint><date when="2015">2015</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><title>Highlights</title>
<p><list list-type="simple" id="lis0005"><list-item id="lsti0005"><label>•</label>
<p id="par0005">The nucleotide composition of a coronaviral RNA genome is biased (high U, low C).</p>
</list-item>
<list-item id="lsti0010"><label>•</label>
<p id="par0010">This bias is a relatively stable property along the viral genome, but less prominent in the last 1/3 of the genome.</p>
</list-item>
<list-item id="lsti0015"><label>•</label>
<p id="par0015">This bias is even more pronounced in the single-stranded, unpaired RNA domains.</p>
</list-item>
<list-item id="lsti0020"><label>•</label>
<p id="par0020">The bias dictates the atypical codon usage of the coronaviruses.</p>
</list-item>
<list-item id="lsti0025"><label>•</label>
<p id="par0025">The RNA genome of the zoonotic viruses MERS and SARS is extremely biased.</p>
</list-item>
</list>
</p>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct><analytic><author><name sortKey="Bennetzen, J L" uniqKey="Bennetzen J">J.L. Bennetzen</name>
</author>
<author><name sortKey="Hall, B D" uniqKey="Hall B">B.D. Hall</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Berkhout, B" uniqKey="Berkhout B">B. Berkhout</name>
</author>
<author><name sortKey="Van Hemert, F J" uniqKey="Van Hemert F">F.J. van Hemert</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Chen, Y" uniqKey="Chen Y">Y. Chen</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Fouchier, R A" uniqKey="Fouchier R">R.A. Fouchier</name>
</author>
<author><name sortKey="Kuiken, T" uniqKey="Kuiken T">T. Kuiken</name>
</author>
<author><name sortKey="Schutten, M" uniqKey="Schutten M">M. Schutten</name>
</author>
<author><name sortKey="Van Amerongen, G" uniqKey="Van Amerongen G">G. Van Amerongen</name>
</author>
<author><name sortKey="Van Doornum, G J" uniqKey="Van Doornum G">G.J. van Doornum</name>
</author>
<author><name sortKey="Van Den Hoogen, B G" uniqKey="Van Den Hoogen B">B.G. van den Hoogen</name>
</author>
<author><name sortKey="Peiris, M" uniqKey="Peiris M">M. Peiris</name>
</author>
<author><name sortKey="Lim, W" uniqKey="Lim W">W. Lim</name>
</author>
<author><name sortKey="Stohr, K" uniqKey="Stohr K">K. Stohr</name>
</author>
<author><name sortKey="Osterhaus, A D" uniqKey="Osterhaus A">A.D. Osterhaus</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Grigoriev, A" uniqKey="Grigoriev A">A. Grigoriev</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Grigoriev, A" uniqKey="Grigoriev A">A. Grigoriev</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Gu, W" uniqKey="Gu W">W. Gu</name>
</author>
<author><name sortKey="Zhou, T" uniqKey="Zhou T">T. Zhou</name>
</author>
<author><name sortKey="Ma, J" uniqKey="Ma J">J. Ma</name>
</author>
<author><name sortKey="Sun, X" uniqKey="Sun X">X. Sun</name>
</author>
<author><name sortKey="Lu, Z" uniqKey="Lu Z">Z. Lu</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Haagmans, B L" uniqKey="Haagmans B">B.L. Haagmans</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Jenkins, G M" uniqKey="Jenkins G">G.M. Jenkins</name>
</author>
<author><name sortKey="Holmes, E C" uniqKey="Holmes E">E.C. Holmes</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Kerr, P J" uniqKey="Kerr P">P.J. Kerr</name>
</author>
<author><name sortKey="Ghedin, E" uniqKey="Ghedin E">E. Ghedin</name>
</author>
<author><name sortKey="Depasse, J V" uniqKey="Depasse J">J.V. DePasse</name>
</author>
<author><name sortKey="Fitch, A" uniqKey="Fitch A">A. Fitch</name>
</author>
<author><name sortKey="Cattadori, I M" uniqKey="Cattadori I">I.M. Cattadori</name>
</author>
<author><name sortKey="Hudson, P J" uniqKey="Hudson P">P.J. Hudson</name>
</author>
<author><name sortKey="Tscharke, D C" uniqKey="Tscharke D">D.C. Tscharke</name>
</author>
<author><name sortKey="Read, A F" uniqKey="Read A">A.F. Read</name>
</author>
<author><name sortKey="Holmes, E C" uniqKey="Holmes E">E.C. Holmes</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Kindler, E" uniqKey="Kindler E">E. Kindler</name>
</author>
<author><name sortKey="Thiel, V" uniqKey="Thiel V">V. Thiel</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Marra, M A" uniqKey="Marra M">M.A. Marra</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Nakamura, Y" uniqKey="Nakamura Y">Y. Nakamura</name>
</author>
<author><name sortKey="Gojobori, T" uniqKey="Gojobori T">T. Gojobori</name>
</author>
<author><name sortKey="Ikemura, T" uniqKey="Ikemura T">T. Ikemura</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Pyrc, K" uniqKey="Pyrc K">K. Pyrc</name>
</author>
<author><name sortKey="Berkhout, B" uniqKey="Berkhout B">B. Berkhout</name>
</author>
<author><name sortKey="Van Der Hoek, L" uniqKey="Van Der Hoek L">L. van der Hoek</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Pyrc, K" uniqKey="Pyrc K">K. Pyrc</name>
</author>
<author><name sortKey="Jebbink, M F" uniqKey="Jebbink M">M.F. Jebbink</name>
</author>
<author><name sortKey="Berkhout, B" uniqKey="Berkhout B">B. Berkhout</name>
</author>
<author><name sortKey="Van Der Hoek, L" uniqKey="Van Der Hoek L">L. van der Hoek</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Sharp, P M" uniqKey="Sharp P">P.M. Sharp</name>
</author>
<author><name sortKey="Li, W H" uniqKey="Li W">W.H. Li</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Shi, S L" uniqKey="Shi S">S.L. Shi</name>
</author>
<author><name sortKey="Jiang, Y R" uniqKey="Jiang Y">Y.R. Jiang</name>
</author>
<author><name sortKey="Liu, Y Q" uniqKey="Liu Y">Y.Q. Liu</name>
</author>
<author><name sortKey="Xia, R X" uniqKey="Xia R">R.X. Xia</name>
</author>
<author><name sortKey="Qin, L" uniqKey="Qin L">L. Qin</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Tamura, K" uniqKey="Tamura K">K. Tamura</name>
</author>
<author><name sortKey="Peterson, D" uniqKey="Peterson D">D. Peterson</name>
</author>
<author><name sortKey="Peterson, N" uniqKey="Peterson N">N. Peterson</name>
</author>
<author><name sortKey="Stecher, G" uniqKey="Stecher G">G. Stecher</name>
</author>
<author><name sortKey="Nei, M" uniqKey="Nei M">M. Nei</name>
</author>
<author><name sortKey="Kumar, S" uniqKey="Kumar S">S. Kumar</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Vabret, N" uniqKey="Vabret N">N. Vabret</name>
</author>
<author><name sortKey="Bailly Bechet, M" uniqKey="Bailly Bechet M">M. Bailly-Bechet</name>
</author>
<author><name sortKey="Najburg, V" uniqKey="Najburg V">V. Najburg</name>
</author>
<author><name sortKey="Muller Trutwin, M" uniqKey="Muller Trutwin M">M. Muller-Trutwin</name>
</author>
<author><name sortKey="Verrier, B" uniqKey="Verrier B">B. Verrier</name>
</author>
<author><name sortKey="Tangy, F" uniqKey="Tangy F">F. Tangy</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Van Boheemen, S" uniqKey="Van Boheemen S">S. van Boheemen</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Van Der Hoek, L" uniqKey="Van Der Hoek L">L. van der Hoek</name>
</author>
<author><name sortKey="Pyrc, K" uniqKey="Pyrc K">K. Pyrc</name>
</author>
<author><name sortKey="Jebbink, M F" uniqKey="Jebbink M">M.F. Jebbink</name>
</author>
<author><name sortKey="Vermeulen Oost, W" uniqKey="Vermeulen Oost W">W. Vermeulen-Oost</name>
</author>
<author><name sortKey="Berkhout, R J" uniqKey="Berkhout R">R.J. Berkhout</name>
</author>
<author><name sortKey="Wolthers, K C" uniqKey="Wolthers K">K.C. Wolthers</name>
</author>
<author><name sortKey="Wertheim Van Dillen, P M" uniqKey="Wertheim Van Dillen P">P.M. Wertheim-van Dillen</name>
</author>
<author><name sortKey="Kaandorp, J" uniqKey="Kaandorp J">J. Kaandorp</name>
</author>
<author><name sortKey="Spaargaren, J" uniqKey="Spaargaren J">J. Spaargaren</name>
</author>
<author><name sortKey="Berkhout, B" uniqKey="Berkhout B">B. Berkhout</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Van Der Kuyl, A C" uniqKey="Van Der Kuyl A">A.C. van der Kuyl</name>
</author>
<author><name sortKey="Berkhout, B" uniqKey="Berkhout B">B. Berkhout</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Van Hemert, F" uniqKey="Van Hemert F">F. Van Hemert</name>
</author>
<author><name sortKey="Van Der Kuyl, A C" uniqKey="Van Der Kuyl A">A.C. van der Kuyl</name>
</author>
<author><name sortKey="Berkhout, B" uniqKey="Berkhout B">B. Berkhout</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Van Hemert, F J" uniqKey="Van Hemert F">F.J. van Hemert</name>
</author>
<author><name sortKey="Berkhout, B" uniqKey="Berkhout B">B. Berkhout</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Van Hemert, F J" uniqKey="Van Hemert F">F.J. van Hemert</name>
</author>
<author><name sortKey="Van Der Kuyl, A C" uniqKey="Van Der Kuyl A">A.C. van der Kuyl</name>
</author>
<author><name sortKey="Berkhout, B" uniqKey="Berkhout B">B. Berkhout</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Wang, M" uniqKey="Wang M">M. Wang</name>
</author>
<author><name sortKey="Zhang, J" uniqKey="Zhang J">J. Zhang</name>
</author>
<author><name sortKey="Zhou, J H" uniqKey="Zhou J">J.H. Zhou</name>
</author>
<author><name sortKey="Chen, H T" uniqKey="Chen H">H.T. Chen</name>
</author>
<author><name sortKey="Ma, L N" uniqKey="Ma L">L.N. Ma</name>
</author>
<author><name sortKey="Ding, Y Z" uniqKey="Ding Y">Y.Z. Ding</name>
</author>
<author><name sortKey="Liu, W Q" uniqKey="Liu W">W.Q. Liu</name>
</author>
<author><name sortKey="Liu, Y S" uniqKey="Liu Y">Y.S. Liu</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Watts, J M" uniqKey="Watts J">J.M. Watts</name>
</author>
<author><name sortKey="Dang, K K" uniqKey="Dang K">K.K. Dang</name>
</author>
<author><name sortKey="Gorelick, R J" uniqKey="Gorelick R">R.J. Gorelick</name>
</author>
<author><name sortKey="Leonard, C W" uniqKey="Leonard C">C.W. Leonard</name>
</author>
<author><name sortKey="Bess, J W" uniqKey="Bess J">J.W. Bess</name>
</author>
<author><name sortKey="Swanstrom, R" uniqKey="Swanstrom R">R. Swanstrom</name>
</author>
<author><name sortKey="Burch, C L" uniqKey="Burch C">C.L. Burch</name>
</author>
<author><name sortKey="Weeks, K M" uniqKey="Weeks K">K.M. Weeks</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Wong, E H" uniqKey="Wong E">E.H. Wong</name>
</author>
<author><name sortKey="Smith, D K" uniqKey="Smith D">D.K. Smith</name>
</author>
<author><name sortKey="Rabadan, R" uniqKey="Rabadan R">R. Rabadan</name>
</author>
<author><name sortKey="Peiris, M" uniqKey="Peiris M">M. Peiris</name>
</author>
<author><name sortKey="Poon, L L" uniqKey="Poon L">L.L. Poon</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Woo, P C" uniqKey="Woo P">P.C. Woo</name>
</author>
<author><name sortKey="Huang, Y" uniqKey="Huang Y">Y. Huang</name>
</author>
<author><name sortKey="Lau, S K" uniqKey="Lau S">S.K. Lau</name>
</author>
<author><name sortKey="Yuen, K Y" uniqKey="Yuen K">K.Y. Yuen</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Woo, P C" uniqKey="Woo P">P.C. Woo</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Woo, P C" uniqKey="Woo P">P.C. Woo</name>
</author>
<author><name sortKey="Wong, B H" uniqKey="Wong B">B.H. Wong</name>
</author>
<author><name sortKey="Huang, Y" uniqKey="Huang Y">Y. Huang</name>
</author>
<author><name sortKey="Lau, S K" uniqKey="Lau S">S.K. Lau</name>
</author>
<author><name sortKey="Yuen, K Y" uniqKey="Yuen K">K.Y. Yuen</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Wright, F" uniqKey="Wright F">F. Wright</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Zaki, A M" uniqKey="Zaki A">A.M. Zaki</name>
</author>
<author><name sortKey="Van, B S" uniqKey="Van B">B.S. van</name>
</author>
<author><name sortKey="Bestebroer, T M" uniqKey="Bestebroer T">T.M. Bestebroer</name>
</author>
<author><name sortKey="Osterhaus, A D" uniqKey="Osterhaus A">A.D. Osterhaus</name>
</author>
<author><name sortKey="Fouchier, R A" uniqKey="Fouchier R">R.A. Fouchier</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Zhang, Z" uniqKey="Zhang Z">Z. Zhang</name>
</author>
<author><name sortKey="Dai, W" uniqKey="Dai W">W. Dai</name>
</author>
<author><name sortKey="Dai, D" uniqKey="Dai D">D. Dai</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Zuker, M" uniqKey="Zuker M">M. Zuker</name>
</author>
<author><name sortKey="Turner, D H" uniqKey="Turner D">D.H. Turner</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article"><pmc-dir>properties open_access</pmc-dir>
  <front><journal-meta><journal-id journal-id-type="nlm-ta">Virus Res</journal-id>
<journal-id journal-id-type="iso-abbrev">Virus Res</journal-id>
<journal-title-group><journal-title>Virus Research</journal-title>
</journal-title-group>
<issn pub-type="ppub">0168-1702</issn>
<issn pub-type="epub">1872-7492</issn>
<publisher><publisher-name>Elsevier B.V.</publisher-name>
</publisher>
</journal-meta>
<article-meta><article-id pub-id-type="pmid">25656063</article-id>
<article-id pub-id-type="pmc">7114406</article-id>
<article-id pub-id-type="publisher-id">S0168-1702(15)00044-1</article-id>
<article-id pub-id-type="doi">10.1016/j.virusres.2014.11.031</article-id>
<article-categories><subj-group subj-group-type="heading"><subject>Article</subject>
</subj-group>
</article-categories>
<title-group><article-title>On the biased nucleotide composition of the human coronavirus RNA genome</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" id="aut0005"><name><surname>Berkhout</surname>
<given-names>Ben</given-names>
</name>
<email>b.berkhout@amc.uva.nl</email>
<xref rid="cor0005" ref-type="corresp">⁎</xref>
</contrib>
<contrib contrib-type="author" id="aut0010"><name><surname>van Hemert</surname>
<given-names>Formijn</given-names>
</name>
</contrib>
</contrib-group>
<aff id="aff0005">Laboratory of Experimental Virology, Department of Medical Microbiology, Center for Infection and Immunity Amsterdam (CINIMA), Academic Medical Center, University of Amsterdam, The Netherlands</aff>
<author-notes><corresp id="cor0005"><label>⁎</label>
Corresponding author. Tel.: +31 205664822. <email>b.berkhout@amc.uva.nl</email>
</corresp>
</author-notes>
<pub-date pub-type="pmc-release"><day>2</day>
<month>2</month>
<year>2015</year>
</pub-date>
<pmc-comment> PMC Release delay is 0 months and 0 days and was based on .</pmc-comment>
      <pub-date pub-type="ppub"><day>16</day>
<month>4</month>
<year>2015</year>
</pub-date>
<pub-date pub-type="epub"><day>2</day>
<month>2</month>
<year>2015</year>
</pub-date>
<volume>202</volume>
<fpage>41</fpage>
<lpage>47</lpage>
<permissions><copyright-statement>Copyright © 2015 Elsevier B.V. All rights reserved.</copyright-statement>
<copyright-year>2015</copyright-year>
<copyright-holder>Elsevier B.V.</copyright-holder>
<license><license-p>Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.</license-p>
</license>
</permissions>
<abstract abstract-type="author-highlights" id="abs0005"><title>Highlights</title>
<p><list list-type="simple" id="lis0005"><list-item id="lsti0005"><label>•</label>
<p id="par0005">The nucleotide composition of a coronaviral RNA genome is biased (high U, low C).</p>
</list-item>
<list-item id="lsti0010"><label>•</label>
<p id="par0010">This bias is a relatively stable property along the viral genome, but less prominent in the last 1/3 of the genome.</p>
</list-item>
<list-item id="lsti0015"><label>•</label>
<p id="par0015">This bias is even more pronounced in the single-stranded, unpaired RNA domains.</p>
</list-item>
<list-item id="lsti0020"><label>•</label>
<p id="par0020">The bias dictates the atypical codon usage of the coronaviruses.</p>
</list-item>
<list-item id="lsti0025"><label>•</label>
<p id="par0025">The RNA genome of the zoonotic viruses MERS and SARS is extremely biased.</p>
</list-item>
</list>
</p>
</abstract>
<abstract id="abs0010"><p>We investigated the nucleotide composition of the RNA genome of the six human coronaviruses. Some general coronavirus characteristics were apparent (e.g. high U, low C count), but we also detected species-specific signatures. Most strikingly, the high U and low C proportions are quite variable and act like communicating vessels, C goes down when U goes up and vice versa. U ranges among virus isolates from 30.7% to 40.3%, and C makes the opposite movement from 20.0% to 12.9%, respectively. The nucleotide biases are more pronounced in the unpaired regions of the structured RNA genome, which may suggest a certain biological function for these distinctive sequence signatures. Coronaviruses have an atypical codon usage that has been linked to mutational events operating on the viral RNA genome on an evolutionary time scale. We suggest that the atypical nucleotide bias may serve a distinct biological function and that it is the direct cause of the characteristic codon usage in these viruses. The relevance for evolution of the novel human pathogens MERS and SARS is discussed.</p>
</abstract>
<kwd-group id="kwd0005"><title>Keywords</title>
<kwd>Coronavirus</kwd>
<kwd>RNA genome</kwd>
<kwd>Nucleotide signature</kwd>
<kwd>MERS</kwd>
<kwd>SARS</kwd>
</kwd-group>
</article-meta>
</front>
<body><sec id="sec0005"><label>1</label>
<title>Introduction</title>
<p id="par0030">Coronaviruses are positive sense, single-stranded RNA viruses that infect a wide range of animals. The coronaviridae can cause a spectrum of diseases, ranging from respiratory, enteric, hepatic and neurological diseases of varying severity. Two human coronaviruses that cause relatively mild respiratory symptoms are known since the 1960s: HCoV-229E and HCoV-OC43. SARS-CoV was identified in 2003 and causes a more severe respiratory syndrome (<xref rid="bib0020" ref-type="bibr">Fouchier et al., 2003</xref>
). The fourth member of the coronaviridae family that was identified in 2004 to cause respiratory symptoms in humans is HCoV-NL63 (<xref rid="bib0110" ref-type="bibr">van der Hoek et al., 2004</xref>
). The fifth member HCoV-HKU was described the next year (<xref rid="bib0155" ref-type="bibr">Woo et al., 2005</xref>
). More recently, the pathogenic MERS coronavirus was identified in the Middle East as the sixth human coronavirus (<xref rid="bib0170" ref-type="bibr">Zaki et al., 2012</xref>
). Both SARS and MERS represent recent zoonotic transfers into the human population. The full-length sequences of the RNA genomes of all these human coronaviruses have been analyzed to some extent (<xref rid="bib0105" ref-type="bibr">van Boheemen et al., 2012</xref>
, <xref rid="bib0160" ref-type="bibr">Woo et al., 2007</xref>
, <xref rid="bib0080" ref-type="bibr">Pyrc et al., 2004</xref>
, <xref rid="bib0075" ref-type="bibr">Pyrc et al., 2007</xref>
, <xref rid="bib0065" ref-type="bibr">Marra et al., 2003</xref>
). The coronavirus RNA genome is around 30 kb and thereby the largest among the profoundly diverse group of RNA viruses. In global terms, a similar genome organization is apparent for all coronaviruses and a similar set of proteins is encoded. Besides encoding for viral proteins, the viral RNA genome usually contains multiple molecular signals, either RNA sequence elements or secondary and higher order RNA structures that interact with viral or cellular components – proteins or RNAs – to facilitate certain steps of the viral replication cycle. This is also true for the coronaviridae, which for instance encode the Transcription Regulation Sequence (TRS) that is involved in the induction of a discontinuous transcription mechanism to generate subgenomic mRNAs that encode the different viral proteins. In this study, we want to focus on a more basic property of the coronavirus RNA genomes: their biased nucleotide composition.</p>
<p id="par0035">There have been previous reports on the biased nucleotide composition of coronavirus RNA genomes. Grigoriev performed a cumulative skew analysis to analyze mutational patterns and reported an excess of G compared with C, suggestive of excessive C-to-U deamination among the coronaviruses, but significantly less for SARS (<xref rid="bib0030" ref-type="bibr">Grigoriev, 2004</xref>
). A subsequent analysis based on the NL63 genome indicated a very low C count and a high U count (<xref rid="bib0080" ref-type="bibr">Pyrc et al., 2004</xref>
). Both studies reported a clear difference in the magnitude of the nucleotide bias between the first two-third and last one-third of the coronavirus genome, which likely relates to the mechanism of subgenomic mRNA synthesis and exposure of single-stranded RNA domains. Cytosine deamination and discrimination against CpG dinucleotides were proposed as the driving forces that shaped the coronavirus RNA genomes over evolutionary times (<xref rid="bib0160" ref-type="bibr">Woo et al., 2007</xref>
, <xref rid="bib0150" ref-type="bibr">Woo et al., 2010</xref>
). Related to this deviant nucleotide count, the codon usage of the coronaviruses is also particularly unusual (<xref rid="bib0035" ref-type="bibr">Gu et al., 2004</xref>
).</p>
<p id="par0040">Several new findings urged us to revisit this topic. First, the MERS coronavirus as novel human pathogen follows some general coronavirus trends, but is also characterized by some unique and rather extreme features of nucleotide usage. Second, we can present a simple, but striking classification of the human coronaviruses based on their nucleotide composition. Coronaviral RNA genomes have a rather stable G and A count, but vary significantly in the U and C distribution, with as two extremes MERS (32.5% U, 20.3% C) and HKU (40.3% U, 12.9% C). Third, we performed for the first time a nucleotide usage analysis in the context of the structured coronavirus RNA genomes. All this information provides new mechanistic insight. Specifically, some of the previously proposed mutational scenario's (e.g. CpG discrimination) become less likely and we propose alternatives for the C deamination scenario that involves a cellular cytosine deaminase enzyme. Specifically, the differential U versus C bias among the human coronaviridae may suggest a mutational property of the diverse viral polymerases. Alternatively, the specific nucleotide composition of the viral RNA genomes may have been selected to execute a certain biological function. Finally, this new insight also strongly influences the way we should look at the atypical codons used by these viruses.</p>
</sec>
<sec id="sec0010"><label>2</label>
<title>Materials and methods</title>
<p id="par0045">Nucleotide sequences of the coronaviruses were taken from Genbank (MERS: <ext-link ext-link-type="uri" xlink:href="ncbi-n:JX869059" id="intr0050">JX869059</ext-link>
, SARS: NC004718, CoV 229E: KF514433, CoV OC43: NC005147, CoV NL63: <ext-link ext-link-type="uri" xlink:href="ncbi-n:JX504050" id="intr0055">JX504050</ext-link>
, HKU-1A: <ext-link ext-link-type="uri" xlink:href="ncbi-n:DQ415914" id="intr0060">DQ415914</ext-link>
, HKU-1B: <ext-link ext-link-type="uri" xlink:href="ncbi-n:DQ415911" id="intr0065">DQ415911</ext-link>
, HKU-1C: <ext-link ext-link-type="uri" xlink:href="ncbi-n:DQ415912" id="intr0070">DQ415912</ext-link>
). MFold (<xref rid="bib0180" ref-type="bibr">Zuker and Turner, 1999</xref>
) was used with default settings for RNA secondary structure prediction. The single-stranded or ss-count file of an MFold output supplied the number of folded structures (50 maximally), including a frequency value for each individual nucleotide of being unpaired in this collection of structures. We scored a nucleotide as unpaired (single-stranded, “ss”) if half or more than half of the structure models reported its position as “ss”. Nucleotides with a ss-count value below this criterion were scored as being paired (double-stranded, “ds”). Discrimination between ss and ds nucleotides was performed in Excel and fasta files were created in order to determine the composition of ss and ds nucleotides, separately, by means of MEGAv5 (<xref rid="bib0095" ref-type="bibr">Tamura et al., 2011</xref>
).</p>
<p id="par0050">The size limit for submission to the MFold server is 9000 nucleotides (nts). The 30,000 nts coronaviral RNA genomes were therefore partitioned into four portions (3 × 8500 nts and rest) with 500 nts overlaps. This obviously ignores long-distance interactions that may occur in a coronavirus RNA genome. The ss-count data of the submission output files were arithmetically averaged at the region of overlap before the ss/ds discrimination was performed. We used the MFold ct file of the top 1 structure model to score the basepair usage in coronaviral RNA (regular Watson–Crick and G-U/U-G pairs). Partial sequence files were reconstituted at a site near the center of the overlap to minimize folding artifacts near the borders of the submitted sequences.</p>
<p id="par0055">Base composition analysis along the RNA genome length and the accompanying ss and ds fasta files was performed by the method of cumulative skew diagrams in overlapping windows (<xref rid="bib0025" ref-type="bibr">Grigoriev, 1998</xref>
). For normalization purposes, windows were defined around 1% of the sequence length with a step size of 20% of the window size.</p>
<p id="par0060">Codon usage was characterized by means of plotting the effective number of codons (ENC-values) of coronavirus genes versus their GC-content at the 3rd synonymous codon positions (GC3-values): the “Nc-plot” (<xref rid="bib0165" ref-type="bibr">Wright, 1990</xref>
). This analysis excludes the codons AUG (Met) and UGG (Trp). A continuous line indicates theoretical ENC values with random codon usage as a function of GC3. Deviation from this line in the direction of lower ENC-values points to translational selection acting in favor of a preferred set of codons, as has been described for highly expressed genes in yeast (<xref rid="bib0005" ref-type="bibr">Bennetzen and Hall, 1982</xref>
) and Escherichia coli (<xref rid="bib0085" ref-type="bibr">Sharp and Li, 1987</xref>
). Codon usage data for the nuclear genes of the host species were obtained from the Codon Usage Database (<xref rid="bib0070" ref-type="bibr">Nakamura et al., 2000</xref>
). The data were available for the following numbers of codons (number of available coding sequences or CDS in parentheses): human: 40,662,582 (93,487), dromedary: 6414 (21) and bats (3 species): 3522 (10).</p>
<p id="par0065">Calculations were performed in Excel.</p>
</sec>
<sec id="sec0015"><label>3</label>
<title>Results</title>
<sec id="sec0020"><label>3.1</label>
<title>The nucleotide count of the RNA genome differs per coronavirus</title>
<p id="par0070"><xref rid="tbl0005" ref-type="table">Table 1</xref>
lists the nucleotide count of the human coronavirus RNA genomes, ranked from highest C-count (MERS, 20.3%) to the lowest (HKU, 12.9%). We included 3 HKU isolates (1A, 1B, 1C) to demonstrate conservation of these nucleotide characteristics among virus isolates of a particular coronavirus. Strain-specific trends are also conserved for different isolates of the other coronaviruses (not shown). MERS and SARS, which represent two recent zoonotic transmissions into the human population, are both present on the same end of the spectrum. SARS is quite extreme with a C-count of 20.0% and in fact the lowest U-count of 30.7%. The highest count of 40.3% U is apparent for the 1B isolate of HKU. These numbers seem quite dramatic. For instance, the extremely biased HIV-1 RNA genome reaches a maximal A-count of 36.7% (<xref rid="bib0115" ref-type="bibr">van der Kuyl and Berkhout, 2012</xref>
).<table-wrap position="float" id="tbl0005"><label>Table 1</label>
<caption><p>Differential nucleotide composition among coronaviruses.</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr><th align="left">Coronavirus</th>
<th align="left">ID</th>
<th align="left">A</th>
<th align="left">U</th>
<th align="left">C</th>
<th align="left">G</th>
</tr>
</thead>
<tbody><tr><td align="left">MERS</td>
<td align="left"><ext-link ext-link-type="uri" xlink:href="ncbi-n:JX869059" id="intr0005">JX869059</ext-link>
</td>
<td align="char">26.2</td>
<td align="char">32.5</td>
<td align="char">20.3</td>
<td align="char">20.9</td>
</tr>
<tr><td align="left">SARS</td>
<td align="left"><ext-link ext-link-type="uri" xlink:href="ncbi-n:NC_004718" id="intr0010">NC_004718</ext-link>
</td>
<td align="char">28.5</td>
<td align="char">30.7</td>
<td align="char">20.0</td>
<td align="char">20.8</td>
</tr>
<tr><td align="left">229E</td>
<td align="left">KF514433</td>
<td align="char">27.1</td>
<td align="char">34.7</td>
<td align="char">16.6</td>
<td align="char">21.6</td>
</tr>
<tr><td align="left">OC43</td>
<td align="left"><ext-link ext-link-type="uri" xlink:href="ncbi-n:NC_005147" id="intr0015">NC_005147</ext-link>
</td>
<td align="char">27.6</td>
<td align="char">35.6</td>
<td align="char">15.2</td>
<td align="char">21.7</td>
</tr>
<tr><td align="left">NL63</td>
<td align="left"><ext-link ext-link-type="uri" xlink:href="ncbi-n:JX504050" id="intr0020">JX504050</ext-link>
</td>
<td align="char">26.3</td>
<td align="char">39.2</td>
<td align="char">14.4</td>
<td align="char">20.1</td>
</tr>
<tr><td align="left">HKU-1C</td>
<td align="left"><ext-link ext-link-type="uri" xlink:href="ncbi-n:DQ415912" id="intr0025">DQ415912</ext-link>
</td>
<td align="char">27.8</td>
<td align="char">40.1</td>
<td align="char">13.0</td>
<td align="char">19.1</td>
</tr>
<tr><td align="left">HKU-1A</td>
<td align="left"><ext-link ext-link-type="uri" xlink:href="ncbi-n:DQ415914" id="intr0030">DQ415914</ext-link>
</td>
<td align="char">27.9</td>
<td align="char">40.2</td>
<td align="char">13.0</td>
<td align="char">19.0</td>
</tr>
<tr><td align="left">HKU-1B</td>
<td align="left"><ext-link ext-link-type="uri" xlink:href="ncbi-n:DQ415911" id="intr0035">DQ415911</ext-link>
</td>
<td align="char">27.7</td>
<td align="char">40.3</td>
<td align="char">12.9</td>
<td align="char">19.1</td>
</tr>
</tbody>
</table>
</table-wrap>
</p>
<p id="par0075">Some intriguing patterns become apparent by inspection of the nucleotide counts of the human coronaviruses. A few general coronavirus rules are observed. The first relates to the pyrimidines: the U-count is above average and the C-count is below average. The second rule applies to the purines and is less prominent, but A is preferred over G. There are also species-specific trends. In particular, the C/U ratio differs profoundly per coronavirus type, and these nucleotides seem to behave as communicating vessels. To illustrate that C and U seem to be competing for sequence space, we plotted the nucleotide composition per coronavirus (color coded) in <xref rid="fig0005" ref-type="fig">Fig. 1</xref>
. This picture also nicely illustrates that most variation occurs in the C/U and not the A/G section. The A/G ratio is rather stable among different coronaviruses, with minor fluctuations in the A-count (ranging from 26.2% for MERS to 28.5% for SARS) and the G-count (ranging from 19.0% for HKU-1A to 21.7% for OC43). For the more detailed follow-up analyses, we selected the two extremes MERS (32.5% U) and HKU (the 1B isolate with 40.3% U, simply called HKU hereafter).<fig id="fig0005"><label>Fig. 1</label>
<caption><p>Summary of the nucleotide composition of coronavirus RNA genomes. A and G proportions are relatively invariant among coronaviruses, while in contrast U and C are highly variable and represent communicating vessels.</p>
</caption>
<graphic xlink:href="gr1"></graphic>
</fig>
</p>
</sec>
<sec id="sec0025"><label>3.2</label>
<title>Nucleotide distribution in structured coronavirus RNA</title>
<p id="par0080">For HIV-1 RNA, we recently analyzed the nucleotide composition in the context of the structured RNA genome because an experimentally probed RNA secondary structure model was available (<xref rid="bib0130" ref-type="bibr">van Hemert et al., 2013</xref>
, <xref rid="bib0140" ref-type="bibr">Watts et al., 2009</xref>
). In particular, we were interested to map the distribution of the different nucleotides in the single-stranded and double-stranded parts across the genome. We described that the HIV-specific nucleotide bias is even more extreme in the unpaired regions of HIV-1 RNA (<xref rid="bib0130" ref-type="bibr">van Hemert et al., 2013</xref>
) and subsequently demonstrated that the same trend is apparent for MFold-predicted RNA structures (<xref rid="bib0120" ref-type="bibr">Van Hemert et al., 2014</xref>
). As no experimentally probed RNA structure models are available for the complete coronavirus genomes, we relied on computer-generated RNA structures in a first attempt to analyze the structural presentation of the different nucleotides.</p>
<p id="par0085">The RNA genomes of the two extremes, MERS (relatively low U) and HKU (relatively high U), were folded with the MFold program using default settings to yield 30–50 structures. We subsequently investigated the predicted structures of the RNA genomes of other coronaviruses. <xref rid="tbl0010" ref-type="table">Table 2</xref>
lists the nucleotide composition of the unpaired (single-stranded or ss) and basepaired (double-stranded or ds) nucleotides. The trend first described for HIV-1 RNA that the extremes become more extreme in the ss domains and consequently less extreme in the ds domains is also apparent for the HKU RNA genome. The high U-count of HKU (40.3%) goes up to 48.3% for ss and down to 36.1% for ds nucleotides. These values are extreme, but HIV-1 reaches up to 50.3% A in ss regions of its RNA genome. The C-count makes the contrasting movement: from a mere 12.9% further down to 11.0% in ss and up to 13.9% in ds domains. Thus, the nucleotide composition is put to the extreme for ss regions and approaches more neutral values in the ds regions.<table-wrap position="float" id="tbl0010"><label>Table 2</label>
<caption><p>Nucleotide composition in RNA structure models of MERS and HKU genomes.</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr><th align="left">Coronavirus</th>
<th align="left">ID</th>
<th></th>
<th align="left">A</th>
<th align="left">U</th>
<th align="left">C</th>
<th align="left">G</th>
<th align="left">nts</th>
</tr>
</thead>
<tbody><tr><td rowspan="3" align="left" valign="middle">HKU-1B</td>
<td rowspan="3" align="left" valign="middle"><ext-link ext-link-type="uri" xlink:href="ncbi-n:DQ415911" id="intr0040">DQ415911</ext-link>
</td>
<td align="left">All nts</td>
<td align="char">27.7</td>
<td align="char">40.3</td>
<td align="char">12.9</td>
<td align="char">19.1</td>
<td align="char">29,904</td>
</tr>
<tr><td align="left">ss nts</td>
<td align="char">28.1</td>
<td align="char">48.3</td>
<td align="char">11.0</td>
<td align="char">12.5</td>
<td align="char">10,154</td>
</tr>
<tr><td align="left">ds nts</td>
<td align="char">27.5</td>
<td align="char">36.1</td>
<td align="char">13.9</td>
<td align="char">22.5</td>
<td align="char">19,748</td>
</tr>
<tr><td colspan="8" align="left">  </td>
</tr>
<tr><td rowspan="3" align="left" valign="middle">MERS</td>
<td rowspan="3" align="left" valign="middle"><ext-link ext-link-type="uri" xlink:href="ncbi-n:JX869059" id="intr0045">JX869059</ext-link>
</td>
<td align="left">All nts</td>
<td align="char">26.2</td>
<td align="char">32.5</td>
<td align="char">20.3</td>
<td align="char">20.9</td>
<td align="char">30,119</td>
</tr>
<tr><td align="left">ss nts</td>
<td align="char">31.2</td>
<td align="char">38.6</td>
<td align="char">19.4</td>
<td align="char">10.8</td>
<td align="char">10,948</td>
</tr>
<tr><td align="left">ds nts</td>
<td align="char">23.3</td>
<td align="char">29.1</td>
<td align="char">20.8</td>
<td align="char">26.7</td>
<td align="char">19,173</td>
</tr>
</tbody>
</table>
<table-wrap-foot><fn><p>Coronavirus RNA (30,000 nts) was divided into 4 parts (3 × 8500 + rest) with a 500 nts overlap allowing reconstitution before analysis. The ss-count output file of Mfold (based on max 50 structures) was used for data calculation.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</p>
<p id="par0090">The situation is more complex for MERS, where the relatively suppressed U-count (only 32.5% compared to 40.3% for HKU) is neither suppressed more severely in the ss domains (38.6%) nor overrepresented in the ds domains (29.1%). The C-count is relatively stable in both compartments. The general value of 20.3% becomes 19.4% for ss and 20.8% for ds nucleotides.</p>
</sec>
<sec id="sec0030"><label>3.3</label>
<title>The nucleotide composition influences the basepair usage in coronavirus RNA</title>
<p id="par0095">We next probed whether the atypical nucleotide composition does influence the type of basepairs used in the structured RNA genomes. The previous HIV-1 study noticed a trend toward the more frequent usage of the more stable basepairs (GC and CG > AU and UA > GU and UG), which correlated with the G and C preference in the ds parts of the HIV-1 RNA genome (<xref rid="bib0130" ref-type="bibr">van Hemert et al., 2013</xref>
). We now analyzed the coronavirus RNA genomes. As said, we focused on the most extreme MERS and HKU genomes and we used the A-rich HIV-1 RNA as outgroup for comparison.</p>
<p id="par0100">The major conclusion is that the basepair composition correlates with the biased nucleotide counts (<xref rid="fig0010" ref-type="fig">Fig. 2</xref>
). The general coronavirus pattern (U up, C down) is visible in the basepairs: G–U/U–G and U–A/A–U are relatively up and G–C/C–G are down, which is especially true for the most extreme HKU genome. The HKU genome is low in ds C (13.9%), but remains fairly high in ds G (22.5%), which means that more G–U/U–G basepairs must have been accomodated. U still prefers to pair with A (53.9%), but we scored a significant number of pairings with G (19.5%). Overall, a rather distinct basepair composition is apparent for HKU coronavirus compared to HIV-1. The biggest difference is apparent for G–C pairs: 26.5% for HKU and 45.1% for HIV-1. But we also observed notable differences between MERS and HKU that relate to the different nucleotide composition of their RNA genomes. The basepair composition of MERS coronavirus resembles that of HIV-1 RNA much more than that of HKU.<fig id="fig0010"><label>Fig. 2</label>
<caption><p>Basepair composition in RNA structure model of MERS, HKU and HIV-1. Only the top MFold predictions were analyzed. Coronavirus RNA genomes (30,000 nts) were split in four fragments (3 × 8500 + rest) with 500-nts overlaps.</p>
</caption>
<graphic xlink:href="gr2"></graphic>
</fig>
</p>
</sec>
<sec id="sec0035"><label>3.4</label>
<title>Skew analysis along the coronavirus RNA genome</title>
<p id="par0105">We performed a nucleotide skew analysis along the MERS and HKU genomes to reveal the fine structure of the ss and ds segments (<xref rid="fig0015" ref-type="fig">Fig. 3</xref>
). An advantage of a skew analysis is that it allows one to score global trends across genomes by minimizing local fluctuations. Skew values were plotted for all six nucleotide comparisons (G vs A, C vs A, U vs A, U vs C, C vs G, U vs G) along the 30 kb viral genome in overlapping windows of 1% of the sequence size with a step size of 20% of the window size. It should be noted that GA in skew language does not represent a basepair, but a comparison of the number of G-nucleotides with the number of A-nucleotides. As a result, about 500 data points were obtained comprising each <italic>X</italic>
-axis irrespective of the length of the nucleotide sequence involved. We specifically used the same scale on the <italic>Y</italic>
-axis to allow a direct comparison of the virus-specific signatures.<fig id="fig0015"><label>Fig. 3</label>
<caption><p>Skew analysis of RNA genomes of MERS and HKU. Skew values (<italic>N</italic>
1 − <italic>N</italic>
2)/(<italic>N</italic>
1 + <italic>N</italic>
2) have been calculated in overlapping windows along the sequence (“all nts”, “ds nts” and “ss nts”). Window size was set at 1% of the length of the sequence with a step size of 20% of the window size resulting in approximately 500 datapoints comprising the <italic>X</italic>
-axis. We used the same <italic>Y</italic>
-axis for the cumulative skew values to allow a direct comparison of the compositional signatures of different coronavirus RNA genomes. It should be noted that the labels next to each line do not indicate a basepair, but refer to the two nucleotides for which the skew values were calculated.</p>
</caption>
<graphic xlink:href="gr3"></graphic>
</fig>
</p>
<p id="par0110">The general skew analysis in <xref rid="fig0015" ref-type="fig">Fig. 3</xref>
 (left panels) is in line with the general coronavirus pattern (U up, C down), with trends that pertain across the genome. U wins from the other three nucleotides as evidenced by the ascending lines, most pronounced from C, than G and A. This holds true even for the more moderately biased MERS RNA. As expected, the HKU skew is much more extreme (steeper lines) than that of MERS, but the patterns are similar. The skew patterns confirm that there is relatively much variation in the U/C values and little variation in the G/A ratio.</p>
<p id="par0115">We next performed a skew analysis for the ds and ss nucleotides (middle and right panels, respectively). The results confirm that the bias is more extreme (steeper skew lines) for the unpaired (ss) nucleotides. Skew values are relatively large in the ss domains, relatively small in the ds domains and, as expected, intermediate for all nucleotides. All skews are in agreement with the individual nucleotide composition of these viruses.</p>
<p id="par0120">The coronavirus skew values, as soon as they differ significantly from zero, are represented by straight lines along the genome length, indicating that they represent an intrinsic property of the viral genome that is not restricted to specific parts of the viral genome. In other words, the observed biases represent a stable property, but one noticeable exception seems to be present. A shift is apparent at two-third of the MERS and HKU genome length. This shift occurs in a region between the 1A/1B and S genes on the viral RNA genome. The shift is visible in all nucleotide analyses of MERS, but appears more extreme in the ss skews. A more dramatic switch of the direction of the CG skew line is apparent for HKU. Two possible mechanisms have been proposed in literature to explain this shift or switch in the nucleotide bias (<xref rid="bib0030" ref-type="bibr">Grigoriev, 2004</xref>
, <xref rid="bib0080" ref-type="bibr">Pyrc et al., 2004</xref>
).</p>
</sec>
<sec id="sec0040"><label>3.5</label>
<title>Codon usage is dictated by the nucleotide composition of coronaviruses</title>
<p id="par0125">We previously described that the A-rich HIV-1 RNA dictates the exotic codon usage of this virus. We therefore wanted to know whether the particular codon usage of coronaviruses, as described in literature, correlates with the nucleotide biases. We performed an analysis of the effective number of codons (ENC) used by the two extreme genomes of MERS and HKU. We compared the G and C content of the synonymous third codon positions (GC3) in the coronavirus gene 1 (ORF1, occupying the first two-third of the viral genome) vs the S and N genes in the last one-third, thus downstream of the observed shift/switch in the skew analysis (<xref rid="fig0020" ref-type="fig">Fig. 4</xref>
). The 3′-located S and N genes do not behave significantly different from the 5′-located gene 1. The three genes cluster together for each coronavirus species, but HKU is more extreme than MERS toward low GC3 values, as expected. The codons do not deviate much from the bell-shaped line that represents ENC values of random codon usage expected for a given GC3 composition. The characteristic distinction between MERS and HKU is the difference of GC content at the 3rd synonymous codon position. Thus, the codons follow the nucleotide count and the overall codon usage is not biased in other ways.<fig id="fig0020"><label>Fig. 4</label>
<caption><p>Codon ENC analysis of MERS and HKU. The effective number of codons (ENC-values, <italic>Y</italic>
-axis) of coronavirus genes was plotted against the GC-content at the 3rd synonymous codon positions (GC3-values, <italic>X</italic>
-axis). The continuous line indicates theoretical ENC values with random codon usage as a function of GC3. Deviation from this line in the direction of lower ENC-values points to translational selection acting in favor of a preferred set of codons. Codon usage data for the nuclear genes of the host species were obtained from the following numbers of codons (number of genes in parentheses): human: 40,662,582 (93,487), dromedary: 6414 (21) and bats (3 species): 3522 (10).</p>
</caption>
<graphic xlink:href="gr4"></graphic>
</fig>
</p>
<p id="par0130">A gross difference is apparent for codon usage in a few possible hosts: human, bat and dromedary as candidate host for MERS (<xref rid="bib0040" ref-type="bibr">Haagmans et al., 2014</xref>
). We performed additional analyses with all coronaviruses listed in <xref rid="tbl0005" ref-type="table">Table 1</xref>
. GA is stable among the different coronaviruses and UG was used to visualize U-preference (results not shown), but none of the coronaviruses cluster with any of the hosts. There is variation among coronaviruses due to the magnitude of the U/C-bias: MERS is least extreme and HKU most extreme. All other human coronaviruses occupy in between spots in the CG3 plot (not shown), consistent with their intermediate nucleotide count. These relationships hold perfectly for the longest ORF1. A few exceptions were apparent for the much smaller and thus less reliable ORFs encoding the S protein (229E a bit more extreme than MERS) and the N protein (SARS and OC43 more extreme than MERS). Most importantly, all coronavirus genes are positioned rather close to the bell-shaped curve, which indicates the absence of selection of certain “preferred” codons, and relatively far away from the GC3 values of any of the candidate host species. We conclude that the particular codon usage is determined largely by the biased nucleotide count of the coronavirus genomes.</p>
</sec>
<sec id="sec0045"><label>3.6</label>
<title>A more detailed codon analysis</title>
<p id="par0135">We also inspected the detailed codon tables that have been presented by others (<xref rid="bib0035" ref-type="bibr">Gu et al., 2004</xref>
, <xref rid="bib0160" ref-type="bibr">Woo et al., 2007</xref>
) for trends that could support our analyses. We generated a detailed survey of the codons used in the largest ORF1 in HKU vs MERS (Supplementary Table S1) and calculated the nt-count per codon position (Supplementary Table S2). Indeed, the first general coronavirus rule about pyrimidines (U up, C down) dictates codon usage in all codon groups and at the three codon positions without any exceptions. This rule holds for the broad collection of human and animal coronaviruses that were analyzed by <xref rid="bib0160" ref-type="bibr">Woo et al. (2007)</xref>
, and among the human coronaviruses the effect ranges from MERS (most modest) to HKU (most extreme). This rule is perhaps most dramatically visualized for the 2-codon groups where the choice is C or U. Phenylalanine is encoded by UUU or UUC, but a 0.94/0.06 = 15.7-fold bias for U is present in HKU, contrasting with a modest 1.7-fold bias in MERS. All other coronaviruses take an intermediate position. Another example is provided by the 4-codon groups. Valine is encoded by the four codons GUN, which are used with profound different frequencies in HKU: 0.68 GUU, 0.05 GUC, 0.20 GUA and 0.06 GUG, thus yielding a 13.6-fold bias for U over C, but a much more modest 2.2-fold bias in MERS. Also, the coronaviral rule (U up, C down) is more prominent in HKU than in MERS and hence, different proportions of ORF1 encoded amino acids with the C-nucleotide at the 2nd codon position can be expected (Supplementary Table S2). Indeed, HKU/MERS ratios of 0.86 (Ser), 0.88 (Pro), 0.79 (Thr) and 0.76 (Ala) are apparent (Supplementary Table S1). Nucleotide preferences of coronaviruses (U up, C down) affect the amino acid composition of the viral proteins.</p>
<p>Supplementary Table S1 related to this article can be found, in the online version, at <ext-link ext-link-type="doi" xlink:href="10.1016/j.virusres.2014.11.031" id="intr0075">http://dx.doi.org/10.1016/j.virusres.2014.11.031</ext-link>
.</p>
<p id="par0185"><supplementary-material content-type="local-data" id="upi0005"><caption><title>Supplementary Table S1</title>
<p>Codon usage table of HKU and MERS ORF1.</p>
</caption>
<media xlink:href="mmc1.xlsx"></media>
</supplementary-material>
</p>
<p>Supplementary Table S2 related to this article can be found, in the online version, at <ext-link ext-link-type="doi" xlink:href="10.1016/j.virusres.2014.11.031" id="intr0080">http://dx.doi.org/10.1016/j.virusres.2014.11.031</ext-link>
.</p>
<p id="par0195"><supplementary-material content-type="local-data" id="upi0010"><caption><title>Supplementary Table S2</title>
<p>Nucleotide composition at the different codon positions of HKU and MERS ORF1.</p>
</caption>
<media xlink:href="mmc2.xlsx"></media>
</supplementary-material>
</p>
<p id="par0140">The second general coronavirus rule about purines (A over G) is also well supported, but with less dramatic numbers. For A/G choices, A always wins in the coronaviruses. For instance, lysine is encoded by AAA or AAG that are used in quite different fraction of codons (0.68 vs 0.32) in HKU, yielding a 2.1-fold bias of A over G. More neutral values are apparent for MERS (1.1-fold bias of G over A). Very similar values are observed for A/G choices. All these findings follow the basic nucleotide count in these genomes as illustrated in <xref rid="fig0005" ref-type="fig">Fig. 1</xref>
.</p>
</sec>
</sec>
<sec id="sec0050"><label>4</label>
<title>Discussion</title>
<p id="par0145">We analyzed the nucleotide composition of the RNA genome of human coronaviruses and arrive at some general and species-specific rules. Two general coronavirus rules are apparent that relate to the usage of pyrimidines (C over U) and purines (A over G). The A/G bias is a relatively stable property among the coronaviruses. In contrast, the C/U bias differs significantly per virus type and we scored U-counts from 30.7% (SARS) to 40.3% (HKU) and C-counts from 20.3% (MERS) to 12.9% (HKU). The C- and U-counts behave as communicating vessels. Although this study was restricted to the human coronaviruses, these basic properties apply to all known animal and human coronas (results not shown). A quick survey revealed a new record number for the Bat-SARS-CoV with 30.5% U (<xref rid="bib0160" ref-type="bibr">Woo et al., 2007</xref>
). Perhaps surprisingly, we think that these basic nucleotide trends have not been reported previously. Although they may seem useless numbers for some, we think that these basic properties can encode important biological functions and one may start wondering about the evolutionary history of these sometimes striking nucleotide features. The biased nucleotide composition can also have a major influence on derived parameters. For instance, our analysis clearly suggests that the nucleotide composition largely dictates the codons that are used by these viruses for the translation of the RNA genome and sub-genomic mRNAs.</p>
<p id="par0150">Previous studies on codon usage in different viruses have highlighted mutational pressure as the major factor in shaping codon usage patterns compared with natural selection (<xref rid="bib0035" ref-type="bibr">Gu et al., 2004</xref>
, <xref rid="bib0045" ref-type="bibr">Jenkins and Holmes, 2003a</xref>
, <xref rid="bib0135" ref-type="bibr">Wang et al., 2011</xref>
, <xref rid="bib0145" ref-type="bibr">Wong et al., 2010</xref>
, <xref rid="bib0160" ref-type="bibr">Woo et al., 2007</xref>
, <xref rid="bib0035" ref-type="bibr">Gu et al., 2004</xref>
). For coronaviruses, cytosine deamination and the selection against CpG motifs have been proposed as the mutational forces that shaped the viral genome and its codons (<xref rid="bib0030" ref-type="bibr">Grigoriev, 2004</xref>
, <xref rid="bib0160" ref-type="bibr">Woo et al., 2007</xref>
, <xref rid="bib0080" ref-type="bibr">Pyrc et al., 2004</xref>
). However, as our understanding of codon usage increases, it appears that although mutational pressure is still a major driving force, it is certainly not the only force when considering different types of RNA and DNA viruses (<xref rid="bib0010" ref-type="bibr">Berkhout and van Hemert, 1994</xref>
, <xref rid="bib0125" ref-type="bibr">van Hemert and Berkhout, 1995</xref>
, <xref rid="bib0015" ref-type="bibr">Chen, 2013</xref>
, <xref rid="bib0090" ref-type="bibr">Shi et al., 2013</xref>
, <xref rid="bib0175" ref-type="bibr">Zhang et al., 2013</xref>
). For HIV-1 with its A-rich RNA genome, we initially proposed two evolutionary scenarios. Mutations could be introduced on an evolutionary time scale by the error-prone reverse transcriptase or cellular enzymes like Apobec, but we also stressed that these atypical RNA molecules may have been selected to exert a certain biological function. For instance, the distinctive nucleotide composition influences the overall folding of the RNA molecule and may thus affect specific replication steps like packaging of the RNA genome in virion particles. The genome composition may also relate to the intensive virus-host interaction, e.g. by avoiding recognition by the innate immune system. For HIV-1 with its extremely A-rich genome it was recently proposed that this property helps to avoid recognition by the innate immune system (<xref rid="bib0100" ref-type="bibr">Vabret et al., 2012</xref>
), which could provide strong selective pressure on retroviruses and many RNA viruses including coronaviruses (<xref rid="bib0115" ref-type="bibr">van der Kuyl and Berkhout, 2012</xref>
, <xref rid="bib0120" ref-type="bibr">Van Hemert et al., 2014</xref>
, <xref rid="bib0060" ref-type="bibr">Kindler and Thiel, 2014</xref>
). There could also be a more passive function for the biased genome composition. For HIV-1, A-rich sequences may restrict the number of sites that can be mutated and inactivated by the cellular Apobec restriction factor. For coronaviruses, the C-rich genome may highlight some important replication signals such as the A-rich TRS element (e.g. AACUAAA in NL63 (<xref rid="bib0080" ref-type="bibr">Pyrc et al., 2004</xref>
)) that is positioned at several locations within the coronaviral genome.</p>
<p id="par0155">Our finding that these nucleotide signatures are even more pronounced in the single-stranded domains of the coronaviral genomes perhaps support this latter selection theory. The nucleotide trends are certainly not restricted to the coding regions of these genomes as they are also apparent in the non-coding 5′UTR, indicating that these signatures are not invented to create a certain codon bias and that the effect is executed at the level of translation, but rather that it serves another biological purpose. In fact, this nucleotide bias also directly influences many other parameters such as the dinucleotide composition and possibly even the amino acid composition of the encoded viral proteins (<xref rid="bib0010" ref-type="bibr">Berkhout and van Hemert, 1994</xref>
). We previously indicated that serious nucleotide skews may even trigger phylogenetic artifacts: viruses with a similar nucleotide preference tend to cluster, but are not necessarily related by descent (<xref rid="bib0125" ref-type="bibr">van Hemert and Berkhout, 1995</xref>
). We also cannot formally exclude that the bias operates at the level of the genomic minus-strand RNA, which obviously has the opposite characteristics (C over U becomes G over A).</p>
<p id="par0160">Do these results tell us something about virus pathogenicity and evolutionary events during zoonotic transmissions? Although the two serious pathogens SARS and MERS are present on one side of the C/U spectrum with a relatively low U-count and high C-count, it seems dangerous to propose a correlation between the nucleotide signature and pathogenicity as this may just be a coincidence. Most viral nucleotide/codon characteristics appear not to depend on the host organism as we observed similar properties for murine, avian, bat and human coronaviruses (results not shown). The codon analysis presented in <xref rid="fig0020" ref-type="fig">Fig. 4</xref>
 also clearly indicates that there is no virus adaptation to the host, at least in this respect. One evolutionary scenario that links nucleotide usage to pathogenicity remains possible. The new coronaviruses that arrived in humans via zoonotic transfer are pathogenic (MERS, SARS). The ones that are circulating in humans for a much longer period may have adapted to become less pathogenic. This idea is similar to a natural attenuation scenario that has been proposed for other viruses like myxoma virus in a new Australian epidemic among the introduced European rabbits (<xref rid="bib0055" ref-type="bibr">Kerr et al., 2012</xref>
). It is beneficial for the virus not to kill the host too quickly as this increases the chance of viral spread. We may see this adaptation as a gradual increase of U and decrease of C, which may attenuate viral gene expression (e.g. sub-optimal codon usage, visible as gradual deviation from the ENC curve, or by another mechanism: e.g. reduced RNA packaging capacity or increased recognition by the innate immune system). This hypothesis predicts that MERS and SARS will evolve to become more U-rich and C-poor in humans, but that obviously requires the viral presence in the human population over evolutionary times.</p>
<p id="par0165">Finally, knowledge of the nucleotide and codon usage in viruses can not only reveal information about molecular evolution, but also improve our understanding of the regulation of viral gene expression and aid vaccine design, e.g. by providing novel ways for stable virus attenuation.</p>
</sec>
</body>
<back><ref-list id="bibl0005"><title>References</title>
<ref id="bib0005"><element-citation publication-type="journal" id="sbref0005"><person-group person-group-type="author"><name><surname>Bennetzen</surname>
<given-names>J.L.</given-names>
</name>
<name><surname>Hall</surname>
<given-names>B.D.</given-names>
</name>
</person-group>
<article-title>Codon selection in yeast</article-title>
<source>J. Biol. Chem.</source>
<volume>257</volume>
<year>1982</year>
<fpage>3026</fpage>
<lpage>3031</lpage>
<pub-id pub-id-type="pmid">7037777</pub-id>
</element-citation>
</ref>
<ref id="bib0010"><element-citation publication-type="journal" id="sbref0010"><person-group person-group-type="author"><name><surname>Berkhout</surname>
<given-names>B.</given-names>
</name>
<name><surname>van Hemert</surname>
<given-names>F.J.</given-names>
</name>
</person-group>
<article-title>The unusual nucleotide content of the HIV RNA genome results in a biased amino acid composition of HIV proteins</article-title>
<source>Nucl. Acids Res.</source>
<volume>22</volume>
<year>1994</year>
<fpage>1705</fpage>
<lpage>1711</lpage>
<pub-id pub-id-type="pmid">8202375</pub-id>
</element-citation>
</ref>
<ref id="bib0015"><element-citation publication-type="journal" id="sbref0015"><person-group person-group-type="author"><name><surname>Chen</surname>
<given-names>Y.</given-names>
</name>
</person-group>
<article-title>A comparison of synonymous codon usage bias patterns in DNA and RNA virus genomes: quantifying the relative importance of mutational pressure and natural selection</article-title>
<source>Biomed. Res. Int.</source>
<volume>2013</volume>
<year>2013</year>
<fpage>406342</fpage>
<pub-id pub-id-type="pmid">24199191</pub-id>
</element-citation>
</ref>
<ref id="bib0020"><element-citation publication-type="journal" id="sbref0020"><person-group person-group-type="author"><name><surname>Fouchier</surname>
<given-names>R.A.</given-names>
</name>
<name><surname>Kuiken</surname>
<given-names>T.</given-names>
</name>
<name><surname>Schutten</surname>
<given-names>M.</given-names>
</name>
<name><surname>Van Amerongen</surname>
<given-names>G.</given-names>
</name>
<name><surname>van Doornum</surname>
<given-names>G.J.</given-names>
</name>
<name><surname>van den Hoogen</surname>
<given-names>B.G.</given-names>
</name>
<name><surname>Peiris</surname>
<given-names>M.</given-names>
</name>
<name><surname>Lim</surname>
<given-names>W.</given-names>
</name>
<name><surname>Stohr</surname>
<given-names>K.</given-names>
</name>
<name><surname>Osterhaus</surname>
<given-names>A.D.</given-names>
</name>
</person-group>
<article-title>Aetiology: Koch's postulates fulfilled for SARS virus</article-title>
<source>Nature</source>
<volume>423</volume>
<year>2003</year>
<fpage>240</fpage>
<pub-id pub-id-type="pmid">12748632</pub-id>
</element-citation>
</ref>
<ref id="bib0025"><element-citation publication-type="journal" id="sbref0025"><person-group person-group-type="author"><name><surname>Grigoriev</surname>
<given-names>A.</given-names>
</name>
</person-group>
<article-title>Analyzing genomes with cumulative skew diagrams</article-title>
<source>Nucl. Acids Res.</source>
<volume>26</volume>
<year>1998</year>
<fpage>2286</fpage>
<lpage>2290</lpage>
<pub-id pub-id-type="pmid">9580676</pub-id>
</element-citation>
</ref>
<ref id="bib0030"><element-citation publication-type="journal" id="sbref0030"><person-group person-group-type="author"><name><surname>Grigoriev</surname>
<given-names>A.</given-names>
</name>
</person-group>
<article-title>Mutational patterns correlate with genome organization in SARS and other coronaviruses</article-title>
<source>Trends Genet.</source>
<volume>20</volume>
<year>2004</year>
<fpage>131</fpage>
<lpage>135</lpage>
<pub-id pub-id-type="pmid">15049309</pub-id>
</element-citation>
</ref>
<ref id="bib0035"><element-citation publication-type="journal" id="sbref0035"><person-group person-group-type="author"><name><surname>Gu</surname>
<given-names>W.</given-names>
</name>
<name><surname>Zhou</surname>
<given-names>T.</given-names>
</name>
<name><surname>Ma</surname>
<given-names>J.</given-names>
</name>
<name><surname>Sun</surname>
<given-names>X.</given-names>
</name>
<name><surname>Lu</surname>
<given-names>Z.</given-names>
</name>
</person-group>
<article-title>Analysis of synonymous codon usage in SARS coronavirus and other viruses in the Nidovirales</article-title>
<source>Virus Res.</source>
<volume>101</volume>
<year>2004</year>
<fpage>155</fpage>
<lpage>161</lpage>
<pub-id pub-id-type="pmid">15041183</pub-id>
</element-citation>
</ref>
<ref id="bib0040"><element-citation publication-type="journal" id="sbref0040"><person-group person-group-type="author"><name><surname>Haagmans</surname>
<given-names>B.L.</given-names>
</name>
</person-group>
<article-title>Middle East respiratory syndrome coronavirus in dromedary camels: an outbreak investigation</article-title>
<source>Lancet Infect. Dis.</source>
<volume>14</volume>
<year>2014</year>
<fpage>140</fpage>
<lpage>145</lpage>
<pub-id pub-id-type="pmid">24355866</pub-id>
</element-citation>
</ref>
<ref id="bib0045"><element-citation publication-type="journal" id="sbref0045"><person-group person-group-type="author"><name><surname>Jenkins</surname>
<given-names>G.M.</given-names>
</name>
<name><surname>Holmes</surname>
<given-names>E.C.</given-names>
</name>
</person-group>
<article-title>The extent of codon usage bias in human RNA viruses and its evolutionary origin</article-title>
<source>Virus Res.</source>
<volume>92</volume>
<year>2003</year>
<fpage>1</fpage>
<lpage>7</lpage>
<pub-id pub-id-type="pmid">12606071</pub-id>
</element-citation>
</ref>
<ref id="bib0055"><element-citation publication-type="journal" id="sbref0055"><person-group person-group-type="author"><name><surname>Kerr</surname>
<given-names>P.J.</given-names>
</name>
<name><surname>Ghedin</surname>
<given-names>E.</given-names>
</name>
<name><surname>DePasse</surname>
<given-names>J.V.</given-names>
</name>
<name><surname>Fitch</surname>
<given-names>A.</given-names>
</name>
<name><surname>Cattadori</surname>
<given-names>I.M.</given-names>
</name>
<name><surname>Hudson</surname>
<given-names>P.J.</given-names>
</name>
<name><surname>Tscharke</surname>
<given-names>D.C.</given-names>
</name>
<name><surname>Read</surname>
<given-names>A.F.</given-names>
</name>
<name><surname>Holmes</surname>
<given-names>E.C.</given-names>
</name>
</person-group>
<article-title>Evolutionary history and attenuation of myxoma virus on two continents</article-title>
<source>PLoS Pathog.</source>
<volume>8</volume>
<year>2012</year>
<fpage>e1002950</fpage>
<pub-id pub-id-type="pmid">23055928</pub-id>
</element-citation>
</ref>
<ref id="bib0060"><element-citation publication-type="journal" id="sbref0060"><person-group person-group-type="author"><name><surname>Kindler</surname>
<given-names>E.</given-names>
</name>
<name><surname>Thiel</surname>
<given-names>V.</given-names>
</name>
</person-group>
<article-title>To sense or not to sense viral RNA-essentials of coronavirus innate immune evasion</article-title>
<source>Curr. Opin. Microbiol.</source>
<volume>20C</volume>
<year>2014</year>
<fpage>69</fpage>
<lpage>75</lpage>
</element-citation>
</ref>
<ref id="bib0065"><element-citation publication-type="journal" id="sbref0065"><person-group person-group-type="author"><name><surname>Marra</surname>
<given-names>M.A.</given-names>
</name>
</person-group>
<article-title>The genome sequence of the SARS-associated coronavirus</article-title>
<source>Science</source>
<volume>300</volume>
<year>2003</year>
<fpage>1399</fpage>
<lpage>1404</lpage>
<pub-id pub-id-type="pmid">12730501</pub-id>
</element-citation>
</ref>
<ref id="bib0070"><element-citation publication-type="journal" id="sbref0070"><person-group person-group-type="author"><name><surname>Nakamura</surname>
<given-names>Y.</given-names>
</name>
<name><surname>Gojobori</surname>
<given-names>T.</given-names>
</name>
<name><surname>Ikemura</surname>
<given-names>T.</given-names>
</name>
</person-group>
<article-title>Codon usage tabulated from international DNA sequence databases: status for the year 2000</article-title>
<source>Nucl. Acids Res.</source>
<volume>28</volume>
<year>2000</year>
<fpage>292</fpage>
<pub-id pub-id-type="pmid">10592250</pub-id>
</element-citation>
</ref>
<ref id="bib0075"><element-citation publication-type="journal" id="sbref0075"><person-group person-group-type="author"><name><surname>Pyrc</surname>
<given-names>K.</given-names>
</name>
<name><surname>Berkhout</surname>
<given-names>B.</given-names>
</name>
<name><surname>van der Hoek</surname>
<given-names>L.</given-names>
</name>
</person-group>
<article-title>The novel human coronaviruses NL63 and HKU1</article-title>
<source>J. Virol.</source>
<volume>81</volume>
<year>2007</year>
<fpage>3051</fpage>
<lpage>3057</lpage>
<pub-id pub-id-type="pmid">17079323</pub-id>
</element-citation>
</ref>
<ref id="bib0080"><element-citation publication-type="journal" id="sbref0080"><person-group person-group-type="author"><name><surname>Pyrc</surname>
<given-names>K.</given-names>
</name>
<name><surname>Jebbink</surname>
<given-names>M.F.</given-names>
</name>
<name><surname>Berkhout</surname>
<given-names>B.</given-names>
</name>
<name><surname>van der Hoek</surname>
<given-names>L.</given-names>
</name>
</person-group>
<article-title>Genome structure and transcriptional regulation of human coronavirus NL63</article-title>
<source>Virol. J.</source>
<volume>1</volume>
<year>2004</year>
<fpage>7</fpage>
<pub-id pub-id-type="pmid">15548333</pub-id>
</element-citation>
</ref>
<ref id="bib0085"><element-citation publication-type="journal" id="sbref0085"><person-group person-group-type="author"><name><surname>Sharp</surname>
<given-names>P.M.</given-names>
</name>
<name><surname>Li</surname>
<given-names>W.H.</given-names>
</name>
</person-group>
<article-title>The codon Adaptation Index – a measure of directional synonymous codon usage bias, and its potential applications</article-title>
<source>Nucl. Acids Res.</source>
<volume>15</volume>
<year>1987</year>
<fpage>1281</fpage>
<lpage>1295</lpage>
<pub-id pub-id-type="pmid">3547335</pub-id>
</element-citation>
</ref>
<ref id="bib0090"><element-citation publication-type="journal" id="sbref0090"><person-group person-group-type="author"><name><surname>Shi</surname>
<given-names>S.L.</given-names>
</name>
<name><surname>Jiang</surname>
<given-names>Y.R.</given-names>
</name>
<name><surname>Liu</surname>
<given-names>Y.Q.</given-names>
</name>
<name><surname>Xia</surname>
<given-names>R.X.</given-names>
</name>
<name><surname>Qin</surname>
<given-names>L.</given-names>
</name>
</person-group>
<article-title>Selective pressure dominates the synonymous codon usage in parvoviridae</article-title>
<source>Virus Gen.</source>
<volume>46</volume>
<year>2013</year>
<fpage>10</fpage>
<lpage>19</lpage>
</element-citation>
</ref>
<ref id="bib0095"><element-citation publication-type="journal" id="sbref0095"><person-group person-group-type="author"><name><surname>Tamura</surname>
<given-names>K.</given-names>
</name>
<name><surname>Peterson</surname>
<given-names>D.</given-names>
</name>
<name><surname>Peterson</surname>
<given-names>N.</given-names>
</name>
<name><surname>Stecher</surname>
<given-names>G.</given-names>
</name>
<name><surname>Nei</surname>
<given-names>M.</given-names>
</name>
<name><surname>Kumar</surname>
<given-names>S.</given-names>
</name>
</person-group>
<article-title>MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods</article-title>
<source>Mol. Biol. Evol.</source>
<volume>28</volume>
<year>2011</year>
<fpage>2731</fpage>
<lpage>2739</lpage>
<pub-id pub-id-type="pmid">21546353</pub-id>
</element-citation>
</ref>
<ref id="bib0100"><element-citation publication-type="journal" id="sbref0100"><person-group person-group-type="author"><name><surname>Vabret</surname>
<given-names>N.</given-names>
</name>
<name><surname>Bailly-Bechet</surname>
<given-names>M.</given-names>
</name>
<name><surname>Najburg</surname>
<given-names>V.</given-names>
</name>
<name><surname>Muller-Trutwin</surname>
<given-names>M.</given-names>
</name>
<name><surname>Verrier</surname>
<given-names>B.</given-names>
</name>
<name><surname>Tangy</surname>
<given-names>F.</given-names>
</name>
</person-group>
<article-title>The biased nucleotide composition of HIV-1 triggers type I interferon response and correlates with subtype D increased pathogenicity</article-title>
<source>PLoS ONE</source>
<volume>7</volume>
<year>2012</year>
<fpage>e33502</fpage>
<pub-id pub-id-type="pmid">22529893</pub-id>
</element-citation>
</ref>
<ref id="bib0105"><element-citation publication-type="journal" id="sbref0105"><person-group person-group-type="author"><name><surname>van Boheemen</surname>
<given-names>S.</given-names>
</name>
</person-group>
<article-title>Genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans</article-title>
<source>mBio</source>
<volume>3</volume>
<year>2012</year>
<comment>pii:mBio.00473-12</comment>
</element-citation>
</ref>
<ref id="bib0110"><element-citation publication-type="journal" id="sbref0110"><person-group person-group-type="author"><name><surname>van der Hoek</surname>
<given-names>L.</given-names>
</name>
<name><surname>Pyrc</surname>
<given-names>K.</given-names>
</name>
<name><surname>Jebbink</surname>
<given-names>M.F.</given-names>
</name>
<name><surname>Vermeulen-Oost</surname>
<given-names>W.</given-names>
</name>
<name><surname>Berkhout</surname>
<given-names>R.J.</given-names>
</name>
<name><surname>Wolthers</surname>
<given-names>K.C.</given-names>
</name>
<name><surname>Wertheim-van Dillen</surname>
<given-names>P.M.</given-names>
</name>
<name><surname>Kaandorp</surname>
<given-names>J.</given-names>
</name>
<name><surname>Spaargaren</surname>
<given-names>J.</given-names>
</name>
<name><surname>Berkhout</surname>
<given-names>B.</given-names>
</name>
</person-group>
<article-title>Identification of a new human coronavirus</article-title>
<source>Nat. Med.</source>
<volume>10</volume>
<year>2004</year>
<fpage>368</fpage>
<lpage>373</lpage>
<pub-id pub-id-type="pmid">15034574</pub-id>
</element-citation>
</ref>
<ref id="bib0115"><element-citation publication-type="journal" id="sbref0115"><person-group person-group-type="author"><name><surname>van der Kuyl</surname>
<given-names>A.C.</given-names>
</name>
<name><surname>Berkhout</surname>
<given-names>B.</given-names>
</name>
</person-group>
<article-title>The biased nucleotide composition of the HIV genome: a constant factor in a highly variable virus</article-title>
<source>Retrovirology</source>
<volume>9</volume>
<year>2012</year>
<fpage>92</fpage>
<pub-id pub-id-type="pmid">23131071</pub-id>
</element-citation>
</ref>
<ref id="bib0120"><element-citation publication-type="journal" id="sbref0120"><person-group person-group-type="author"><name><surname>Van Hemert</surname>
<given-names>F.</given-names>
</name>
<name><surname>van der Kuyl</surname>
<given-names>A.C.</given-names>
</name>
<name><surname>Berkhout</surname>
<given-names>B.</given-names>
</name>
</person-group>
<article-title>On the nucleotide composition and structure of retroviral RNA genomes</article-title>
<source>Virus Res.</source>
<volume>193</volume>
<year>2014</year>
<fpage>16</fpage>
<lpage>23</lpage>
<pub-id pub-id-type="pmid">24675274</pub-id>
</element-citation>
</ref>
<ref id="bib0125"><element-citation publication-type="journal" id="sbref0125"><person-group person-group-type="author"><name><surname>van Hemert</surname>
<given-names>F.J.</given-names>
</name>
<name><surname>Berkhout</surname>
<given-names>B.</given-names>
</name>
</person-group>
<article-title>The tendency of lentiviral open reading frames to become A-rich: constraints imposed by viral genome organization and cellular tRNA availability</article-title>
<source>J. Mol. Evol.</source>
<volume>41</volume>
<year>1995</year>
<fpage>132</fpage>
<lpage>140</lpage>
<pub-id pub-id-type="pmid">7666442</pub-id>
</element-citation>
</ref>
<ref id="bib0130"><element-citation publication-type="journal" id="sbref0130"><person-group person-group-type="author"><name><surname>van Hemert</surname>
<given-names>F.J.</given-names>
</name>
<name><surname>van der Kuyl</surname>
<given-names>A.C.</given-names>
</name>
<name><surname>Berkhout</surname>
<given-names>B.</given-names>
</name>
</person-group>
<article-title>The A-nucleotide preference of HIV-1 in the context of its structured RNA genome</article-title>
<source>RNA Biol.</source>
<volume>10</volume>
<year>2013</year>
<fpage>211</fpage>
<lpage>215</lpage>
<pub-id pub-id-type="pmid">23235488</pub-id>
</element-citation>
</ref>
<ref id="bib0135"><element-citation publication-type="journal" id="sbref0135"><person-group person-group-type="author"><name><surname>Wang</surname>
<given-names>M.</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name><surname>Zhou</surname>
<given-names>J.H.</given-names>
</name>
<name><surname>Chen</surname>
<given-names>H.T.</given-names>
</name>
<name><surname>Ma</surname>
<given-names>L.N.</given-names>
</name>
<name><surname>Ding</surname>
<given-names>Y.Z.</given-names>
</name>
<name><surname>Liu</surname>
<given-names>W.Q.</given-names>
</name>
<name><surname>Liu</surname>
<given-names>Y.S.</given-names>
</name>
</person-group>
<article-title>Analysis of codon usage in bovine viral diarrhea virus</article-title>
<source>Arch. Virol.</source>
<volume>156</volume>
<year>2011</year>
<fpage>153</fpage>
<lpage>160</lpage>
<pub-id pub-id-type="pmid">21069395</pub-id>
</element-citation>
</ref>
<ref id="bib0140"><element-citation publication-type="journal" id="sbref0140"><person-group person-group-type="author"><name><surname>Watts</surname>
<given-names>J.M.</given-names>
</name>
<name><surname>Dang</surname>
<given-names>K.K.</given-names>
</name>
<name><surname>Gorelick</surname>
<given-names>R.J.</given-names>
</name>
<name><surname>Leonard</surname>
<given-names>C.W.</given-names>
</name>
<name><surname>Bess</surname>
<given-names>J.W.</given-names>
<suffix>Jr.</suffix>
</name>
<name><surname>Swanstrom</surname>
<given-names>R.</given-names>
</name>
<name><surname>Burch</surname>
<given-names>C.L.</given-names>
</name>
<name><surname>Weeks</surname>
<given-names>K.M.</given-names>
</name>
</person-group>
<article-title>Architecture and secondary structure of an entire HIV-1 RNA genome</article-title>
<source>Nature</source>
<volume>460</volume>
<year>2009</year>
<fpage>711</fpage>
<lpage>716</lpage>
<pub-id pub-id-type="pmid">19661910</pub-id>
</element-citation>
</ref>
<ref id="bib0145"><element-citation publication-type="journal" id="sbref0145"><person-group person-group-type="author"><name><surname>Wong</surname>
<given-names>E.H.</given-names>
</name>
<name><surname>Smith</surname>
<given-names>D.K.</given-names>
</name>
<name><surname>Rabadan</surname>
<given-names>R.</given-names>
</name>
<name><surname>Peiris</surname>
<given-names>M.</given-names>
</name>
<name><surname>Poon</surname>
<given-names>L.L.</given-names>
</name>
</person-group>
<article-title>Codon usage bias and the evolution of influenza A viruses. Codon usage biases of influenza virus</article-title>
<source>BMC. Evol. Biol.</source>
<volume>10</volume>
<year>2010</year>
<fpage>253</fpage>
<pub-id pub-id-type="pmid">20723216</pub-id>
</element-citation>
</ref>
<ref id="bib0150"><element-citation publication-type="journal" id="sbref0150"><person-group person-group-type="author"><name><surname>Woo</surname>
<given-names>P.C.</given-names>
</name>
<name><surname>Huang</surname>
<given-names>Y.</given-names>
</name>
<name><surname>Lau</surname>
<given-names>S.K.</given-names>
</name>
<name><surname>Yuen</surname>
<given-names>K.Y.</given-names>
</name>
</person-group>
<article-title>Coronavirus genomics and bioinformatics analysis</article-title>
<source>Viruses</source>
<volume>2</volume>
<year>2010</year>
<fpage>1804</fpage>
<lpage>1820</lpage>
<pub-id pub-id-type="pmid">21994708</pub-id>
</element-citation>
</ref>
<ref id="bib0155"><element-citation publication-type="journal" id="sbref0155"><person-group person-group-type="author"><name><surname>Woo</surname>
<given-names>P.C.</given-names>
</name>
</person-group>
<article-title>Characterization and complete genome sequence of a novel coronavirus, coronavirus HKU1, from patients with pneumonia</article-title>
<source>J. Virol.</source>
<volume>79</volume>
<year>2005</year>
<fpage>884</fpage>
<lpage>895</lpage>
<pub-id pub-id-type="pmid">15613317</pub-id>
</element-citation>
</ref>
<ref id="bib0160"><element-citation publication-type="journal" id="sbref0160"><person-group person-group-type="author"><name><surname>Woo</surname>
<given-names>P.C.</given-names>
</name>
<name><surname>Wong</surname>
<given-names>B.H.</given-names>
</name>
<name><surname>Huang</surname>
<given-names>Y.</given-names>
</name>
<name><surname>Lau</surname>
<given-names>S.K.</given-names>
</name>
<name><surname>Yuen</surname>
<given-names>K.Y.</given-names>
</name>
</person-group>
<article-title>Cytosine deamination and selection of CpG suppressed clones are the two major independent biological forces that shape codon usage bias in coronaviruses</article-title>
<source>Virology</source>
<volume>369</volume>
<year>2007</year>
<fpage>431</fpage>
<lpage>442</lpage>
<pub-id pub-id-type="pmid">17881030</pub-id>
</element-citation>
</ref>
<ref id="bib0165"><element-citation publication-type="journal" id="sbref0165"><person-group person-group-type="author"><name><surname>Wright</surname>
<given-names>F.</given-names>
</name>
</person-group>
<article-title>The ‘effective number of codons’ used in a gene</article-title>
<source>Gene</source>
<volume>87</volume>
<year>1990</year>
<fpage>23</fpage>
<lpage>29</lpage>
<pub-id pub-id-type="pmid">2110097</pub-id>
</element-citation>
</ref>
<ref id="bib0170"><element-citation publication-type="journal" id="sbref0170"><person-group person-group-type="author"><name><surname>Zaki</surname>
<given-names>A.M.</given-names>
</name>
<name><surname>van</surname>
<given-names>B.S.</given-names>
</name>
<name><surname>Bestebroer</surname>
<given-names>T.M.</given-names>
</name>
<name><surname>Osterhaus</surname>
<given-names>A.D.</given-names>
</name>
<name><surname>Fouchier</surname>
<given-names>R.A.</given-names>
</name>
</person-group>
<article-title>Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia</article-title>
<source>N. Engl. J. Med.</source>
<volume>367</volume>
<year>2012</year>
<fpage>1814</fpage>
<lpage>1820</lpage>
<pub-id pub-id-type="pmid">23075143</pub-id>
</element-citation>
</ref>
<ref id="bib0175"><element-citation publication-type="journal" id="sbref0175"><person-group person-group-type="author"><name><surname>Zhang</surname>
<given-names>Z.</given-names>
</name>
<name><surname>Dai</surname>
<given-names>W.</given-names>
</name>
<name><surname>Dai</surname>
<given-names>D.</given-names>
</name>
</person-group>
<article-title>Synonymous codon usage in TTSuV2: analysis and comparison with TTSuV1</article-title>
<source>PLOS ONE</source>
<volume>8</volume>
<year>2013</year>
<fpage>e81469</fpage>
<pub-id pub-id-type="pmid">24303050</pub-id>
</element-citation>
</ref>
<ref id="bib0180"><element-citation publication-type="book" id="sbref0180"><person-group person-group-type="author"><name><surname>Zuker</surname>
<given-names>M.</given-names>
</name>
<name><surname>Turner</surname>
<given-names>D.H.</given-names>
</name>
</person-group>
<chapter-title>Algorithms and thermodynamics for RNA secondary structure prediction: a practical guide</chapter-title>
<person-group person-group-type="editor"><name><surname>Barciszewski</surname>
<given-names>J.</given-names>
</name>
<name><surname>Clark</surname>
<given-names>B.F.C.</given-names>
</name>
</person-group>
<source>RNA Biochemistry and Biotechnology</source>
<year>1999</year>
<publisher-name>Kluwer Academic Publishers</publisher-name>
<publisher-loc>Dordrecht/Boston/London</publisher-loc>
<fpage>11</fpage>
<lpage>43</lpage>
</element-citation>
</ref>
</ref-list>
<ack id="ack0005"><title>Acknowledgement</title>
<p>This research is sponsored by <funding-source id="gs0005">NWO</funding-source>
 (700.59.301) (TOP grant to BB).</p>
</ack>
</back>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Pmc/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001198  | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 001198  | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     
   |texte=   
}}

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021

	Serveur d'exploration MERS
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration MERS

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri