CyberinfraV1, Pmc, Corpus, bibRecord, 000182

Whole genome de novo assemblies of three divergent strains of rice, Oryza sativa, document novel gene space of aus and indica

Identifieur interne : 000182 ( Pmc/Corpus ); précédent : 000181; suivant : 000183

Whole genome de novo assemblies of three divergent strains of rice, Oryza sativa, document novel gene space of aus and indica

Auteurs : Michael C. Schatz ; Lyza G. Maron ; Joshua C. Stein ; Alejandro Hernandez Wences ; James Gurtowski ; Eric Biggers ; Hayan Lee ; Melissa Kramer ; Eric Antoniou ; Elena Ghiban ; Mark H. Wright ; Jer-Ming Chia ; Doreen Ware ; Susan R. Mccouch ; W Richard Mccombie

Source :

Genome Biology [ 1465-6906 ] ; 2014.

RBID : PMC:4268812

Abstract

Background

The use of high throughput genome-sequencing technologies has uncovered a large extent of structural variation in eukaryotic genomes that makes important contributions to genomic diversity and phenotypic variation. When the genomes of different strains of a given organism are compared, whole genome resequencing data are typically aligned to an established reference sequence. However, when the reference differs in significant structural ways from the individuals under study, the analysis is often incomplete or inaccurate.

Results

Here, we use rice as a model to demonstrate how improvements in sequencing and assembly technology allow rapid and inexpensive de novo assembly of next generation sequence data into high-quality assemblies that can be directly compared using whole genome alignment to provide an unbiased assessment. Using this approach, we are able to accurately assess the ‘pan-genome’ of three divergent rice varieties and document several megabases of each genome absent in the other two.

Conclusions

Many of the genome-specific loci are annotated to contain genes, reflecting the potential for new biological properties that would be missed by standard reference-mapping approaches. We further provide a detailed analysis of several loci associated with agriculturally important traits, including the S5 hybrid sterility locus, the Sub1 submergence tolerance locus, the LRK gene cluster associated with improved yield, and the Pup1 cluster associated with phosphorus deficiency, illustrating the utility of our approach for biological discovery. All of the data and software are openly available to support further breeding and functional studies of rice and other species.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-014-0506-z) contains supplementary material, which is available to authorized users.

Url:

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4268812

DOI: 10.1186/s13059-014-0506-z
PubMed: 25468217
PubMed Central: 4268812

Links to Exploration step

PMC:4268812

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Whole genome <italic>de novo</italic>
 assemblies of three divergent strains of rice, <italic>Oryza sativa</italic>
, document novel gene space of <italic>aus</italic>
 and <italic>indica</italic>
</title>
<author><name sortKey="Schatz, Michael C" sort="Schatz, Michael C" uniqKey="Schatz M" first="Michael C" last="Schatz">Michael C. Schatz</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Maron, Lyza G" sort="Maron, Lyza G" uniqKey="Maron L" first="Lyza G" last="Maron">Lyza G. Maron</name>
<affiliation><nlm:aff id="Aff2">Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Stein, Joshua C" sort="Stein, Joshua C" uniqKey="Stein J" first="Joshua C" last="Stein">Joshua C. Stein</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Wences, Alejandro Hernandez" sort="Wences, Alejandro Hernandez" uniqKey="Wences A" first="Alejandro Hernandez" last="Wences">Alejandro Hernandez Wences</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="Aff3">Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, 62210 Morelos Mexico</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Gurtowski, James" sort="Gurtowski, James" uniqKey="Gurtowski J" first="James" last="Gurtowski">James Gurtowski</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Biggers, Eric" sort="Biggers, Eric" uniqKey="Biggers E" first="Eric" last="Biggers">Eric Biggers</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="Aff4">Macalester College, St Paul, MN 55105 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Lee, Hayan" sort="Lee, Hayan" uniqKey="Lee H" first="Hayan" last="Lee">Hayan Lee</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="Aff5">Stony Brook University, Stony Brook, NY 11794 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Kramer, Melissa" sort="Kramer, Melissa" uniqKey="Kramer M" first="Melissa" last="Kramer">Melissa Kramer</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Antoniou, Eric" sort="Antoniou, Eric" uniqKey="Antoniou E" first="Eric" last="Antoniou">Eric Antoniou</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Ghiban, Elena" sort="Ghiban, Elena" uniqKey="Ghiban E" first="Elena" last="Ghiban">Elena Ghiban</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Wright, Mark H" sort="Wright, Mark H" uniqKey="Wright M" first="Mark H" last="Wright">Mark H. Wright</name>
<affiliation><nlm:aff id="Aff2">Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Chia, Jer Ming" sort="Chia, Jer Ming" uniqKey="Chia J" first="Jer-Ming" last="Chia">Jer-Ming Chia</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Ware, Doreen" sort="Ware, Doreen" uniqKey="Ware D" first="Doreen" last="Ware">Doreen Ware</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="Aff6">USDA-ARS NAA Plant, Soil and Nutrition Laboratory Research Unit, Cornell University, Ithaca, NY 14853 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Mccouch, Susan R" sort="Mccouch, Susan R" uniqKey="Mccouch S" first="Susan R" last="Mccouch">Susan R. Mccouch</name>
<affiliation><nlm:aff id="Aff2">Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Mccombie, W Richard" sort="Mccombie, W Richard" uniqKey="Mccombie W" first="W Richard" last="Mccombie">W Richard Mccombie</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">25468217</idno>
<idno type="pmc">4268812</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4268812</idno>
<idno type="RBID">PMC:4268812</idno>
<idno type="doi">10.1186/s13059-014-0506-z</idno>
<date when="2014">2014</date>
<idno type="wicri:Area/Pmc/Corpus">000182</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">Whole genome <italic>de novo</italic>
 assemblies of three divergent strains of rice, <italic>Oryza sativa</italic>
, document novel gene space of <italic>aus</italic>
 and <italic>indica</italic>
</title>
<author><name sortKey="Schatz, Michael C" sort="Schatz, Michael C" uniqKey="Schatz M" first="Michael C" last="Schatz">Michael C. Schatz</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Maron, Lyza G" sort="Maron, Lyza G" uniqKey="Maron L" first="Lyza G" last="Maron">Lyza G. Maron</name>
<affiliation><nlm:aff id="Aff2">Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Stein, Joshua C" sort="Stein, Joshua C" uniqKey="Stein J" first="Joshua C" last="Stein">Joshua C. Stein</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Wences, Alejandro Hernandez" sort="Wences, Alejandro Hernandez" uniqKey="Wences A" first="Alejandro Hernandez" last="Wences">Alejandro Hernandez Wences</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="Aff3">Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, 62210 Morelos Mexico</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Gurtowski, James" sort="Gurtowski, James" uniqKey="Gurtowski J" first="James" last="Gurtowski">James Gurtowski</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Biggers, Eric" sort="Biggers, Eric" uniqKey="Biggers E" first="Eric" last="Biggers">Eric Biggers</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="Aff4">Macalester College, St Paul, MN 55105 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Lee, Hayan" sort="Lee, Hayan" uniqKey="Lee H" first="Hayan" last="Lee">Hayan Lee</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="Aff5">Stony Brook University, Stony Brook, NY 11794 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Kramer, Melissa" sort="Kramer, Melissa" uniqKey="Kramer M" first="Melissa" last="Kramer">Melissa Kramer</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Antoniou, Eric" sort="Antoniou, Eric" uniqKey="Antoniou E" first="Eric" last="Antoniou">Eric Antoniou</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Ghiban, Elena" sort="Ghiban, Elena" uniqKey="Ghiban E" first="Elena" last="Ghiban">Elena Ghiban</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Wright, Mark H" sort="Wright, Mark H" uniqKey="Wright M" first="Mark H" last="Wright">Mark H. Wright</name>
<affiliation><nlm:aff id="Aff2">Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Chia, Jer Ming" sort="Chia, Jer Ming" uniqKey="Chia J" first="Jer-Ming" last="Chia">Jer-Ming Chia</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Ware, Doreen" sort="Ware, Doreen" uniqKey="Ware D" first="Doreen" last="Ware">Doreen Ware</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="Aff6">USDA-ARS NAA Plant, Soil and Nutrition Laboratory Research Unit, Cornell University, Ithaca, NY 14853 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Mccouch, Susan R" sort="Mccouch, Susan R" uniqKey="Mccouch S" first="Susan R" last="Mccouch">Susan R. Mccouch</name>
<affiliation><nlm:aff id="Aff2">Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853 USA</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Mccombie, W Richard" sort="Mccombie, W Richard" uniqKey="Mccombie W" first="W Richard" last="Mccombie">W Richard Mccombie</name>
<affiliation><nlm:aff id="Aff1">Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</nlm:aff>
</affiliation>
</author>
</analytic>
<series><title level="j">Genome Biology</title>
<idno type="ISSN">1465-6906</idno>
<idno type="eISSN">1465-6914</idno>
<imprint><date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><sec><title>Background</title>
<p>The use of high throughput genome-sequencing technologies has uncovered a large extent of structural variation in eukaryotic genomes that makes important contributions to genomic diversity and phenotypic variation. When the genomes of different strains of a given organism are compared, whole genome resequencing data are typically aligned to an established reference sequence. However, when the reference differs in significant structural ways from the individuals under study, the analysis is often incomplete or inaccurate.</p>
</sec>
<sec><title>Results</title>
<p>Here, we use rice as a model to demonstrate how improvements in sequencing and assembly technology allow rapid and inexpensive <italic>de novo</italic>
 assembly of next generation sequence data into high-quality assemblies that can be directly compared using whole genome alignment to provide an unbiased assessment. Using this approach, we are able to accurately assess the ‘pan-genome’ of three divergent rice varieties and document several megabases of each genome absent in the other two.</p>
</sec>
<sec><title>Conclusions</title>
<p>Many of the genome-specific loci are annotated to contain genes, reflecting the potential for new biological properties that would be missed by standard reference-mapping approaches. We further provide a detailed analysis of several loci associated with agriculturally important traits, including the <italic>S5</italic>
 hybrid sterility locus, the <italic>Sub1</italic>
 submergence tolerance locus, the <italic>LRK</italic>
 gene cluster associated with improved yield, and the <italic>Pup1</italic>
 cluster associated with phosphorus deficiency, illustrating the utility of our approach for biological discovery. All of the data and software are openly available to support further breeding and functional studies of rice and other species.</p>
</sec>
<sec><title>Electronic supplementary material</title>
<p>The online version of this article (doi:10.1186/s13059-014-0506-z) contains supplementary material, which is available to authorized users.</p>
</sec>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct><analytic><author><name sortKey="Garris, Aj" uniqKey="Garris A">AJ Garris</name>
</author>
<author><name sortKey="Tai, Th" uniqKey="Tai T">TH Tai</name>
</author>
<author><name sortKey="Coburn, J" uniqKey="Coburn J">J Coburn</name>
</author>
<author><name sortKey="Kresovich, S" uniqKey="Kresovich S">S Kresovich</name>
</author>
<author><name sortKey="Mccouch, S" uniqKey="Mccouch S">S McCouch</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Huang, X" uniqKey="Huang X">X Huang</name>
</author>
<author><name sortKey="Kurata, N" uniqKey="Kurata N">N Kurata</name>
</author>
<author><name sortKey="Wei, X" uniqKey="Wei X">X Wei</name>
</author>
<author><name sortKey="Wang, Zx" uniqKey="Wang Z">ZX Wang</name>
</author>
<author><name sortKey="Wang, A" uniqKey="Wang A">A Wang</name>
</author>
<author><name sortKey="Zhao, Q" uniqKey="Zhao Q">Q Zhao</name>
</author>
<author><name sortKey="Zhao, Y" uniqKey="Zhao Y">Y Zhao</name>
</author>
<author><name sortKey="Liu, K" uniqKey="Liu K">K Liu</name>
</author>
<author><name sortKey="Lu, H" uniqKey="Lu H">H Lu</name>
</author>
<author><name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
<author><name sortKey="Guo, Y" uniqKey="Guo Y">Y Guo</name>
</author>
<author><name sortKey="Lu, Y" uniqKey="Lu Y">Y Lu</name>
</author>
<author><name sortKey="Zhou, C" uniqKey="Zhou C">C Zhou</name>
</author>
<author><name sortKey="Fan, D" uniqKey="Fan D">D Fan</name>
</author>
<author><name sortKey="Weng, Q" uniqKey="Weng Q">Q Weng</name>
</author>
<author><name sortKey="Zhu, C" uniqKey="Zhu C">C Zhu</name>
</author>
<author><name sortKey="Huang, T" uniqKey="Huang T">T Huang</name>
</author>
<author><name sortKey="Zhang, L" uniqKey="Zhang L">L Zhang</name>
</author>
<author><name sortKey="Wang, Y" uniqKey="Wang Y">Y Wang</name>
</author>
<author><name sortKey="Feng, L" uniqKey="Feng L">L Feng</name>
</author>
<author><name sortKey="Furuumi, H" uniqKey="Furuumi H">H Furuumi</name>
</author>
<author><name sortKey="Kubo, T" uniqKey="Kubo T">T Kubo</name>
</author>
<author><name sortKey="Miyabayashi, T" uniqKey="Miyabayashi T">T Miyabayashi</name>
</author>
<author><name sortKey="Yuan, X" uniqKey="Yuan X">X Yuan</name>
</author>
<author><name sortKey="Xu, Q" uniqKey="Xu Q">Q Xu</name>
</author>
<author><name sortKey="Dong, G" uniqKey="Dong G">G Dong</name>
</author>
<author><name sortKey="Zhan, Q" uniqKey="Zhan Q">Q Zhan</name>
</author>
<author><name sortKey="Li, C" uniqKey="Li C">C Li</name>
</author>
<author><name sortKey="Fujiyama, A" uniqKey="Fujiyama A">A Fujiyama</name>
</author>
<author><name sortKey="Toyoda, A" uniqKey="Toyoda A">A Toyoda</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Zhao, Ky" uniqKey="Zhao K">KY Zhao</name>
</author>
<author><name sortKey="Wright, M" uniqKey="Wright M">M Wright</name>
</author>
<author><name sortKey="Kimball, J" uniqKey="Kimball J">J Kimball</name>
</author>
<author><name sortKey="Eizenga, G" uniqKey="Eizenga G">G Eizenga</name>
</author>
<author><name sortKey="Mcclung, A" uniqKey="Mcclung A">A McClung</name>
</author>
<author><name sortKey="Kovach, M" uniqKey="Kovach M">M Kovach</name>
</author>
<author><name sortKey="Tyagi, W" uniqKey="Tyagi W">W Tyagi</name>
</author>
<author><name sortKey="Ali, Ml" uniqKey="Ali M">ML Ali</name>
</author>
<author><name sortKey="Tung, Cw" uniqKey="Tung C">CW Tung</name>
</author>
<author><name sortKey="Reynolds, A" uniqKey="Reynolds A">A Reynolds</name>
</author>
<author><name sortKey="Bustamante, Cd" uniqKey="Bustamante C">CD Bustamante</name>
</author>
<author><name sortKey="Mccouch, Sr" uniqKey="Mccouch S">SR McCouch</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Boyko, Ar" uniqKey="Boyko A">AR Boyko</name>
</author>
<author><name sortKey="Quignon, P" uniqKey="Quignon P">P Quignon</name>
</author>
<author><name sortKey="Li, L" uniqKey="Li L">L Li</name>
</author>
<author><name sortKey="Schoenebeck, Jj" uniqKey="Schoenebeck J">JJ Schoenebeck</name>
</author>
<author><name sortKey="Degenhardt, Jd" uniqKey="Degenhardt J">JD Degenhardt</name>
</author>
<author><name sortKey="Lohmueller, Ke" uniqKey="Lohmueller K">KE Lohmueller</name>
</author>
<author><name sortKey="Zhao, K" uniqKey="Zhao K">K Zhao</name>
</author>
<author><name sortKey="Brisbin, A" uniqKey="Brisbin A">A Brisbin</name>
</author>
<author><name sortKey="Parker, Hg" uniqKey="Parker H">HG Parker</name>
</author>
<author><name sortKey="Vonholdt, Bm" uniqKey="Vonholdt B">BM vonHoldt</name>
</author>
<author><name sortKey="Cargill, M" uniqKey="Cargill M">M Cargill</name>
</author>
<author><name sortKey="Auton, A" uniqKey="Auton A">A Auton</name>
</author>
<author><name sortKey="Reynolds, A" uniqKey="Reynolds A">A Reynolds</name>
</author>
<author><name sortKey="Elkahloun, Ag" uniqKey="Elkahloun A">AG Elkahloun</name>
</author>
<author><name sortKey="Castelhano, M" uniqKey="Castelhano M">M Castelhano</name>
</author>
<author><name sortKey="Mosher, Ds" uniqKey="Mosher D">DS Mosher</name>
</author>
<author><name sortKey="Sutter, Nb" uniqKey="Sutter N">NB Sutter</name>
</author>
<author><name sortKey="Johnson, Gs" uniqKey="Johnson G">GS Johnson</name>
</author>
<author><name sortKey="Novembre, J" uniqKey="Novembre J">J Novembre</name>
</author>
<author><name sortKey="Hubisz, Mj" uniqKey="Hubisz M">MJ Hubisz</name>
</author>
<author><name sortKey="Siepel, A" uniqKey="Siepel A">A Siepel</name>
</author>
<author><name sortKey="Wayne, Rk" uniqKey="Wayne R">RK Wayne</name>
</author>
<author><name sortKey="Bustamante, Cd" uniqKey="Bustamante C">CD Bustamante</name>
</author>
<author><name sortKey="Ostrander, Ea" uniqKey="Ostrander E">EA Ostrander</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Weir, Bs" uniqKey="Weir B">BS Weir</name>
</author>
<author><name sortKey="Cardon, Lr" uniqKey="Cardon L">LR Cardon</name>
</author>
<author><name sortKey="Anderson, Ad" uniqKey="Anderson A">AD Anderson</name>
</author>
<author><name sortKey="Nielsen, Dm" uniqKey="Nielsen D">DM Nielsen</name>
</author>
<author><name sortKey="Hill, Wg" uniqKey="Hill W">WG Hill</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Matsuoka, Y" uniqKey="Matsuoka Y">Y Matsuoka</name>
</author>
<author><name sortKey="Vigouroux, Y" uniqKey="Vigouroux Y">Y Vigouroux</name>
</author>
<author><name sortKey="Goodman, Mm" uniqKey="Goodman M">MM Goodman</name>
</author>
<author><name sortKey="Sanchez, Gj" uniqKey="Sanchez G">GJ Sanchez</name>
</author>
<author><name sortKey="Buckler, E" uniqKey="Buckler E">E Buckler</name>
</author>
<author><name sortKey="Doebley, J" uniqKey="Doebley J">J Doebley</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Zhao, K" uniqKey="Zhao K">K Zhao</name>
</author>
<author><name sortKey="Tung, Cw" uniqKey="Tung C">CW Tung</name>
</author>
<author><name sortKey="Eizenga, Gc" uniqKey="Eizenga G">GC Eizenga</name>
</author>
<author><name sortKey="Wright, Mh" uniqKey="Wright M">MH Wright</name>
</author>
<author><name sortKey="Ali, Ml" uniqKey="Ali M">ML Ali</name>
</author>
<author><name sortKey="Price, Ah" uniqKey="Price A">AH Price</name>
</author>
<author><name sortKey="Norton, Gj" uniqKey="Norton G">GJ Norton</name>
</author>
<author><name sortKey="Islam, Mr" uniqKey="Islam M">MR Islam</name>
</author>
<author><name sortKey="Reynolds, A" uniqKey="Reynolds A">A Reynolds</name>
</author>
<author><name sortKey="Mezey, J" uniqKey="Mezey J">J Mezey</name>
</author>
<author><name sortKey="Mcclung, Am" uniqKey="Mcclung A">AM McClung</name>
</author>
<author><name sortKey="Bustamante, Cd" uniqKey="Bustamante C">CD Bustamante</name>
</author>
<author><name sortKey="Mccouch, Sr" uniqKey="Mccouch S">SR McCouch</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ma, J" uniqKey="Ma J">J Ma</name>
</author>
<author><name sortKey="Bennetzen, Jl" uniqKey="Bennetzen J">JL Bennetzen</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Cheng, Cy" uniqKey="Cheng C">CY Cheng</name>
</author>
<author><name sortKey="Motohashi, R" uniqKey="Motohashi R">R Motohashi</name>
</author>
<author><name sortKey="Tsuchimoto, S" uniqKey="Tsuchimoto S">S Tsuchimoto</name>
</author>
<author><name sortKey="Fukuta, Y" uniqKey="Fukuta Y">Y Fukuta</name>
</author>
<author><name sortKey="Ohtsubo, H" uniqKey="Ohtsubo H">H Ohtsubo</name>
</author>
<author><name sortKey="Ohtsubo, E" uniqKey="Ohtsubo E">E Ohtsubo</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Kovach, Mj" uniqKey="Kovach M">MJ Kovach</name>
</author>
<author><name sortKey="Sweeney, Mt" uniqKey="Sweeney M">MT Sweeney</name>
</author>
<author><name sortKey="Mccouch, Sr" uniqKey="Mccouch S">SR McCouch</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Roy, Sc" uniqKey="Roy S">SC Roy</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Second, G" uniqKey="Second G">G Second</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Second, G" uniqKey="Second G">G Second</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ding, J" uniqKey="Ding J">J Ding</name>
</author>
<author><name sortKey="Araki, H" uniqKey="Araki H">H Araki</name>
</author>
<author><name sortKey="Wang, Q" uniqKey="Wang Q">Q Wang</name>
</author>
<author><name sortKey="Zhang, P" uniqKey="Zhang P">P Zhang</name>
</author>
<author><name sortKey="Yang, S" uniqKey="Yang S">S Yang</name>
</author>
<author><name sortKey="Chen, Jq" uniqKey="Chen J">JQ Chen</name>
</author>
<author><name sortKey="Tian, D" uniqKey="Tian D">D Tian</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Liu, Xh" uniqKey="Liu X">XH Liu</name>
</author>
<author><name sortKey="Lu, Tt" uniqKey="Lu T">TT Lu</name>
</author>
<author><name sortKey="Yu, Sl" uniqKey="Yu S">SL Yu</name>
</author>
<author><name sortKey="Li, Y" uniqKey="Li Y">Y Li</name>
</author>
<author><name sortKey="Huang, Yc" uniqKey="Huang Y">YC Huang</name>
</author>
<author><name sortKey="Huang, T" uniqKey="Huang T">T Huang</name>
</author>
<author><name sortKey="Zhang, L" uniqKey="Zhang L">L Zhang</name>
</author>
<author><name sortKey="Zhu, Jj" uniqKey="Zhu J">JJ Zhu</name>
</author>
<author><name sortKey="Zhao, Q" uniqKey="Zhao Q">Q Zhao</name>
</author>
<author><name sortKey="Fan, Dl" uniqKey="Fan D">DL Fan</name>
</author>
<author><name sortKey="Mu, J" uniqKey="Mu J">J Mu</name>
</author>
<author><name sortKey="Shangguan, Yy" uniqKey="Shangguan Y">YY Shangguan</name>
</author>
<author><name sortKey="Feng, Q" uniqKey="Feng Q">Q Feng</name>
</author>
<author><name sortKey="Guan, Jp" uniqKey="Guan J">JP Guan</name>
</author>
<author><name sortKey="Ying, K" uniqKey="Ying K">K Ying</name>
</author>
<author><name sortKey="Zhang, Y" uniqKey="Zhang Y">Y Zhang</name>
</author>
<author><name sortKey="Lin, Zx" uniqKey="Lin Z">ZX Lin</name>
</author>
<author><name sortKey="Sun, Zx" uniqKey="Sun Z">ZX Sun</name>
</author>
<author><name sortKey="Qian, Q" uniqKey="Qian Q">Q Qian</name>
</author>
<author><name sortKey="Lu, Yp" uniqKey="Lu Y">YP Lu</name>
</author>
<author><name sortKey="Han, B" uniqKey="Han B">B Han</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Feltus, Fa" uniqKey="Feltus F">FA Feltus</name>
</author>
<author><name sortKey="Wan, J" uniqKey="Wan J">J Wan</name>
</author>
<author><name sortKey="Schulze, Sr" uniqKey="Schulze S">SR Schulze</name>
</author>
<author><name sortKey="Estill, Jc" uniqKey="Estill J">JC Estill</name>
</author>
<author><name sortKey="Jiang, N" uniqKey="Jiang N">N Jiang</name>
</author>
<author><name sortKey="Paterson, Ah" uniqKey="Paterson A">AH Paterson</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Huang, Xh" uniqKey="Huang X">XH Huang</name>
</author>
<author><name sortKey="Lu, Gj" uniqKey="Lu G">GJ Lu</name>
</author>
<author><name sortKey="Zhao, Q" uniqKey="Zhao Q">Q Zhao</name>
</author>
<author><name sortKey="Liu, Xh" uniqKey="Liu X">XH Liu</name>
</author>
<author><name sortKey="Han, B" uniqKey="Han B">B Han</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Shomura, A" uniqKey="Shomura A">A Shomura</name>
</author>
<author><name sortKey="Izawa, T" uniqKey="Izawa T">T Izawa</name>
</author>
<author><name sortKey="Ebana, K" uniqKey="Ebana K">K Ebana</name>
</author>
<author><name sortKey="Ebitani, T" uniqKey="Ebitani T">T Ebitani</name>
</author>
<author><name sortKey="Kanegae, H" uniqKey="Kanegae H">H Kanegae</name>
</author>
<author><name sortKey="Konishi, S" uniqKey="Konishi S">S Konishi</name>
</author>
<author><name sortKey="Yano, M" uniqKey="Yano M">M Yano</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Takano Kai, N" uniqKey="Takano Kai N">N Takano-Kai</name>
</author>
<author><name sortKey="Jiang, H" uniqKey="Jiang H">H Jiang</name>
</author>
<author><name sortKey="Kubo, T" uniqKey="Kubo T">T Kubo</name>
</author>
<author><name sortKey="Sweeney, M" uniqKey="Sweeney M">M Sweeney</name>
</author>
<author><name sortKey="Matsumoto, T" uniqKey="Matsumoto T">T Matsumoto</name>
</author>
<author><name sortKey="Kanamori, H" uniqKey="Kanamori H">H Kanamori</name>
</author>
<author><name sortKey="Padhukasahasram, B" uniqKey="Padhukasahasram B">B Padhukasahasram</name>
</author>
<author><name sortKey="Bustamante, C" uniqKey="Bustamante C">C Bustamante</name>
</author>
<author><name sortKey="Yoshimura, A" uniqKey="Yoshimura A">A Yoshimura</name>
</author>
<author><name sortKey="Doi, K" uniqKey="Doi K">K Doi</name>
</author>
<author><name sortKey="Mccouch, S" uniqKey="Mccouch S">S McCouch</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Takano Kai, N" uniqKey="Takano Kai N">N Takano-Kai</name>
</author>
<author><name sortKey="Jiang, H" uniqKey="Jiang H">H Jiang</name>
</author>
<author><name sortKey="Kubo, T" uniqKey="Kubo T">T Kubo</name>
</author>
<author><name sortKey="Sweeney, M" uniqKey="Sweeney M">M Sweeney</name>
</author>
<author><name sortKey="Matsumoto, T" uniqKey="Matsumoto T">T Matsumoto</name>
</author>
<author><name sortKey="Kanamori, H" uniqKey="Kanamori H">H Kanamori</name>
</author>
<author><name sortKey="Padhukasahasram, B" uniqKey="Padhukasahasram B">B Padhukasahasram</name>
</author>
<author><name sortKey="Bustamante, C" uniqKey="Bustamante C">C Bustamante</name>
</author>
<author><name sortKey="Yoshimura, A" uniqKey="Yoshimura A">A Yoshimura</name>
</author>
<author><name sortKey="Doi, K" uniqKey="Doi K">K Doi</name>
</author>
<author><name sortKey="Mccouch, S" uniqKey="Mccouch S">S McCouch</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Tan, L" uniqKey="Tan L">L Tan</name>
</author>
<author><name sortKey="Li, X" uniqKey="Li X">X Li</name>
</author>
<author><name sortKey="Liu, F" uniqKey="Liu F">F Liu</name>
</author>
<author><name sortKey="Sun, X" uniqKey="Sun X">X Sun</name>
</author>
<author><name sortKey="Li, C" uniqKey="Li C">C Li</name>
</author>
<author><name sortKey="Zhu, Z" uniqKey="Zhu Z">Z Zhu</name>
</author>
<author><name sortKey="Fu, Y" uniqKey="Fu Y">Y Fu</name>
</author>
<author><name sortKey="Cai, H" uniqKey="Cai H">H Cai</name>
</author>
<author><name sortKey="Wang, X" uniqKey="Wang X">X Wang</name>
</author>
<author><name sortKey="Xie, D" uniqKey="Xie D">D Xie</name>
</author>
<author><name sortKey="Sun, C" uniqKey="Sun C">C Sun</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Harushima, Y" uniqKey="Harushima Y">Y Harushima</name>
</author>
<author><name sortKey="Nakagahra, M" uniqKey="Nakagahra M">M Nakagahra</name>
</author>
<author><name sortKey="Yano, M" uniqKey="Yano M">M Yano</name>
</author>
<author><name sortKey="Sasaki, T" uniqKey="Sasaki T">T Sasaki</name>
</author>
<author><name sortKey="Kurata, N" uniqKey="Kurata N">N Kurata</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Lin, Sy" uniqKey="Lin S">SY Lin</name>
</author>
<author><name sortKey="Ikehashi, H" uniqKey="Ikehashi H">H Ikehashi</name>
</author>
<author><name sortKey="Yanagihara, S" uniqKey="Yanagihara S">S Yanagihara</name>
</author>
<author><name sortKey="Kawashima, A" uniqKey="Kawashima A">A Kawashima</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Oka, Hi" uniqKey="Oka H">HI Oka</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Sano, Y" uniqKey="Sano Y">Y Sano</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ammiraju, Jss" uniqKey="Ammiraju J">JSS Ammiraju</name>
</author>
<author><name sortKey="Song, Xa" uniqKey="Song X">XA Song</name>
</author>
<author><name sortKey="Luo, Mz" uniqKey="Luo M">MZ Luo</name>
</author>
<author><name sortKey="Sisneros, N" uniqKey="Sisneros N">N Sisneros</name>
</author>
<author><name sortKey="Angelova, A" uniqKey="Angelova A">A Angelova</name>
</author>
<author><name sortKey="Kudrna, D" uniqKey="Kudrna D">D Kudrna</name>
</author>
<author><name sortKey="Kim, H" uniqKey="Kim H">H Kim</name>
</author>
<author><name sortKey="Yu, Y" uniqKey="Yu Y">Y Yu</name>
</author>
<author><name sortKey="Goicoechea, Jl" uniqKey="Goicoechea J">JL Goicoechea</name>
</author>
<author><name sortKey="Lorieux, M" uniqKey="Lorieux M">M Lorieux</name>
</author>
<author><name sortKey="Kurata, N" uniqKey="Kurata N">N Kurata</name>
</author>
<author><name sortKey="Brar, D" uniqKey="Brar D">D Brar</name>
</author>
<author><name sortKey="Ware, D" uniqKey="Ware D">D Ware</name>
</author>
<author><name sortKey="Jackson, S" uniqKey="Jackson S">S Jackson</name>
</author>
<author><name sortKey="Wing, Ra" uniqKey="Wing R">RA Wing</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Gao, Zy" uniqKey="Gao Z">ZY Gao</name>
</author>
<author><name sortKey="Zhao, Sc" uniqKey="Zhao S">SC Zhao</name>
</author>
<author><name sortKey="He, Wm" uniqKey="He W">WM He</name>
</author>
<author><name sortKey="Guo, Lb" uniqKey="Guo L">LB Guo</name>
</author>
<author><name sortKey="Peng, Yl" uniqKey="Peng Y">YL Peng</name>
</author>
<author><name sortKey="Wang, Jj" uniqKey="Wang J">JJ Wang</name>
</author>
<author><name sortKey="Guo, Xs" uniqKey="Guo X">XS Guo</name>
</author>
<author><name sortKey="Zhang, Xm" uniqKey="Zhang X">XM Zhang</name>
</author>
<author><name sortKey="Rao, Yc" uniqKey="Rao Y">YC Rao</name>
</author>
<author><name sortKey="Zhang, C" uniqKey="Zhang C">C Zhang</name>
</author>
<author><name sortKey="Dong, Gj" uniqKey="Dong G">GJ Dong</name>
</author>
<author><name sortKey="Zheng, Fy" uniqKey="Zheng F">FY Zheng</name>
</author>
<author><name sortKey="Lu, Cx" uniqKey="Lu C">CX Lu</name>
</author>
<author><name sortKey="Hu, J" uniqKey="Hu J">J Hu</name>
</author>
<author><name sortKey="Zhou, Q" uniqKey="Zhou Q">Q Zhou</name>
</author>
<author><name sortKey="Liu, Hj" uniqKey="Liu H">HJ Liu</name>
</author>
<author><name sortKey="Wu, Hy" uniqKey="Wu H">HY Wu</name>
</author>
<author><name sortKey="Xu, J" uniqKey="Xu J">J Xu</name>
</author>
<author><name sortKey="Ni, Px" uniqKey="Ni P">PX Ni</name>
</author>
<author><name sortKey="Zeng, Dl" uniqKey="Zeng D">DL Zeng</name>
</author>
<author><name sortKey="Liu, Dh" uniqKey="Liu D">DH Liu</name>
</author>
<author><name sortKey="Tian, P" uniqKey="Tian P">P Tian</name>
</author>
<author><name sortKey="Gong, Lh" uniqKey="Gong L">LH Gong</name>
</author>
<author><name sortKey="Ye, C" uniqKey="Ye C">C Ye</name>
</author>
<author><name sortKey="Zhang, Gh" uniqKey="Zhang G">GH Zhang</name>
</author>
<author><name sortKey="Wang, J" uniqKey="Wang J">J Wang</name>
</author>
<author><name sortKey="Tian, Fk" uniqKey="Tian F">FK Tian</name>
</author>
<author><name sortKey="Xue, Dw" uniqKey="Xue D">DW Xue</name>
</author>
<author><name sortKey="Liao, Y" uniqKey="Liao Y">Y Liao</name>
</author>
<author><name sortKey="Zhu, L" uniqKey="Zhu L">L Zhu</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Yu, J" uniqKey="Yu J">J Yu</name>
</author>
<author><name sortKey="Wang, J" uniqKey="Wang J">J Wang</name>
</author>
<author><name sortKey="Lin, W" uniqKey="Lin W">W Lin</name>
</author>
<author><name sortKey="Li, Sg" uniqKey="Li S">SG Li</name>
</author>
<author><name sortKey="Li, H" uniqKey="Li H">H Li</name>
</author>
<author><name sortKey="Zhou, J" uniqKey="Zhou J">J Zhou</name>
</author>
<author><name sortKey="Ni, Px" uniqKey="Ni P">PX Ni</name>
</author>
<author><name sortKey="Dong, W" uniqKey="Dong W">W Dong</name>
</author>
<author><name sortKey="Hu, Sn" uniqKey="Hu S">SN Hu</name>
</author>
<author><name sortKey="Zeng, Cq" uniqKey="Zeng C">CQ Zeng</name>
</author>
<author><name sortKey="Zhang, Jg" uniqKey="Zhang J">JG Zhang</name>
</author>
<author><name sortKey="Zhang, Y" uniqKey="Zhang Y">Y Zhang</name>
</author>
<author><name sortKey="Li, Rq" uniqKey="Li R">RQ Li</name>
</author>
<author><name sortKey="Xu, Zy" uniqKey="Xu Z">ZY Xu</name>
</author>
<author><name sortKey="Li, St" uniqKey="Li S">ST Li</name>
</author>
<author><name sortKey="Li, Xr" uniqKey="Li X">XR Li</name>
</author>
<author><name sortKey="Zheng, Hk" uniqKey="Zheng H">HK Zheng</name>
</author>
<author><name sortKey="Cong, Lj" uniqKey="Cong L">LJ Cong</name>
</author>
<author><name sortKey="Lin, L" uniqKey="Lin L">L Lin</name>
</author>
<author><name sortKey="Yin, Jn" uniqKey="Yin J">JN Yin</name>
</author>
<author><name sortKey="Geng, Jn" uniqKey="Geng J">JN Geng</name>
</author>
<author><name sortKey="Li, Gy" uniqKey="Li G">GY Li</name>
</author>
<author><name sortKey="Shi, Jp" uniqKey="Shi J">JP Shi</name>
</author>
<author><name sortKey="Liu, J" uniqKey="Liu J">J Liu</name>
</author>
<author><name sortKey="Lv, H" uniqKey="Lv H">H Lv</name>
</author>
<author><name sortKey="Li, J" uniqKey="Li J">J Li</name>
</author>
<author><name sortKey="Wang, J" uniqKey="Wang J">J Wang</name>
</author>
<author><name sortKey="Deng, Yj" uniqKey="Deng Y">YJ Deng</name>
</author>
<author><name sortKey="Ran, Lh" uniqKey="Ran L">LH Ran</name>
</author>
<author><name sortKey="Shi, Xl" uniqKey="Shi X">XL Shi</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Huang, Xh" uniqKey="Huang X">XH Huang</name>
</author>
<author><name sortKey="Wei, Xh" uniqKey="Wei X">XH Wei</name>
</author>
<author><name sortKey="Sang, T" uniqKey="Sang T">T Sang</name>
</author>
<author><name sortKey="Zhao, Qa" uniqKey="Zhao Q">QA Zhao</name>
</author>
<author><name sortKey="Feng, Q" uniqKey="Feng Q">Q Feng</name>
</author>
<author><name sortKey="Zhao, Y" uniqKey="Zhao Y">Y Zhao</name>
</author>
<author><name sortKey="Li, Cy" uniqKey="Li C">CY Li</name>
</author>
<author><name sortKey="Zhu, Cr" uniqKey="Zhu C">CR Zhu</name>
</author>
<author><name sortKey="Lu, Tt" uniqKey="Lu T">TT Lu</name>
</author>
<author><name sortKey="Zhang, Zw" uniqKey="Zhang Z">ZW Zhang</name>
</author>
<author><name sortKey="Li, M" uniqKey="Li M">M Li</name>
</author>
<author><name sortKey="Fan, Dl" uniqKey="Fan D">DL Fan</name>
</author>
<author><name sortKey="Guo, Yl" uniqKey="Guo Y">YL Guo</name>
</author>
<author><name sortKey="Wang, A" uniqKey="Wang A">A Wang</name>
</author>
<author><name sortKey="Wang, L" uniqKey="Wang L">L Wang</name>
</author>
<author><name sortKey="Deng, Lw" uniqKey="Deng L">LW Deng</name>
</author>
<author><name sortKey="Li, Wj" uniqKey="Li W">WJ Li</name>
</author>
<author><name sortKey="Lu, Yq" uniqKey="Lu Y">YQ Lu</name>
</author>
<author><name sortKey="Weng, Qj" uniqKey="Weng Q">QJ Weng</name>
</author>
<author><name sortKey="Liu, Ky" uniqKey="Liu K">KY Liu</name>
</author>
<author><name sortKey="Huang, T" uniqKey="Huang T">T Huang</name>
</author>
<author><name sortKey="Zhou, Ty" uniqKey="Zhou T">TY Zhou</name>
</author>
<author><name sortKey="Jing, Yf" uniqKey="Jing Y">YF Jing</name>
</author>
<author><name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
<author><name sortKey="Lin, Z" uniqKey="Lin Z">Z Lin</name>
</author>
<author><name sortKey="Buckler, Es" uniqKey="Buckler E">ES Buckler</name>
</author>
<author><name sortKey="Qian, Qa" uniqKey="Qian Q">QA Qian</name>
</author>
<author><name sortKey="Zhang, Qf" uniqKey="Zhang Q">QF Zhang</name>
</author>
<author><name sortKey="Li, Jy" uniqKey="Li J">JY Li</name>
</author>
<author><name sortKey="Han, B" uniqKey="Han B">B Han</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Mccouch, Sr" uniqKey="Mccouch S">SR McCouch</name>
</author>
<author><name sortKey="Zhao, Ky" uniqKey="Zhao K">KY Zhao</name>
</author>
<author><name sortKey="Wright, M" uniqKey="Wright M">M Wright</name>
</author>
<author><name sortKey="Tung, Cw" uniqKey="Tung C">CW Tung</name>
</author>
<author><name sortKey="Ebana, K" uniqKey="Ebana K">K Ebana</name>
</author>
<author><name sortKey="Thomson, M" uniqKey="Thomson M">M Thomson</name>
</author>
<author><name sortKey="Reynolds, A" uniqKey="Reynolds A">A Reynolds</name>
</author>
<author><name sortKey="Wang, D" uniqKey="Wang D">D Wang</name>
</author>
<author><name sortKey="Declerck, G" uniqKey="Declerck G">G DeClerck</name>
</author>
<author><name sortKey="Ali, Ml" uniqKey="Ali M">ML Ali</name>
</author>
<author><name sortKey="Mcclung, A" uniqKey="Mcclung A">A McClung</name>
</author>
<author><name sortKey="Eizenga, G" uniqKey="Eizenga G">G Eizenga</name>
</author>
<author><name sortKey="Bustamante, C" uniqKey="Bustamante C">C Bustamante</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Mcnally, Kl" uniqKey="Mcnally K">KL McNally</name>
</author>
<author><name sortKey="Childs, Kl" uniqKey="Childs K">KL Childs</name>
</author>
<author><name sortKey="Bohnert, R" uniqKey="Bohnert R">R Bohnert</name>
</author>
<author><name sortKey="Davidson, Rm" uniqKey="Davidson R">RM Davidson</name>
</author>
<author><name sortKey="Zhao, K" uniqKey="Zhao K">K Zhao</name>
</author>
<author><name sortKey="Ulat, Vj" uniqKey="Ulat V">VJ Ulat</name>
</author>
<author><name sortKey="Zeller, G" uniqKey="Zeller G">G Zeller</name>
</author>
<author><name sortKey="Clark, Rm" uniqKey="Clark R">RM Clark</name>
</author>
<author><name sortKey="Hoen, Dr" uniqKey="Hoen D">DR Hoen</name>
</author>
<author><name sortKey="Bureau, Te" uniqKey="Bureau T">TE Bureau</name>
</author>
<author><name sortKey="Stokowski, R" uniqKey="Stokowski R">R Stokowski</name>
</author>
<author><name sortKey="Ballinger, Dg" uniqKey="Ballinger D">DG Ballinger</name>
</author>
<author><name sortKey="Frazer, Ka" uniqKey="Frazer K">KA Frazer</name>
</author>
<author><name sortKey="Cox, Dr" uniqKey="Cox D">DR Cox</name>
</author>
<author><name sortKey="Padhukasahasram, B" uniqKey="Padhukasahasram B">B Padhukasahasram</name>
</author>
<author><name sortKey="Bustamante, Cd" uniqKey="Bustamante C">CD Bustamante</name>
</author>
<author><name sortKey="Weigel, D" uniqKey="Weigel D">D Weigel</name>
</author>
<author><name sortKey="Mackill, Dj" uniqKey="Mackill D">DJ Mackill</name>
</author>
<author><name sortKey="Bruskiewich, Rm" uniqKey="Bruskiewich R">RM Bruskiewich</name>
</author>
<author><name sortKey="Ratsch, G" uniqKey="Ratsch G">G Ratsch</name>
</author>
<author><name sortKey="Buell, Cr" uniqKey="Buell C">CR Buell</name>
</author>
<author><name sortKey="Leung, H" uniqKey="Leung H">H Leung</name>
</author>
<author><name sortKey="Leach, Je" uniqKey="Leach J">JE Leach</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Xu, K" uniqKey="Xu K">K Xu</name>
</author>
<author><name sortKey="Xu, X" uniqKey="Xu X">X Xu</name>
</author>
<author><name sortKey="Fukao, T" uniqKey="Fukao T">T Fukao</name>
</author>
<author><name sortKey="Canlas, P" uniqKey="Canlas P">P Canlas</name>
</author>
<author><name sortKey="Maghirang Rodriguez, R" uniqKey="Maghirang Rodriguez R">R Maghirang-Rodriguez</name>
</author>
<author><name sortKey="Heuer, S" uniqKey="Heuer S">S Heuer</name>
</author>
<author><name sortKey="Ismail, Am" uniqKey="Ismail A">AM Ismail</name>
</author>
<author><name sortKey="Bailey Serres, J" uniqKey="Bailey Serres J">J Bailey-Serres</name>
</author>
<author><name sortKey="Ronald, Pc" uniqKey="Ronald P">PC Ronald</name>
</author>
<author><name sortKey="Mackill, Dj" uniqKey="Mackill D">DJ Mackill</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Huang, Xh" uniqKey="Huang X">XH Huang</name>
</author>
<author><name sortKey="Feng, Q" uniqKey="Feng Q">Q Feng</name>
</author>
<author><name sortKey="Qian, Q" uniqKey="Qian Q">Q Qian</name>
</author>
<author><name sortKey="Zhao, Q" uniqKey="Zhao Q">Q Zhao</name>
</author>
<author><name sortKey="Wang, L" uniqKey="Wang L">L Wang</name>
</author>
<author><name sortKey="Wang, Ah" uniqKey="Wang A">AH Wang</name>
</author>
<author><name sortKey="Guan, Jp" uniqKey="Guan J">JP Guan</name>
</author>
<author><name sortKey="Fan, Dl" uniqKey="Fan D">DL Fan</name>
</author>
<author><name sortKey="Weng, Qj" uniqKey="Weng Q">QJ Weng</name>
</author>
<author><name sortKey="Huang, T" uniqKey="Huang T">T Huang</name>
</author>
<author><name sortKey="Dong, Gj" uniqKey="Dong G">GJ Dong</name>
</author>
<author><name sortKey="Sang, T" uniqKey="Sang T">T Sang</name>
</author>
<author><name sortKey="Han, B" uniqKey="Han B">B Han</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Xu, X" uniqKey="Xu X">X Xu</name>
</author>
<author><name sortKey="Liu, X" uniqKey="Liu X">X Liu</name>
</author>
<author><name sortKey="Ge, S" uniqKey="Ge S">S Ge</name>
</author>
<author><name sortKey="Jensen, Jd" uniqKey="Jensen J">JD Jensen</name>
</author>
<author><name sortKey="Hu, Fy" uniqKey="Hu F">FY Hu</name>
</author>
<author><name sortKey="Li, X" uniqKey="Li X">X Li</name>
</author>
<author><name sortKey="Dong, Y" uniqKey="Dong Y">Y Dong</name>
</author>
<author><name sortKey="Gutenkunst, Rn" uniqKey="Gutenkunst R">RN Gutenkunst</name>
</author>
<author><name sortKey="Fang, L" uniqKey="Fang L">L Fang</name>
</author>
<author><name sortKey="Huang, L" uniqKey="Huang L">L Huang</name>
</author>
<author><name sortKey="Li, Jx" uniqKey="Li J">JX Li</name>
</author>
<author><name sortKey="He, Wm" uniqKey="He W">WM He</name>
</author>
<author><name sortKey="Zhang, Gj" uniqKey="Zhang G">GJ Zhang</name>
</author>
<author><name sortKey="Zheng, Xm" uniqKey="Zheng X">XM Zheng</name>
</author>
<author><name sortKey="Zhang, Fm" uniqKey="Zhang F">FM Zhang</name>
</author>
<author><name sortKey="Li, Yr" uniqKey="Li Y">YR Li</name>
</author>
<author><name sortKey="Yu, C" uniqKey="Yu C">C Yu</name>
</author>
<author><name sortKey="Kristiansen, K" uniqKey="Kristiansen K">K Kristiansen</name>
</author>
<author><name sortKey="Zhang, Xq" uniqKey="Zhang X">XQ Zhang</name>
</author>
<author><name sortKey="Wang, J" uniqKey="Wang J">J Wang</name>
</author>
<author><name sortKey="Wright, M" uniqKey="Wright M">M Wright</name>
</author>
<author><name sortKey="Mccouch, S" uniqKey="Mccouch S">S McCouch</name>
</author>
<author><name sortKey="Nielsen, R" uniqKey="Nielsen R">R Nielsen</name>
</author>
<author><name sortKey="Wang, J" uniqKey="Wang J">J Wang</name>
</author>
<author><name sortKey="Wang, W" uniqKey="Wang W">W Wang</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Li, Jy" uniqKey="Li J">JY Li</name>
</author>
<author><name sortKey="Wang, J" uniqKey="Wang J">J Wang</name>
</author>
<author><name sortKey="Zeigler, Rs" uniqKey="Zeigler R">RS Zeigler</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Han, B" uniqKey="Han B">B Han</name>
</author>
<author><name sortKey="Xue, Yb" uniqKey="Xue Y">YB Xue</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Zuccolo, A" uniqKey="Zuccolo A">A Zuccolo</name>
</author>
<author><name sortKey="Sebastian, A" uniqKey="Sebastian A">A Sebastian</name>
</author>
<author><name sortKey="Talag, J" uniqKey="Talag J">J Talag</name>
</author>
<author><name sortKey="Yu, Y" uniqKey="Yu Y">Y Yu</name>
</author>
<author><name sortKey="Kim, H" uniqKey="Kim H">H Kim</name>
</author>
<author><name sortKey="Collura, K" uniqKey="Collura K">K Collura</name>
</author>
<author><name sortKey="Kudrna, D" uniqKey="Kudrna D">D Kudrna</name>
</author>
<author><name sortKey="Wing, Ra" uniqKey="Wing R">RA Wing</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Yu, P" uniqKey="Yu P">P Yu</name>
</author>
<author><name sortKey="Wang, Ch" uniqKey="Wang C">CH Wang</name>
</author>
<author><name sortKey="Xu, Q" uniqKey="Xu Q">Q Xu</name>
</author>
<author><name sortKey="Feng, Y" uniqKey="Feng Y">Y Feng</name>
</author>
<author><name sortKey="Yuan, Xp" uniqKey="Yuan X">XP Yuan</name>
</author>
<author><name sortKey="Yu, Hy" uniqKey="Yu H">HY Yu</name>
</author>
<author><name sortKey="Wang, Yp" uniqKey="Wang Y">YP Wang</name>
</author>
<author><name sortKey="Tang, Sx" uniqKey="Tang S">SX Tang</name>
</author>
<author><name sortKey="Wei, Xh" uniqKey="Wei X">XH Wei</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Famoso, An" uniqKey="Famoso A">AN Famoso</name>
</author>
<author><name sortKey="Zhao, K" uniqKey="Zhao K">K Zhao</name>
</author>
<author><name sortKey="Clark, Rt" uniqKey="Clark R">RT Clark</name>
</author>
<author><name sortKey="Tung, Cw" uniqKey="Tung C">CW Tung</name>
</author>
<author><name sortKey="Wright, Mh" uniqKey="Wright M">MH Wright</name>
</author>
<author><name sortKey="Bustamante, C" uniqKey="Bustamante C">C Bustamante</name>
</author>
<author><name sortKey="Kochian, Lv" uniqKey="Kochian L">LV Kochian</name>
</author>
<author><name sortKey="Mccouch, Sr" uniqKey="Mccouch S">SR McCouch</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Gamuyao, R" uniqKey="Gamuyao R">R Gamuyao</name>
</author>
<author><name sortKey="Chin, Jh" uniqKey="Chin J">JH Chin</name>
</author>
<author><name sortKey="Pariasca Tanaka, J" uniqKey="Pariasca Tanaka J">J Pariasca-Tanaka</name>
</author>
<author><name sortKey="Pesaresi, P" uniqKey="Pesaresi P">P Pesaresi</name>
</author>
<author><name sortKey="Catausan, S" uniqKey="Catausan S">S Catausan</name>
</author>
<author><name sortKey="Dalid, C" uniqKey="Dalid C">C Dalid</name>
</author>
<author><name sortKey="Slamet Loedin, I" uniqKey="Slamet Loedin I">I Slamet-Loedin</name>
</author>
<author><name sortKey="Tecson Mendoza, Em" uniqKey="Tecson Mendoza E">EM Tecson-Mendoza</name>
</author>
<author><name sortKey="Wissuwa, M" uniqKey="Wissuwa M">M Wissuwa</name>
</author>
<author><name sortKey="Heuer, S" uniqKey="Heuer S">S Heuer</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Uga, Y" uniqKey="Uga Y">Y Uga</name>
</author>
<author><name sortKey="Sugimoto, K" uniqKey="Sugimoto K">K Sugimoto</name>
</author>
<author><name sortKey="Ogawa, S" uniqKey="Ogawa S">S Ogawa</name>
</author>
<author><name sortKey="Rane, J" uniqKey="Rane J">J Rane</name>
</author>
<author><name sortKey="Ishitani, M" uniqKey="Ishitani M">M Ishitani</name>
</author>
<author><name sortKey="Hara, N" uniqKey="Hara N">N Hara</name>
</author>
<author><name sortKey="Kitomi, Y" uniqKey="Kitomi Y">Y Kitomi</name>
</author>
<author><name sortKey="Inukai, Y" uniqKey="Inukai Y">Y Inukai</name>
</author>
<author><name sortKey="Ono, K" uniqKey="Ono K">K Ono</name>
</author>
<author><name sortKey="Kanno, N" uniqKey="Kanno N">N Kanno</name>
</author>
<author><name sortKey="Inoue, H" uniqKey="Inoue H">H Inoue</name>
</author>
<author><name sortKey="Takehisa, H" uniqKey="Takehisa H">H Takehisa</name>
</author>
<author><name sortKey="Motoyama, R" uniqKey="Motoyama R">R Motoyama</name>
</author>
<author><name sortKey="Nagamura, Y" uniqKey="Nagamura Y">Y Nagamura</name>
</author>
<author><name sortKey="Wu, J" uniqKey="Wu J">J Wu</name>
</author>
<author><name sortKey="Matsumoto, T" uniqKey="Matsumoto T">T Matsumoto</name>
</author>
<author><name sortKey="Takai, T" uniqKey="Takai T">T Takai</name>
</author>
<author><name sortKey="Okuno, K" uniqKey="Okuno K">K Okuno</name>
</author>
<author><name sortKey="Yano, M" uniqKey="Yano M">M Yano</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Liakat Ali, M" uniqKey="Liakat Ali M">M Liakat Ali</name>
</author>
<author><name sortKey="Mcclung, Am" uniqKey="Mcclung A">AM McClung</name>
</author>
<author><name sortKey="Jia, Mh" uniqKey="Jia M">MH Jia</name>
</author>
<author><name sortKey="Kimball, Ja" uniqKey="Kimball J">JA Kimball</name>
</author>
<author><name sortKey="Mccouch, Sr" uniqKey="Mccouch S">SR McCouch</name>
</author>
<author><name sortKey="Susan, R" uniqKey="Susan R">R Susan</name>
</author>
<author><name sortKey="Georgia, Ce" uniqKey="Georgia C">CE Georgia</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Garris, Aj" uniqKey="Garris A">AJ Garris</name>
</author>
<author><name sortKey="Mccouch, Sr" uniqKey="Mccouch S">SR McCouch</name>
</author>
<author><name sortKey="Kresovich, S" uniqKey="Kresovich S">S Kresovich</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Hattori, Y" uniqKey="Hattori Y">Y Hattori</name>
</author>
<author><name sortKey="Nagai, K" uniqKey="Nagai K">K Nagai</name>
</author>
<author><name sortKey="Furukawa, S" uniqKey="Furukawa S">S Furukawa</name>
</author>
<author><name sortKey="Song, Xj" uniqKey="Song X">XJ Song</name>
</author>
<author><name sortKey="Kawano, R" uniqKey="Kawano R">R Kawano</name>
</author>
<author><name sortKey="Sakakibara, H" uniqKey="Sakakibara H">H Sakakibara</name>
</author>
<author><name sortKey="Wu, J" uniqKey="Wu J">J Wu</name>
</author>
<author><name sortKey="Matsumoto, T" uniqKey="Matsumoto T">T Matsumoto</name>
</author>
<author><name sortKey="Yoshimura, A" uniqKey="Yoshimura A">A Yoshimura</name>
</author>
<author><name sortKey="Kitano, H" uniqKey="Kitano H">H Kitano</name>
</author>
<author><name sortKey="Matsuoka, M" uniqKey="Matsuoka M">M Matsuoka</name>
</author>
<author><name sortKey="Mori, H" uniqKey="Mori H">H Mori</name>
</author>
<author><name sortKey="Ashikari, M" uniqKey="Ashikari M">M Ashikari</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Bernier, J" uniqKey="Bernier J">J Bernier</name>
</author>
<author><name sortKey="Kumar, A" uniqKey="Kumar A">A Kumar</name>
</author>
<author><name sortKey="Venuprasad, R" uniqKey="Venuprasad R">R Venuprasad</name>
</author>
<author><name sortKey="Spaner, D" uniqKey="Spaner D">D Spaner</name>
</author>
<author><name sortKey="Verulkar, S" uniqKey="Verulkar S">S Verulkar</name>
</author>
<author><name sortKey="Mandal, N" uniqKey="Mandal N">N Mandal</name>
</author>
<author><name sortKey="Sinha, P" uniqKey="Sinha P">P Sinha</name>
</author>
<author><name sortKey="Peeraju, P" uniqKey="Peeraju P">P Peeraju</name>
</author>
<author><name sortKey="Dongre, P" uniqKey="Dongre P">P Dongre</name>
</author>
<author><name sortKey="Mahto, Rn" uniqKey="Mahto R">RN Mahto</name>
</author>
<author><name sortKey="Atlin, G" uniqKey="Atlin G">G Atlin</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Gnerre, S" uniqKey="Gnerre S">S Gnerre</name>
</author>
<author><name sortKey="Maccallum, I" uniqKey="Maccallum I">I Maccallum</name>
</author>
<author><name sortKey="Przybylski, D" uniqKey="Przybylski D">D Przybylski</name>
</author>
<author><name sortKey="Ribeiro, Fj" uniqKey="Ribeiro F">FJ Ribeiro</name>
</author>
<author><name sortKey="Burton, Jn" uniqKey="Burton J">JN Burton</name>
</author>
<author><name sortKey="Walker, Bj" uniqKey="Walker B">BJ Walker</name>
</author>
<author><name sortKey="Sharpe, T" uniqKey="Sharpe T">T Sharpe</name>
</author>
<author><name sortKey="Hall, G" uniqKey="Hall G">G Hall</name>
</author>
<author><name sortKey="Shea, Tp" uniqKey="Shea T">TP Shea</name>
</author>
<author><name sortKey="Sykes, S" uniqKey="Sykes S">S Sykes</name>
</author>
<author><name sortKey="Berlin, Am" uniqKey="Berlin A">AM Berlin</name>
</author>
<author><name sortKey="Aird, D" uniqKey="Aird D">D Aird</name>
</author>
<author><name sortKey="Costello, M" uniqKey="Costello M">M Costello</name>
</author>
<author><name sortKey="Daza, R" uniqKey="Daza R">R Daza</name>
</author>
<author><name sortKey="Williams, L" uniqKey="Williams L">L Williams</name>
</author>
<author><name sortKey="Nicol, R" uniqKey="Nicol R">R Nicol</name>
</author>
<author><name sortKey="Gnirke, A" uniqKey="Gnirke A">A Gnirke</name>
</author>
<author><name sortKey="Nusbaum, C" uniqKey="Nusbaum C">C Nusbaum</name>
</author>
<author><name sortKey="Lander, Es" uniqKey="Lander E">ES Lander</name>
</author>
<author><name sortKey="Jaffe, Db" uniqKey="Jaffe D">DB Jaffe</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Bradnam, Kr" uniqKey="Bradnam K">KR Bradnam</name>
</author>
<author><name sortKey="Fass, Jn" uniqKey="Fass J">JN Fass</name>
</author>
<author><name sortKey="Alexandrov, A" uniqKey="Alexandrov A">A Alexandrov</name>
</author>
<author><name sortKey="Baranay, P" uniqKey="Baranay P">P Baranay</name>
</author>
<author><name sortKey="Bechner, M" uniqKey="Bechner M">M Bechner</name>
</author>
<author><name sortKey="Birol, I" uniqKey="Birol I">I Birol</name>
</author>
<author><name sortKey="Boisvert, S" uniqKey="Boisvert S">S Boisvert</name>
</author>
<author><name sortKey="Chapman, Ja" uniqKey="Chapman J">JA Chapman</name>
</author>
<author><name sortKey="Chapuis, G" uniqKey="Chapuis G">G Chapuis</name>
</author>
<author><name sortKey="Chikhi, R" uniqKey="Chikhi R">R Chikhi</name>
</author>
<author><name sortKey="Chitsaz, H" uniqKey="Chitsaz H">H Chitsaz</name>
</author>
<author><name sortKey="Chou, Wc" uniqKey="Chou W">WC Chou</name>
</author>
<author><name sortKey="Corbeil, J" uniqKey="Corbeil J">J Corbeil</name>
</author>
<author><name sortKey="Del Fabbro, C" uniqKey="Del Fabbro C">C Del Fabbro</name>
</author>
<author><name sortKey="Docking, Tr" uniqKey="Docking T">TR Docking</name>
</author>
<author><name sortKey="Durbin, R" uniqKey="Durbin R">R Durbin</name>
</author>
<author><name sortKey="Earl, D" uniqKey="Earl D">D Earl</name>
</author>
<author><name sortKey="Emrich, S" uniqKey="Emrich S">S Emrich</name>
</author>
<author><name sortKey="Fedotov, P" uniqKey="Fedotov P">P Fedotov</name>
</author>
<author><name sortKey="Fonseca, Na" uniqKey="Fonseca N">NA Fonseca</name>
</author>
<author><name sortKey="Ganapathy, G" uniqKey="Ganapathy G">G Ganapathy</name>
</author>
<author><name sortKey="Gibbs, Ra" uniqKey="Gibbs R">RA Gibbs</name>
</author>
<author><name sortKey="Gnerre, S" uniqKey="Gnerre S">S Gnerre</name>
</author>
<author><name sortKey="Godzaridis, E" uniqKey="Godzaridis E">E Godzaridis</name>
</author>
<author><name sortKey="Goldstein, S" uniqKey="Goldstein S">S Goldstein</name>
</author>
<author><name sortKey="Haimel, M" uniqKey="Haimel M">M Haimel</name>
</author>
<author><name sortKey="Hall, G" uniqKey="Hall G">G Hall</name>
</author>
<author><name sortKey="Haussler, D" uniqKey="Haussler D">D Haussler</name>
</author>
<author><name sortKey="Hiatt, Jb" uniqKey="Hiatt J">JB Hiatt</name>
</author>
<author><name sortKey="Ho, Iy" uniqKey="Ho I">IY Ho</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Earl, D" uniqKey="Earl D">D Earl</name>
</author>
<author><name sortKey="Bradnam, K" uniqKey="Bradnam K">K Bradnam</name>
</author>
<author><name sortKey="St John, J" uniqKey="St John J">J St John</name>
</author>
<author><name sortKey="Darling, A" uniqKey="Darling A">A Darling</name>
</author>
<author><name sortKey="Lin, D" uniqKey="Lin D">D Lin</name>
</author>
<author><name sortKey="Fass, J" uniqKey="Fass J">J Fass</name>
</author>
<author><name sortKey="Yu, Ho" uniqKey="Yu H">HO Yu</name>
</author>
<author><name sortKey="Buffalo, V" uniqKey="Buffalo V">V Buffalo</name>
</author>
<author><name sortKey="Zerbino, Dr" uniqKey="Zerbino D">DR Zerbino</name>
</author>
<author><name sortKey="Diekhans, M" uniqKey="Diekhans M">M Diekhans</name>
</author>
<author><name sortKey="Nguyen, N" uniqKey="Nguyen N">N Nguyen</name>
</author>
<author><name sortKey="Ariyaratne, Pn" uniqKey="Ariyaratne P">PN Ariyaratne</name>
</author>
<author><name sortKey="Sung, Wk" uniqKey="Sung W">WK Sung</name>
</author>
<author><name sortKey="Ning, Z" uniqKey="Ning Z">Z Ning</name>
</author>
<author><name sortKey="Haimel, M" uniqKey="Haimel M">M Haimel</name>
</author>
<author><name sortKey="Simpson, Jt" uniqKey="Simpson J">JT Simpson</name>
</author>
<author><name sortKey="Fonseca, Na" uniqKey="Fonseca N">NA Fonseca</name>
</author>
<author><name sortKey="Birol, I" uniqKey="Birol I">I Birol</name>
</author>
<author><name sortKey="Docking, Tr" uniqKey="Docking T">TR Docking</name>
</author>
<author><name sortKey="Ho, Iy" uniqKey="Ho I">IY Ho</name>
</author>
<author><name sortKey="Rokhsar, Ds" uniqKey="Rokhsar D">DS Rokhsar</name>
</author>
<author><name sortKey="Chikhi, R" uniqKey="Chikhi R">R Chikhi</name>
</author>
<author><name sortKey="Lavenier, D" uniqKey="Lavenier D">D Lavenier</name>
</author>
<author><name sortKey="Chapuis, G" uniqKey="Chapuis G">G Chapuis</name>
</author>
<author><name sortKey="Naquin, D" uniqKey="Naquin D">D Naquin</name>
</author>
<author><name sortKey="Maillet, N" uniqKey="Maillet N">N Maillet</name>
</author>
<author><name sortKey="Schatz, Mc" uniqKey="Schatz M">MC Schatz</name>
</author>
<author><name sortKey="Kelley, Dr" uniqKey="Kelley D">DR Kelley</name>
</author>
<author><name sortKey="Phillippy, Am" uniqKey="Phillippy A">AM Phillippy</name>
</author>
<author><name sortKey="Koren, S" uniqKey="Koren S">S Koren</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
<author><name sortKey="Phillippy, Am" uniqKey="Phillippy A">AM Phillippy</name>
</author>
<author><name sortKey="Zimin, A" uniqKey="Zimin A">A Zimin</name>
</author>
<author><name sortKey="Puiu, D" uniqKey="Puiu D">D Puiu</name>
</author>
<author><name sortKey="Magoc, T" uniqKey="Magoc T">T Magoc</name>
</author>
<author><name sortKey="Koren, S" uniqKey="Koren S">S Koren</name>
</author>
<author><name sortKey="Treangen, Tj" uniqKey="Treangen T">TJ Treangen</name>
</author>
<author><name sortKey="Schatz, Mc" uniqKey="Schatz M">MC Schatz</name>
</author>
<author><name sortKey="Delcher, Al" uniqKey="Delcher A">AL Delcher</name>
</author>
<author><name sortKey="Roberts, M" uniqKey="Roberts M">M Roberts</name>
</author>
<author><name sortKey="Marcais, G" uniqKey="Marcais G">G Marcais</name>
</author>
<author><name sortKey="Pop, M" uniqKey="Pop M">M Pop</name>
</author>
<author><name sortKey="Yorke, Ja" uniqKey="Yorke J">JA Yorke</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Kawahara, Y" uniqKey="Kawahara Y">Y Kawahara</name>
</author>
<author><name sortKey="De La Bastide, M" uniqKey="De La Bastide M">M de la Bastide</name>
</author>
<author><name sortKey="Hamilton, Jp" uniqKey="Hamilton J">JP Hamilton</name>
</author>
<author><name sortKey="Kanamori, H" uniqKey="Kanamori H">H Kanamori</name>
</author>
<author><name sortKey="Mccombie, Wr" uniqKey="Mccombie W">WR McCombie</name>
</author>
<author><name sortKey="Ouyang, S" uniqKey="Ouyang S">S Ouyang</name>
</author>
<author><name sortKey="Schwartz, Dc" uniqKey="Schwartz D">DC Schwartz</name>
</author>
<author><name sortKey="Tanaka, T" uniqKey="Tanaka T">T Tanaka</name>
</author>
<author><name sortKey="Wu, J" uniqKey="Wu J">J Wu</name>
</author>
<author><name sortKey="Zhou, S" uniqKey="Zhou S">S Zhou</name>
</author>
<author><name sortKey="Childs, Kl" uniqKey="Childs K">KL Childs</name>
</author>
<author><name sortKey="Davidson, Rm" uniqKey="Davidson R">RM Davidson</name>
</author>
<author><name sortKey="Lin, H" uniqKey="Lin H">H Lin</name>
</author>
<author><name sortKey="Quesada Ocampo, L" uniqKey="Quesada Ocampo L">L Quesada-Ocampo</name>
</author>
<author><name sortKey="Vaillancourt, B" uniqKey="Vaillancourt B">B Vaillancourt</name>
</author>
<author><name sortKey="Sakai, H" uniqKey="Sakai H">H Sakai</name>
</author>
<author><name sortKey="Lee, Ss" uniqKey="Lee S">SS Lee</name>
</author>
<author><name sortKey="Kim, J" uniqKey="Kim J">J Kim</name>
</author>
<author><name sortKey="Numa, H" uniqKey="Numa H">H Numa</name>
</author>
<author><name sortKey="Itoh, T" uniqKey="Itoh T">T Itoh</name>
</author>
<author><name sortKey="Buell, Cr" uniqKey="Buell C">CR Buell</name>
</author>
<author><name sortKey="Matsumoto, T" uniqKey="Matsumoto T">T Matsumoto</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Campbell, Ms" uniqKey="Campbell M">MS Campbell</name>
</author>
<author><name sortKey="Law, M" uniqKey="Law M">M Law</name>
</author>
<author><name sortKey="Holt, C" uniqKey="Holt C">C Holt</name>
</author>
<author><name sortKey="Stein, Jc" uniqKey="Stein J">JC Stein</name>
</author>
<author><name sortKey="Moghe, Gd" uniqKey="Moghe G">GD Moghe</name>
</author>
<author><name sortKey="Hufnagel, De" uniqKey="Hufnagel D">DE Hufnagel</name>
</author>
<author><name sortKey="Lei, J" uniqKey="Lei J">J Lei</name>
</author>
<author><name sortKey="Achawanantakun, R" uniqKey="Achawanantakun R">R Achawanantakun</name>
</author>
<author><name sortKey="Jiao, D" uniqKey="Jiao D">D Jiao</name>
</author>
<author><name sortKey="Lawrence, Cj" uniqKey="Lawrence C">CJ Lawrence</name>
</author>
<author><name sortKey="Ware, D" uniqKey="Ware D">D Ware</name>
</author>
<author><name sortKey="Shiu, Sh" uniqKey="Shiu S">SH Shiu</name>
</author>
<author><name sortKey="Childs, Kl" uniqKey="Childs K">KL Childs</name>
</author>
<author><name sortKey="Sun, Y" uniqKey="Sun Y">Y Sun</name>
</author>
<author><name sortKey="Jiang, N" uniqKey="Jiang N">N Jiang</name>
</author>
<author><name sortKey="Yandell, M" uniqKey="Yandell M">M Yandell</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
<author><name sortKey="Souvorov, A" uniqKey="Souvorov A">A Souvorov</name>
</author>
<author><name sortKey="Koonin, Ev" uniqKey="Koonin E">EV Koonin</name>
</author>
<author><name sortKey="Panchenko, Ar" uniqKey="Panchenko A">AR Panchenko</name>
</author>
<author><name sortKey="Tatusova, Ta" uniqKey="Tatusova T">TA Tatusova</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Capra, Ja" uniqKey="Capra J">JA Capra</name>
</author>
<author><name sortKey="Pollard, Ks" uniqKey="Pollard K">KS Pollard</name>
</author>
<author><name sortKey="Singh, M" uniqKey="Singh M">M Singh</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Cai, Jj" uniqKey="Cai J">JJ Cai</name>
</author>
<author><name sortKey="Petrov, Da" uniqKey="Petrov D">DA Petrov</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Yanagihara, S" uniqKey="Yanagihara S">S Yanagihara</name>
</author>
<author><name sortKey="Mccouch, Sr" uniqKey="Mccouch S">SR Mccouch</name>
</author>
<author><name sortKey="Ishikawa, K" uniqKey="Ishikawa K">K Ishikawa</name>
</author>
<author><name sortKey="Ogi, Y" uniqKey="Ogi Y">Y Ogi</name>
</author>
<author><name sortKey="Maruyama, K" uniqKey="Maruyama K">K Maruyama</name>
</author>
<author><name sortKey="Ikehashi, H" uniqKey="Ikehashi H">H Ikehashi</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Chen, Jj" uniqKey="Chen J">JJ Chen</name>
</author>
<author><name sortKey="Ding, Jh" uniqKey="Ding J">JH Ding</name>
</author>
<author><name sortKey="Ouyang, Yd" uniqKey="Ouyang Y">YD Ouyang</name>
</author>
<author><name sortKey="Du, Hy" uniqKey="Du H">HY Du</name>
</author>
<author><name sortKey="Yang, Jy" uniqKey="Yang J">JY Yang</name>
</author>
<author><name sortKey="Cheng, K" uniqKey="Cheng K">K Cheng</name>
</author>
<author><name sortKey="Zhao, J" uniqKey="Zhao J">J Zhao</name>
</author>
<author><name sortKey="Qiu, Sq" uniqKey="Qiu S">SQ Qiu</name>
</author>
<author><name sortKey="Zhang, Xl" uniqKey="Zhang X">XL Zhang</name>
</author>
<author><name sortKey="Yao, Jl" uniqKey="Yao J">JL Yao</name>
</author>
<author><name sortKey="Liu, Kd" uniqKey="Liu K">KD Liu</name>
</author>
<author><name sortKey="Wang, L" uniqKey="Wang L">L Wang</name>
</author>
<author><name sortKey="Xu, Cg" uniqKey="Xu C">CG Xu</name>
</author>
<author><name sortKey="Li, Xh" uniqKey="Li X">XH Li</name>
</author>
<author><name sortKey="Xue, Yb" uniqKey="Xue Y">YB Xue</name>
</author>
<author><name sortKey="Xia, M" uniqKey="Xia M">M Xia</name>
</author>
<author><name sortKey="Ji, Q" uniqKey="Ji Q">Q Ji</name>
</author>
<author><name sortKey="Lu, Jf" uniqKey="Lu J">JF Lu</name>
</author>
<author><name sortKey="Xu, Ml" uniqKey="Xu M">ML Xu</name>
</author>
<author><name sortKey="Zhang, Qf" uniqKey="Zhang Q">QF Zhang</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Yang, J" uniqKey="Yang J">J Yang</name>
</author>
<author><name sortKey="Zhao, X" uniqKey="Zhao X">X Zhao</name>
</author>
<author><name sortKey="Cheng, K" uniqKey="Cheng K">K Cheng</name>
</author>
<author><name sortKey="Du, H" uniqKey="Du H">H Du</name>
</author>
<author><name sortKey="Ouyang, Y" uniqKey="Ouyang Y">Y Ouyang</name>
</author>
<author><name sortKey="Chen, J" uniqKey="Chen J">J Chen</name>
</author>
<author><name sortKey="Qiu, S" uniqKey="Qiu S">S Qiu</name>
</author>
<author><name sortKey="Huang, J" uniqKey="Huang J">J Huang</name>
</author>
<author><name sortKey="Jiang, Y" uniqKey="Jiang Y">Y Jiang</name>
</author>
<author><name sortKey="Jiang, L" uniqKey="Jiang L">L Jiang</name>
</author>
<author><name sortKey="Ding, J" uniqKey="Ding J">J Ding</name>
</author>
<author><name sortKey="Wang, J" uniqKey="Wang J">J Wang</name>
</author>
<author><name sortKey="Xu, C" uniqKey="Xu C">C Xu</name>
</author>
<author><name sortKey="Li, X" uniqKey="Li X">X Li</name>
</author>
<author><name sortKey="Zhang, Q" uniqKey="Zhang Q">Q Zhang</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="He, Gm" uniqKey="He G">GM He</name>
</author>
<author><name sortKey="Luo, Xj" uniqKey="Luo X">XJ Luo</name>
</author>
<author><name sortKey="Tian, F" uniqKey="Tian F">F Tian</name>
</author>
<author><name sortKey="Li, Kg" uniqKey="Li K">KG Li</name>
</author>
<author><name sortKey="Zhu, Zf" uniqKey="Zhu Z">ZF Zhu</name>
</author>
<author><name sortKey="Su, W" uniqKey="Su W">W Su</name>
</author>
<author><name sortKey="Qian, Xy" uniqKey="Qian X">XY Qian</name>
</author>
<author><name sortKey="Fu, Yc" uniqKey="Fu Y">YC Fu</name>
</author>
<author><name sortKey="Wang, Xk" uniqKey="Wang X">XK Wang</name>
</author>
<author><name sortKey="Sun, Cq" uniqKey="Sun C">CQ Sun</name>
</author>
<author><name sortKey="Yang, Js" uniqKey="Yang J">JS Yang</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Wissuwa, M" uniqKey="Wissuwa M">M Wissuwa</name>
</author>
<author><name sortKey="Wegner, J" uniqKey="Wegner J">J Wegner</name>
</author>
<author><name sortKey="Ae, N" uniqKey="Ae N">N Ae</name>
</author>
<author><name sortKey="Yano, M" uniqKey="Yano M">M Yano</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Wissuwa, M" uniqKey="Wissuwa M">M Wissuwa</name>
</author>
<author><name sortKey="Yano, M" uniqKey="Yano M">M Yano</name>
</author>
<author><name sortKey="Ae, N" uniqKey="Ae N">N Ae</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Chin, Jh" uniqKey="Chin J">JH Chin</name>
</author>
<author><name sortKey="Gamuyao, R" uniqKey="Gamuyao R">R Gamuyao</name>
</author>
<author><name sortKey="Dalid, C" uniqKey="Dalid C">C Dalid</name>
</author>
<author><name sortKey="Bustamam, M" uniqKey="Bustamam M">M Bustamam</name>
</author>
<author><name sortKey="Prasetiyono, J" uniqKey="Prasetiyono J">J Prasetiyono</name>
</author>
<author><name sortKey="Moeljopawiro, S" uniqKey="Moeljopawiro S">S Moeljopawiro</name>
</author>
<author><name sortKey="Wissuwa, M" uniqKey="Wissuwa M">M Wissuwa</name>
</author>
<author><name sortKey="Heuer, S" uniqKey="Heuer S">S Heuer</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Eizenga, Gcam" uniqKey="Eizenga G">GCAM Eizenga</name>
</author>
<author><name sortKey="Bryant, Rj" uniqKey="Bryant R">RJ Bryant</name>
</author>
<author><name sortKey="Yeater, Km" uniqKey="Yeater K">KM Yeater</name>
</author>
<author><name sortKey="Mcclung, Am" uniqKey="Mcclung A">AM McClung</name>
</author>
<author><name sortKey="Mccouch, Sr" uniqKey="Mccouch S">SR McCouch</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Bin Rahman, An" uniqKey="Bin Rahman A">AN Bin Rahman</name>
</author>
<author><name sortKey="Zhang, J" uniqKey="Zhang J">J Zhang</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Roberts, Rj" uniqKey="Roberts R">RJ Roberts</name>
</author>
<author><name sortKey="Carneiro, Mo" uniqKey="Carneiro M">MO Carneiro</name>
</author>
<author><name sortKey="Schatz, Mc" uniqKey="Schatz M">MC Schatz</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Luo, R" uniqKey="Luo R">R Luo</name>
</author>
<author><name sortKey="Liu, B" uniqKey="Liu B">B Liu</name>
</author>
<author><name sortKey="Xie, Y" uniqKey="Xie Y">Y Xie</name>
</author>
<author><name sortKey="Li, Z" uniqKey="Li Z">Z Li</name>
</author>
<author><name sortKey="Huang, W" uniqKey="Huang W">W Huang</name>
</author>
<author><name sortKey="Yuan, J" uniqKey="Yuan J">J Yuan</name>
</author>
<author><name sortKey="He, G" uniqKey="He G">G He</name>
</author>
<author><name sortKey="Chen, Y" uniqKey="Chen Y">Y Chen</name>
</author>
<author><name sortKey="Pan, Q" uniqKey="Pan Q">Q Pan</name>
</author>
<author><name sortKey="Liu, Y" uniqKey="Liu Y">Y Liu</name>
</author>
<author><name sortKey="Tang, J" uniqKey="Tang J">J Tang</name>
</author>
<author><name sortKey="Wu, G" uniqKey="Wu G">G Wu</name>
</author>
<author><name sortKey="Zhang, H" uniqKey="Zhang H">H Zhang</name>
</author>
<author><name sortKey="Shi, Y" uniqKey="Shi Y">Y Shi</name>
</author>
<author><name sortKey="Yu, C" uniqKey="Yu C">C Yu</name>
</author>
<author><name sortKey="Wang, B" uniqKey="Wang B">B Wang</name>
</author>
<author><name sortKey="Lu, Y" uniqKey="Lu Y">Y Lu</name>
</author>
<author><name sortKey="Han, C" uniqKey="Han C">C Han</name>
</author>
<author><name sortKey="Cheung, Dw" uniqKey="Cheung D">DW Cheung</name>
</author>
<author><name sortKey="Yiu, Sm" uniqKey="Yiu S">SM Yiu</name>
</author>
<author><name sortKey="Peng, S" uniqKey="Peng S">S Peng</name>
</author>
<author><name sortKey="Xiaoqian, Z" uniqKey="Xiaoqian Z">Z Xiaoqian</name>
</author>
<author><name sortKey="Liu, G" uniqKey="Liu G">G Liu</name>
</author>
<author><name sortKey="Liao, X" uniqKey="Liao X">X Liao</name>
</author>
<author><name sortKey="Li, Y" uniqKey="Li Y">Y Li</name>
</author>
<author><name sortKey="Yang, H" uniqKey="Yang H">H Yang</name>
</author>
<author><name sortKey="Wang, J" uniqKey="Wang J">J Wang</name>
</author>
<author><name sortKey="Lam, Tw" uniqKey="Lam T">TW Lam</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Simpson, Jt" uniqKey="Simpson J">JT Simpson</name>
</author>
<author><name sortKey="Durbin, R" uniqKey="Durbin R">R Durbin</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Kelley, Dr" uniqKey="Kelley D">DR Kelley</name>
</author>
<author><name sortKey="Schatz, Mc" uniqKey="Schatz M">MC Schatz</name>
</author>
<author><name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Cantarel, Bl" uniqKey="Cantarel B">BL Cantarel</name>
</author>
<author><name sortKey="Korf, I" uniqKey="Korf I">I Korf</name>
</author>
<author><name sortKey="Robb, Sm" uniqKey="Robb S">SM Robb</name>
</author>
<author><name sortKey="Parra, G" uniqKey="Parra G">G Parra</name>
</author>
<author><name sortKey="Ross, E" uniqKey="Ross E">E Ross</name>
</author>
<author><name sortKey="Moore, B" uniqKey="Moore B">B Moore</name>
</author>
<author><name sortKey="Holt, C" uniqKey="Holt C">C Holt</name>
</author>
<author><name sortKey="Sanchez Alvarado, A" uniqKey="Sanchez Alvarado A">A Sanchez Alvarado</name>
</author>
<author><name sortKey="Yandell, M" uniqKey="Yandell M">M Yandell</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Goff, Sa" uniqKey="Goff S">SA Goff</name>
</author>
<author><name sortKey="Vaughn, M" uniqKey="Vaughn M">M Vaughn</name>
</author>
<author><name sortKey="Mckay, S" uniqKey="Mckay S">S McKay</name>
</author>
<author><name sortKey="Lyons, E" uniqKey="Lyons E">E Lyons</name>
</author>
<author><name sortKey="Stapleton, Ae" uniqKey="Stapleton A">AE Stapleton</name>
</author>
<author><name sortKey="Gessler, D" uniqKey="Gessler D">D Gessler</name>
</author>
<author><name sortKey="Matasci, N" uniqKey="Matasci N">N Matasci</name>
</author>
<author><name sortKey="Wang, L" uniqKey="Wang L">L Wang</name>
</author>
<author><name sortKey="Hanlon, M" uniqKey="Hanlon M">M Hanlon</name>
</author>
<author><name sortKey="Lenards, A" uniqKey="Lenards A">A Lenards</name>
</author>
<author><name sortKey="Muir, A" uniqKey="Muir A">A Muir</name>
</author>
<author><name sortKey="Merchant, N" uniqKey="Merchant N">N Merchant</name>
</author>
<author><name sortKey="Lowry, S" uniqKey="Lowry S">S Lowry</name>
</author>
<author><name sortKey="Mock, S" uniqKey="Mock S">S Mock</name>
</author>
<author><name sortKey="Helmke, M" uniqKey="Helmke M">M Helmke</name>
</author>
<author><name sortKey="Kubach, A" uniqKey="Kubach A">A Kubach</name>
</author>
<author><name sortKey="Narro, M" uniqKey="Narro M">M Narro</name>
</author>
<author><name sortKey="Hopkins, N" uniqKey="Hopkins N">N Hopkins</name>
</author>
<author><name sortKey="Micklos, D" uniqKey="Micklos D">D Micklos</name>
</author>
<author><name sortKey="Hilgert, U" uniqKey="Hilgert U">U Hilgert</name>
</author>
<author><name sortKey="Gonzales, M" uniqKey="Gonzales M">M Gonzales</name>
</author>
<author><name sortKey="Jordan, C" uniqKey="Jordan C">C Jordan</name>
</author>
<author><name sortKey="Skidmore, E" uniqKey="Skidmore E">E Skidmore</name>
</author>
<author><name sortKey="Dooley, R" uniqKey="Dooley R">R Dooley</name>
</author>
<author><name sortKey="Cazes, J" uniqKey="Cazes J">J Cazes</name>
</author>
<author><name sortKey="Mclay, R" uniqKey="Mclay R">R McLay</name>
</author>
<author><name sortKey="Lu, Z" uniqKey="Lu Z">Z Lu</name>
</author>
<author><name sortKey="Pasternak, S" uniqKey="Pasternak S">S Pasternak</name>
</author>
<author><name sortKey="Koesterke, L" uniqKey="Koesterke L">L Koesterke</name>
</author>
<author><name sortKey="Piel, Wh" uniqKey="Piel W">WH Piel</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Holt, C" uniqKey="Holt C">C Holt</name>
</author>
<author><name sortKey="Yandell, M" uniqKey="Yandell M">M Yandell</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Salamov, Aa" uniqKey="Salamov A">AA Salamov</name>
</author>
<author><name sortKey="Solovyev, Vv" uniqKey="Solovyev V">VV Solovyev</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Korf, I" uniqKey="Korf I">I Korf</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Jones, P" uniqKey="Jones P">P Jones</name>
</author>
<author><name sortKey="Binns, D" uniqKey="Binns D">D Binns</name>
</author>
<author><name sortKey="Chang, Hy" uniqKey="Chang H">HY Chang</name>
</author>
<author><name sortKey="Fraser, M" uniqKey="Fraser M">M Fraser</name>
</author>
<author><name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
<author><name sortKey="Mcanulla, C" uniqKey="Mcanulla C">C McAnulla</name>
</author>
<author><name sortKey="Mcwilliam, H" uniqKey="Mcwilliam H">H McWilliam</name>
</author>
<author><name sortKey="Maslen, J" uniqKey="Maslen J">J Maslen</name>
</author>
<author><name sortKey="Mitchell, A" uniqKey="Mitchell A">A Mitchell</name>
</author>
<author><name sortKey="Nuka, G" uniqKey="Nuka G">G Nuka</name>
</author>
<author><name sortKey="Pesseat, S" uniqKey="Pesseat S">S Pesseat</name>
</author>
<author><name sortKey="Quinn, Af" uniqKey="Quinn A">AF Quinn</name>
</author>
<author><name sortKey="Sangrador Vegas, A" uniqKey="Sangrador Vegas A">A Sangrador-Vegas</name>
</author>
<author><name sortKey="Scheremetjew, M" uniqKey="Scheremetjew M">M Scheremetjew</name>
</author>
<author><name sortKey="Yong, Sy" uniqKey="Yong S">SY Yong</name>
</author>
<author><name sortKey="Lopez, R" uniqKey="Lopez R">R Lopez</name>
</author>
<author><name sortKey="Hunter, S" uniqKey="Hunter S">S Hunter</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Oliver, Sl" uniqKey="Oliver S">SL Oliver</name>
</author>
<author><name sortKey="Lenards, Aj" uniqKey="Lenards A">AJ Lenards</name>
</author>
<author><name sortKey="Barthelson, Ra" uniqKey="Barthelson R">RA Barthelson</name>
</author>
<author><name sortKey="Merchant, N" uniqKey="Merchant N">N Merchant</name>
</author>
<author><name sortKey="Mckay, Sj" uniqKey="Mckay S">SJ McKay</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Kurtz, S" uniqKey="Kurtz S">S Kurtz</name>
</author>
<author><name sortKey="Phillippy, A" uniqKey="Phillippy A">A Phillippy</name>
</author>
<author><name sortKey="Delcher, Al" uniqKey="Delcher A">AL Delcher</name>
</author>
<author><name sortKey="Smoot, M" uniqKey="Smoot M">M Smoot</name>
</author>
<author><name sortKey="Shumway, M" uniqKey="Shumway M">M Shumway</name>
</author>
<author><name sortKey="Antonescu, C" uniqKey="Antonescu C">C Antonescu</name>
</author>
<author><name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Quinlan, Ar" uniqKey="Quinlan A">AR Quinlan</name>
</author>
<author><name sortKey="Hall, Im" uniqKey="Hall I">IM Hall</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Schatz, Mc" uniqKey="Schatz M">MC Schatz</name>
</author>
<author><name sortKey="Phillippy, Am" uniqKey="Phillippy A">AM Phillippy</name>
</author>
<author><name sortKey="Sommer, Dd" uniqKey="Sommer D">DD Sommer</name>
</author>
<author><name sortKey="Delcher, Al" uniqKey="Delcher A">AL Delcher</name>
</author>
<author><name sortKey="Puiu, D" uniqKey="Puiu D">D Puiu</name>
</author>
<author><name sortKey="Narzisi, G" uniqKey="Narzisi G">G Narzisi</name>
</author>
<author><name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
<author><name sortKey="Pop, M" uniqKey="Pop M">M Pop</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Marcais, G" uniqKey="Marcais G">G Marcais</name>
</author>
<author><name sortKey="Kingsford, C" uniqKey="Kingsford C">C Kingsford</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Kurtz, S" uniqKey="Kurtz S">S Kurtz</name>
</author>
<author><name sortKey="Narechania, A" uniqKey="Narechania A">A Narechania</name>
</author>
<author><name sortKey="Stein, Jc" uniqKey="Stein J">JC Stein</name>
</author>
<author><name sortKey="Ware, D" uniqKey="Ware D">D Ware</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Phillippy, Am" uniqKey="Phillippy A">AM Phillippy</name>
</author>
<author><name sortKey="Schatz, Mc" uniqKey="Schatz M">MC Schatz</name>
</author>
<author><name sortKey="Pop, M" uniqKey="Pop M">M Pop</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Reyes, J" uniqKey="Reyes J">J Reyes</name>
</author>
<author><name sortKey="Gomez Romero, L" uniqKey="Gomez Romero L">L Gomez-Romero</name>
</author>
<author><name sortKey="Ibarra Soria, X" uniqKey="Ibarra Soria X">X Ibarra-Soria</name>
</author>
<author><name sortKey="Palacios Flores, K" uniqKey="Palacios Flores K">K Palacios-Flores</name>
</author>
<author><name sortKey="Arriola, Lr" uniqKey="Arriola L">LR Arriola</name>
</author>
<author><name sortKey="Wences, A" uniqKey="Wences A">A Wences</name>
</author>
<author><name sortKey="Garcia, D" uniqKey="Garcia D">D Garcia</name>
</author>
<author><name sortKey="Boege, M" uniqKey="Boege M">M Boege</name>
</author>
<author><name sortKey="Davila, G" uniqKey="Davila G">G Davila</name>
</author>
<author><name sortKey="Flores, M" uniqKey="Flores M">M Flores</name>
</author>
<author><name sortKey="Palacios, R" uniqKey="Palacios R">R Palacios</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article"><pmc-dir>properties open_access</pmc-dir>
  <front><journal-meta><journal-id journal-id-type="nlm-ta">Genome Biol</journal-id>
<journal-title-group><journal-title>Genome Biology</journal-title>
</journal-title-group>
<issn pub-type="ppub">1465-6906</issn>
<issn pub-type="epub">1465-6914</issn>
<publisher><publisher-name>BioMed Central</publisher-name>
<publisher-loc>London</publisher-loc>
</publisher>
</journal-meta>
<article-meta><article-id pub-id-type="pmid">25468217</article-id>
<article-id pub-id-type="pmc">4268812</article-id>
<article-id pub-id-type="publisher-id">506</article-id>
<article-id pub-id-type="doi">10.1186/s13059-014-0506-z</article-id>
<article-categories><subj-group subj-group-type="heading"><subject>Research</subject>
</subj-group>
</article-categories>
<title-group><article-title>Whole genome <italic>de novo</italic>
 assemblies of three divergent strains of rice, <italic>Oryza sativa</italic>
, document novel gene space of <italic>aus</italic>
 and <italic>indica</italic>
</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" equal-contrib="yes"><name><surname>Schatz</surname>
<given-names>Michael C</given-names>
</name>
<address><email>mschatz@cshl.edu</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author" equal-contrib="yes"><name><surname>Maron</surname>
<given-names>Lyza G</given-names>
</name>
<address><email>lyza.maron@cornell.edu</email>
</address>
<xref ref-type="aff" rid="Aff2"></xref>
</contrib>
<contrib contrib-type="author" equal-contrib="yes"><name><surname>Stein</surname>
<given-names>Joshua C</given-names>
</name>
<address><email>steinj@cshl.edu</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author"><name><surname>Wences</surname>
<given-names>Alejandro Hernandez</given-names>
</name>
<address><email>alhernan@cshl.edu</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
<xref ref-type="aff" rid="Aff3"></xref>
</contrib>
<contrib contrib-type="author"><name><surname>Gurtowski</surname>
<given-names>James</given-names>
</name>
<address><email>gurtowsk@cshl.edu</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author"><name><surname>Biggers</surname>
<given-names>Eric</given-names>
</name>
<address><email>ebiggers@macalester.edu</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
<xref ref-type="aff" rid="Aff4"></xref>
</contrib>
<contrib contrib-type="author"><name><surname>Lee</surname>
<given-names>Hayan</given-names>
</name>
<address><email>hlee@cshl.edu</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
<xref ref-type="aff" rid="Aff5"></xref>
</contrib>
<contrib contrib-type="author"><name><surname>Kramer</surname>
<given-names>Melissa</given-names>
</name>
<address><email>delabast@cshl.edu</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author"><name><surname>Antoniou</surname>
<given-names>Eric</given-names>
</name>
<address><email>eantonio@cshl.edu</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author"><name><surname>Ghiban</surname>
<given-names>Elena</given-names>
</name>
<address><email>ghibane@cshl.edu</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author"><name><surname>Wright</surname>
<given-names>Mark H</given-names>
</name>
<address><email>mhw6@cornell.edu</email>
</address>
<xref ref-type="aff" rid="Aff2"></xref>
</contrib>
<contrib contrib-type="author"><name><surname>Chia</surname>
<given-names>Jer-ming</given-names>
</name>
<address><email>jermth@gmail.com</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author"><name><surname>Ware</surname>
<given-names>Doreen</given-names>
</name>
<address><email>ware@cshl.edu</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
<xref ref-type="aff" rid="Aff6"></xref>
</contrib>
<contrib contrib-type="author" corresp="yes"><name><surname>McCouch</surname>
<given-names>Susan R</given-names>
</name>
<address><email>srm4@cornell.edu</email>
</address>
<xref ref-type="aff" rid="Aff2"></xref>
</contrib>
<contrib contrib-type="author" corresp="yes"><name><surname>McCombie</surname>
<given-names>W Richard</given-names>
</name>
<address><email>mccombie@cshl.edu</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<aff id="Aff1"><label></label>
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA</aff>
<aff id="Aff2"><label></label>
Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853 USA</aff>
<aff id="Aff3"><label></label>
Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, 62210 Morelos Mexico</aff>
<aff id="Aff4"><label></label>
Macalester College, St Paul, MN 55105 USA</aff>
<aff id="Aff5"><label></label>
Stony Brook University, Stony Brook, NY 11794 USA</aff>
<aff id="Aff6"><label></label>
USDA-ARS NAA Plant, Soil and Nutrition Laboratory Research Unit, Cornell University, Ithaca, NY 14853 USA</aff>
</contrib-group>
<pub-date pub-type="epub"><day>3</day>
<month>12</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="pmc-release"><day>3</day>
<month>12</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="ppub"><year>2014</year>
</pub-date>
<volume>15</volume>
<issue>11</issue>
<elocation-id>506</elocation-id>
<history><date date-type="received"><day>24</day>
<month>4</month>
<year>2014</year>
</date>
<date date-type="accepted"><day>21</day>
<month>10</month>
<year>2014</year>
</date>
</history>
<permissions><copyright-statement>© Schatz et al.; licensee BioMed Central Ltd. 2014</copyright-statement>
<license license-type="open-access"><license-p>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0">http://creativecommons.org/licenses/by/4.0</ext-link>
), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/publicdomain/zero/1.0/">http://creativecommons.org/publicdomain/zero/1.0/</ext-link>
) applies to the data made available in this article, unless otherwise stated.</license-p>
</license>
</permissions>
<abstract id="Abs1"><sec><title>Background</title>
<p>The use of high throughput genome-sequencing technologies has uncovered a large extent of structural variation in eukaryotic genomes that makes important contributions to genomic diversity and phenotypic variation. When the genomes of different strains of a given organism are compared, whole genome resequencing data are typically aligned to an established reference sequence. However, when the reference differs in significant structural ways from the individuals under study, the analysis is often incomplete or inaccurate.</p>
</sec>
<sec><title>Results</title>
<p>Here, we use rice as a model to demonstrate how improvements in sequencing and assembly technology allow rapid and inexpensive <italic>de novo</italic>
 assembly of next generation sequence data into high-quality assemblies that can be directly compared using whole genome alignment to provide an unbiased assessment. Using this approach, we are able to accurately assess the ‘pan-genome’ of three divergent rice varieties and document several megabases of each genome absent in the other two.</p>
</sec>
<sec><title>Conclusions</title>
<p>Many of the genome-specific loci are annotated to contain genes, reflecting the potential for new biological properties that would be missed by standard reference-mapping approaches. We further provide a detailed analysis of several loci associated with agriculturally important traits, including the <italic>S5</italic>
 hybrid sterility locus, the <italic>Sub1</italic>
 submergence tolerance locus, the <italic>LRK</italic>
 gene cluster associated with improved yield, and the <italic>Pup1</italic>
 cluster associated with phosphorus deficiency, illustrating the utility of our approach for biological discovery. All of the data and software are openly available to support further breeding and functional studies of rice and other species.</p>
</sec>
<sec><title>Electronic supplementary material</title>
<p>The online version of this article (doi:10.1186/s13059-014-0506-z) contains supplementary material, which is available to authorized users.</p>
</sec>
</abstract>
<custom-meta-group><custom-meta><meta-name>issue-copyright-statement</meta-name>
<meta-value>© The Author(s) 2014</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
</front>
<body><sec id="Sec1" sec-type="introduction"><title>Background</title>
<p>Rice (<italic>Oryza sativa</italic>
) provides 20% of the world’s dietary energy supply and is the predominant staple food for 17 countries in Asia, 9 countries in North and South America and 8 countries in Africa. Within <italic>O. sativa</italic>
, there are two major varietal groups, <italic>Indica</italic>
 and <italic>Japonica</italic>
, that can be further subdivided into five major subpopulations: <italic>indica</italic>
 and <italic>aus</italic>
 share ancestry within the <italic>Indica</italic>
 varietal group, and <italic>tropical japonica</italic>
, <italic>temperate japonica</italic>
 and <italic>aromatic</italic>
 (<italic>Group V</italic>
) share ancestry within the <italic>Japonica</italic>
 varietal group (Figure <xref rid="Fig1" ref-type="fig">1</xref>
) [<xref ref-type="bibr" rid="CR1">1</xref>
-<xref ref-type="bibr" rid="CR3">3</xref>
]. The subpopulation structure of <italic>O. sativa</italic>
 is deep and ancient, with estimates of divergence showing average pairwise Fst values of 0.375 to 0.45 [<xref ref-type="bibr" rid="CR1">1</xref>
-<xref ref-type="bibr" rid="CR3">3</xref>
], compared with Fst values of 0.25 for dogs [<xref ref-type="bibr" rid="CR4">4</xref>
], around 0.10 to 0.12 across human populations [<xref ref-type="bibr" rid="CR5">5</xref>
], or 0.08 to 0.09 for heterotic groups in maize [<xref ref-type="bibr" rid="CR6">6</xref>
].<fig id="Fig1"><label>Figure 1</label>
<caption><p><bold>Population structure in</bold>
<bold><italic>O. sativa</italic>
</bold>
<bold>.</bold>
 A principal component analysis (PCA) based on 40,000 SNPs shows the deep subpopulation structure of a rice diversity panel (400 <italic>O. sativa</italic>
 accessions). The top two principal components (PC1 and PC2) explain 44.1% of the genetic variation. Accessions are color-coded based on subpopulation: red, <italic>indica</italic>
; dark blue, <italic>temperate japonica</italic>
; light blue, <italic>tropical japonica</italic>
; yellow, <italic>aus</italic>
; purple, <italic>aromatic</italic>
; black, admixed. Figure reproduced with permission from [<xref ref-type="bibr" rid="CR7">7</xref>
].</p>
</caption>
<graphic xlink:href="13059_2014_506_Fig1_HTML" id="MO1"></graphic>
</fig>
</p>
<p>The time since divergence of the ancestral <italic>Indica</italic>
 and <italic>Japonica</italic>
 gene pools is estimated at 0.44 million years, based on sequence comparisons between cv Nipponbare (<italic>Japonica</italic>
) and cv 93-11 (<italic>Indica</italic>
) [<xref ref-type="bibr" rid="CR8">8</xref>
]. This time estimate pre-dates the domestication of <italic>O. sativa</italic>
 by several hundred thousand years, suggesting that rice cultivation proceeded from multiple, pre-differentiated ancestral pools [<xref ref-type="bibr" rid="CR1">1</xref>
,<xref ref-type="bibr" rid="CR9">9</xref>
-<xref ref-type="bibr" rid="CR13">13</xref>
]. This is consistent with genome-wide estimates of divergence based on gene content [<xref ref-type="bibr" rid="CR14">14</xref>
], transcript levels [<xref ref-type="bibr" rid="CR15">15</xref>
], single nucleotide polymorphisms (SNPs) [<xref ref-type="bibr" rid="CR3">3</xref>
,<xref ref-type="bibr" rid="CR16">16</xref>
], and transposable elements [<xref ref-type="bibr" rid="CR17">17</xref>
]. This is also consistent with evidence from the cloning of dozens of genes underlying diverse quantitative trait loci (QTLs) [<xref ref-type="bibr" rid="CR2">2</xref>
,<xref ref-type="bibr" rid="CR10">10</xref>
,<xref ref-type="bibr" rid="CR18">18</xref>
-<xref ref-type="bibr" rid="CR21">21</xref>
]. Despite ongoing debate about the precise moment and location of the first domestication 'event' in rice, these studies all demonstrate that natural variation in the rice genome is deeply partitioned and that divergent haplotypes can be readily associated with major varietal groups and subpopulations. The course of domestication, as rice transitioned from its ancestral state as a tropical, outcrossing, aquatic, perennial species to a predominantly inbreeding, annual species adapted to a wide range of ecologies, was punctuated by persistent episodes of intermating among the different subpopulations. This resulted in both natural and human-directed gene flow between the different gene pools, but the essential differentiation that distinguishes the <italic>Indica</italic>
 and <italic>Japonica</italic>
 genomes was maintained and reinforced over time as a result of numerous partial sterility barriers scattered throughout the genome [<xref ref-type="bibr" rid="CR22">22</xref>
-<xref ref-type="bibr" rid="CR25">25</xref>
].</p>
<p>A better understanding of the nature and extent of genome variation within the <italic>Oryza</italic>
 clade is critical for both practical and scientific reasons. While the OMAP project [<xref ref-type="bibr" rid="CR26">26</xref>
] is focused on documenting structural variation across 21 wild species of <italic>Oryza</italic>
, relatively little effort has been made to explore the nature of structural variation within and between subpopulations of <italic>O. sativa.</italic>
 The high quality, bacterial artificial chromosome (BAC)-by-BAC sequence of the <italic>temperate japonica</italic>
 rice variety Nipponbare, generated by the International Rice Genome Sequencing Program (IRGSP) [<xref ref-type="bibr" rid="CR27">27</xref>
], and the shotgun assembly of an <italic>indica</italic>
 rice genome, cv 93-11, by Chinese scientists in 2005 [<xref ref-type="bibr" rid="CR28">28</xref>
,<xref ref-type="bibr" rid="CR29">29</xref>
] have served as ‘reference genomes’ for the rice research community. The availability of these reference genomes helped catalyze and unify rice research efforts for over a decade, and continue to serve as the backbone for re-sequencing efforts today [<xref ref-type="bibr" rid="CR2">2</xref>
,<xref ref-type="bibr" rid="CR30">30</xref>
-<xref ref-type="bibr" rid="CR33">33</xref>
].</p>
<p>Recently, the resequencing of hundreds of wild and cultivated rice genomes using next generation sequencing (NGS) and various complexity-reduction and genotype-by-sequencing strategies have enriched the pool of sequence information available for rice [<xref ref-type="bibr" rid="CR30">30</xref>
,<xref ref-type="bibr" rid="CR34">34</xref>
,<xref ref-type="bibr" rid="CR35">35</xref>
]. However, the vast majority of resequenced genomes are aligned to and compared with the Nipponbare reference rather than being assembled <italic>de novo</italic>
, including in our own previous work [<xref ref-type="bibr" rid="CR35">35</xref>
] and in the current 3,000 rice genomes project [<xref ref-type="bibr" rid="CR36">36</xref>
]. This introduces a potential bias due to significant differences in genome size [<xref ref-type="bibr" rid="CR37">37</xref>
,<xref ref-type="bibr" rid="CR38">38</xref>
] and structure [<xref ref-type="bibr" rid="CR14">14</xref>
,<xref ref-type="bibr" rid="CR17">17</xref>
,<xref ref-type="bibr" rid="CR29">29</xref>
,<xref ref-type="bibr" rid="CR39">39</xref>
] that characterize the different subpopulations and varieties of rice. Alignment to a single reference is particularly problematic when NGS data from <italic>indica</italic>
, <italic>aus</italic>
 or divergent wild species genomes from the center of diversity of <italic>Oryza</italic>
 are aligned to the genetically and geographically divergent Nipponbare (<italic>temperate japonica</italic>
) reference because of the potential for misalignment, and for elimination of critical sequences that cannot be aligned with confidence.</p>
<p>The type and distribution of structural variation that distinguishes one rice genome from another, both within and between the five subpopulations of <italic>O. sativa</italic>
, remain largely unknown. Yet it is essential to understanding the genetic basis of heterosis, as well as to identify genes underlying many of the most significant phenotypic differences that are critical to global food security, including a plant’s ability to grow in stressful environments afflicted by drought, submergence, low phosphorus and/or disease. The only practical way to fully understand the genomic diversity of rice is to carry out whole genome shotgun sequencing and <italic>de novo</italic>
 assembly. This has been problematic until recently due to the difficulties in assembling the short reads initially provided by NGS. However, recent advances in NGS chemistry and in computational approaches to sequence assembly have significantly improved the power and reliability of <italic>de novo</italic>
 assembly of NGS data.</p>
<p>In this study we use these advances to <italic>de novo</italic>
 assemble three divergent rice genomes representing the <italic>indica</italic>
 (IR64), <italic>aus</italic>
 (DJ123) and <italic>temperate japonica</italic>
 (Nipponbare) subpopulations and to determine the extent and distribution of structural variation among them. These varieties were chosen for both biological interest and to facilitate evaluation of assemblies. On the biological side, different subpopulations of rice are adapted to different ecologies and geographies, and harbor different alleles and traits of interest for plant improvement [<xref ref-type="bibr" rid="CR3">3</xref>
,<xref ref-type="bibr" rid="CR19">19</xref>
,<xref ref-type="bibr" rid="CR20">20</xref>
,<xref ref-type="bibr" rid="CR40">40</xref>
-<xref ref-type="bibr" rid="CR43">43</xref>
]. The <italic>aus</italic>
 subpopulation is of particular interest because it is the source of important alleles conferring disease resistance [<xref ref-type="bibr" rid="CR44">44</xref>
], tolerance to submergence [<xref ref-type="bibr" rid="CR33">33</xref>
], deep water [<xref ref-type="bibr" rid="CR45">45</xref>
], low-phosphorus soils [<xref ref-type="bibr" rid="CR41">41</xref>
], and drought [<xref ref-type="bibr" rid="CR46">46</xref>
]. <italic>Indica</italic>
 rice harbors the greatest amount of genetic variation [<xref ref-type="bibr" rid="CR1">1</xref>
,<xref ref-type="bibr" rid="CR30">30</xref>
] and accounts for the largest contribution to rice production globally. Our choice to sequence Nipponbare was due to the fact that it provided a high quality BAC-by-BAC sequence assembly [<xref ref-type="bibr" rid="CR27">27</xref>
] that served as a solid benchmark for assessing the quality of our three NGS assemblies and provided a context for understanding the impact of varying data sets and parameters used in the assemblies.</p>
</sec>
<sec id="Sec2" sec-type="results"><title>Results and discussion</title>
<sec id="Sec3"><title><italic>De novo</italic>
 genome assemblies and functional annotation</title>
<p>The three rice varieties were assembled using the ALLPATHS-LG whole genome assembler [<xref ref-type="bibr" rid="CR47">47</xref>
] using approximately 50× coverage of a 180 bp fragment library, approximately 30× coverage of a 2 kbp jumping library, and approximately 30× coverage of a 5 kbp jumping library (see <xref rid="Sec13" ref-type="sec">Materials and methods</xref>
). We selected this assembler based on its performance with these data compared with other assemblers and its high ranking in the Assemblathon I and II and GAGE evaluations [<xref ref-type="bibr" rid="CR48">48</xref>
-<xref ref-type="bibr" rid="CR50">50</xref>
]. The three assemblies were named Os-Nipponbare-Draft-CSHL-1.0, Os-IR64-Draft-CSHL-1.0, and Os-DJ123-Draft-CSHL-1.0, following nomenclature proposed by [<xref ref-type="bibr" rid="CR51">51</xref>
].</p>
<p>All three of our assemblies had excellent results: approximately 90% of each of the genomes were assembled into scaffolds at least 1 kbp long, with scaffold N50 sizes ranging from 213 kbp to 323 kbp, and contig N50 sizes ranging from 21.9 kbp to 25.5 kbp (Table <xref rid="Tab1" ref-type="table">1</xref>
). It is notable that an earlier assembly of the Nipponbare genome prior to sequencing the 5 kbp jumping library achieved a similar contig N50 size (21.2 kbp versus 21.9 kbp), but a substantially smaller scaffold N50 size (99 kbp versus 213 kbp) (also see <xref rid="Sec13" ref-type="sec">Materials and methods</xref>
). Improved scaffold sizes from including the larger library were expected, although the magnitude depends on the specific genome characteristics. Since the scaffolds were more than twice as large for Nipponbare with the larger library, this prompted us to sequence the 5 kbp jumping library for all three genomes to maximize our ability to identify genes and other features, as well as to structurally compare the genomes.<table-wrap id="Tab1"><label>Table 1</label>
<caption><p><bold>Assembly and annotation statistics of the three</bold>
<bold><italic>de novo</italic>
</bold>
<bold>assemblies used in this study</bold>
</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr valign="top"><th></th>
<th><bold>Nipponbare</bold>
</th>
<th><bold>IR64</bold>
</th>
<th><bold>DJ123</bold>
</th>
</tr>
</thead>
<tbody><tr valign="top"><td>Total span</td>
<td align="center">355.6 Mbp</td>
<td align="center">345.2 Mbp</td>
<td align="center">345.9 Mbp</td>
</tr>
<tr valign="top"><td>Total bases</td>
<td align="center">318.2 Mbp</td>
<td align="center">316.3 Mbp</td>
<td align="center">321.2 Mbp</td>
</tr>
<tr valign="top"><td>Genome coverage<sup>a</sup>
 (span/bases)</td>
<td align="center">91.2%/81.8%</td>
<td align="center">88.5%/81.3%</td>
<td align="center">88.6%/82.5%</td>
</tr>
<tr valign="top"><td>Number of scaffolds</td>
<td align="center">4,110</td>
<td align="center">2,919</td>
<td align="center">2,819</td>
</tr>
<tr valign="top"><td>N50 scaffold span</td>
<td align="center">213 kbp</td>
<td align="center">293 kbp</td>
<td align="center">323 kbp</td>
</tr>
<tr valign="top"><td>Max scaffold span</td>
<td align="center">1.37 Mbp</td>
<td align="center">2.85 Mbp</td>
<td align="center">2.38 Mbp</td>
</tr>
<tr valign="top"><td>Number of contigs (>1,000 bp)</td>
<td align="center">27,486</td>
<td align="center">26,160</td>
<td align="center">23,902</td>
</tr>
<tr valign="top"><td>N50 contig size</td>
<td align="center">21.9kbp</td>
<td align="center">22.2 kbp</td>
<td align="center">25.5 kbp</td>
</tr>
<tr valign="top"><td>Max contig size</td>
<td align="center">133 kbp</td>
<td align="center">160 kbp</td>
<td align="center">252 kbp</td>
</tr>
<tr valign="top"><td>Total genes</td>
<td align="center">39,083</td>
<td align="center">37,758</td>
<td align="center">37,812</td>
</tr>
<tr valign="top"><td>Median gene length</td>
<td align="center">2,224</td>
<td align="center">2,275</td>
<td align="center">2,285</td>
</tr>
<tr valign="top"><td>Mean exons per transcript</td>
<td align="center">4.8</td>
<td align="center">4.8</td>
<td align="center">4.9</td>
</tr>
</tbody>
</table>
<table-wrap-foot><p><sup>a</sup>
Assumes the total genome size is 389 Mbp, as according to IRGSP [<xref ref-type="bibr" rid="CR27">27</xref>
].</p>
</table-wrap-foot>
</table-wrap>
</p>
<p>The assemblies were repeat-masked and annotated for protein-coding genes using the MAKER-P automated pipeline [<xref ref-type="bibr" rid="CR52">52</xref>
], combining both evidence-based and <italic>ab initio</italic>
 methods (Table S1 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). In addition to EST and full-length cDNA, we included as evidence the two published annotations of Nipponbare [<xref ref-type="bibr" rid="CR51">51</xref>
], and the published annotations of strains 93-11 and PA64s [<xref ref-type="bibr" rid="CR28">28</xref>
], thereby maximizing consistency and reducing bias of annotation across the three assemblies. Putative transposon-encoded genes were screened following analysis of InterPro domains (see <xref rid="Sec13" ref-type="sec">Materials and methods</xref>
), which flagged approximately 1% of initial gene calls in each of the three assemblies. Summary statistics for remaining genes are provided in Table <xref rid="Tab1" ref-type="table">1</xref>
 and in Table S2 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
. Gene counts ranged from 37,758 (IR64) to 39,083 (Nipponbare), similar to the numbers reported by the Michigan State University (MSU) Rice Genome Annotation Project and Rice Annotation Project for the Os-Nipponbare-Reference-IRGSP-1.0 (39,102 and 35,681 respectively) [<xref ref-type="bibr" rid="CR51">51</xref>
]. Overall statistics for structural features, such as exons, introns, and coding regions were highly consistent between the three assemblies and with published annotations. For instance, average translated protein lengths compared across MSU, Rice Annotation Project, and the three <italic>de novo</italic>
 assemblies ranged from 280 to 288 amino acids (median values: 268 to 291 amino acids), suggesting that contiguity of the <italic>de novo</italic>
 assemblies did not limit ability to identify protein-coding genes. For each assembly, 61 to 62% of annotated loci possessed one or more InterPro domains and 77% showed homology to plant NCBI RefSeq genes.</p>
</sec>
<sec id="Sec4"><title>Whole genome comparison to Nipponbare reference genomes</title>
<p>We evaluated the agreement between our <italic>de novo</italic>
 assemblies to the Nipponbare reference sequences using the GAGE assembly evaluation algorithm [<xref ref-type="bibr" rid="CR50">50</xref>
]. As expected, the <italic>de novo</italic>
 Nipponbare assembly very closely matches the reference Nipponbare sequence, with a 99.94% average identity and only 0.31% of the assembly not aligning to the reference (Tables <xref rid="Tab2" ref-type="table">2</xref>
, <xref rid="Tab3" ref-type="table">3</xref>
 and <xref rid="Tab4" ref-type="table">4</xref>
). Even at this very high agreement, there are several tens of thousands of small variations, and several hundred larger variations. These variations are a combination of true variations from our sample relative to the reference genome, of which we expect there to be few, and errors from ALLPATHS-LG when used with these libraries and coverage levels. Consequently, considering that the assembly has a 99.94% overall similarity, the upper-bound on the error rate of sequencing and assembling with ALLPATHS-LG is at most 0.06%.</p>
<p>The portions of the reference genome without any alignments from our Nipponbare assembly are scattered throughout the genome in 57,821 segments averaging 203 bp long. However, of this, only 301,525 bp are annotated to be within the coding sequence (CDS; 0.72% of the total CDS), and another 12,344 bp are annotated to be within non-coding exons. We further evaluated the unaligned regions by computing their read k-mer coverage from a sample of 400 million unassembled Nipponbare reads, and found the mean k-mer coverage of these regions exceeds 12,000×, while the mode k-mer coverage of the set is less than 100× (Figure S1 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). A full two-thirds (38,373/57,821) of these regions exceed 1,000× k-mer coverage, more than 10 times higher than unique segments of the genome. This implies the unassembled/unaligned regions are highly enriched for high copy repeats too complex to be assembled. In contrast, the genic regions are very well represented, suggesting it would be possible for a detailed analysis of the 'gene space' of the accessions from these assemblies.</p>
<p>Tables <xref rid="Tab2" ref-type="table">2</xref>
, <xref rid="Tab3" ref-type="table">3</xref>
 and <xref rid="Tab4" ref-type="table">4</xref>
 summarize the alignments of the three <italic>de novo</italic>
 assemblies relative to the reference IRGSP-1.0 Nipponbare assembly. As expected, the IR64 and DJ123 assemblies show noticeably lower overall identity, and have considerably more unaligned bases. The average k-mer coverage of the unaligned bases indicates most regions are unassembled high copy repeats, although there are 11.8 Mbp and 12.3 Mbp unaligned reference bases in IR64 and DJ123, respectively, that are not repetitive based on the k-mer analysis. This suggests there may be megabases of sequence specific to each of the three genomes.<table-wrap id="Tab2"><label>Table 2</label>
<caption><p><bold>Comparison of the three</bold>
<bold><italic>de novo</italic>
</bold>
<bold>assemblies to the Nipponbare reference (IRGSP-1.0)</bold>
</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr valign="top"><th></th>
<th><bold>Unaligned reference bases</bold>
</th>
<th><bold>Unaligned assembly bases</bold>
</th>
<th><bold>Average ID</bold>
</th>
<th><bold>SNPs and small indels</bold>
</th>
<th><bold>Indels > 5 bp</bold>
</th>
<th><bold>Inversions</bold>
</th>
<th><bold>Relocations</bold>
</th>
<th><bold>Translocations</bold>
</th>
</tr>
</thead>
<tbody><tr valign="top"><td>Nipponbare</td>
<td align="center">3.14%</td>
<td align="center">0.31%</td>
<td align="center">99.94%</td>
<td align="center">57,459</td>
<td align="center">3,445</td>
<td align="center">131</td>
<td align="center">252</td>
<td align="center">617</td>
</tr>
<tr valign="top"><td>IR64</td>
<td align="center">11.08%</td>
<td align="center">8.94%</td>
<td align="center">98.91%</td>
<td align="center">2,917,780</td>
<td align="center">80,631</td>
<td align="center">1,004</td>
<td align="center">1,721</td>
<td align="center">7,060</td>
</tr>
<tr valign="top"><td>DJ123</td>
<td align="center">9.85%</td>
<td align="center">8.55%</td>
<td align="center">98.93%</td>
<td align="center">2,933,257</td>
<td align="center">80,346</td>
<td align="center">1,007</td>
<td align="center">1,615</td>
<td align="center">6,683</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="Tab3"><label>Table 3</label>
<caption><p><bold>Summary of unaligned reference regions</bold>
</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr valign="top"><th></th>
<th><bold>Total bp</bold>
</th>
<th><bold>Regions</bold>
</th>
<th><bold>Average size</bold>
</th>
<th><bold>Maximum size</bold>
</th>
<th><bold>Mean k-mer coverage</bold>
</th>
</tr>
</thead>
<tbody><tr valign="top"><td>Nipponbare</td>
<td align="center">11,750,969</td>
<td align="center">57,821</td>
<td align="center">203 ± 350</td>
<td align="center">12,773</td>
<td align="center">12,210×</td>
</tr>
<tr valign="top"><td>IR64</td>
<td align="center">41,639,095</td>
<td align="center">133,536</td>
<td align="center">311 ± 921</td>
<td align="center">27,087</td>
<td align="center">7,938×</td>
</tr>
<tr valign="top"><td>DJ123</td>
<td align="center">37,010,281</td>
<td align="center">122,589</td>
<td align="center">302 ± 931</td>
<td align="center">26,236</td>
<td align="center">6,121×</td>
</tr>
</tbody>
</table>
<table-wrap-foot><p>Mean k-mer coverage was evaluated by counting the k-mers in a sample of 400 M unassembled reads in each of the three genomes, and evaluating those counts along the reference sequence.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap id="Tab4"><label>Table 4</label>
<caption><p><bold>Summary of unaligned bases by reference annotation</bold>
</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr valign="top"><th></th>
<th><bold>Non-coding exon</bold>
</th>
<th><bold>5</bold>
′ <bold>UTR</bold>
</th>
<th><bold>3</bold>
′ <bold>UTR</bold>
</th>
<th><bold>mRNA</bold>
</th>
<th><bold>Coding sequence</bold>
</th>
<th><bold>Repetitive bp (>100× k-mer coverage)</bold>
</th>
</tr>
</thead>
<tbody><tr valign="top"><td>Nipponbare</td>
<td align="center">12,344</td>
<td align="center">75,443</td>
<td align="center">80,518</td>
<td align="center">984,489</td>
<td align="center">301,525</td>
<td align="center">10,863,120</td>
</tr>
<tr valign="top"><td>IR64</td>
<td align="center">129,428</td>
<td align="center">902,520</td>
<td align="center">638,856</td>
<td align="center">6,827,679</td>
<td align="center">1,706,454</td>
<td align="center">29,685,919</td>
</tr>
<tr valign="top"><td>DJ123</td>
<td align="center">114,735</td>
<td align="center">865,470</td>
<td align="center">601,928</td>
<td align="center">6,418,865</td>
<td align="center">1,519,686</td>
<td align="center">24,505,588</td>
</tr>
</tbody>
</table>
<table-wrap-foot><p>Repeats were evaluated by analyzing their k-mer coverage using the same method as above.</p>
</table-wrap-foot>
</table-wrap>
</p>
</sec>
<sec id="Sec5"><title>Whole genome comparison to <italic>indica</italic>
 reference genomes</title>
<p>Using the same methods used for comparing to the reference Nipponbare genome, we also evaluated the three genomes relative to the reference <italic>indica</italic>
 genome (cv 93–11) [<xref ref-type="bibr" rid="CR27">27</xref>
] (Tables <xref rid="Tab5" ref-type="table">5</xref>
, <xref rid="Tab6" ref-type="table">6</xref>
 and <xref rid="Tab7" ref-type="table">7</xref>
). The agreement between the <italic>de novo</italic>
 IR64 assembly and the reference <italic>indica</italic>
 sequence is appreciably less than the Nipponbare-Nipponbare alignment; 4.31% of the IR64 assembly does not align to the 93–11 reference and the aligned regions have only 99.52% identity between these two <italic>indica</italic>
 varieties. Since the assemblies and alignments were computed with the same sample preparation and analysis algorithms, this suggests there are more true biological variations between IR64 and 93–11 (as would be expected from two different varieties), and/or that the 93–11 reference assembly is not as complete nor as accurate as the reference Nipponbare assembly. The later explanation is quite likely to be a contributing factor, given the fact that the 93–11 genome represents a whole genome shotgun assembly, while the Nipponbare genome utilized a combination of BACs and whole genome shotgun sequencing. For example, the 93–11 assembly has 14.1 million unresolved ('N') bases, while the Nipponbare reference has only 118,200. As seen with Nipponbare, most of the unassembled/unaligned bases between the 93–11 reference and our assemblies are repetitive with mean k-mer coverage over 14,000×. A quarter of the unaligned references bases (7.75 Mbp/31 Mbp) are non-repetitive from the k-mer analysis, while less than 900 kbp of the unaligned reference Nipponbare genome are not repetitive. This underscores the fact that there are substantially more true biological differences between IR64 and the reference 93–11 <italic>indica</italic>
 assembly than our Nipponbare sample and reference.<table-wrap id="Tab5"><label>Table 5</label>
<caption><p><bold>Comparison of the three</bold>
<bold><italic>de novo</italic>
</bold>
<bold>assemblies to the</bold>
<bold><italic>Indica</italic>
</bold>
<bold>reference (93-11 from</bold>
 [<xref ref-type="bibr" rid="CR28">28</xref>
]<bold>)</bold>
</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr valign="top"><th></th>
<th><bold>Unaligned reference bases</bold>
</th>
<th><bold>Unaligned assembly bases</bold>
</th>
<th><bold>Average ID</bold>
</th>
<th><bold>SNPs and smal indels</bold>
</th>
<th><bold>Indels > 5 kbp</bold>
</th>
<th><bold>Inversions</bold>
</th>
<th><bold>Relocations</bold>
</th>
<th><bold>Translocations</bold>
</th>
</tr>
</thead>
<tbody><tr valign="top"><td>Nipponbare</td>
<td align="center">16.95%</td>
<td align="center">8.90%</td>
<td align="center">98.91%</td>
<td align="center">2,813,076</td>
<td align="center">75,944</td>
<td align="center">1,162</td>
<td align="center">11,627</td>
<td align="center">11,030</td>
</tr>
<tr valign="top"><td>IR64</td>
<td align="center">13.29%</td>
<td align="center">4.31%</td>
<td align="center">99.52%</td>
<td align="center">1,228,732</td>
<td align="center">35,762</td>
<td align="center">644</td>
<td align="center">6,985</td>
<td align="center">7,101</td>
</tr>
<tr valign="top"><td>DJ123</td>
<td align="center">14.37%</td>
<td align="center">6.78%</td>
<td align="center">99.16%</td>
<td align="center">2,264,541</td>
<td align="center">62,191</td>
<td align="center">974</td>
<td align="center">8,582</td>
<td align="center">10,170</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="Tab6"><label>Table 6</label>
<caption><p><bold>Summary of unaligned reference regions relative to the</bold>
<bold><italic>Indica</italic>
</bold>
<bold>reference</bold>
</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr valign="top"><th></th>
<th><bold>Total bp</bold>
</th>
<th><bold>Regions</bold>
</th>
<th><bold>Average size</bold>
</th>
<th><bold>Maximum size</bold>
</th>
<th><bold>Mean k-mer coverage</bold>
</th>
</tr>
</thead>
<tbody><tr valign="top"><td>Nipponbare</td>
<td align="center">46,101,370</td>
<td align="center">176,504</td>
<td align="center">261 ± 796</td>
<td align="center">53,066</td>
<td align="center">6,945</td>
</tr>
<tr valign="top"><td>IR64</td>
<td align="center">31,006,053</td>
<td align="center">139,336</td>
<td align="center">264 ± 718</td>
<td align="center">54,056</td>
<td align="center">14,215</td>
</tr>
<tr valign="top"><td>DJ123</td>
<td align="center">49,562,877</td>
<td align="center">152,378</td>
<td align="center">325 ± 845</td>
<td align="center">54,307</td>
<td align="center">9,605</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="Tab7"><label>Table 7</label>
<caption><p><bold>Summary of unaligned bases by reference annotation relative to the</bold>
<bold><italic>Indica</italic>
</bold>
<bold>reference</bold>
</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr valign="top"><th></th>
<th><bold>mRNA</bold>
</th>
<th><bold>CDS</bold>
</th>
<th><bold>Repetitive bp (>100× kmer coverage)</bold>
</th>
</tr>
</thead>
<tbody><tr valign="top"><td>Nipponbare</td>
<td>9,768,022</td>
<td>4,809,510</td>
<td align="center">33,934,310</td>
</tr>
<tr valign="top"><td>IR64</td>
<td>7,629,670</td>
<td>3,993,153</td>
<td align="center">23,255,924</td>
</tr>
<tr valign="top"><td>DJ123</td>
<td>5,999,753</td>
<td>1,947,112</td>
<td align="center">23,805,398</td>
</tr>
</tbody>
</table>
<table-wrap-foot><p>Note that only CDS and mRNA annotations are available for the reference <italic>Indica</italic>
 assembly.</p>
</table-wrap-foot>
</table-wrap>
</p>
<p>Finally, the comparison between the IR64 and DJ123 assemblies shows that they differ from each other nearly as much as either one differs from the Nipponbare reference sequence. These results suggest that the <italic>aus</italic>
 genome harbors a greater amount of novel variation than previously recognized. It also highlights the value of taking an unbiased, <italic>de novo</italic>
 assembly approach when evaluating genomic variation among varieties and subpopulations to capture genome-specific variations.</p>
</sec>
<sec id="Sec6"><title>Pan-genome analysis</title>
<p>We next evaluated the 'pan-genome' of the three <italic>de novo</italic>
 assemblies to identify sequences that were conserved across the genomes as well as sequences specific to just one genome (see <xref rid="Sec13" ref-type="sec">Materials and methods</xref>
). Using the whole genome alignment information, we classified each base of each genome as being specific to that genome (unaligned to either other genome), or shared by one or both genomes. The majority of the assembled sequences (approximately 302 Mbp per genome) and exonic sequences (approximately 55.5 Mbp per genome), were shared among the three genomes, although 4.8 Mbp to 8.2 Mbp (423 kbp to 930 kbp exonic) were found to be genome-specific (Figure <xref rid="Fig2" ref-type="fig">2</xref>
A). Since a gene sequence may be partially shared or partially genome-specific, we assigned each gene to the sector on the Venn diagram for which the majority of the exonic bases were assigned over all transcripts associated with each gene. For example, if 90% of a gene is shared among all three genomes, but 10% is genome-specific, we would assign it to the center (fully shared) sector under the majority rule. This will not necessarily characterize changes in gene function if critical protein domains are shared/unshared, but highlights the major trends between the lineages and discovers 297 to 786 genome-specific loci.<fig id="Fig2"><label>Figure 2</label>
<caption><p><bold>Venn diagrams of the shared sequence content between Nipponbare (</bold>
<bold><italic>temperate japonica</italic>
</bold>
<bold>), IR64 (</bold>
<bold><italic>indica</italic>
</bold>
<bold>) and DJ123 (</bold>
<bold><italic>aus</italic>
</bold>
<bold>). (A)</bold>
 overall sequence content. In each sector, the top number is the total number of base pairs, the middle number is the number of exonic bases, and the bottom is the gene count. If a gene is partially shared, it is assigned to the sector with the most exonic bases. <bold>(B)</bold>
 Genic content. In each sector, the top number is the median CDS length, the middle number is the average number of exons per gene, and the bottom is the percentage InterPro/homology.</p>
</caption>
<graphic xlink:href="13059_2014_506_Fig2_HTML" id="MO2"></graphic>
</fig>
</p>
<p>Using the same k-mer analysis techniques we applied for the reference analysis, we further classified the genome-specific bases as being unique or repetitive, using a threshold of 100× average k-mer coverage to classify unique sequences. From this, we identified only 1.2 Mbp to 1.5 Mbp of non-repetitive sequence specific to each genome, meaning that most of the genome-specific bases were actually repetitive (Table <xref rid="Tab8" ref-type="table">8</xref>
). Since repetitive sequences are also the most likely to be unassembled, as observed in our comparison to the reference genomes, we further examined the genome-specific exonic bases and refined our initial estimates to 555 kbp to 760 kbp of non-repetitive, genome-specific sequences intersecting annotated genes by at least 100 bp (Table <xref rid="Tab9" ref-type="table">9</xref>
). Note these segments may include flanking promoter and other regulatory regions in addition to the exons themselves. From this catalog, we selected 10 of the largest regions in each of the genomes for PCR validation, and were able to confirm the computational analysis with 100% success (Figure <xref rid="Fig3" ref-type="fig">3</xref>
; Table S4a-c in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
).<table-wrap id="Tab8"><label>Table 8</label>
<caption><p><bold>Genome-specific non-repetitive bases</bold>
</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr valign="top"><th><bold>Genome</bold>
</th>
<th><bold>Genome-specific</bold>
</th>
<th><bold>Regions</bold>
</th>
<th><bold>Mean ± standard deviation</bold>
</th>
<th><bold>Maximum size</bold>
</th>
</tr>
</thead>
<tbody><tr valign="top"><td>Nipponbare</td>
<td align="center">1,574,801 bp</td>
<td align="center">2,250</td>
<td align="center">699 ± 904</td>
<td align="center">8,463</td>
</tr>
<tr valign="top"><td>IR64</td>
<td align="center">1,336,650 bp</td>
<td align="center">1,702</td>
<td align="center">785 ± 1066</td>
<td align="center">10,400</td>
</tr>
<tr valign="top"><td>DJ123</td>
<td align="center">1,263,681 bp</td>
<td align="center">1,569</td>
<td align="center">805 ± 1,058</td>
<td align="center">8,960</td>
</tr>
</tbody>
</table>
<table-wrap-foot><p>Identified sequences must be at least 100 bp long with no alignments to the other genomes, average between 10× and 100× k-mer coverage in that genome, and average below 10× k-mer coverage using the reads from the other two genomes.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap id="Tab9"><label>Table 9</label>
<caption><p><bold>Genome-specific non-repetitive gene sequences</bold>
</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr valign="top"><th><bold>Genome</bold>
</th>
<th><bold>Genome-specific</bold>
</th>
<th><bold>Regions</bold>
</th>
<th><bold>Mean ± standard deviation</bold>
</th>
<th><bold>Maximum size</bold>
</th>
</tr>
</thead>
<tbody><tr valign="top"><td>Nipponbare</td>
<td align="center">760,064 bp</td>
<td align="center">779</td>
<td align="center">975 ± 1,197</td>
<td align="center">8,463</td>
</tr>
<tr valign="top"><td>IR64</td>
<td align="center">637,470 bp</td>
<td align="center">583</td>
<td align="center">1,093 ± 1,430</td>
<td align="center">10,400</td>
</tr>
<tr valign="top"><td>DJ123</td>
<td align="center">555,507 bp</td>
<td align="center">492</td>
<td align="center">1,129 ± 1,409</td>
<td align="center">8,960</td>
</tr>
</tbody>
</table>
<table-wrap-foot><p>Regions identified to be specific to a given accession using the criterion used in Table <xref rid="Tab8" ref-type="table">8</xref>
 and that intersect an annotated gene region by at least 100 bp.</p>
</table-wrap-foot>
</table-wrap>
<fig id="Fig3"><label>Figure 3</label>
<caption><p><bold>PCR validation of genome-specific regions.</bold>
 Regions identified as unique to each genome assembly were amplified from genomic DNA of all three genomes and visualized on 1% agarose gels. <bold>(A)</bold>
 Nipponbare-specific sequences. <bold>(B)</bold>
 IR64-specific sequences. <bold>(C)</bold>
 DJ123-specific sequences.</p>
</caption>
<graphic xlink:href="13059_2014_506_Fig3_HTML" id="MO3"></graphic>
</fig>
</p>
<p>For Nipponbare and IR64, we determined the positions of the non-repetitive segments along the different reference chromosomes, and found the segments were broadly distributed. For Nipponbare, we could localize 2,208 of the genome-specific regions, and found that one region occurred, on average, every 162 ± 362 kbp, following an approximately exponential distribution (data not shown). For IR64, we could localize 1,074 of the genome-specific regions, and found one region occurred, on average, every 338 ± 752 kbp, also from an approximately exponential distribution. The distributions suggest that the genome-specific bases are not highly localized, as an exponential distribution in spacing can occur if there is a uniform probability distribution of a site occurring at any position at random.</p>
<p>Genome-specific loci, as well as those shared between two genomes but not the third, exhibited shorter CDSs and greater novelty compared with genes shared among all three genomes (Figure <xref rid="Fig2" ref-type="fig">2</xref>
B). For example, loci common to all genomes had a median coding length of 888 bp compared with median values ranging from 483 to 654 bp for the genome-specific gene sets. Likewise, the core, fully shared set of genes averaged 4.9 exons per transcript compared with a range of 2.9 to 3.0 genes per transcript amongst genome-specific genes. A smaller fraction of genome-specific loci contained InterPro domains compared with the core set (40% versus 63%), and fewer showed homology to plant RefSeq proteins (57% versus 79%). However, artifacts of inaccurate annotation may contribute to this trend [<xref ref-type="bibr" rid="CR53">53</xref>
], so we investigated if these differences were negatively influenced by assembly quality, especially if genome-specific genes tended to terminate in scaffold gaps more frequently than core genes. We observe a modest effect, and genes shared by all three strains have a median distance of 12 to 14 kbp (5′ and 3′ flanking distances), whereas genome-specific genes have a median distance of 8 to 10 kbp (Table S7 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). Only 7 to 12% of the genes have a flanking distance of less than 1 kbp, suggesting this is not a major factor in our analysis. Indeed, our results are similar to studies in yeast, <italic>Drosophila</italic>
, and vertebrates that have found that novel and recently evolved genes tend to encode smaller proteins than conserved or ancient genes [<xref ref-type="bibr" rid="CR53">53</xref>
-<xref ref-type="bibr" rid="CR55">55</xref>
].</p>
<p>To characterize potential function of genome-specific genes we further examined genes with annotated InterPro domains. Notably, genes with domains related to disease resistance were the most prevalent type among genome-specific genes. For example, 12% of genes specific to IR64 possessed the NB-ARC motif (IPR002182), the central nucleotide-binding domain of plant R-genes. This domain, and others associated with R-genes, also prevailed among the DJ123-specific and Nipponbare-specific gene sets, accounting for 9% and 5% of genes, respectively. In contrast, only 0.35% of genes shared among all three genomes encode the NB-ARC domain. Genes shared between just two genomes showed intermediate frequencies of disease resistance genes (1.5 to 2.5%). Similar distributions were seen for genes classified with the Gene Ontology (GO) term 'defense response' (GO:0006952). These results are consistent with Ding <italic>et al</italic>
. [<xref ref-type="bibr" rid="CR14">14</xref>
], who showed high levels of 'genome asymmetry' among R genes when comparing the Nipponbare and 93-11 reference assemblies. A large diversity of other protein domain classes, such as those associated with receptor and non-receptor protein kinases, transcription factors, metabolic enzymes, proteases, and transporters, were also found in the genome-specific gene sets. A complete listing of putative strain-specific genes, their InterPro domains, GO terms, and summary of homology search results are provided in Additional file 2. We anticipate these findings will greatly enhance the ongoing 3,000 rice genomes project [<xref ref-type="bibr" rid="CR36">36</xref>
] and other resequencing projects that had previously focused on single nucleotide variations relative to the Nipponbare reference.</p>
</sec>
<sec id="Sec7"><title>Detailed regions</title>
<p>We chose four agronomically relevant regions of the rice genome that were previously reported to harbor differences among the three varieties or subpopulations to illustrate the utility of these high quality whole genome assemblies for understanding the variation in genome structure underlying salient phenotypic variants.</p>
<sec id="Sec8"><title>S5 <italic>hybrid sterility locus</italic>
</title>
<p><italic>S5</italic>
 is a major locus for hybrid sterility in rice that affects embryo sac fertility. Genetic analysis of the <italic>S5</italic>
 locus documented three alleles: an <italic>indica</italic>
 (<italic>S5</italic>
-i), a <italic>japonica</italic>
 (<italic>S5</italic>
-j), and a neutral allele (<italic>S5</italic>
-n) [<xref ref-type="bibr" rid="CR23">23</xref>
,<xref ref-type="bibr" rid="CR56">56</xref>
]. Hybrids of genotype <italic>S5</italic>
-i/<italic>S5</italic>
-j are mostly sterile, whereas hybrids of genotypes consisting of <italic>S5</italic>
-n with either <italic>S5</italic>
-i or <italic>S5</italic>
-j are mostly fertile. The <italic>S5</italic>
 locus contains three tightly linked genes that work together in a ‘killer-protector’-type system [<xref ref-type="bibr" rid="CR57">57</xref>
,<xref ref-type="bibr" rid="CR58">58</xref>
]. During female sporogenesis, ORF5+ (killer) and ORF4+ (partner) cause endoplasmic reticulum stress. ORF3+ prevents endoplasmic reticulum stress and allows the production of normal gametes, whereas the ORF3- allele cannot prevent it, resulting in embryo sac abortion. The <italic>ORF3</italic>
- allele has a 13-bp deletion; the <italic>ORF4</italic>
- allele carries an 11-bp deletion that causes a premature stop codon [<xref ref-type="bibr" rid="CR58">58</xref>
]. The ORF5 <italic>indica</italic>
 (<italic>ORF5</italic>
+) and <italic>japonica</italic>
 (<italic>ORF5</italic>
-) alleles differ by only two nucleotides, whereas the wide compatibility allele <italic>S5</italic>
-n (<italic>ORF5</italic>
n) has a large deletion in the amino terminus of the predicted protein, rendering it presumably non-functional [<xref ref-type="bibr" rid="CR57">57</xref>
]. The typical <italic>indica</italic>
 haplotype is ORF3+/ORF4-/ORF5+, while the typical <italic>japonica</italic>
 haplotype is ORF3-/ORF4+/ORF5-.</p>
<p>In each of the three <italic>de novo</italic>
 assemblies reported here the <italic>S5</italic>
 locus containing the three genes lies within a single scaffold and haplotypes can be easily identified (Table S5 and Figure S2 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). The identity of the <italic>ORF5</italic>
 alleles in Nipponbare, IR64 and DJ123 were also confirmed by Sanger sequencing from genomic DNA, and perfectly confirm the assembly results (Figures S3 and S4 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). The Nipponbare assembly is in agreement with the Nipponbare IRGSP-1.0 reference sequence for the region and shows that it carries the typical <italic>japonica</italic>
 haplotype ORF3-/ORF4+/ORF5-. The IR64 assembly shows that this accession carries the typical <italic>indica</italic>
 haplotype ORF3+/ORF4-/ORF5+. In the case of DJ123, our <italic>de novo</italic>
 assembly revealed that this <italic>aus</italic>
 accession carries the 136-bp deletion characteristic of the neutral allele, <italic>ORF5</italic>
n. However, the DJ123 <italic>ORF5</italic>
n allele is novel, as it differs from the reported <italic>ORF5</italic>
n allele by two SNPs and one 10-bp deletion within the coding region of the gene (also confirmed by Sanger sequencing). The DJ123 haplotype for the locus is ORF3-/ORF4-/ORF5n, a haplotype previously identified by Yang <italic>et al.</italic>
 [<xref ref-type="bibr" rid="CR58">58</xref>
] in four accessions from Bangladesh. Although the accessions bearing this haplotype were referred to as <italic>indica</italic>
 in this study, they almost certainly belonged to the <italic>aus</italic>
 subpopulation.</p>
</sec>
<sec id="Sec9"><title>Sub1 <italic>locus</italic>
</title>
<p>The <italic>Submergence 1</italic>
 (<italic>Sub1</italic>
) locus on chromosome 9 is a major QTL determining submergence tolerance in rice [<xref ref-type="bibr" rid="CR33">33</xref>
]. The <italic>Sub1</italic>
 locus is a cluster of three genes encoding putative ethylene response factors. <italic>Sub1B</italic>
 and <italic>Sub1C</italic>
 are present in all rice accessions tested to date, while <italic>Sub1A</italic>
 may be present or absent. Originally identified in the <italic>aus</italic>
 accession FR13A, <italic>Sub1A</italic>
 appears to be found only within the <italic>Indica</italic>
 varietal group [<xref ref-type="bibr" rid="CR33">33</xref>
]. <italic>Sub1A</italic>
 has two alleles: <italic>Sub1A-1</italic>
 is found in submergence-tolerant varieties, while <italic>Sub1A-2</italic>
 is found in intolerant varieties. A haplotype survey in <italic>O. sativa</italic>
 varieties also identified nine <italic>Sub1B</italic>
 and seven <italic>Sub1C</italic>
 alleles [<xref ref-type="bibr" rid="CR33">33</xref>
].</p>
<p>In the IR64 and DJ123 <italic>de novo</italic>
 assemblies reported here the <italic>Sub1</italic>
 locus lies within a single scaffold and haplotypes can be easily identified (Table S5 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). In the IR64 assembly the <italic>Sub1A</italic>
 gene is present as the <italic>Sub1A-2</italic>
 allele, previously identified in submergence-intolerant accessions including IR64 [<xref ref-type="bibr" rid="CR33">33</xref>
]. For the <italic>Sub1B</italic>
 and <italic>C</italic>
 genes, IR64 carries the alleles <italic>Sub1B</italic>
-1 and <italic>Sub1C</italic>
-3, as reported [<xref ref-type="bibr" rid="CR33">33</xref>
]. <italic>Sub1A</italic>
 is absent from the DJ123 assembly, suggesting that this <italic>aus</italic>
 variety is not submergence tolerant. DJ123 carries a novel <italic>Sub1B</italic>
 allele (<italic>Sub1B</italic>
-10), and the previously identified <italic>Sub1C</italic>
-<italic>6</italic>
 allele. In the Nipponbare assembly, <italic>Sub1B</italic>
 and <italic>Sub1C</italic>
 lie within a single scaffold, and the alleles identified are in agreement with published results [<xref ref-type="bibr" rid="CR33">33</xref>
]. Nipponbare is not submergence tolerant and the <italic>Sub1A</italic>
 gene is absent in Nipponbare according to previous reports. Our <italic>de novo</italic>
 assembly is unresolved in the region that corresponds to the <italic>Sub1A</italic>
 gene, but a k-mer analysis using the methods and data applied above clearly shows a lack of coverage in the DJ123 and Nipponbare sequencing reads across the locus except for high copy repeats dispersed in the sequence (Figure <xref rid="Fig4" ref-type="fig">4</xref>
, top and bottom). Conversely, the k-mer coverage of the IR64 assembly is uniformly at the single-copy coverage level (approximately 100×), except for a small number of localized gaps in coverage, corresponding to SNPs in IR64 relative to the reference <italic>Sub1A</italic>
 sequence, and the high frequency repeats (Figure <xref rid="Fig4" ref-type="fig">4</xref>
, middle). In contrast, the k-mer coverage across <italic>Sub1B</italic>
 and <italic>Sub1C</italic>
 is consistently approximately 100×, except for isolated sharp gaps corresponding to variations relative to the reference sequences (Figures S5 and S6 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
).<fig id="Fig4"><label>Figure 4</label>
<caption><p><bold>K-mer coverage in the three assemblies across the</bold>
<bold><italic>Sub1A</italic>
</bold>
<bold>gene.</bold>
 In each panel, the k-mer coverage of the sequence reads of the three respective genomes are plotted according to the sequence of the Sub1A A-2 allele. Only IR64 has consistent coverage across the gene, while the other two genomes have sparse coverage of a few repetitive k-mer sequences. For clarity, the k-mer coverage range 1× to 50,000× (log scale) is displayed in all the plots.</p>
</caption>
<graphic xlink:href="13059_2014_506_Fig4_HTML" id="MO4"></graphic>
</fig>
</p>
</sec>
<sec id="Sec10"><title>LRK <italic>gene cluster</italic>
</title>
<p>Fine-mapping of a yield-improving QTL on rice chromosome 2 identified a cluster of leucine-rich repeat receptor kinase genes [<xref ref-type="bibr" rid="CR59">59</xref>
], consisting of seven or eight intronless gene copies contained within a 40 to 50 kb genomic region. The QTL, originally introgressed from a wild rice accession (Dongxiang), was shown to increase grain yield of the recurrent parent Guichao2 (<italic>indica</italic>
) by about 25%. The <italic>LRK</italic>
 locus in Dongxiang carries an extra gene, <italic>LRK1</italic>
, absent from Guichao2. A survey of haplotype divergence in 13 rice accessions showed that <italic>LRK1</italic>
 is absent in only three <italic>indica</italic>
 accessions, suggesting that these haplotypes may have originated via gene loss.</p>
<p>In each of the three <italic>de novo</italic>
 assemblies reported here the <italic>LRK</italic>
 locus lies within a single scaffold and haplotypes can be easily identified (Table S5 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). The Nipponbare assembly is in agreement with the reference sequence, with the exception of regions that the <italic>de novo</italic>
 assembly was not able to resolve because of high copy repeats (Figure S7 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). <italic>LRK1</italic>
 is absent in the IR64 assembly as evident in the k-mer plot (Figure S8 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
), indicating that IR64 carries the seven-gene haplotype identified in other <italic>indica</italic>
 accessions [<xref ref-type="bibr" rid="CR59">59</xref>
]. According to our assembly and the corresponding k-mer analysis, the <italic>aus</italic>
 accession DJ123 carries <italic>LRK1.</italic>
 Based on sequence variation on the 5′ upstream region of <italic>LRK4</italic>
 and <italic>LRK6</italic>
, we can predict that the DJ123 haplotype for the LRK gene cluster is closest to the haplotypes identified in <italic>indica</italic>
 accessions in which <italic>LRK1</italic>
 is present (haplotypes A, B and C in Figure <xref rid="Fig3" ref-type="fig">3</xref>
 of [<xref ref-type="bibr" rid="CR59">59</xref>
]).</p>
</sec>
<sec id="Sec11"><title>Pup1 <italic>region</italic>
</title>
<p><italic>Phosphorus uptake1</italic>
 (<italic>Pup1</italic>
) is a major rice QTL associated with tolerance to phosphorus deficiency in soils [<xref ref-type="bibr" rid="CR60">60</xref>
,<xref ref-type="bibr" rid="CR61">61</xref>
]. The <italic>Pup1</italic>
 locus is a large, 90 kb region originally identified in Kasalath, an <italic>aus</italic>
 variety that is tolerant to phosphorus deficiency, but is absent in phosphorus starvation-intolerant varieties, including Nipponbare [<xref ref-type="bibr" rid="CR62">62</xref>
]. A gene encoding a protein kinase, <italic>Pstol1</italic>
, located within the 90 kb indel, is responsible for the P-uptake efficiency phenotype [<xref ref-type="bibr" rid="CR41">41</xref>
].</p>
<p>Of the three <italic>de novo</italic>
 assemblies reported here, the 90 kb indel is absent from both Nipponbare and IR64, but a large portion of it, including the <italic>Pstol1</italic>
 gene, is present in the <italic>aus</italic>
 variety DJ123 (Figure <xref rid="Fig5" ref-type="fig">5</xref>
; Table S5 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). Although it is at least partially present, the region of the 90 kb indel described in Kasalath could not be fully resolved in our DJ123 assembly. This suggests that the 90 kb indel may be truncated and/or rearranged in some <italic>aus</italic>
 varieties. Interestingly, as shown in Figure <xref rid="Fig5" ref-type="fig">5</xref>
, the Kasalath reference sequence contains unresolved gaps flanking regions of very high k-mer coverage; therefore, longer reads may be necessary to assemble this region with confidence. The <italic>Pstol1</italic>
 gene sequence is complete in DJ123, and shows six SNPs relative to the Kasalath sequence (also apparent as abrupt drops in coverage in the k-mer coverage plot; Figures S9 and S10 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). These SNPs were confirmed via Sanger sequencing on genomic DNA (Figure S11 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). One of these SNPs introduces a premature stop codon, resulting in a protein that is only 136 amino acids long (the intact PSTOL1 protein is 324 amino acids) and, therefore, presumably non-functional.<fig id="Fig5"><label>Figure 5</label>
<caption><p><bold>K-mer Coverage across the Kasalath/</bold>
<bold><italic>Pstol1</italic>
</bold>
<bold>gene in the three genomes, with 30 kbp of upstream and downstream flanking sequence.</bold>
 The k-mer coverage is plotted with respect to the reference Kasalath sequence (AB458444.1). The position of the <italic>Pstol1</italic>
 gene is indicated with green vertical bars. Also see Figure S9 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
 for a detailed view of the <italic>Pstol1</italic>
 coverage, and Figure S10 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
 for a plot of the entire Kasalath sequence. Unresolved gaps in the reference sequence are indicated with black vertical bars. Only DJ123 has consistent coverage across this region, especially upstream of the gene, while the other two genomes show complete gaps in coverage.</p>
</caption>
<graphic xlink:href="13059_2014_506_Fig5_HTML" id="MO5"></graphic>
</fig>
</p>
</sec>
</sec>
</sec>
<sec id="Sec12" sec-type="conclusion"><title>Conclusions</title>
<p>In this study we wanted to overcome the limitation on sequencing and comparison to a reference genome by instead analyzing high quality <italic>de novo</italic>
 assemblies of multiple rice genomes to observe biologically significant changes between them. The rice accessions sequenced were selected to represent the <italic>indica</italic>
 (cv IR64), <italic>aus</italic>
 (DJ123) and <italic>temperate japonica</italic>
 (Nipponbare) subpopulations (Figure <xref rid="Fig1" ref-type="fig">1</xref>
). The inclusion of the high quality, BAC-by-BAC assembly of Nipponbare and the shotgun assembly of 93–11 provided a control that allowed us to assess the quality of the different datasets and <italic>de novo</italic>
 assembly strategies. It is apparent from comparing different assembly software that ALLPATHS-LG gave the best results in our hands (Table S3 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). It is also apparent that the use of k-mer frequencies is a robust technique for characterizing repetitive regions, and enabled us to correctly characterize and validate genome-specific regions.</p>
<p>The three-way comparison among the different genomes was informative in identifying major shared and structurally variable regions of the rice genome. We were particularly interested in regions that were structurally unique to either the <italic>indica</italic>
 and/or the <italic>aus</italic>
 genome because they would likely have been discarded in previous re-sequencing efforts due to difficulties aligning their sequencing reads to the Nipponbare reference genome. This would be particularly true for longer genome-specific sequences, which would be completely absent in the alignments to the reference. We anticipate the ongoing 3,000 rice genomes project [<xref ref-type="bibr" rid="CR36">36</xref>
] will benefit greatly from having our assemblies available, especially so that they can map variations within regions not present in the Nipponbare reference as they have currently done. We also anticipate future studies will systematically perform follow-up functional studies of genome-specific gene loci as being likely candidates for phenotypic differences observed between the genomes.</p>
<p>Our analysis clearly demonstrates that the <italic>indica</italic>
 and the <italic>aus</italic>
 genomes are more distantly related than previously known. Because the <italic>aus</italic>
 subpopulation is phenotypically so similar to <italic>indica</italic>
, the degree of genetic differentiation has been underappreciated by breeders and geneticists alike [<xref ref-type="bibr" rid="CR43">43</xref>
,<xref ref-type="bibr" rid="CR63">63</xref>
,<xref ref-type="bibr" rid="CR64">64</xref>
]. The unusual characteristics of the <italic>aus</italic>
 subpopulation, combined with evidence of unique <italic>aus</italic>
 alleles at loci such as <italic>Rc</italic>
, conferring white versus colored pericarp [<xref ref-type="bibr" rid="CR19">19</xref>
], the <italic>Snorkel</italic>
 locus conferring deep water ability [<xref ref-type="bibr" rid="CR45">45</xref>
], the <italic>Pstol1</italic>
 locus conferring phosphorus-update efficiency [<xref ref-type="bibr" rid="CR41">41</xref>
], or the <italic>Sub1</italic>
 locus conferring submergence tolerance [<xref ref-type="bibr" rid="CR33">33</xref>
], all support the hypothesis that <italic>aus</italic>
 may have a unique domestication history compared to <italic>japonica</italic>
 and <italic>indica</italic>
. These findings underscore the importance of recognizing genetic subpopulation structure to guide plant breeders in identifying novel sources of variation for traits of interest. In recent years, many key biotic and abiotic stress tolerance genes have been discovered in <italic>aus</italic>
 varieties [<xref ref-type="bibr" rid="CR33">33</xref>
,<xref ref-type="bibr" rid="CR41">41</xref>
,<xref ref-type="bibr" rid="CR44">44</xref>
-<xref ref-type="bibr" rid="CR46">46</xref>
]. It is interesting to note that in several cases, the donor <italic>aus</italic>
 germplasm is referred to as <italic>indica</italic>
, underscoring how <italic>indica</italic>
 and <italic>aus</italic>
 are often confused, as noted for the DJ123 haplotype of the S5 hybrid sterility locus (see above).</p>
<p>The overall annotation of our Nipponbare assembly is quite close to that of the reference Nipponbare genome. This illustrates that the approach we describe here provides a genome sequence of considerable vitality for further research. However, our contig N50 sizes (as opposed to scaffold N50) are still fragmented by the presence of repeats too long and too complex to be fully resolved in the short read assemblies. This somewhat limits application when studying large structural rearrangements, as exemplified by the <italic>Pup1</italic>
 region that remains partially unassembled in the <italic>aus</italic>
 variety DJ123 (Figure <xref rid="Fig5" ref-type="fig">5</xref>
), and the modest differences we observed in annotation quality between core genes and genome-specific genes. We anticipate that some combination of short-read NGS sequencing and newly emerging long read sequences, such as Pacific Biosciences Single Molecule Real Time Sequence [<xref ref-type="bibr" rid="CR65">65</xref>
], which can now produce reads approaching 100 kbp long, will soon overcome this limitation and provide assemblies approaching, or perhaps even surpassing, those provided by the vastly more expensive and time consuming BAC-by-BAC approach. Once this occurs it should spark an outburst of genomics studies of agronomically important plant genomes, greatly enriching our potential to understand their many unique qualities and characteristics and paving the way for enhanced utilization of natural variation in plant improvement.</p>
</sec>
<sec id="Sec13" sec-type="materials|methods"><title>Materials and methods</title>
<sec id="Sec14"><title>Plant material</title>
<p>Three rice (<italic>Oryza sativa</italic>
) accessions (Nipponbare, IR64, DJ123) were used in the study. Accession information (that is, Genetic Stocks <italic>Oryza</italic>
 (GSOR) identifier, accession name, country of origin, subpopulation) is summarized in Table <xref rid="Tab10" ref-type="table">10</xref>
 [<xref ref-type="bibr" rid="CR63">63</xref>
]. The plants were grown in the Guterman greenhouse facility at Cornell University, leaf tissue was harvested from one-month-old seedlings, ground in a mortar and pestle, and DNA was extracted using the Qiagen Plant DNeasy kit (Qiagen, Valencia, CA, USA).<table-wrap id="Tab10"><label>Table 10</label>
<caption><p><bold>Accession information for the three rice genomes in the Genetic Stocks</bold>
<bold><italic>Oryza</italic>
</bold>
<bold>(GSOR) stock center</bold>
</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr valign="top"><th><bold>GSOR ID</bold>
</th>
<th><bold>Accession name</bold>
</th>
<th><bold>Country of origin</bold>
</th>
<th><bold>Subpopulation</bold>
</th>
</tr>
</thead>
<tbody><tr valign="top"><td>301164</td>
<td>Nipponbare</td>
<td>Japan</td>
<td><italic>temperate japonica</italic>
</td>
</tr>
<tr valign="top"><td>312010</td>
<td>IR64</td>
<td>Philippines</td>
<td><italic>indica</italic>
</td>
</tr>
<tr valign="top"><td>301307</td>
<td>DJ123</td>
<td>Bangladesh</td>
<td><italic>aus</italic>
</td>
</tr>
</tbody>
</table>
</table-wrap>
</p>
</sec>
<sec id="Sec15"><title>DNA sequencing</title>
<p>The DNA sequencing was performed in the Cold Spring Harbor Laboratory Genome Center using Illumina HiSeq 2000 instruments. For each of the three varieties, three libraries were sequenced following the requirements and recommendations of the ALLPATHS-LG whole genome assembler: (1) a 180 bp fragment library sequenced as 2 × 100 bp reads; (2) an approximately 2 kbp jumping library sequenced as 2 × 50 bp reads; and (3) an approximately 5 kbp jumping library sequenced as 2 × 50 bp reads.</p>
<p>For the 180-bp overlap library the sample was mechanically fragmented by using the Covaris S2 System and then prepared based on the New England Biolabs NEBNext Illumina library protocol and ligated to standard Illumina paired-end adapters. To maximize sample throughput the samples were size-selected in 50-bp windows between 290 and 310 bp using the Caliper XT instrument. Each library was PCR enriched for 12 cycles and quantified using the Bioanalyzer.</p>
<p>For the jumping libraries, the Illumina mate-pair library protocol was used. The DNA was fragmented into 2 kb and 5 kb segments. We again used the Covaris S2 System using programs that we developed in the lab. The fragmented DNA was then end-repaired with biotin-labeled dNTPs. The labeled fragments are circularized and fragmented again into 400 bp pieces. Fragments with the biotin labels are enriched, end-repaired, and ligated with adapters used for downstream processes. Each library was PCR enriched for 18 cycles and size-selected for 350 to 650 bp fragments. The final library consists of fragments made up of two DNA segments that were originally separated by approximately 2 kbp or approximately 5 kb. Each of the libraries was sequenced to 30× to 80× sequence coverage, as recommended by the assembler.</p>
<p>Libraries were sequenced on one or more lanes of an Illumina HiSeq 2000 using paired-end 50- or 100-bp runs. Image processing and base calling were performed as the runs progressed with Illumina’s Real Time Analysis (RTA) software. The binary base call files were streamed to a shared Linux server for further processing. The Illumina Casava pipeline (v1.8) was used to process the binary files to fastq files containing the base-called reads and per base quality scores. Only reads passing the standard Illumina quality filter were included in the output files.</p>
</sec>
<sec id="Sec16"><title>Genome assembly</title>
<p>The ALLPATHS-LG version R41348 assembly algorithm was used for the assemblies. It consists of five major phases: (1) pre-assembly error correction, (2) merging of the overlapping fragment reads into extended reads, (3) constructing the unipath graph from the k-mers present in the reads, (4) scaffolding the unipaths with the jumping libraries, and (5) gap closing. To complete the five phases, the algorithm requires an overlapping pair fragment library and at least one jumping library, although the authors recommend at least two jumping libraries of approximately 2 kbp and approximately 5 kbp or larger. We assembled each of the genomes using approximately 50× coverage of the fragment library and approximately 30× coverage of each of the two jumping libraries using the recommended parameters, except we lowered the MIN_CONTIG size to 300 bp from the default 1,000 bp. This parameter controls the minimum contig size to be used for scaffolding, and our previous testing determined this change leads to (modestly) improved contig and scaffold statistics.</p>
<p>We also evaluated using SOAPdenovo2 [<xref ref-type="bibr" rid="CR66">66</xref>
] and SGA [<xref ref-type="bibr" rid="CR67">67</xref>
] for the assemblies (Table S3 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
), using the same fragment, 2 kbp, and 5 kbp libraries but both assemblers had substantially worse contiguity statistics under a variety of parameter settings. For SOAPdenovo2, we corrected the reads using the Quake error correction algorithm [<xref ref-type="bibr" rid="CR68">68</xref>
], and then ran seven assemblies with the de Bruijn graph k-mer size set to k = 31 through k = 45 (odd values only, as required). In every attempt the scaffold N50 size was below 10 kbp compared with >200 kbp for our best ALLPATHS-LG assembly. For SGA, we evaluated four assemblies with the string graph minimum overlap length of k = 71 through k = 77 (odd values only, as required), but the scaffold N50 size was below 15 kbp in every attempt. We hypothesize that ALLPATHS-LG achieved superior results because the algorithm automatically measures many of the properties of the sequencing data, and could therefore self-adjust the various cutoffs used by the algorithm for error correction, contigging, and scaffolding.</p>
<p>Applying nomenclature proposed by [<xref ref-type="bibr" rid="CR51">51</xref>
], we have named these assemblies to convey accession, quality, origin, and iteration as follows: Os-Nipponbare-Draft-CSHL-1.0, Os-IR64-Draft-CSHL-1.0, Os-DJ123-Draft-CSHL-1.0.</p>
</sec>
<sec id="Sec17"><title>Genome annotation</title>
<p>Repeat elements were masked using RepeatMasker [<xref ref-type="bibr" rid="CR69">69</xref>
] with a rice repeat library available from the Arizona Genome Institute. Protein-coding genes were annotated using MAKER-P version 2.30, installed on the Texas Advanced Computer Center Lonestar cluster and provisioned through an iPlant Collaborative allocation [<xref ref-type="bibr" rid="CR52">52</xref>
,<xref ref-type="bibr" rid="CR70">70</xref>
-<xref ref-type="bibr" rid="CR72">72</xref>
]. Sequence evidence used as input for MAKER-P included <italic>Oryza</italic>
 expressed sequences (EST, cDNA, and mRNA) downloaded from the National Center for Biotechnology Information (NCBI), and annotated coding and protein sequences available for Nipponbare (IRGSP1.0 and MSU release 7) [<xref ref-type="bibr" rid="CR51">51</xref>
], 93-11 [<xref ref-type="bibr" rid="CR28">28</xref>
], and PA64s (Table S1 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). <italic>Ab initio</italic>
 gene predictions made using FGENESH [<xref ref-type="bibr" rid="CR73">73</xref>
] were incorporated exogenously into the MAKER-P pipeline using the pred_gff parameter. The SNAP [<xref ref-type="bibr" rid="CR74">74</xref>
] <italic>ab initio</italic>
 predictor was run within MAKER-P using the O.sativa.hmm parameter provided with SNAP. To annotate protein domain structure and assign GO terms we used InterProScan 5 software [<xref ref-type="bibr" rid="CR75">75</xref>
], available within the iPlant Discovery Environment [<xref ref-type="bibr" rid="CR76">76</xref>
]. Among resulting InterPro domains we curated 21 as being associated with transposon-encoded genes and screened out MAKER-P annotations with these domains (IPR000477, IPR001207, IPR001584, IPR002559, IPR004242, IPR004252, IPR004264, IPR004330, IPR004332, IPR005063, IPR005162, IPR006912, IPR007321, IPR013103, IPR013242, IPR014736, IPR015401, IPR018289, IPR026103, IPR026960, IPR027806). To identify homologies we conducted BLASTP alignment to the plants subsection of NCBI RefSeq (release 63), using an e-value threshold of 1e-10.</p>
</sec>
<sec id="Sec18"><title>Whole genome comparisons</title>
<p>We used the MUMmer [<xref ref-type="bibr" rid="CR77">77</xref>
] whole genome alignment package and the GAGE assembly comparison scripts to compare the <italic>de novo</italic>
 assemblies to the reference Nipponbare and <italic>Indica</italic>
 genomes. Briefly, we aligned the assemblies to the genomes using <italic>nucmer</italic>
 using sensitive alignment settings (-c 65 -l 30 -banded -D 5). For base level accuracy evaluations, we used the GAGE assembly comparison script, which further refines the alignments by computing the best set of one-to-one alignments between the two genomes using the dynamic programming algorithm <italic>delta-filter.</italic>
 This algorithm weighs the length of the alignments and their percentage identity to select one-to-one non-redundant alignments. This effectively discards spurious repetitive alignments from consideration, allowing us to focus on the meaningful differences between the genomes. Finally, the evaluation algorithm uses <italic>dnadiff</italic>
 to scan the remaining, non-repetitive alignments to summarize the agreement between the sequences, including characterizing the nature of any non-aligning bases as substitutions, small indels, or other larger structural variations. To characterize the unaligned regions of the reference genome, we converted the whole genome alignments into BED format. For this we did not exclude repetitive alignments, so that we could focus on novel sequence instead of copy number differences. We used BEDTools [<xref ref-type="bibr" rid="CR78">78</xref>
] to intersect the unaligned segments with the reference annotation, and summarized the size distributions of the unaligned segments using AMOS [<xref ref-type="bibr" rid="CR79">79</xref>
].</p>
</sec>
<sec id="Sec19"><title>K-mer analysis</title>
<p>To evaluate the repeat composition, we selected a random sample of 400 million unassembled reads from each of the three genomes and used <italic>Jellyfish</italic>
 [<xref ref-type="bibr" rid="CR80">80</xref>
] to count the number of occurrences of all length 21 k-mers in each read set. Length 21 was selected to be sufficiently long so that the expected number of occurrences of a random k-mer was below 1, but short enough to be robust to sequencing errors. The modes of the 3 k-mer frequency distributions, excluding erroneous k-mers that occurred less than 10 times, were 60× (Nipponbare), 64× (DJ123), and 73× (IR64) drawn from an approximately negative binomial distribution (Figure S1 in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
). These values correspond to the average k-mer coverage for single copy, non-repetitive regions of the genome. See Kelly <italic>et al</italic>
. [<xref ref-type="bibr" rid="CR68">68</xref>
] for a discussion of k-mer frequencies. We then used the AMOS program <italic>kmer-cov-plot</italic>
 [<xref ref-type="bibr" rid="CR79">79</xref>
] to report the kmer coverage along the two reference genomes using the three databases of read k-mer frequencies. Unlike read alignments, which may be sensitive to repeats and variations, evaluating k-mer coverage is very robust to determine repetitive content [<xref ref-type="bibr" rid="CR81">81</xref>
,<xref ref-type="bibr" rid="CR82">82</xref>
]. Single nucleotide variants are also readily apparent in these plots as abrupt gaps in coverage kilobase pairs long, while indels will be present as longer gaps in coverage [<xref ref-type="bibr" rid="CR83">83</xref>
].</p>
</sec>
<sec id="Sec20"><title>Pan-genome analysis</title>
<p>The pan-genome analysis followed the reference-based analysis above, using <italic>nucmer</italic>
 to align the genomes to each other, <italic>BEDTools</italic>
 to find the genome-specific and shared regions of the genomes, and the <italic>jellyfish/AMOS</italic>
 k-mer analysis as described above to classify unique and repetitive sequences. We also used <italic>BEDTools</italic>
 to intersect the genome-specific/shared regions against their respective annotations to determine how the exonic bases were shared across the genomes. We summarized the genome-specific/shared exonic bases into gene counts by counting the total number of shared or specific exonic bases across all possible transcripts for a gene, and assigned the gene to the sector of the Venn diagram with the most bases associated with it. For the purposes of the Venn diagram (Figure <xref rid="Fig2" ref-type="fig">2</xref>
A), wherever possible, the Nipponbare base or gene counts were used, followed by the values from IR64, and then followed by the DJ123 specific values, although the values were all largely consistent.</p>
</sec>
<sec id="Sec21"><title>PCR and sequencing validation of specific regions</title>
<p>The same algorithms and parameters as the pan-genome analysis were also used to characterize the specific regions identified in the paper. PCR and/or sequencing validation were performed on genomic DNA extracted from tissue collected from independently grown plants obtained from the same seed source used for Illumina sequencing. Genomic DNA was extracted from young leaf tissue using the Qiagen Plant DNeasy Mini kit. Primers used for validation of 10 of the longest genome-specific sequences from each rice line, and of the <italic>S5</italic>
 and <italic>Pup1</italic>
 loci, are listed in Table S4a-c in Additional file <xref rid="MOESM1" ref-type="media">1</xref>
. Sanger sequencing was performed at the Biotechnology Resource Center at Cornell University.</p>
</sec>
<sec id="Sec22"><title>Data access</title>
<p>The read data, assemblies, annotations, and pan-genome alignments are posted on the CSHL website at [<xref ref-type="bibr" rid="CR84">84</xref>
]. The NCBI Sequence Read Archive (SRA) accession numbers for the short read data used in this study are listed in Table <xref rid="Tab11" ref-type="table">11</xref>
. Analysis software packages are available open source from the websites for ALLPATHS-LG [<xref ref-type="bibr" rid="CR85">85</xref>
], MUMmer [<xref ref-type="bibr" rid="CR86">86</xref>
], AMOS [<xref ref-type="bibr" rid="CR87">87</xref>
], Jellyfish [<xref ref-type="bibr" rid="CR88">88</xref>
], and BEDTools [<xref ref-type="bibr" rid="CR89">89</xref>
].<table-wrap id="Tab11"><label>Table 11</label>
<caption><p><bold>NCBI Sequence Read Archive accession codes for sequencing data used in this study</bold>
</p>
</caption>
<table frame="hsides" rules="groups"><thead><tr valign="top"><th><bold>Genome</bold>
</th>
<th><bold>Library type</bold>
</th>
<th><bold>Read length</bold>
</th>
<th><bold>SRA accession</bold>
</th>
</tr>
</thead>
<tbody><tr valign="top"><td>Nipponbare</td>
<td>180 bp fragment</td>
<td align="center">2 × 101</td>
<td>SRX734432</td>
</tr>
<tr valign="top"><td></td>
<td>2 kbp jump</td>
<td align="center">2 × 50</td>
<td>SRX179260</td>
</tr>
<tr valign="top"><td></td>
<td>5 kbp jump</td>
<td align="center">2 × 50</td>
<td>SRX179265</td>
</tr>
<tr valign="top"><td>IR64</td>
<td>180 bp fragment</td>
<td align="center">2 × 101</td>
<td>SRX180537</td>
</tr>
<tr valign="top"><td></td>
<td>2 kbp jump</td>
<td align="center">2 × 50</td>
<td>SRX180555</td>
</tr>
<tr valign="top"><td></td>
<td>5 kbp jump</td>
<td align="center">2 × 50</td>
<td>SRX180597</td>
</tr>
<tr valign="top"><td>DJ123</td>
<td>180 bp fragment</td>
<td align="center">2 × 101</td>
<td>SRX180718</td>
</tr>
<tr valign="top"><td></td>
<td>2 kbp jump</td>
<td align="center">2 × 50</td>
<td>SRX180822</td>
</tr>
<tr valign="top"><td></td>
<td>5 kbp jump</td>
<td align="center">2 × 50</td>
<td>SRX180892</td>
</tr>
</tbody>
</table>
</table-wrap>
</p>
</sec>
</sec>
</body>
<back><app-group><app id="App1"><sec id="Sec23"><title>Additional files</title>
<p><media position="anchor" xlink:href="13059_2014_506_MOESM1_ESM.pdf" id="MOESM1"><label>Additional file 1:</label>
<caption><p><bold>Contains the supplementary tables and figures described in the manuscript.</bold>
</p>
</caption>
</media>
<media position="anchor" xlink:href="13059_2014_506_MOESM2_ESM.xlsx" id="MOESM2"><label>Additional file 2:</label>
<caption><p><bold>Is an excel spreadsheet listing the putative strain-specific genes.</bold>
</p>
</caption>
</media>
</p>
</sec>
</app>
</app-group>
<glossary><title>Abbreviations</title>
<def-list><def-list><def-item><term>BAC</term>
<def><p>bacterial artificial chromosome</p>
</def>
</def-item>
<def-item><term>bp</term>
<def><p>base pair</p>
</def>
</def-item>
<def-item><term>CDS</term>
<def><p>coding sequence</p>
</def>
</def-item>
<def-item><term>EST</term>
<def><p>expressed sequence tag</p>
</def>
</def-item>
<def-item><term>GO</term>
<def><p>Gene Ontology</p>
</def>
</def-item>
<def-item><term>IRGSP</term>
<def><p>International Rice Genome Sequencing Program</p>
</def>
</def-item>
<def-item><term>MSU</term>
<def><p>Michigan State University</p>
</def>
</def-item>
<def-item><term>NCBI</term>
<def><p>National Center for Biotechnology Information</p>
</def>
</def-item>
<def-item><term>NGS</term>
<def><p>next generation sequencing</p>
</def>
</def-item>
<def-item><term>ORF</term>
<def><p>open reading frame</p>
</def>
</def-item>
<def-item><term>PCR</term>
<def><p>polymerase chain reaction</p>
</def>
</def-item>
<def-item><term>QTL</term>
<def><p>quantitative trait locus</p>
</def>
</def-item>
<def-item><term>SNP</term>
<def><p>single nucleotide polymorphism</p>
</def>
</def-item>
<def-item><term>UTR</term>
<def><p>untranslated region</p>
</def>
</def-item>
</def-list>
</def-list>
</glossary>
<fn-group><fn><p>Michael C Schatz, Lyza G Maron, and Joshua C Stein are equal contributors.</p>
</fn>
<fn><p><bold>Competing interests</bold>
</p>
<p>WRM has participated in Illumina sponsored meetings over the past four years and received travel reimbursement and an honorarium for presenting at these events. Illumina had no role in decisions relating to the study/work to be published, data collection and analysis of data and the decision to publish. WRM has participated in Pacific Biosciences sponsored meetings over the past three years and received travel reimbursement for presenting at these events. WRM is a founder and share holder of Orion Genomics, which focuses on plant genomics and cancer genetics.</p>
</fn>
<fn><p><bold>Authors’ contributions</bold>
</p>
<p>SRM, WRM, and DW designed the study. MCS, LGM, JCS, AHW, JG, EB, HL, MK, and JMC performed the computational analysis. LGM, MK, EA, EG, MHW, and SRM performed the experimental analysis. MCS, LGM, JCS, JMC, DW, SRM, and WRM wrote the manuscript. All authors read and approved the final manuscript.</p>
</fn>
</fn-group>
<ack><title>Acknowledgements</title>
<p>This project was supported in part by National Science Foundation awards PGRP-1026555 to SMc, DBI-126383 to DW and MCS, DBI-1350041 to MCS, IOS-1032105 to WRM and DW, and DBI-0933128 to WRM. It was also supported in part by National Institutes of Health award R01-HG006677 to MCS. We would like to thank Adam Phillippy and Sergey Koren for their helpful discussions with the GAGE assembly validation software and pan-genome alignments; Aaron Quinlan for his helpful discussions with BEDTools; and David Jaffe, Iain MacCallum, Ted Sharpe, Filipe Joao Ribeiro, and all the ALLPATHS-LG developers and support staff for the assistance debugging and troubleshooting the assemblies.</p>
</ack>
<ref-list id="Bib1"><title>References</title>
<ref id="CR1"><label>1.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Garris</surname>
<given-names>AJ</given-names>
</name>
<name><surname>Tai</surname>
<given-names>TH</given-names>
</name>
<name><surname>Coburn</surname>
<given-names>J</given-names>
</name>
<name><surname>Kresovich</surname>
<given-names>S</given-names>
</name>
<name><surname>McCouch</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Genetic structure and diversity in Oryza sativa L</article-title>
<source>Genetics</source>
<year>2005</year>
<volume>169</volume>
<fpage>1631</fpage>
<lpage>1638</lpage>
<pub-id pub-id-type="doi">10.1534/genetics.104.035642</pub-id>
<pub-id pub-id-type="pmid">15654106</pub-id>
</element-citation>
</ref>
<ref id="CR2"><label>2.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname>
<given-names>X</given-names>
</name>
<name><surname>Kurata</surname>
<given-names>N</given-names>
</name>
<name><surname>Wei</surname>
<given-names>X</given-names>
</name>
<name><surname>Wang</surname>
<given-names>ZX</given-names>
</name>
<name><surname>Wang</surname>
<given-names>A</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>Q</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>Y</given-names>
</name>
<name><surname>Liu</surname>
<given-names>K</given-names>
</name>
<name><surname>Lu</surname>
<given-names>H</given-names>
</name>
<name><surname>Li</surname>
<given-names>W</given-names>
</name>
<name><surname>Guo</surname>
<given-names>Y</given-names>
</name>
<name><surname>Lu</surname>
<given-names>Y</given-names>
</name>
<name><surname>Zhou</surname>
<given-names>C</given-names>
</name>
<name><surname>Fan</surname>
<given-names>D</given-names>
</name>
<name><surname>Weng</surname>
<given-names>Q</given-names>
</name>
<name><surname>Zhu</surname>
<given-names>C</given-names>
</name>
<name><surname>Huang</surname>
<given-names>T</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>L</given-names>
</name>
<name><surname>Wang</surname>
<given-names>Y</given-names>
</name>
<name><surname>Feng</surname>
<given-names>L</given-names>
</name>
<name><surname>Furuumi</surname>
<given-names>H</given-names>
</name>
<name><surname>Kubo</surname>
<given-names>T</given-names>
</name>
<name><surname>Miyabayashi</surname>
<given-names>T</given-names>
</name>
<name><surname>Yuan</surname>
<given-names>X</given-names>
</name>
<name><surname>Xu</surname>
<given-names>Q</given-names>
</name>
<name><surname>Dong</surname>
<given-names>G</given-names>
</name>
<name><surname>Zhan</surname>
<given-names>Q</given-names>
</name>
<name><surname>Li</surname>
<given-names>C</given-names>
</name>
<name><surname>Fujiyama</surname>
<given-names>A</given-names>
</name>
<name><surname>Toyoda</surname>
<given-names>A</given-names>
</name>
<etal></etal>
</person-group>
<article-title>A map of rice genome variation reveals the origin of cultivated rice</article-title>
<source>Nature</source>
<year>2012</year>
<volume>490</volume>
<fpage>497</fpage>
<lpage>501</lpage>
<pub-id pub-id-type="doi">10.1038/nature11532</pub-id>
<pub-id pub-id-type="pmid">23034647</pub-id>
</element-citation>
</ref>
<ref id="CR3"><label>3.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname>
<given-names>KY</given-names>
</name>
<name><surname>Wright</surname>
<given-names>M</given-names>
</name>
<name><surname>Kimball</surname>
<given-names>J</given-names>
</name>
<name><surname>Eizenga</surname>
<given-names>G</given-names>
</name>
<name><surname>McClung</surname>
<given-names>A</given-names>
</name>
<name><surname>Kovach</surname>
<given-names>M</given-names>
</name>
<name><surname>Tyagi</surname>
<given-names>W</given-names>
</name>
<name><surname>Ali</surname>
<given-names>ML</given-names>
</name>
<name><surname>Tung</surname>
<given-names>CW</given-names>
</name>
<name><surname>Reynolds</surname>
<given-names>A</given-names>
</name>
<name><surname>Bustamante</surname>
<given-names>CD</given-names>
</name>
<name><surname>McCouch</surname>
<given-names>SR</given-names>
</name>
</person-group>
<article-title>Genomic diversity and introgression in O. sativa reveal the impact of domestication and breeding on the rice genome</article-title>
<source>Plos One</source>
<year>2010</year>
<volume>5</volume>
<fpage>e10780</fpage>
<pub-id pub-id-type="doi">10.1371/journal.pone.0010780</pub-id>
<pub-id pub-id-type="pmid">20520727</pub-id>
</element-citation>
</ref>
<ref id="CR4"><label>4.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Boyko</surname>
<given-names>AR</given-names>
</name>
<name><surname>Quignon</surname>
<given-names>P</given-names>
</name>
<name><surname>Li</surname>
<given-names>L</given-names>
</name>
<name><surname>Schoenebeck</surname>
<given-names>JJ</given-names>
</name>
<name><surname>Degenhardt</surname>
<given-names>JD</given-names>
</name>
<name><surname>Lohmueller</surname>
<given-names>KE</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>K</given-names>
</name>
<name><surname>Brisbin</surname>
<given-names>A</given-names>
</name>
<name><surname>Parker</surname>
<given-names>HG</given-names>
</name>
<name><surname>vonHoldt</surname>
<given-names>BM</given-names>
</name>
<name><surname>Cargill</surname>
<given-names>M</given-names>
</name>
<name><surname>Auton</surname>
<given-names>A</given-names>
</name>
<name><surname>Reynolds</surname>
<given-names>A</given-names>
</name>
<name><surname>Elkahloun</surname>
<given-names>AG</given-names>
</name>
<name><surname>Castelhano</surname>
<given-names>M</given-names>
</name>
<name><surname>Mosher</surname>
<given-names>DS</given-names>
</name>
<name><surname>Sutter</surname>
<given-names>NB</given-names>
</name>
<name><surname>Johnson</surname>
<given-names>GS</given-names>
</name>
<name><surname>Novembre</surname>
<given-names>J</given-names>
</name>
<name><surname>Hubisz</surname>
<given-names>MJ</given-names>
</name>
<name><surname>Siepel</surname>
<given-names>A</given-names>
</name>
<name><surname>Wayne</surname>
<given-names>RK</given-names>
</name>
<name><surname>Bustamante</surname>
<given-names>CD</given-names>
</name>
<name><surname>Ostrander</surname>
<given-names>EA</given-names>
</name>
</person-group>
<article-title>A simple genetic architecture underlies morphological variation in dogs</article-title>
<source>PLoS Biol</source>
<year>2010</year>
<volume>8</volume>
<fpage>e1000451</fpage>
<pub-id pub-id-type="doi">10.1371/journal.pbio.1000451</pub-id>
<pub-id pub-id-type="pmid">20711490</pub-id>
</element-citation>
</ref>
<ref id="CR5"><label>5.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Weir</surname>
<given-names>BS</given-names>
</name>
<name><surname>Cardon</surname>
<given-names>LR</given-names>
</name>
<name><surname>Anderson</surname>
<given-names>AD</given-names>
</name>
<name><surname>Nielsen</surname>
<given-names>DM</given-names>
</name>
<name><surname>Hill</surname>
<given-names>WG</given-names>
</name>
</person-group>
<article-title>Measures of human population structure show heterogeneity among genomic regions</article-title>
<source>Genome Res</source>
<year>2005</year>
<volume>15</volume>
<fpage>1468</fpage>
<lpage>1476</lpage>
<pub-id pub-id-type="doi">10.1101/gr.4398405</pub-id>
<pub-id pub-id-type="pmid">16251456</pub-id>
</element-citation>
</ref>
<ref id="CR6"><label>6.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Matsuoka</surname>
<given-names>Y</given-names>
</name>
<name><surname>Vigouroux</surname>
<given-names>Y</given-names>
</name>
<name><surname>Goodman</surname>
<given-names>MM</given-names>
</name>
<name><surname>Sanchez</surname>
<given-names>GJ</given-names>
</name>
<name><surname>Buckler</surname>
<given-names>E</given-names>
</name>
<name><surname>Doebley</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>A single domestication for maize shown by multilocus microsatellite genotyping</article-title>
<source>Proc Natl Acad Sci U S A</source>
<year>2002</year>
<volume>99</volume>
<fpage>6080</fpage>
<lpage>6084</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.052125199</pub-id>
<pub-id pub-id-type="pmid">11983901</pub-id>
</element-citation>
</ref>
<ref id="CR7"><label>7.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname>
<given-names>K</given-names>
</name>
<name><surname>Tung</surname>
<given-names>CW</given-names>
</name>
<name><surname>Eizenga</surname>
<given-names>GC</given-names>
</name>
<name><surname>Wright</surname>
<given-names>MH</given-names>
</name>
<name><surname>Ali</surname>
<given-names>ML</given-names>
</name>
<name><surname>Price</surname>
<given-names>AH</given-names>
</name>
<name><surname>Norton</surname>
<given-names>GJ</given-names>
</name>
<name><surname>Islam</surname>
<given-names>MR</given-names>
</name>
<name><surname>Reynolds</surname>
<given-names>A</given-names>
</name>
<name><surname>Mezey</surname>
<given-names>J</given-names>
</name>
<name><surname>McClung</surname>
<given-names>AM</given-names>
</name>
<name><surname>Bustamante</surname>
<given-names>CD</given-names>
</name>
<name><surname>McCouch</surname>
<given-names>SR</given-names>
</name>
</person-group>
<article-title>Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa</article-title>
<source>Nat Commun</source>
<year>2011</year>
<volume>2</volume>
<fpage>467</fpage>
<pub-id pub-id-type="doi">10.1038/ncomms1467</pub-id>
<pub-id pub-id-type="pmid">21915109</pub-id>
</element-citation>
</ref>
<ref id="CR8"><label>8.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ma</surname>
<given-names>J</given-names>
</name>
<name><surname>Bennetzen</surname>
<given-names>JL</given-names>
</name>
</person-group>
<article-title>Rapid recent growth and divergence of rice nuclear genomes</article-title>
<source>Proc Natl Acad Sci U S A</source>
<year>2004</year>
<volume>101</volume>
<fpage>12404</fpage>
<lpage>12410</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.0403715101</pub-id>
<pub-id pub-id-type="pmid">15240870</pub-id>
</element-citation>
</ref>
<ref id="CR9"><label>9.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cheng</surname>
<given-names>CY</given-names>
</name>
<name><surname>Motohashi</surname>
<given-names>R</given-names>
</name>
<name><surname>Tsuchimoto</surname>
<given-names>S</given-names>
</name>
<name><surname>Fukuta</surname>
<given-names>Y</given-names>
</name>
<name><surname>Ohtsubo</surname>
<given-names>H</given-names>
</name>
<name><surname>Ohtsubo</surname>
<given-names>E</given-names>
</name>
</person-group>
<article-title>Polyphyletic origin of cultivated rice: Based on the interspersion pattern of SINEs</article-title>
<source>Mol Biol Evol</source>
<year>2003</year>
<volume>20</volume>
<fpage>67</fpage>
<lpage>75</lpage>
<pub-id pub-id-type="doi">10.1093/molbev/msg004</pub-id>
<pub-id pub-id-type="pmid">12519908</pub-id>
</element-citation>
</ref>
<ref id="CR10"><label>10.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kovach</surname>
<given-names>MJ</given-names>
</name>
<name><surname>Sweeney</surname>
<given-names>MT</given-names>
</name>
<name><surname>McCouch</surname>
<given-names>SR</given-names>
</name>
</person-group>
<article-title>New insights into the history of rice domestication</article-title>
<source>Trends Genet</source>
<year>2007</year>
<volume>23</volume>
<fpage>578</fpage>
<lpage>587</lpage>
<pub-id pub-id-type="doi">10.1016/j.tig.2007.08.012</pub-id>
<pub-id pub-id-type="pmid">17963977</pub-id>
</element-citation>
</ref>
<ref id="CR11"><label>11.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Roy</surname>
<given-names>SC</given-names>
</name>
</person-group>
<article-title>A preliminary classification of the wild rices of the Central Province and Berar</article-title>
<source>Agric J India</source>
<year>1921</year>
<volume>16</volume>
<fpage>365</fpage>
<lpage>380</lpage>
</element-citation>
</ref>
<ref id="CR12"><label>12.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Second</surname>
<given-names>G</given-names>
</name>
</person-group>
<article-title>Origin of the genic diversity of cultivated rice (Oryza-spp) - study of the polymorphism scored at 40 isoenzyme loci</article-title>
<source>Jpn J Genet</source>
<year>1982</year>
<volume>57</volume>
<fpage>25</fpage>
<lpage>57</lpage>
<pub-id pub-id-type="doi">10.1266/jjg.57.25</pub-id>
</element-citation>
</ref>
<ref id="CR13"><label>13.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Second</surname>
<given-names>G</given-names>
</name>
</person-group>
<article-title>Molecular markers in rice systematics and the evaluation of genetic resources</article-title>
<source>Biotechnol Agric For</source>
<year>1991</year>
<volume>14</volume>
<fpage>468</fpage>
<lpage>494</lpage>
</element-citation>
</ref>
<ref id="CR14"><label>14.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ding</surname>
<given-names>J</given-names>
</name>
<name><surname>Araki</surname>
<given-names>H</given-names>
</name>
<name><surname>Wang</surname>
<given-names>Q</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>P</given-names>
</name>
<name><surname>Yang</surname>
<given-names>S</given-names>
</name>
<name><surname>Chen</surname>
<given-names>JQ</given-names>
</name>
<name><surname>Tian</surname>
<given-names>D</given-names>
</name>
</person-group>
<article-title>Highly asymmetric rice genomes</article-title>
<source>BMC Genomics</source>
<year>2007</year>
<volume>8</volume>
<fpage>154</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2164-8-154</pub-id>
<pub-id pub-id-type="pmid">17555605</pub-id>
</element-citation>
</ref>
<ref id="CR15"><label>15.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname>
<given-names>XH</given-names>
</name>
<name><surname>Lu</surname>
<given-names>TT</given-names>
</name>
<name><surname>Yu</surname>
<given-names>SL</given-names>
</name>
<name><surname>Li</surname>
<given-names>Y</given-names>
</name>
<name><surname>Huang</surname>
<given-names>YC</given-names>
</name>
<name><surname>Huang</surname>
<given-names>T</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>L</given-names>
</name>
<name><surname>Zhu</surname>
<given-names>JJ</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>Q</given-names>
</name>
<name><surname>Fan</surname>
<given-names>DL</given-names>
</name>
<name><surname>Mu</surname>
<given-names>J</given-names>
</name>
<name><surname>Shangguan</surname>
<given-names>YY</given-names>
</name>
<name><surname>Feng</surname>
<given-names>Q</given-names>
</name>
<name><surname>Guan</surname>
<given-names>JP</given-names>
</name>
<name><surname>Ying</surname>
<given-names>K</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>Y</given-names>
</name>
<name><surname>Lin</surname>
<given-names>ZX</given-names>
</name>
<name><surname>Sun</surname>
<given-names>ZX</given-names>
</name>
<name><surname>Qian</surname>
<given-names>Q</given-names>
</name>
<name><surname>Lu</surname>
<given-names>YP</given-names>
</name>
<name><surname>Han</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>A collection of 10,096 indica rice full-length cDNAs reveals highly expressed sequence divergence between Oryza sativa indica and japonica subspecies</article-title>
<source>Plant Mol Biol</source>
<year>2007</year>
<volume>65</volume>
<fpage>403</fpage>
<lpage>415</lpage>
<pub-id pub-id-type="doi">10.1007/s11103-007-9174-7</pub-id>
<pub-id pub-id-type="pmid">17522955</pub-id>
</element-citation>
</ref>
<ref id="CR16"><label>16.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Feltus</surname>
<given-names>FA</given-names>
</name>
<name><surname>Wan</surname>
<given-names>J</given-names>
</name>
<name><surname>Schulze</surname>
<given-names>SR</given-names>
</name>
<name><surname>Estill</surname>
<given-names>JC</given-names>
</name>
<name><surname>Jiang</surname>
<given-names>N</given-names>
</name>
<name><surname>Paterson</surname>
<given-names>AH</given-names>
</name>
</person-group>
<article-title>An SNP resource for rice genetics and breeding based on subspecies Indica and Japonica genome alignments</article-title>
<source>Genome Res</source>
<year>2004</year>
<volume>14</volume>
<fpage>1812</fpage>
<lpage>1819</lpage>
<pub-id pub-id-type="doi">10.1101/gr.2479404</pub-id>
<pub-id pub-id-type="pmid">15342564</pub-id>
</element-citation>
</ref>
<ref id="CR17"><label>17.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname>
<given-names>XH</given-names>
</name>
<name><surname>Lu</surname>
<given-names>GJ</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>Q</given-names>
</name>
<name><surname>Liu</surname>
<given-names>XH</given-names>
</name>
<name><surname>Han</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>Genome-wide analysis of transposon insertion polymorphisms reveals intraspecific variation in cultivated rice</article-title>
<source>Plant Physiol</source>
<year>2008</year>
<volume>148</volume>
<fpage>25</fpage>
<lpage>40</lpage>
<pub-id pub-id-type="doi">10.1104/pp.108.121491</pub-id>
<pub-id pub-id-type="pmid">18650402</pub-id>
</element-citation>
</ref>
<ref id="CR18"><label>18.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shomura</surname>
<given-names>A</given-names>
</name>
<name><surname>Izawa</surname>
<given-names>T</given-names>
</name>
<name><surname>Ebana</surname>
<given-names>K</given-names>
</name>
<name><surname>Ebitani</surname>
<given-names>T</given-names>
</name>
<name><surname>Kanegae</surname>
<given-names>H</given-names>
</name>
<name><surname>Konishi</surname>
<given-names>S</given-names>
</name>
<name><surname>Yano</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Deletion in a gene associated with grain size increased yields during rice domestication</article-title>
<source>Nat Genet</source>
<year>2008</year>
<volume>40</volume>
<fpage>1023</fpage>
<lpage>1028</lpage>
<pub-id pub-id-type="doi">10.1038/ng.169</pub-id>
<pub-id pub-id-type="pmid">18604208</pub-id>
</element-citation>
</ref>
<ref id="CR19"><label>19.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Takano-Kai</surname>
<given-names>N</given-names>
</name>
<name><surname>Jiang</surname>
<given-names>H</given-names>
</name>
<name><surname>Kubo</surname>
<given-names>T</given-names>
</name>
<name><surname>Sweeney</surname>
<given-names>M</given-names>
</name>
<name><surname>Matsumoto</surname>
<given-names>T</given-names>
</name>
<name><surname>Kanamori</surname>
<given-names>H</given-names>
</name>
<name><surname>Padhukasahasram</surname>
<given-names>B</given-names>
</name>
<name><surname>Bustamante</surname>
<given-names>C</given-names>
</name>
<name><surname>Yoshimura</surname>
<given-names>A</given-names>
</name>
<name><surname>Doi</surname>
<given-names>K</given-names>
</name>
<name><surname>McCouch</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Global dissemination of a single mutation conferring white pericarp in rice</article-title>
<source>PLoS Genet</source>
<year>2007</year>
<volume>3</volume>
<fpage>e133</fpage>
<pub-id pub-id-type="doi">10.1371/journal.pgen.0030133</pub-id>
<pub-id pub-id-type="pmid">17696613</pub-id>
</element-citation>
</ref>
<ref id="CR20"><label>20.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Takano-Kai</surname>
<given-names>N</given-names>
</name>
<name><surname>Jiang</surname>
<given-names>H</given-names>
</name>
<name><surname>Kubo</surname>
<given-names>T</given-names>
</name>
<name><surname>Sweeney</surname>
<given-names>M</given-names>
</name>
<name><surname>Matsumoto</surname>
<given-names>T</given-names>
</name>
<name><surname>Kanamori</surname>
<given-names>H</given-names>
</name>
<name><surname>Padhukasahasram</surname>
<given-names>B</given-names>
</name>
<name><surname>Bustamante</surname>
<given-names>C</given-names>
</name>
<name><surname>Yoshimura</surname>
<given-names>A</given-names>
</name>
<name><surname>Doi</surname>
<given-names>K</given-names>
</name>
<name><surname>McCouch</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Evolutionary history of GS3, a gene conferring grain length in rice</article-title>
<source>Genetics</source>
<year>2009</year>
<volume>182</volume>
<fpage>1323</fpage>
<lpage>1334</lpage>
<pub-id pub-id-type="doi">10.1534/genetics.109.103002</pub-id>
<pub-id pub-id-type="pmid">19506305</pub-id>
</element-citation>
</ref>
<ref id="CR21"><label>21.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tan</surname>
<given-names>L</given-names>
</name>
<name><surname>Li</surname>
<given-names>X</given-names>
</name>
<name><surname>Liu</surname>
<given-names>F</given-names>
</name>
<name><surname>Sun</surname>
<given-names>X</given-names>
</name>
<name><surname>Li</surname>
<given-names>C</given-names>
</name>
<name><surname>Zhu</surname>
<given-names>Z</given-names>
</name>
<name><surname>Fu</surname>
<given-names>Y</given-names>
</name>
<name><surname>Cai</surname>
<given-names>H</given-names>
</name>
<name><surname>Wang</surname>
<given-names>X</given-names>
</name>
<name><surname>Xie</surname>
<given-names>D</given-names>
</name>
<name><surname>Sun</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>Control of a key transition from prostrate to erect growth in rice domestication</article-title>
<source>Nat Genet</source>
<year>2008</year>
<volume>40</volume>
<fpage>1360</fpage>
<lpage>1364</lpage>
<pub-id pub-id-type="doi">10.1038/ng.197</pub-id>
<pub-id pub-id-type="pmid">18820699</pub-id>
</element-citation>
</ref>
<ref id="CR22"><label>22.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Harushima</surname>
<given-names>Y</given-names>
</name>
<name><surname>Nakagahra</surname>
<given-names>M</given-names>
</name>
<name><surname>Yano</surname>
<given-names>M</given-names>
</name>
<name><surname>Sasaki</surname>
<given-names>T</given-names>
</name>
<name><surname>Kurata</surname>
<given-names>N</given-names>
</name>
</person-group>
<article-title>Diverse variation of reproductive barriers in three intraspecific rice crosses</article-title>
<source>Genetics</source>
<year>2002</year>
<volume>160</volume>
<fpage>313</fpage>
<lpage>322</lpage>
<pub-id pub-id-type="pmid">11805066</pub-id>
</element-citation>
</ref>
<ref id="CR23"><label>23.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname>
<given-names>SY</given-names>
</name>
<name><surname>Ikehashi</surname>
<given-names>H</given-names>
</name>
<name><surname>Yanagihara</surname>
<given-names>S</given-names>
</name>
<name><surname>Kawashima</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>Segregation distortion via male gametes in hybrids between Indica and Japonica or wide-compatibility varieties of rice (Oryza-sativa L)</article-title>
<source>Theor Appl Genet</source>
<year>1992</year>
<volume>84</volume>
<fpage>812</fpage>
<lpage>818</lpage>
<pub-id pub-id-type="pmid">24201479</pub-id>
</element-citation>
</ref>
<ref id="CR24"><label>24.</label>
<element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Oka</surname>
<given-names>HI</given-names>
</name>
</person-group>
<article-title>Functions and genetic base of reproductive barriers</article-title>
<source>Origin of Cultivated Rice</source>
<year>1988</year>
<publisher-loc>Amsterdam</publisher-loc>
<publisher-name>Tokyo/Elsevier Science/Japan Scientific Societies Press</publisher-name>
<fpage>181</fpage>
<lpage>209</lpage>
</element-citation>
</ref>
<ref id="CR25"><label>25.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sano</surname>
<given-names>Y</given-names>
</name>
</person-group>
<article-title>Constraints in using wild relatives in breeding: lack of basic knowledge on crop gene pools</article-title>
<source>Int Crop Sci</source>
<year>1993</year>
<volume>1</volume>
<fpage>437</fpage>
<lpage>443</lpage>
</element-citation>
</ref>
<ref id="CR26"><label>26.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ammiraju</surname>
<given-names>JSS</given-names>
</name>
<name><surname>Song</surname>
<given-names>XA</given-names>
</name>
<name><surname>Luo</surname>
<given-names>MZ</given-names>
</name>
<name><surname>Sisneros</surname>
<given-names>N</given-names>
</name>
<name><surname>Angelova</surname>
<given-names>A</given-names>
</name>
<name><surname>Kudrna</surname>
<given-names>D</given-names>
</name>
<name><surname>Kim</surname>
<given-names>H</given-names>
</name>
<name><surname>Yu</surname>
<given-names>Y</given-names>
</name>
<name><surname>Goicoechea</surname>
<given-names>JL</given-names>
</name>
<name><surname>Lorieux</surname>
<given-names>M</given-names>
</name>
<name><surname>Kurata</surname>
<given-names>N</given-names>
</name>
<name><surname>Brar</surname>
<given-names>D</given-names>
</name>
<name><surname>Ware</surname>
<given-names>D</given-names>
</name>
<name><surname>Jackson</surname>
<given-names>S</given-names>
</name>
<name><surname>Wing</surname>
<given-names>RA</given-names>
</name>
</person-group>
<article-title>The Oryza BAC resource: a genus-wide and genome scale tool for exploring rice genome evolution and leveraging useful genetic diversity from wild relatives</article-title>
<source>Breeding Sci</source>
<year>2010</year>
<volume>60</volume>
<fpage>536</fpage>
<lpage>543</lpage>
<pub-id pub-id-type="doi">10.1270/jsbbs.60.536</pub-id>
</element-citation>
</ref>
<ref id="CR27"><label>27.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><collab>International Rice Genome Sequencing Project</collab>
</person-group>
<article-title>The map-based sequence of the rice genome</article-title>
<source>Nature</source>
<year>2005</year>
<volume>436</volume>
<fpage>793</fpage>
<lpage>800</lpage>
<pub-id pub-id-type="doi">10.1038/nature03895</pub-id>
<pub-id pub-id-type="pmid">16100779</pub-id>
</element-citation>
</ref>
<ref id="CR28"><label>28.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gao</surname>
<given-names>ZY</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>SC</given-names>
</name>
<name><surname>He</surname>
<given-names>WM</given-names>
</name>
<name><surname>Guo</surname>
<given-names>LB</given-names>
</name>
<name><surname>Peng</surname>
<given-names>YL</given-names>
</name>
<name><surname>Wang</surname>
<given-names>JJ</given-names>
</name>
<name><surname>Guo</surname>
<given-names>XS</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>XM</given-names>
</name>
<name><surname>Rao</surname>
<given-names>YC</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>C</given-names>
</name>
<name><surname>Dong</surname>
<given-names>GJ</given-names>
</name>
<name><surname>Zheng</surname>
<given-names>FY</given-names>
</name>
<name><surname>Lu</surname>
<given-names>CX</given-names>
</name>
<name><surname>Hu</surname>
<given-names>J</given-names>
</name>
<name><surname>Zhou</surname>
<given-names>Q</given-names>
</name>
<name><surname>Liu</surname>
<given-names>HJ</given-names>
</name>
<name><surname>Wu</surname>
<given-names>HY</given-names>
</name>
<name><surname>Xu</surname>
<given-names>J</given-names>
</name>
<name><surname>Ni</surname>
<given-names>PX</given-names>
</name>
<name><surname>Zeng</surname>
<given-names>DL</given-names>
</name>
<name><surname>Liu</surname>
<given-names>DH</given-names>
</name>
<name><surname>Tian</surname>
<given-names>P</given-names>
</name>
<name><surname>Gong</surname>
<given-names>LH</given-names>
</name>
<name><surname>Ye</surname>
<given-names>C</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>GH</given-names>
</name>
<name><surname>Wang</surname>
<given-names>J</given-names>
</name>
<name><surname>Tian</surname>
<given-names>FK</given-names>
</name>
<name><surname>Xue</surname>
<given-names>DW</given-names>
</name>
<name><surname>Liao</surname>
<given-names>Y</given-names>
</name>
<name><surname>Zhu</surname>
<given-names>L</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Dissecting yield-associated loci in super hybrid rice by resequencing recombinant inbred lines and improving parental genome sequences</article-title>
<source>Proc Natl Acad Sci U S A</source>
<year>2013</year>
<volume>110</volume>
<fpage>14492</fpage>
<lpage>14497</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.1306579110</pub-id>
<pub-id pub-id-type="pmid">23940322</pub-id>
</element-citation>
</ref>
<ref id="CR29"><label>29.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname>
<given-names>J</given-names>
</name>
<name><surname>Wang</surname>
<given-names>J</given-names>
</name>
<name><surname>Lin</surname>
<given-names>W</given-names>
</name>
<name><surname>Li</surname>
<given-names>SG</given-names>
</name>
<name><surname>Li</surname>
<given-names>H</given-names>
</name>
<name><surname>Zhou</surname>
<given-names>J</given-names>
</name>
<name><surname>Ni</surname>
<given-names>PX</given-names>
</name>
<name><surname>Dong</surname>
<given-names>W</given-names>
</name>
<name><surname>Hu</surname>
<given-names>SN</given-names>
</name>
<name><surname>Zeng</surname>
<given-names>CQ</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>JG</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>Y</given-names>
</name>
<name><surname>Li</surname>
<given-names>RQ</given-names>
</name>
<name><surname>Xu</surname>
<given-names>ZY</given-names>
</name>
<name><surname>Li</surname>
<given-names>ST</given-names>
</name>
<name><surname>Li</surname>
<given-names>XR</given-names>
</name>
<name><surname>Zheng</surname>
<given-names>HK</given-names>
</name>
<name><surname>Cong</surname>
<given-names>LJ</given-names>
</name>
<name><surname>Lin</surname>
<given-names>L</given-names>
</name>
<name><surname>Yin</surname>
<given-names>JN</given-names>
</name>
<name><surname>Geng</surname>
<given-names>JN</given-names>
</name>
<name><surname>Li</surname>
<given-names>GY</given-names>
</name>
<name><surname>Shi</surname>
<given-names>JP</given-names>
</name>
<name><surname>Liu</surname>
<given-names>J</given-names>
</name>
<name><surname>Lv</surname>
<given-names>H</given-names>
</name>
<name><surname>Li</surname>
<given-names>J</given-names>
</name>
<name><surname>Wang</surname>
<given-names>J</given-names>
</name>
<name><surname>Deng</surname>
<given-names>YJ</given-names>
</name>
<name><surname>Ran</surname>
<given-names>LH</given-names>
</name>
<name><surname>Shi</surname>
<given-names>XL</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The Genomes of Oryza sativa: A history of duplications</article-title>
<source>PLoS Biol</source>
<year>2005</year>
<volume>3</volume>
<fpage>266</fpage>
<lpage>281</lpage>
<pub-id pub-id-type="doi">10.1371/journal.pbio.0030038</pub-id>
</element-citation>
</ref>
<ref id="CR30"><label>30.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname>
<given-names>XH</given-names>
</name>
<name><surname>Wei</surname>
<given-names>XH</given-names>
</name>
<name><surname>Sang</surname>
<given-names>T</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>QA</given-names>
</name>
<name><surname>Feng</surname>
<given-names>Q</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>Y</given-names>
</name>
<name><surname>Li</surname>
<given-names>CY</given-names>
</name>
<name><surname>Zhu</surname>
<given-names>CR</given-names>
</name>
<name><surname>Lu</surname>
<given-names>TT</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>ZW</given-names>
</name>
<name><surname>Li</surname>
<given-names>M</given-names>
</name>
<name><surname>Fan</surname>
<given-names>DL</given-names>
</name>
<name><surname>Guo</surname>
<given-names>YL</given-names>
</name>
<name><surname>Wang</surname>
<given-names>A</given-names>
</name>
<name><surname>Wang</surname>
<given-names>L</given-names>
</name>
<name><surname>Deng</surname>
<given-names>LW</given-names>
</name>
<name><surname>Li</surname>
<given-names>WJ</given-names>
</name>
<name><surname>Lu</surname>
<given-names>YQ</given-names>
</name>
<name><surname>Weng</surname>
<given-names>QJ</given-names>
</name>
<name><surname>Liu</surname>
<given-names>KY</given-names>
</name>
<name><surname>Huang</surname>
<given-names>T</given-names>
</name>
<name><surname>Zhou</surname>
<given-names>TY</given-names>
</name>
<name><surname>Jing</surname>
<given-names>YF</given-names>
</name>
<name><surname>Li</surname>
<given-names>W</given-names>
</name>
<name><surname>Lin</surname>
<given-names>Z</given-names>
</name>
<name><surname>Buckler</surname>
<given-names>ES</given-names>
</name>
<name><surname>Qian</surname>
<given-names>QA</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>QF</given-names>
</name>
<name><surname>Li</surname>
<given-names>JY</given-names>
</name>
<name><surname>Han</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>Genome-wide association studies of 14 agronomic traits in rice landraces</article-title>
<source>Nat Genet</source>
<year>2010</year>
<volume>42</volume>
<fpage>961–U76</fpage>
<pub-id pub-id-type="doi">10.1038/ng.695</pub-id>
<pub-id pub-id-type="pmid">20972439</pub-id>
</element-citation>
</ref>
<ref id="CR31"><label>31.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>McCouch</surname>
<given-names>SR</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>KY</given-names>
</name>
<name><surname>Wright</surname>
<given-names>M</given-names>
</name>
<name><surname>Tung</surname>
<given-names>CW</given-names>
</name>
<name><surname>Ebana</surname>
<given-names>K</given-names>
</name>
<name><surname>Thomson</surname>
<given-names>M</given-names>
</name>
<name><surname>Reynolds</surname>
<given-names>A</given-names>
</name>
<name><surname>Wang</surname>
<given-names>D</given-names>
</name>
<name><surname>DeClerck</surname>
<given-names>G</given-names>
</name>
<name><surname>Ali</surname>
<given-names>ML</given-names>
</name>
<name><surname>McClung</surname>
<given-names>A</given-names>
</name>
<name><surname>Eizenga</surname>
<given-names>G</given-names>
</name>
<name><surname>Bustamante</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>Development of genome-wide SNP assays for rice</article-title>
<source>Breeding Sci</source>
<year>2010</year>
<volume>60</volume>
<fpage>524</fpage>
<lpage>535</lpage>
<pub-id pub-id-type="doi">10.1270/jsbbs.60.524</pub-id>
</element-citation>
</ref>
<ref id="CR32"><label>32.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>McNally</surname>
<given-names>KL</given-names>
</name>
<name><surname>Childs</surname>
<given-names>KL</given-names>
</name>
<name><surname>Bohnert</surname>
<given-names>R</given-names>
</name>
<name><surname>Davidson</surname>
<given-names>RM</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>K</given-names>
</name>
<name><surname>Ulat</surname>
<given-names>VJ</given-names>
</name>
<name><surname>Zeller</surname>
<given-names>G</given-names>
</name>
<name><surname>Clark</surname>
<given-names>RM</given-names>
</name>
<name><surname>Hoen</surname>
<given-names>DR</given-names>
</name>
<name><surname>Bureau</surname>
<given-names>TE</given-names>
</name>
<name><surname>Stokowski</surname>
<given-names>R</given-names>
</name>
<name><surname>Ballinger</surname>
<given-names>DG</given-names>
</name>
<name><surname>Frazer</surname>
<given-names>KA</given-names>
</name>
<name><surname>Cox</surname>
<given-names>DR</given-names>
</name>
<name><surname>Padhukasahasram</surname>
<given-names>B</given-names>
</name>
<name><surname>Bustamante</surname>
<given-names>CD</given-names>
</name>
<name><surname>Weigel</surname>
<given-names>D</given-names>
</name>
<name><surname>Mackill</surname>
<given-names>DJ</given-names>
</name>
<name><surname>Bruskiewich</surname>
<given-names>RM</given-names>
</name>
<name><surname>Ratsch</surname>
<given-names>G</given-names>
</name>
<name><surname>Buell</surname>
<given-names>CR</given-names>
</name>
<name><surname>Leung</surname>
<given-names>H</given-names>
</name>
<name><surname>Leach</surname>
<given-names>JE</given-names>
</name>
</person-group>
<article-title>Genomewide SNP variation reveals relationships among landraces and modern varieties of rice</article-title>
<source>Proc Natl Acad Sci U S A</source>
<year>2009</year>
<volume>106</volume>
<fpage>12273</fpage>
<lpage>12278</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.0900992106</pub-id>
<pub-id pub-id-type="pmid">19597147</pub-id>
</element-citation>
</ref>
<ref id="CR33"><label>33.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname>
<given-names>K</given-names>
</name>
<name><surname>Xu</surname>
<given-names>X</given-names>
</name>
<name><surname>Fukao</surname>
<given-names>T</given-names>
</name>
<name><surname>Canlas</surname>
<given-names>P</given-names>
</name>
<name><surname>Maghirang-Rodriguez</surname>
<given-names>R</given-names>
</name>
<name><surname>Heuer</surname>
<given-names>S</given-names>
</name>
<name><surname>Ismail</surname>
<given-names>AM</given-names>
</name>
<name><surname>Bailey-Serres</surname>
<given-names>J</given-names>
</name>
<name><surname>Ronald</surname>
<given-names>PC</given-names>
</name>
<name><surname>Mackill</surname>
<given-names>DJ</given-names>
</name>
</person-group>
<article-title>Sub1A is an ethylene-response-factor-like gene that confers submergence tolerance to rice</article-title>
<source>Nature</source>
<year>2006</year>
<volume>442</volume>
<fpage>705</fpage>
<lpage>708</lpage>
<pub-id pub-id-type="doi">10.1038/nature04920</pub-id>
<pub-id pub-id-type="pmid">16900200</pub-id>
</element-citation>
</ref>
<ref id="CR34"><label>34.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname>
<given-names>XH</given-names>
</name>
<name><surname>Feng</surname>
<given-names>Q</given-names>
</name>
<name><surname>Qian</surname>
<given-names>Q</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>Q</given-names>
</name>
<name><surname>Wang</surname>
<given-names>L</given-names>
</name>
<name><surname>Wang</surname>
<given-names>AH</given-names>
</name>
<name><surname>Guan</surname>
<given-names>JP</given-names>
</name>
<name><surname>Fan</surname>
<given-names>DL</given-names>
</name>
<name><surname>Weng</surname>
<given-names>QJ</given-names>
</name>
<name><surname>Huang</surname>
<given-names>T</given-names>
</name>
<name><surname>Dong</surname>
<given-names>GJ</given-names>
</name>
<name><surname>Sang</surname>
<given-names>T</given-names>
</name>
<name><surname>Han</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>High-throughput genotyping by whole-genome resequencing</article-title>
<source>Genome Res</source>
<year>2009</year>
<volume>19</volume>
<fpage>1068</fpage>
<lpage>1076</lpage>
<pub-id pub-id-type="doi">10.1101/gr.089516.108</pub-id>
<pub-id pub-id-type="pmid">19420380</pub-id>
</element-citation>
</ref>
<ref id="CR35"><label>35.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname>
<given-names>X</given-names>
</name>
<name><surname>Liu</surname>
<given-names>X</given-names>
</name>
<name><surname>Ge</surname>
<given-names>S</given-names>
</name>
<name><surname>Jensen</surname>
<given-names>JD</given-names>
</name>
<name><surname>Hu</surname>
<given-names>FY</given-names>
</name>
<name><surname>Li</surname>
<given-names>X</given-names>
</name>
<name><surname>Dong</surname>
<given-names>Y</given-names>
</name>
<name><surname>Gutenkunst</surname>
<given-names>RN</given-names>
</name>
<name><surname>Fang</surname>
<given-names>L</given-names>
</name>
<name><surname>Huang</surname>
<given-names>L</given-names>
</name>
<name><surname>Li</surname>
<given-names>JX</given-names>
</name>
<name><surname>He</surname>
<given-names>WM</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>GJ</given-names>
</name>
<name><surname>Zheng</surname>
<given-names>XM</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>FM</given-names>
</name>
<name><surname>Li</surname>
<given-names>YR</given-names>
</name>
<name><surname>Yu</surname>
<given-names>C</given-names>
</name>
<name><surname>Kristiansen</surname>
<given-names>K</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>XQ</given-names>
</name>
<name><surname>Wang</surname>
<given-names>J</given-names>
</name>
<name><surname>Wright</surname>
<given-names>M</given-names>
</name>
<name><surname>McCouch</surname>
<given-names>S</given-names>
</name>
<name><surname>Nielsen</surname>
<given-names>R</given-names>
</name>
<name><surname>Wang</surname>
<given-names>J</given-names>
</name>
<name><surname>Wang</surname>
<given-names>W</given-names>
</name>
</person-group>
<article-title>Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes</article-title>
<source>Nat Biotechnol</source>
<year>2012</year>
<volume>30</volume>
<fpage>105–U57</fpage>
<pub-id pub-id-type="pmid">22158310</pub-id>
</element-citation>
</ref>
<ref id="CR36"><label>36.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname>
<given-names>JY</given-names>
</name>
<name><surname>Wang</surname>
<given-names>J</given-names>
</name>
<name><surname>Zeigler</surname>
<given-names>RS</given-names>
</name>
</person-group>
<article-title>The 3,000 rice genomes project: new opportunities and challenges for future rice research</article-title>
<source>Gigascience</source>
<year>2014</year>
<volume>3</volume>
<fpage>8</fpage>
<pub-id pub-id-type="doi">10.1186/2047-217X-3-8</pub-id>
<pub-id pub-id-type="pmid">24872878</pub-id>
</element-citation>
</ref>
<ref id="CR37"><label>37.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Han</surname>
<given-names>B</given-names>
</name>
<name><surname>Xue</surname>
<given-names>YB</given-names>
</name>
</person-group>
<article-title>Genome-wide intraspecific DNA-sequence variations in rice</article-title>
<source>Curr Opin Plant Biol</source>
<year>2003</year>
<volume>6</volume>
<fpage>134</fpage>
<lpage>138</lpage>
<pub-id pub-id-type="doi">10.1016/S1369-5266(03)00004-9</pub-id>
<pub-id pub-id-type="pmid">12667869</pub-id>
</element-citation>
</ref>
<ref id="CR38"><label>38.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zuccolo</surname>
<given-names>A</given-names>
</name>
<name><surname>Sebastian</surname>
<given-names>A</given-names>
</name>
<name><surname>Talag</surname>
<given-names>J</given-names>
</name>
<name><surname>Yu</surname>
<given-names>Y</given-names>
</name>
<name><surname>Kim</surname>
<given-names>H</given-names>
</name>
<name><surname>Collura</surname>
<given-names>K</given-names>
</name>
<name><surname>Kudrna</surname>
<given-names>D</given-names>
</name>
<name><surname>Wing</surname>
<given-names>RA</given-names>
</name>
</person-group>
<article-title>Transposable element distribution, abundance and role in genome size variation in the genus Oryza</article-title>
<source>BMC Evol Biol</source>
<year>2007</year>
<volume>7</volume>
<fpage>152</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2148-7-152</pub-id>
<pub-id pub-id-type="pmid">17727727</pub-id>
</element-citation>
</ref>
<ref id="CR39"><label>39.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname>
<given-names>P</given-names>
</name>
<name><surname>Wang</surname>
<given-names>CH</given-names>
</name>
<name><surname>Xu</surname>
<given-names>Q</given-names>
</name>
<name><surname>Feng</surname>
<given-names>Y</given-names>
</name>
<name><surname>Yuan</surname>
<given-names>XP</given-names>
</name>
<name><surname>Yu</surname>
<given-names>HY</given-names>
</name>
<name><surname>Wang</surname>
<given-names>YP</given-names>
</name>
<name><surname>Tang</surname>
<given-names>SX</given-names>
</name>
<name><surname>Wei</surname>
<given-names>XH</given-names>
</name>
</person-group>
<article-title>Detection of copy number variations in rice using array-based comparative genomic hybridization</article-title>
<source>BMC Genomics</source>
<year>2011</year>
<volume>12</volume>
<fpage>372</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2164-12-372</pub-id>
<pub-id pub-id-type="pmid">21771342</pub-id>
</element-citation>
</ref>
<ref id="CR40"><label>40.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Famoso</surname>
<given-names>AN</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>K</given-names>
</name>
<name><surname>Clark</surname>
<given-names>RT</given-names>
</name>
<name><surname>Tung</surname>
<given-names>CW</given-names>
</name>
<name><surname>Wright</surname>
<given-names>MH</given-names>
</name>
<name><surname>Bustamante</surname>
<given-names>C</given-names>
</name>
<name><surname>Kochian</surname>
<given-names>LV</given-names>
</name>
<name><surname>McCouch</surname>
<given-names>SR</given-names>
</name>
</person-group>
<article-title>Genetic architecture of aluminum tolerance in rice (Oryza sativa) determined through genome-wide association analysis and QTL mapping</article-title>
<source>PLoS Genet</source>
<year>2011</year>
<volume>7</volume>
<fpage>e1002221</fpage>
<pub-id pub-id-type="doi">10.1371/journal.pgen.1002221</pub-id>
<pub-id pub-id-type="pmid">21829395</pub-id>
</element-citation>
</ref>
<ref id="CR41"><label>41.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gamuyao</surname>
<given-names>R</given-names>
</name>
<name><surname>Chin</surname>
<given-names>JH</given-names>
</name>
<name><surname>Pariasca-Tanaka</surname>
<given-names>J</given-names>
</name>
<name><surname>Pesaresi</surname>
<given-names>P</given-names>
</name>
<name><surname>Catausan</surname>
<given-names>S</given-names>
</name>
<name><surname>Dalid</surname>
<given-names>C</given-names>
</name>
<name><surname>Slamet-Loedin</surname>
<given-names>I</given-names>
</name>
<name><surname>Tecson-Mendoza</surname>
<given-names>EM</given-names>
</name>
<name><surname>Wissuwa</surname>
<given-names>M</given-names>
</name>
<name><surname>Heuer</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>The protein kinase Pstol1 from traditional rice confers tolerance of phosphorus deficiency</article-title>
<source>Nature</source>
<year>2012</year>
<volume>488</volume>
<fpage>535</fpage>
<pub-id pub-id-type="doi">10.1038/nature11346</pub-id>
<pub-id pub-id-type="pmid">22914168</pub-id>
</element-citation>
</ref>
<ref id="CR42"><label>42.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Uga</surname>
<given-names>Y</given-names>
</name>
<name><surname>Sugimoto</surname>
<given-names>K</given-names>
</name>
<name><surname>Ogawa</surname>
<given-names>S</given-names>
</name>
<name><surname>Rane</surname>
<given-names>J</given-names>
</name>
<name><surname>Ishitani</surname>
<given-names>M</given-names>
</name>
<name><surname>Hara</surname>
<given-names>N</given-names>
</name>
<name><surname>Kitomi</surname>
<given-names>Y</given-names>
</name>
<name><surname>Inukai</surname>
<given-names>Y</given-names>
</name>
<name><surname>Ono</surname>
<given-names>K</given-names>
</name>
<name><surname>Kanno</surname>
<given-names>N</given-names>
</name>
<name><surname>Inoue</surname>
<given-names>H</given-names>
</name>
<name><surname>Takehisa</surname>
<given-names>H</given-names>
</name>
<name><surname>Motoyama</surname>
<given-names>R</given-names>
</name>
<name><surname>Nagamura</surname>
<given-names>Y</given-names>
</name>
<name><surname>Wu</surname>
<given-names>J</given-names>
</name>
<name><surname>Matsumoto</surname>
<given-names>T</given-names>
</name>
<name><surname>Takai</surname>
<given-names>T</given-names>
</name>
<name><surname>Okuno</surname>
<given-names>K</given-names>
</name>
<name><surname>Yano</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Control of root system architecture by DEEPER ROOTING 1 increases rice yield under drought conditions</article-title>
<source>Nat Genet</source>
<year>2013</year>
<volume>45</volume>
<fpage>1097</fpage>
<lpage>1102</lpage>
<pub-id pub-id-type="doi">10.1038/ng.2725</pub-id>
<pub-id pub-id-type="pmid">23913002</pub-id>
</element-citation>
</ref>
<ref id="CR43"><label>43.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liakat Ali</surname>
<given-names>M</given-names>
</name>
<name><surname>McClung</surname>
<given-names>AM</given-names>
</name>
<name><surname>Jia</surname>
<given-names>MH</given-names>
</name>
<name><surname>Kimball</surname>
<given-names>JA</given-names>
</name>
<name><surname>McCouch</surname>
<given-names>SR</given-names>
</name>
<name><surname>Susan</surname>
<given-names>R</given-names>
</name>
<name><surname>Georgia</surname>
<given-names>CE</given-names>
</name>
</person-group>
<article-title>A rice diversity panel evaluated for genetic and agro-morphological diversity between subpopulations and its geographic distribution</article-title>
<source>Crop Sci</source>
<year>2011</year>
<volume>51</volume>
<fpage>2021</fpage>
<lpage>2035</lpage>
<pub-id pub-id-type="doi">10.2135/cropsci2010.11.0641</pub-id>
</element-citation>
</ref>
<ref id="CR44"><label>44.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Garris</surname>
<given-names>AJ</given-names>
</name>
<name><surname>McCouch</surname>
<given-names>SR</given-names>
</name>
<name><surname>Kresovich</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Population structure and its effect on haplotype diversity and linkage disequilibrium surrounding the xa5 locus of rice (Oryza sativa L.).</article-title>
<source>Genetics</source>
<year>2003</year>
<volume>165</volume>
<fpage>759</fpage>
<lpage>769</lpage>
<pub-id pub-id-type="pmid">14573486</pub-id>
</element-citation>
</ref>
<ref id="CR45"><label>45.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hattori</surname>
<given-names>Y</given-names>
</name>
<name><surname>Nagai</surname>
<given-names>K</given-names>
</name>
<name><surname>Furukawa</surname>
<given-names>S</given-names>
</name>
<name><surname>Song</surname>
<given-names>XJ</given-names>
</name>
<name><surname>Kawano</surname>
<given-names>R</given-names>
</name>
<name><surname>Sakakibara</surname>
<given-names>H</given-names>
</name>
<name><surname>Wu</surname>
<given-names>J</given-names>
</name>
<name><surname>Matsumoto</surname>
<given-names>T</given-names>
</name>
<name><surname>Yoshimura</surname>
<given-names>A</given-names>
</name>
<name><surname>Kitano</surname>
<given-names>H</given-names>
</name>
<name><surname>Matsuoka</surname>
<given-names>M</given-names>
</name>
<name><surname>Mori</surname>
<given-names>H</given-names>
</name>
<name><surname>Ashikari</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>The ethylene response factors SNORKEL1 and SNORKEL2 allow rice to adapt to deep water</article-title>
<source>Nature</source>
<year>2009</year>
<volume>460</volume>
<fpage>1026</fpage>
<lpage>1030</lpage>
<pub-id pub-id-type="doi">10.1038/nature08258</pub-id>
<pub-id pub-id-type="pmid">19693083</pub-id>
</element-citation>
</ref>
<ref id="CR46"><label>46.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bernier</surname>
<given-names>J</given-names>
</name>
<name><surname>Kumar</surname>
<given-names>A</given-names>
</name>
<name><surname>Venuprasad</surname>
<given-names>R</given-names>
</name>
<name><surname>Spaner</surname>
<given-names>D</given-names>
</name>
<name><surname>Verulkar</surname>
<given-names>S</given-names>
</name>
<name><surname>Mandal</surname>
<given-names>N</given-names>
</name>
<name><surname>Sinha</surname>
<given-names>P</given-names>
</name>
<name><surname>Peeraju</surname>
<given-names>P</given-names>
</name>
<name><surname>Dongre</surname>
<given-names>P</given-names>
</name>
<name><surname>Mahto</surname>
<given-names>RN</given-names>
</name>
<name><surname>Atlin</surname>
<given-names>G</given-names>
</name>
</person-group>
<article-title>Characterization of the effect of a QTL for drought resistance in rice, qtl12.1, over a range of environments in the Philippines and eastern India</article-title>
<source>Euphytica</source>
<year>2009</year>
<volume>166</volume>
<fpage>207</fpage>
<lpage>217</lpage>
<pub-id pub-id-type="doi">10.1007/s10681-008-9826-y</pub-id>
</element-citation>
</ref>
<ref id="CR47"><label>47.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gnerre</surname>
<given-names>S</given-names>
</name>
<name><surname>Maccallum</surname>
<given-names>I</given-names>
</name>
<name><surname>Przybylski</surname>
<given-names>D</given-names>
</name>
<name><surname>Ribeiro</surname>
<given-names>FJ</given-names>
</name>
<name><surname>Burton</surname>
<given-names>JN</given-names>
</name>
<name><surname>Walker</surname>
<given-names>BJ</given-names>
</name>
<name><surname>Sharpe</surname>
<given-names>T</given-names>
</name>
<name><surname>Hall</surname>
<given-names>G</given-names>
</name>
<name><surname>Shea</surname>
<given-names>TP</given-names>
</name>
<name><surname>Sykes</surname>
<given-names>S</given-names>
</name>
<name><surname>Berlin</surname>
<given-names>AM</given-names>
</name>
<name><surname>Aird</surname>
<given-names>D</given-names>
</name>
<name><surname>Costello</surname>
<given-names>M</given-names>
</name>
<name><surname>Daza</surname>
<given-names>R</given-names>
</name>
<name><surname>Williams</surname>
<given-names>L</given-names>
</name>
<name><surname>Nicol</surname>
<given-names>R</given-names>
</name>
<name><surname>Gnirke</surname>
<given-names>A</given-names>
</name>
<name><surname>Nusbaum</surname>
<given-names>C</given-names>
</name>
<name><surname>Lander</surname>
<given-names>ES</given-names>
</name>
<name><surname>Jaffe</surname>
<given-names>DB</given-names>
</name>
</person-group>
<article-title>High-quality draft assemblies of mammalian genomes from massively parallel sequence data</article-title>
<source>Proc Natl Acad Sci U S A</source>
<year>2011</year>
<volume>108</volume>
<fpage>1513</fpage>
<lpage>1518</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.1017351108</pub-id>
<pub-id pub-id-type="pmid">21187386</pub-id>
</element-citation>
</ref>
<ref id="CR48"><label>48.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bradnam</surname>
<given-names>KR</given-names>
</name>
<name><surname>Fass</surname>
<given-names>JN</given-names>
</name>
<name><surname>Alexandrov</surname>
<given-names>A</given-names>
</name>
<name><surname>Baranay</surname>
<given-names>P</given-names>
</name>
<name><surname>Bechner</surname>
<given-names>M</given-names>
</name>
<name><surname>Birol</surname>
<given-names>I</given-names>
</name>
<name><surname>Boisvert</surname>
<given-names>S</given-names>
</name>
<name><surname>Chapman</surname>
<given-names>JA</given-names>
</name>
<name><surname>Chapuis</surname>
<given-names>G</given-names>
</name>
<name><surname>Chikhi</surname>
<given-names>R</given-names>
</name>
<name><surname>Chitsaz</surname>
<given-names>H</given-names>
</name>
<name><surname>Chou</surname>
<given-names>WC</given-names>
</name>
<name><surname>Corbeil</surname>
<given-names>J</given-names>
</name>
<name><surname>Del Fabbro</surname>
<given-names>C</given-names>
</name>
<name><surname>Docking</surname>
<given-names>TR</given-names>
</name>
<name><surname>Durbin</surname>
<given-names>R</given-names>
</name>
<name><surname>Earl</surname>
<given-names>D</given-names>
</name>
<name><surname>Emrich</surname>
<given-names>S</given-names>
</name>
<name><surname>Fedotov</surname>
<given-names>P</given-names>
</name>
<name><surname>Fonseca</surname>
<given-names>NA</given-names>
</name>
<name><surname>Ganapathy</surname>
<given-names>G</given-names>
</name>
<name><surname>Gibbs</surname>
<given-names>RA</given-names>
</name>
<name><surname>Gnerre</surname>
<given-names>S</given-names>
</name>
<name><surname>Godzaridis</surname>
<given-names>E</given-names>
</name>
<name><surname>Goldstein</surname>
<given-names>S</given-names>
</name>
<name><surname>Haimel</surname>
<given-names>M</given-names>
</name>
<name><surname>Hall</surname>
<given-names>G</given-names>
</name>
<name><surname>Haussler</surname>
<given-names>D</given-names>
</name>
<name><surname>Hiatt</surname>
<given-names>JB</given-names>
</name>
<name><surname>Ho</surname>
<given-names>IY</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species</article-title>
<source>Gigascience</source>
<year>2013</year>
<volume>2</volume>
<fpage>10</fpage>
<pub-id pub-id-type="doi">10.1186/2047-217X-2-10</pub-id>
<pub-id pub-id-type="pmid">23870653</pub-id>
</element-citation>
</ref>
<ref id="CR49"><label>49.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Earl</surname>
<given-names>D</given-names>
</name>
<name><surname>Bradnam</surname>
<given-names>K</given-names>
</name>
<name><surname>St John</surname>
<given-names>J</given-names>
</name>
<name><surname>Darling</surname>
<given-names>A</given-names>
</name>
<name><surname>Lin</surname>
<given-names>D</given-names>
</name>
<name><surname>Fass</surname>
<given-names>J</given-names>
</name>
<name><surname>Yu</surname>
<given-names>HO</given-names>
</name>
<name><surname>Buffalo</surname>
<given-names>V</given-names>
</name>
<name><surname>Zerbino</surname>
<given-names>DR</given-names>
</name>
<name><surname>Diekhans</surname>
<given-names>M</given-names>
</name>
<name><surname>Nguyen</surname>
<given-names>N</given-names>
</name>
<name><surname>Ariyaratne</surname>
<given-names>PN</given-names>
</name>
<name><surname>Sung</surname>
<given-names>WK</given-names>
</name>
<name><surname>Ning</surname>
<given-names>Z</given-names>
</name>
<name><surname>Haimel</surname>
<given-names>M</given-names>
</name>
<name><surname>Simpson</surname>
<given-names>JT</given-names>
</name>
<name><surname>Fonseca</surname>
<given-names>NA</given-names>
</name>
<name><surname>Birol</surname>
<given-names>I</given-names>
</name>
<name><surname>Docking</surname>
<given-names>TR</given-names>
</name>
<name><surname>Ho</surname>
<given-names>IY</given-names>
</name>
<name><surname>Rokhsar</surname>
<given-names>DS</given-names>
</name>
<name><surname>Chikhi</surname>
<given-names>R</given-names>
</name>
<name><surname>Lavenier</surname>
<given-names>D</given-names>
</name>
<name><surname>Chapuis</surname>
<given-names>G</given-names>
</name>
<name><surname>Naquin</surname>
<given-names>D</given-names>
</name>
<name><surname>Maillet</surname>
<given-names>N</given-names>
</name>
<name><surname>Schatz</surname>
<given-names>MC</given-names>
</name>
<name><surname>Kelley</surname>
<given-names>DR</given-names>
</name>
<name><surname>Phillippy</surname>
<given-names>AM</given-names>
</name>
<name><surname>Koren</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Assemblathon 1: a competitive assessment of de novo short read assembly methods</article-title>
<source>Genome Res</source>
<year>2011</year>
<volume>21</volume>
<fpage>2224</fpage>
<lpage>2241</lpage>
<pub-id pub-id-type="doi">10.1101/gr.126599.111</pub-id>
<pub-id pub-id-type="pmid">21926179</pub-id>
</element-citation>
</ref>
<ref id="CR50"><label>50.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Salzberg</surname>
<given-names>SL</given-names>
</name>
<name><surname>Phillippy</surname>
<given-names>AM</given-names>
</name>
<name><surname>Zimin</surname>
<given-names>A</given-names>
</name>
<name><surname>Puiu</surname>
<given-names>D</given-names>
</name>
<name><surname>Magoc</surname>
<given-names>T</given-names>
</name>
<name><surname>Koren</surname>
<given-names>S</given-names>
</name>
<name><surname>Treangen</surname>
<given-names>TJ</given-names>
</name>
<name><surname>Schatz</surname>
<given-names>MC</given-names>
</name>
<name><surname>Delcher</surname>
<given-names>AL</given-names>
</name>
<name><surname>Roberts</surname>
<given-names>M</given-names>
</name>
<name><surname>Marcais</surname>
<given-names>G</given-names>
</name>
<name><surname>Pop</surname>
<given-names>M</given-names>
</name>
<name><surname>Yorke</surname>
<given-names>JA</given-names>
</name>
</person-group>
<article-title>GAGE: A critical evaluation of genome assemblies and assembly algorithms</article-title>
<source>Genome Res</source>
<year>2012</year>
<volume>22</volume>
<fpage>557</fpage>
<lpage>567</lpage>
<pub-id pub-id-type="doi">10.1101/gr.131383.111</pub-id>
<pub-id pub-id-type="pmid">22147368</pub-id>
</element-citation>
</ref>
<ref id="CR51"><label>51.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kawahara</surname>
<given-names>Y</given-names>
</name>
<name><surname>de la Bastide</surname>
<given-names>M</given-names>
</name>
<name><surname>Hamilton</surname>
<given-names>JP</given-names>
</name>
<name><surname>Kanamori</surname>
<given-names>H</given-names>
</name>
<name><surname>McCombie</surname>
<given-names>WR</given-names>
</name>
<name><surname>Ouyang</surname>
<given-names>S</given-names>
</name>
<name><surname>Schwartz</surname>
<given-names>DC</given-names>
</name>
<name><surname>Tanaka</surname>
<given-names>T</given-names>
</name>
<name><surname>Wu</surname>
<given-names>J</given-names>
</name>
<name><surname>Zhou</surname>
<given-names>S</given-names>
</name>
<name><surname>Childs</surname>
<given-names>KL</given-names>
</name>
<name><surname>Davidson</surname>
<given-names>RM</given-names>
</name>
<name><surname>Lin</surname>
<given-names>H</given-names>
</name>
<name><surname>Quesada-Ocampo</surname>
<given-names>L</given-names>
</name>
<name><surname>Vaillancourt</surname>
<given-names>B</given-names>
</name>
<name><surname>Sakai</surname>
<given-names>H</given-names>
</name>
<name><surname>Lee</surname>
<given-names>SS</given-names>
</name>
<name><surname>Kim</surname>
<given-names>J</given-names>
</name>
<name><surname>Numa</surname>
<given-names>H</given-names>
</name>
<name><surname>Itoh</surname>
<given-names>T</given-names>
</name>
<name><surname>Buell</surname>
<given-names>CR</given-names>
</name>
<name><surname>Matsumoto</surname>
<given-names>T</given-names>
</name>
</person-group>
<article-title>Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data</article-title>
<source>Rice</source>
<year>2013</year>
<volume>6</volume>
<fpage>4</fpage>
<pub-id pub-id-type="doi">10.1186/1939-8433-6-4</pub-id>
<pub-id pub-id-type="pmid">24280374</pub-id>
</element-citation>
</ref>
<ref id="CR52"><label>52.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Campbell</surname>
<given-names>MS</given-names>
</name>
<name><surname>Law</surname>
<given-names>M</given-names>
</name>
<name><surname>Holt</surname>
<given-names>C</given-names>
</name>
<name><surname>Stein</surname>
<given-names>JC</given-names>
</name>
<name><surname>Moghe</surname>
<given-names>GD</given-names>
</name>
<name><surname>Hufnagel</surname>
<given-names>DE</given-names>
</name>
<name><surname>Lei</surname>
<given-names>J</given-names>
</name>
<name><surname>Achawanantakun</surname>
<given-names>R</given-names>
</name>
<name><surname>Jiao</surname>
<given-names>D</given-names>
</name>
<name><surname>Lawrence</surname>
<given-names>CJ</given-names>
</name>
<name><surname>Ware</surname>
<given-names>D</given-names>
</name>
<name><surname>Shiu</surname>
<given-names>SH</given-names>
</name>
<name><surname>Childs</surname>
<given-names>KL</given-names>
</name>
<name><surname>Sun</surname>
<given-names>Y</given-names>
</name>
<name><surname>Jiang</surname>
<given-names>N</given-names>
</name>
<name><surname>Yandell</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>MAKER-P: A tool kit for the rapid creation, management, and quality control of plant genome annotations</article-title>
<source>Plant Physiol</source>
<year>2014</year>
<volume>164</volume>
<fpage>513</fpage>
<lpage>524</lpage>
<pub-id pub-id-type="doi">10.1104/pp.113.230144</pub-id>
<pub-id pub-id-type="pmid">24306534</pub-id>
</element-citation>
</ref>
<ref id="CR53"><label>53.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lipman</surname>
<given-names>DJ</given-names>
</name>
<name><surname>Souvorov</surname>
<given-names>A</given-names>
</name>
<name><surname>Koonin</surname>
<given-names>EV</given-names>
</name>
<name><surname>Panchenko</surname>
<given-names>AR</given-names>
</name>
<name><surname>Tatusova</surname>
<given-names>TA</given-names>
</name>
</person-group>
<article-title>The relationship of protein conservation and sequence length</article-title>
<source>BMC Evol Biol</source>
<year>2002</year>
<volume>2</volume>
<fpage>20</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2148-2-20</pub-id>
<pub-id pub-id-type="pmid">12410938</pub-id>
</element-citation>
</ref>
<ref id="CR54"><label>54.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Capra</surname>
<given-names>JA</given-names>
</name>
<name><surname>Pollard</surname>
<given-names>KS</given-names>
</name>
<name><surname>Singh</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Novel genes exhibit distinct patterns of function acquisition and network integration</article-title>
<source>Genome Biol</source>
<year>2010</year>
<volume>11</volume>
<fpage>R127</fpage>
<pub-id pub-id-type="doi">10.1186/gb-2010-11-12-r127</pub-id>
<pub-id pub-id-type="pmid">21187012</pub-id>
</element-citation>
</ref>
<ref id="CR55"><label>55.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cai</surname>
<given-names>JJ</given-names>
</name>
<name><surname>Petrov</surname>
<given-names>DA</given-names>
</name>
</person-group>
<article-title>Relaxed purifying selection and possibly high rate of adaptation in primate lineage-specific genes</article-title>
<source>Genome Biol Evol</source>
<year>2010</year>
<volume>2</volume>
<fpage>393</fpage>
<lpage>409</lpage>
<pub-id pub-id-type="doi">10.1093/gbe/evq019</pub-id>
<pub-id pub-id-type="pmid">20624743</pub-id>
</element-citation>
</ref>
<ref id="CR56"><label>56.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yanagihara</surname>
<given-names>S</given-names>
</name>
<name><surname>Mccouch</surname>
<given-names>SR</given-names>
</name>
<name><surname>Ishikawa</surname>
<given-names>K</given-names>
</name>
<name><surname>Ogi</surname>
<given-names>Y</given-names>
</name>
<name><surname>Maruyama</surname>
<given-names>K</given-names>
</name>
<name><surname>Ikehashi</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>Molecular analysis of the inheritance of the S-5 locus, conferring wide compatibility in Indica-Japonica hybrids of rice (Oryza-sativa L)</article-title>
<source>Theor Appl Genet</source>
<year>1995</year>
<volume>90</volume>
<fpage>182</fpage>
<lpage>188</lpage>
<pub-id pub-id-type="doi">10.1007/BF00222200</pub-id>
<pub-id pub-id-type="pmid">24173889</pub-id>
</element-citation>
</ref>
<ref id="CR57"><label>57.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname>
<given-names>JJ</given-names>
</name>
<name><surname>Ding</surname>
<given-names>JH</given-names>
</name>
<name><surname>Ouyang</surname>
<given-names>YD</given-names>
</name>
<name><surname>Du</surname>
<given-names>HY</given-names>
</name>
<name><surname>Yang</surname>
<given-names>JY</given-names>
</name>
<name><surname>Cheng</surname>
<given-names>K</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>J</given-names>
</name>
<name><surname>Qiu</surname>
<given-names>SQ</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>XL</given-names>
</name>
<name><surname>Yao</surname>
<given-names>JL</given-names>
</name>
<name><surname>Liu</surname>
<given-names>KD</given-names>
</name>
<name><surname>Wang</surname>
<given-names>L</given-names>
</name>
<name><surname>Xu</surname>
<given-names>CG</given-names>
</name>
<name><surname>Li</surname>
<given-names>XH</given-names>
</name>
<name><surname>Xue</surname>
<given-names>YB</given-names>
</name>
<name><surname>Xia</surname>
<given-names>M</given-names>
</name>
<name><surname>Ji</surname>
<given-names>Q</given-names>
</name>
<name><surname>Lu</surname>
<given-names>JF</given-names>
</name>
<name><surname>Xu</surname>
<given-names>ML</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>QF</given-names>
</name>
</person-group>
<article-title>A triallelic system of S5 is a major regulator of the reproductive barrier and compatibility of indica-japonica hybrids in rice</article-title>
<source>Proc Natl Acad Sci U S A</source>
<year>2008</year>
<volume>105</volume>
<fpage>11436</fpage>
<lpage>11441</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.0804761105</pub-id>
<pub-id pub-id-type="pmid">18678896</pub-id>
</element-citation>
</ref>
<ref id="CR58"><label>58.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname>
<given-names>J</given-names>
</name>
<name><surname>Zhao</surname>
<given-names>X</given-names>
</name>
<name><surname>Cheng</surname>
<given-names>K</given-names>
</name>
<name><surname>Du</surname>
<given-names>H</given-names>
</name>
<name><surname>Ouyang</surname>
<given-names>Y</given-names>
</name>
<name><surname>Chen</surname>
<given-names>J</given-names>
</name>
<name><surname>Qiu</surname>
<given-names>S</given-names>
</name>
<name><surname>Huang</surname>
<given-names>J</given-names>
</name>
<name><surname>Jiang</surname>
<given-names>Y</given-names>
</name>
<name><surname>Jiang</surname>
<given-names>L</given-names>
</name>
<name><surname>Ding</surname>
<given-names>J</given-names>
</name>
<name><surname>Wang</surname>
<given-names>J</given-names>
</name>
<name><surname>Xu</surname>
<given-names>C</given-names>
</name>
<name><surname>Li</surname>
<given-names>X</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>Q</given-names>
</name>
</person-group>
<article-title>A killer-protector system regulates both hybrid sterility and segregation distortion in rice</article-title>
<source>Science</source>
<year>2012</year>
<volume>337</volume>
<fpage>1336</fpage>
<lpage>1340</lpage>
<pub-id pub-id-type="doi">10.1126/science.1223702</pub-id>
<pub-id pub-id-type="pmid">22984070</pub-id>
</element-citation>
</ref>
<ref id="CR59"><label>59.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>He</surname>
<given-names>GM</given-names>
</name>
<name><surname>Luo</surname>
<given-names>XJ</given-names>
</name>
<name><surname>Tian</surname>
<given-names>F</given-names>
</name>
<name><surname>Li</surname>
<given-names>KG</given-names>
</name>
<name><surname>Zhu</surname>
<given-names>ZF</given-names>
</name>
<name><surname>Su</surname>
<given-names>W</given-names>
</name>
<name><surname>Qian</surname>
<given-names>XY</given-names>
</name>
<name><surname>Fu</surname>
<given-names>YC</given-names>
</name>
<name><surname>Wang</surname>
<given-names>XK</given-names>
</name>
<name><surname>Sun</surname>
<given-names>CQ</given-names>
</name>
<name><surname>Yang</surname>
<given-names>JS</given-names>
</name>
</person-group>
<article-title>Haplotype variation in structure and expression of a gene cluster associated with a quantitative trait locus for improved yield in rice</article-title>
<source>Genome Res</source>
<year>2006</year>
<volume>16</volume>
<fpage>618</fpage>
<lpage>626</lpage>
<pub-id pub-id-type="doi">10.1101/gr.4814006</pub-id>
<pub-id pub-id-type="pmid">16606701</pub-id>
</element-citation>
</ref>
<ref id="CR60"><label>60.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wissuwa</surname>
<given-names>M</given-names>
</name>
<name><surname>Wegner</surname>
<given-names>J</given-names>
</name>
<name><surname>Ae</surname>
<given-names>N</given-names>
</name>
<name><surname>Yano</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Substitution mapping of Pup1: a major QTL increasing phosphorus uptake of rice from a phosphorus-deficient soil</article-title>
<source>Theor Appl Genet</source>
<year>2002</year>
<volume>105</volume>
<fpage>890</fpage>
<lpage>897</lpage>
<pub-id pub-id-type="doi">10.1007/s00122-002-1051-9</pub-id>
<pub-id pub-id-type="pmid">12582914</pub-id>
</element-citation>
</ref>
<ref id="CR61"><label>61.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wissuwa</surname>
<given-names>M</given-names>
</name>
<name><surname>Yano</surname>
<given-names>M</given-names>
</name>
<name><surname>Ae</surname>
<given-names>N</given-names>
</name>
</person-group>
<article-title>Mapping of QTLs for phosphorus-deficiency tolerance in rice (Oryza sativa L.)</article-title>
<source>Theor Appl Genet</source>
<year>1998</year>
<volume>97</volume>
<fpage>777</fpage>
<lpage>783</lpage>
<pub-id pub-id-type="doi">10.1007/s001220050955</pub-id>
</element-citation>
</ref>
<ref id="CR62"><label>62.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chin</surname>
<given-names>JH</given-names>
</name>
<name><surname>Gamuyao</surname>
<given-names>R</given-names>
</name>
<name><surname>Dalid</surname>
<given-names>C</given-names>
</name>
<name><surname>Bustamam</surname>
<given-names>M</given-names>
</name>
<name><surname>Prasetiyono</surname>
<given-names>J</given-names>
</name>
<name><surname>Moeljopawiro</surname>
<given-names>S</given-names>
</name>
<name><surname>Wissuwa</surname>
<given-names>M</given-names>
</name>
<name><surname>Heuer</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Developing rice with high yield under phosphorus deficiency: Pup1 sequence to application</article-title>
<source>Plant Physiol</source>
<year>2011</year>
<volume>156</volume>
<fpage>1202</fpage>
<lpage>1216</lpage>
<pub-id pub-id-type="doi">10.1104/pp.111.175471</pub-id>
<pub-id pub-id-type="pmid">21602323</pub-id>
</element-citation>
</ref>
<ref id="CR63"><label>63.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Eizenga</surname>
<given-names>GCAM</given-names>
</name>
<name><surname>Bryant</surname>
<given-names>RJ</given-names>
</name>
<name><surname>Yeater</surname>
<given-names>KM</given-names>
</name>
<name><surname>McClung</surname>
<given-names>AM</given-names>
</name>
<name><surname>McCouch</surname>
<given-names>SR</given-names>
</name>
</person-group>
<article-title>Registration of the rice diversity panel 1 for genomewide association studies</article-title>
<source>J Plant Reg</source>
<year>2013</year>
<volume>8</volume>
<fpage>109</fpage>
<lpage>116</lpage>
<pub-id pub-id-type="doi">10.3198/jpr2013.03.0013crmp</pub-id>
</element-citation>
</ref>
<ref id="CR64"><label>64.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bin Rahman</surname>
<given-names>AN</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>Rayada specialty: the forgotten resource of elite features of rice</article-title>
<source>Rice</source>
<year>2013</year>
<volume>6</volume>
<fpage>41</fpage>
<pub-id pub-id-type="doi">10.1186/1939-8433-6-41</pub-id>
<pub-id pub-id-type="pmid">24359642</pub-id>
</element-citation>
</ref>
<ref id="CR65"><label>65.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Roberts</surname>
<given-names>RJ</given-names>
</name>
<name><surname>Carneiro</surname>
<given-names>MO</given-names>
</name>
<name><surname>Schatz</surname>
<given-names>MC</given-names>
</name>
</person-group>
<article-title>The advantages of SMRT sequencing</article-title>
<source>Genome Biol</source>
<year>2013</year>
<volume>14</volume>
<fpage>405</fpage>
<pub-id pub-id-type="doi">10.1186/gb-2013-14-6-405</pub-id>
<pub-id pub-id-type="pmid">23822731</pub-id>
</element-citation>
</ref>
<ref id="CR66"><label>66.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname>
<given-names>R</given-names>
</name>
<name><surname>Liu</surname>
<given-names>B</given-names>
</name>
<name><surname>Xie</surname>
<given-names>Y</given-names>
</name>
<name><surname>Li</surname>
<given-names>Z</given-names>
</name>
<name><surname>Huang</surname>
<given-names>W</given-names>
</name>
<name><surname>Yuan</surname>
<given-names>J</given-names>
</name>
<name><surname>He</surname>
<given-names>G</given-names>
</name>
<name><surname>Chen</surname>
<given-names>Y</given-names>
</name>
<name><surname>Pan</surname>
<given-names>Q</given-names>
</name>
<name><surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name><surname>Tang</surname>
<given-names>J</given-names>
</name>
<name><surname>Wu</surname>
<given-names>G</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>H</given-names>
</name>
<name><surname>Shi</surname>
<given-names>Y</given-names>
</name>
<name><surname>Yu</surname>
<given-names>C</given-names>
</name>
<name><surname>Wang</surname>
<given-names>B</given-names>
</name>
<name><surname>Lu</surname>
<given-names>Y</given-names>
</name>
<name><surname>Han</surname>
<given-names>C</given-names>
</name>
<name><surname>Cheung</surname>
<given-names>DW</given-names>
</name>
<name><surname>Yiu</surname>
<given-names>SM</given-names>
</name>
<name><surname>Peng</surname>
<given-names>S</given-names>
</name>
<name><surname>Xiaoqian</surname>
<given-names>Z</given-names>
</name>
<name><surname>Liu</surname>
<given-names>G</given-names>
</name>
<name><surname>Liao</surname>
<given-names>X</given-names>
</name>
<name><surname>Li</surname>
<given-names>Y</given-names>
</name>
<name><surname>Yang</surname>
<given-names>H</given-names>
</name>
<name><surname>Wang</surname>
<given-names>J</given-names>
</name>
<name><surname>Lam</surname>
<given-names>TW</given-names>
</name>
</person-group>
<article-title>SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler</article-title>
<source>Gigascience</source>
<year>2012</year>
<volume>1</volume>
<fpage>18</fpage>
<pub-id pub-id-type="doi">10.1186/2047-217X-1-18</pub-id>
<pub-id pub-id-type="pmid">23587118</pub-id>
</element-citation>
</ref>
<ref id="CR67"><label>67.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Simpson</surname>
<given-names>JT</given-names>
</name>
<name><surname>Durbin</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>Efficient de novo assembly of large genomes using compressed data structures</article-title>
<source>Genome Res</source>
<year>2012</year>
<volume>22</volume>
<fpage>549</fpage>
<lpage>556</lpage>
<pub-id pub-id-type="doi">10.1101/gr.126953.111</pub-id>
<pub-id pub-id-type="pmid">22156294</pub-id>
</element-citation>
</ref>
<ref id="CR68"><label>68.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kelley</surname>
<given-names>DR</given-names>
</name>
<name><surname>Schatz</surname>
<given-names>MC</given-names>
</name>
<name><surname>Salzberg</surname>
<given-names>SL</given-names>
</name>
</person-group>
<article-title>Quake: quality-aware detection and correction of sequencing errors</article-title>
<source>Genome Biol</source>
<year>2010</year>
<volume>11</volume>
<fpage>R116</fpage>
<pub-id pub-id-type="doi">10.1186/gb-2010-11-11-r116</pub-id>
<pub-id pub-id-type="pmid">21114842</pub-id>
</element-citation>
</ref>
<ref id="CR69"><label>69.</label>
<mixed-citation publication-type="other">Smit AFA, Hubley R, Green P: RepeatMaster Open-3.0. 1996–2010. <ext-link ext-link-type="uri" xlink:href="http://www.repeatmasker.org">http://www.repeatmasker.org</ext-link>
</mixed-citation>
</ref>
<ref id="CR70"><label>70.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cantarel</surname>
<given-names>BL</given-names>
</name>
<name><surname>Korf</surname>
<given-names>I</given-names>
</name>
<name><surname>Robb</surname>
<given-names>SM</given-names>
</name>
<name><surname>Parra</surname>
<given-names>G</given-names>
</name>
<name><surname>Ross</surname>
<given-names>E</given-names>
</name>
<name><surname>Moore</surname>
<given-names>B</given-names>
</name>
<name><surname>Holt</surname>
<given-names>C</given-names>
</name>
<name><surname>Sanchez Alvarado</surname>
<given-names>A</given-names>
</name>
<name><surname>Yandell</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes</article-title>
<source>Genome Res</source>
<year>2008</year>
<volume>18</volume>
<fpage>188</fpage>
<lpage>196</lpage>
<pub-id pub-id-type="doi">10.1101/gr.6743907</pub-id>
<pub-id pub-id-type="pmid">18025269</pub-id>
</element-citation>
</ref>
<ref id="CR71"><label>71.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Goff</surname>
<given-names>SA</given-names>
</name>
<name><surname>Vaughn</surname>
<given-names>M</given-names>
</name>
<name><surname>McKay</surname>
<given-names>S</given-names>
</name>
<name><surname>Lyons</surname>
<given-names>E</given-names>
</name>
<name><surname>Stapleton</surname>
<given-names>AE</given-names>
</name>
<name><surname>Gessler</surname>
<given-names>D</given-names>
</name>
<name><surname>Matasci</surname>
<given-names>N</given-names>
</name>
<name><surname>Wang</surname>
<given-names>L</given-names>
</name>
<name><surname>Hanlon</surname>
<given-names>M</given-names>
</name>
<name><surname>Lenards</surname>
<given-names>A</given-names>
</name>
<name><surname>Muir</surname>
<given-names>A</given-names>
</name>
<name><surname>Merchant</surname>
<given-names>N</given-names>
</name>
<name><surname>Lowry</surname>
<given-names>S</given-names>
</name>
<name><surname>Mock</surname>
<given-names>S</given-names>
</name>
<name><surname>Helmke</surname>
<given-names>M</given-names>
</name>
<name><surname>Kubach</surname>
<given-names>A</given-names>
</name>
<name><surname>Narro</surname>
<given-names>M</given-names>
</name>
<name><surname>Hopkins</surname>
<given-names>N</given-names>
</name>
<name><surname>Micklos</surname>
<given-names>D</given-names>
</name>
<name><surname>Hilgert</surname>
<given-names>U</given-names>
</name>
<name><surname>Gonzales</surname>
<given-names>M</given-names>
</name>
<name><surname>Jordan</surname>
<given-names>C</given-names>
</name>
<name><surname>Skidmore</surname>
<given-names>E</given-names>
</name>
<name><surname>Dooley</surname>
<given-names>R</given-names>
</name>
<name><surname>Cazes</surname>
<given-names>J</given-names>
</name>
<name><surname>McLay</surname>
<given-names>R</given-names>
</name>
<name><surname>Lu</surname>
<given-names>Z</given-names>
</name>
<name><surname>Pasternak</surname>
<given-names>S</given-names>
</name>
<name><surname>Koesterke</surname>
<given-names>L</given-names>
</name>
<name><surname>Piel</surname>
<given-names>WH</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The iPlant Collaborative: cyberinfrastructure for plant biology</article-title>
<source>Front Plant Sci</source>
<year>2011</year>
<volume>2</volume>
<fpage>34</fpage>
<pub-id pub-id-type="doi">10.3389/fpls.2011.00034</pub-id>
<pub-id pub-id-type="pmid">22645531</pub-id>
</element-citation>
</ref>
<ref id="CR72"><label>72.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Holt</surname>
<given-names>C</given-names>
</name>
<name><surname>Yandell</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects</article-title>
<source>BMC bioinformatics</source>
<year>2011</year>
<volume>12</volume>
<fpage>491</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2105-12-491</pub-id>
<pub-id pub-id-type="pmid">22192575</pub-id>
</element-citation>
</ref>
<ref id="CR73"><label>73.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Salamov</surname>
<given-names>AA</given-names>
</name>
<name><surname>Solovyev</surname>
<given-names>VV</given-names>
</name>
</person-group>
<article-title>Ab initio gene finding in Drosophila genomic DNA</article-title>
<source>Genome Res</source>
<year>2000</year>
<volume>10</volume>
<fpage>516</fpage>
<lpage>522</lpage>
<pub-id pub-id-type="doi">10.1101/gr.10.4.516</pub-id>
<pub-id pub-id-type="pmid">10779491</pub-id>
</element-citation>
</ref>
<ref id="CR74"><label>74.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Korf</surname>
<given-names>I</given-names>
</name>
</person-group>
<article-title>Gene finding in novel genomes</article-title>
<source>BMC bioinformatics</source>
<year>2004</year>
<volume>5</volume>
<fpage>59</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2105-5-59</pub-id>
<pub-id pub-id-type="pmid">15144565</pub-id>
</element-citation>
</ref>
<ref id="CR75"><label>75.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jones</surname>
<given-names>P</given-names>
</name>
<name><surname>Binns</surname>
<given-names>D</given-names>
</name>
<name><surname>Chang</surname>
<given-names>HY</given-names>
</name>
<name><surname>Fraser</surname>
<given-names>M</given-names>
</name>
<name><surname>Li</surname>
<given-names>W</given-names>
</name>
<name><surname>McAnulla</surname>
<given-names>C</given-names>
</name>
<name><surname>McWilliam</surname>
<given-names>H</given-names>
</name>
<name><surname>Maslen</surname>
<given-names>J</given-names>
</name>
<name><surname>Mitchell</surname>
<given-names>A</given-names>
</name>
<name><surname>Nuka</surname>
<given-names>G</given-names>
</name>
<name><surname>Pesseat</surname>
<given-names>S</given-names>
</name>
<name><surname>Quinn</surname>
<given-names>AF</given-names>
</name>
<name><surname>Sangrador-Vegas</surname>
<given-names>A</given-names>
</name>
<name><surname>Scheremetjew</surname>
<given-names>M</given-names>
</name>
<name><surname>Yong</surname>
<given-names>SY</given-names>
</name>
<name><surname>Lopez</surname>
<given-names>R</given-names>
</name>
<name><surname>Hunter</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>InterProScan 5: genome-scale protein function classification</article-title>
<source>Bioinformatics</source>
<year>2014</year>
<volume>30</volume>
<fpage>1236</fpage>
<lpage>1240</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btu031</pub-id>
<pub-id pub-id-type="pmid">24451626</pub-id>
</element-citation>
</ref>
<ref id="CR76"><label>76.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Oliver</surname>
<given-names>SL</given-names>
</name>
<name><surname>Lenards</surname>
<given-names>AJ</given-names>
</name>
<name><surname>Barthelson</surname>
<given-names>RA</given-names>
</name>
<name><surname>Merchant</surname>
<given-names>N</given-names>
</name>
<name><surname>McKay</surname>
<given-names>SJ</given-names>
</name>
</person-group>
<article-title>Using the iPlant collaborative discovery environment</article-title>
<source>Curr Protoc Bioinformatics</source>
<year>2013</year>
<volume>Chapter 1</volume>
<fpage>Unit1 22</fpage>
<pub-id pub-id-type="pmid">23749752</pub-id>
</element-citation>
</ref>
<ref id="CR77"><label>77.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kurtz</surname>
<given-names>S</given-names>
</name>
<name><surname>Phillippy</surname>
<given-names>A</given-names>
</name>
<name><surname>Delcher</surname>
<given-names>AL</given-names>
</name>
<name><surname>Smoot</surname>
<given-names>M</given-names>
</name>
<name><surname>Shumway</surname>
<given-names>M</given-names>
</name>
<name><surname>Antonescu</surname>
<given-names>C</given-names>
</name>
<name><surname>Salzberg</surname>
<given-names>SL</given-names>
</name>
</person-group>
<article-title>Versatile and open software for comparing large genomes</article-title>
<source>Genome Biol</source>
<year>2004</year>
<volume>5</volume>
<fpage>R12</fpage>
<pub-id pub-id-type="doi">10.1186/gb-2004-5-2-r12</pub-id>
<pub-id pub-id-type="pmid">14759262</pub-id>
</element-citation>
</ref>
<ref id="CR78"><label>78.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Quinlan</surname>
<given-names>AR</given-names>
</name>
<name><surname>Hall</surname>
<given-names>IM</given-names>
</name>
</person-group>
<article-title>BEDTools: a flexible suite of utilities for comparing genomic features</article-title>
<source>Bioinformatics</source>
<year>2010</year>
<volume>26</volume>
<fpage>841</fpage>
<lpage>842</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btq033</pub-id>
<pub-id pub-id-type="pmid">20110278</pub-id>
</element-citation>
</ref>
<ref id="CR79"><label>79.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Schatz</surname>
<given-names>MC</given-names>
</name>
<name><surname>Phillippy</surname>
<given-names>AM</given-names>
</name>
<name><surname>Sommer</surname>
<given-names>DD</given-names>
</name>
<name><surname>Delcher</surname>
<given-names>AL</given-names>
</name>
<name><surname>Puiu</surname>
<given-names>D</given-names>
</name>
<name><surname>Narzisi</surname>
<given-names>G</given-names>
</name>
<name><surname>Salzberg</surname>
<given-names>SL</given-names>
</name>
<name><surname>Pop</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Hawkeye and AMOS: visualizing and assessing the quality of genome assemblies</article-title>
<source>Brief Bioinform</source>
<year>2013</year>
<volume>14</volume>
<fpage>213</fpage>
<lpage>224</lpage>
<pub-id pub-id-type="doi">10.1093/bib/bbr074</pub-id>
<pub-id pub-id-type="pmid">22199379</pub-id>
</element-citation>
</ref>
<ref id="CR80"><label>80.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Marcais</surname>
<given-names>G</given-names>
</name>
<name><surname>Kingsford</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>A fast, lock-free approach for efficient parallel counting of occurrences of k-mers</article-title>
<source>Bioinformatics</source>
<year>2011</year>
<volume>27</volume>
<fpage>764</fpage>
<lpage>770</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btr011</pub-id>
<pub-id pub-id-type="pmid">21217122</pub-id>
</element-citation>
</ref>
<ref id="CR81"><label>81.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kurtz</surname>
<given-names>S</given-names>
</name>
<name><surname>Narechania</surname>
<given-names>A</given-names>
</name>
<name><surname>Stein</surname>
<given-names>JC</given-names>
</name>
<name><surname>Ware</surname>
<given-names>D</given-names>
</name>
</person-group>
<article-title>A new method to compute K-mer frequencies and its application to annotate large repetitive plant genomes</article-title>
<source>BMC Genomics</source>
<year>2008</year>
<volume>9</volume>
<fpage>517</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2164-9-517</pub-id>
<pub-id pub-id-type="pmid">18976482</pub-id>
</element-citation>
</ref>
<ref id="CR82"><label>82.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Phillippy</surname>
<given-names>AM</given-names>
</name>
<name><surname>Schatz</surname>
<given-names>MC</given-names>
</name>
<name><surname>Pop</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Genome assembly forensics: finding the elusive mis-assembly</article-title>
<source>Genome Biol</source>
<year>2008</year>
<volume>9</volume>
<fpage>R55</fpage>
<pub-id pub-id-type="doi">10.1186/gb-2008-9-3-r55</pub-id>
<pub-id pub-id-type="pmid">18341692</pub-id>
</element-citation>
</ref>
<ref id="CR83"><label>83.</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Reyes</surname>
<given-names>J</given-names>
</name>
<name><surname>Gomez-Romero</surname>
<given-names>L</given-names>
</name>
<name><surname>Ibarra-Soria</surname>
<given-names>X</given-names>
</name>
<name><surname>Palacios-Flores</surname>
<given-names>K</given-names>
</name>
<name><surname>Arriola</surname>
<given-names>LR</given-names>
</name>
<name><surname>Wences</surname>
<given-names>A</given-names>
</name>
<name><surname>Garcia</surname>
<given-names>D</given-names>
</name>
<name><surname>Boege</surname>
<given-names>M</given-names>
</name>
<name><surname>Davila</surname>
<given-names>G</given-names>
</name>
<name><surname>Flores</surname>
<given-names>M</given-names>
</name>
<name><surname>Palacios</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>Context-dependent individualization of nucleotides and virtual genomic hybridization allow the precise location of human SNPs</article-title>
<source>Proc Natl Acad Sci U S A</source>
<year>2011</year>
<volume>108</volume>
<fpage>15294</fpage>
<lpage>15299</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.1112567108</pub-id>
<pub-id pub-id-type="pmid">21876154</pub-id>
</element-citation>
</ref>
<ref id="CR84"><label>84.</label>
<mixed-citation publication-type="other"><bold>New whole genome</bold>
<bold><italic>de novo</italic>
</bold>
<bold>assemblies of three divergent strains of rice (O. sativa) documents novel gene space of aus and indica.</bold>
 [<ext-link ext-link-type="uri" xlink:href="http://schatzlab.cshl.edu/data/rice">http://schatzlab.cshl.edu/data/rice</ext-link>
]</mixed-citation>
</ref>
<ref id="CR85"><label>85.</label>
<mixed-citation publication-type="other"><bold>ALLPATHS-LG.</bold>
 [<ext-link ext-link-type="uri" xlink:href="http://www.broadinstitute.org/software/allpaths-lg/blog/?page_id=12">http://www.broadinstitute.org/software/allpaths-lg/blog/?page_id=12</ext-link>
]</mixed-citation>
</ref>
<ref id="CR86"><label>86.</label>
<mixed-citation publication-type="other"><bold>MUMmer.</bold>
 [<ext-link ext-link-type="uri" xlink:href="http://mummer.sourceforge.net">http://mummer.sourceforge.net</ext-link>
]</mixed-citation>
</ref>
<ref id="CR87"><label>87.</label>
<mixed-citation publication-type="other"><bold>AMOS.</bold>
 [<ext-link ext-link-type="uri" xlink:href="http://amos.sourceforge.net">http://amos.sourceforge.net</ext-link>
]</mixed-citation>
</ref>
<ref id="CR88"><label>88.</label>
<mixed-citation publication-type="other"><bold>Jellyfish.</bold>
 [<ext-link ext-link-type="uri" xlink:href="http://www.genome.umd.edu/jellyfish.html">http://www.genome.umd.edu/jellyfish.html</ext-link>
]</mixed-citation>
</ref>
<ref id="CR89"><label>89.</label>
<mixed-citation publication-type="other"><bold>BEDTools.</bold>
 [<ext-link ext-link-type="uri" xlink:href="https://github.com/arq5x/bedtools2">https://github.com/arq5x/bedtools2</ext-link>
]</mixed-citation>
</ref>
</ref-list>
</back>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000182 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000182 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:4268812
   |texte=   Whole genome de novo assemblies of three divergent strains of rice, Oryza sativa, document novel gene space of aus and indica
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:25468217" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024

	Serveur d'exploration Cyberinfrastructure
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration Cyberinfrastructure

Whole genome de novo assemblies of three divergent strains of rice, Oryza sativa, document novel gene space of aus and indica

Whole genome de novo assemblies of three divergent strains of rice, Oryza sativa, document novel gene space of aus and indica

Source :

Abstract

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri

Pour générer des pages wiki