OrangerV1, Pmc, Corpus, bibRecord, 000233

Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree

Identifieur interne : 000233 ( Pmc/Corpus ); précédent : 000232; suivant : 000234

Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree

Auteurs : Nagesh A. Kuravadi ; Vijay Yenagi ; Kannan Rangiah ; Hb Mahesh ; Anantharamanan Rajamani ; Meghana D. Shirke ; Heikham Russiachand ; Ramya Malarini Loganathan ; Chandana Shankara Lingu ; Shilpa Siddappa ; Aishwarya Ramamurthy ; Bn Sathyanarayana ; Malali Gowda

Source :

PeerJ [ 2167-8359 ] ; 2015.

RBID : PMC:4540028

Abstract

Neem (Azadirachta indica A. Juss) is one of the most versatile tropical evergreen tree species known in India since the Vedic period (1500 BC–600 BC). Neem tree is a rich source of limonoids, having a wide spectrum of activity against insect pests and microbial pathogens. Complex tetranortriterpenoids such as azadirachtin, salanin and nimbin are the major active principles isolated from neem seed. Absolutely nothing is known about the biochemical pathways of these metabolites in neem tree. To identify genes and pathways in neem, we sequenced neem genomes and transcriptomes using next generation sequencing technologies. Assembly of Illumina and 454 sequencing reads resulted in 267 Mb, which accounts for 70% of estimated size of neem genome. We predicted 44,495 genes in the neem genome, of which 32,278 genes were expressed in neem tissues. Neem genome consists about 32.5% (87 Mb) of repetitive DNA elements. Neem tree is phylogenetically related to citrus, Citrus sinensis. Comparative analysis anchored 62% (161 Mb) of assembled neem genomic contigs onto citrus chromomes. Ultrahigh performance liquid chromatography-mass spectrometry-selected reaction monitoring (UHPLC-MS/SRM) method was used to quantify azadirachtin, nimbin, and salanin from neem tissues. Weighted Correlation Network Analysis (WCGNA) of expressed genes and metabolites resulted in identification of possible candidate genes involved in azadirachtin biosynthesis pathway. This study provides genomic, transcriptomic and quantity of top three neem metabolites resource, which will accelerate basic research in neem to understand biochemical pathways.

Url:

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4540028

DOI: 10.7717/peerj.1066
PubMed: 26290780
PubMed Central: 4540028

Links to Exploration step

PMC:4540028

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree</title>
<author><name sortKey="Kuravadi, Nagesh A" sort="Kuravadi, Nagesh A" uniqKey="Kuravadi N" first="Nagesh A." last="Kuravadi">Nagesh A. Kuravadi</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Yenagi, Vijay" sort="Yenagi, Vijay" uniqKey="Yenagi V" first="Vijay" last="Yenagi">Vijay Yenagi</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Rangiah, Kannan" sort="Rangiah, Kannan" uniqKey="Rangiah K" first="Kannan" last="Rangiah">Kannan Rangiah</name>
<affiliation><nlm:aff id="aff-2"><institution>Metabolomics Facility, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Mahesh, Hb" sort="Mahesh, Hb" uniqKey="Mahesh H" first="Hb" last="Mahesh">Hb Mahesh</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="aff-3"><institution>Marker Assisted Selection Laboratory, Department of Genetics and Plant Breeding, University of Agricultural Sciences, GKVK</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Rajamani, Anantharamanan" sort="Rajamani, Anantharamanan" uniqKey="Rajamani A" first="Anantharamanan" last="Rajamani">Anantharamanan Rajamani</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Shirke, Meghana D" sort="Shirke, Meghana D" uniqKey="Shirke M" first="Meghana D." last="Shirke">Meghana D. Shirke</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Russiachand, Heikham" sort="Russiachand, Heikham" uniqKey="Russiachand H" first="Heikham" last="Russiachand">Heikham Russiachand</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Loganathan, Ramya Malarini" sort="Loganathan, Ramya Malarini" uniqKey="Loganathan R" first="Ramya Malarini" last="Loganathan">Ramya Malarini Loganathan</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Shankara Lingu, Chandana" sort="Shankara Lingu, Chandana" uniqKey="Shankara Lingu C" first="Chandana" last="Shankara Lingu">Chandana Shankara Lingu</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Siddappa, Shilpa" sort="Siddappa, Shilpa" uniqKey="Siddappa S" first="Shilpa" last="Siddappa">Shilpa Siddappa</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Ramamurthy, Aishwarya" sort="Ramamurthy, Aishwarya" uniqKey="Ramamurthy A" first="Aishwarya" last="Ramamurthy">Aishwarya Ramamurthy</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Sathyanarayana, Bn" sort="Sathyanarayana, Bn" uniqKey="Sathyanarayana B" first="Bn" last="Sathyanarayana">Bn Sathyanarayana</name>
<affiliation><nlm:aff id="aff-4"><institution>Plant Tissue Culture Laboratory, University of Agricultural Sciences, GKVK</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Gowda, Malali" sort="Gowda, Malali" uniqKey="Gowda M" first="Malali" last="Gowda">Malali Gowda</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">26290780</idno>
<idno type="pmc">4540028</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4540028</idno>
<idno type="RBID">PMC:4540028</idno>
<idno type="doi">10.7717/peerj.1066</idno>
<date when="2015">2015</date>
<idno type="wicri:Area/Pmc/Corpus">000233</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree</title>
<author><name sortKey="Kuravadi, Nagesh A" sort="Kuravadi, Nagesh A" uniqKey="Kuravadi N" first="Nagesh A." last="Kuravadi">Nagesh A. Kuravadi</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Yenagi, Vijay" sort="Yenagi, Vijay" uniqKey="Yenagi V" first="Vijay" last="Yenagi">Vijay Yenagi</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Rangiah, Kannan" sort="Rangiah, Kannan" uniqKey="Rangiah K" first="Kannan" last="Rangiah">Kannan Rangiah</name>
<affiliation><nlm:aff id="aff-2"><institution>Metabolomics Facility, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Mahesh, Hb" sort="Mahesh, Hb" uniqKey="Mahesh H" first="Hb" last="Mahesh">Hb Mahesh</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="aff-3"><institution>Marker Assisted Selection Laboratory, Department of Genetics and Plant Breeding, University of Agricultural Sciences, GKVK</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Rajamani, Anantharamanan" sort="Rajamani, Anantharamanan" uniqKey="Rajamani A" first="Anantharamanan" last="Rajamani">Anantharamanan Rajamani</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Shirke, Meghana D" sort="Shirke, Meghana D" uniqKey="Shirke M" first="Meghana D." last="Shirke">Meghana D. Shirke</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Russiachand, Heikham" sort="Russiachand, Heikham" uniqKey="Russiachand H" first="Heikham" last="Russiachand">Heikham Russiachand</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Loganathan, Ramya Malarini" sort="Loganathan, Ramya Malarini" uniqKey="Loganathan R" first="Ramya Malarini" last="Loganathan">Ramya Malarini Loganathan</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Shankara Lingu, Chandana" sort="Shankara Lingu, Chandana" uniqKey="Shankara Lingu C" first="Chandana" last="Shankara Lingu">Chandana Shankara Lingu</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Siddappa, Shilpa" sort="Siddappa, Shilpa" uniqKey="Siddappa S" first="Shilpa" last="Siddappa">Shilpa Siddappa</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Ramamurthy, Aishwarya" sort="Ramamurthy, Aishwarya" uniqKey="Ramamurthy A" first="Aishwarya" last="Ramamurthy">Aishwarya Ramamurthy</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Sathyanarayana, Bn" sort="Sathyanarayana, Bn" uniqKey="Sathyanarayana B" first="Bn" last="Sathyanarayana">Bn Sathyanarayana</name>
<affiliation><nlm:aff id="aff-4"><institution>Plant Tissue Culture Laboratory, University of Agricultural Sciences, GKVK</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Gowda, Malali" sort="Gowda, Malali" uniqKey="Gowda M" first="Malali" last="Gowda">Malali Gowda</name>
<affiliation><nlm:aff id="aff-1"><institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</nlm:aff>
</affiliation>
</author>
</analytic>
<series><title level="j">PeerJ</title>
<idno type="eISSN">2167-8359</idno>
<imprint><date when="2015">2015</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><p>Neem (<italic>Azadirachta indica</italic>
 A. Juss) is one of the most versatile tropical evergreen tree species known in India since the Vedic period (1500 BC–600 BC). Neem tree is a rich source of limonoids, having a wide spectrum of activity against insect pests and microbial pathogens. Complex tetranortriterpenoids such as azadirachtin, salanin and nimbin are the major active principles isolated from neem seed. Absolutely nothing is known about the biochemical pathways of these metabolites in neem tree. To identify genes and pathways in neem, we sequenced neem genomes and transcriptomes using next generation sequencing technologies. Assembly of Illumina and 454 sequencing reads resulted in 267 Mb, which accounts for 70% of estimated size of neem genome. We predicted 44,495 genes in the neem genome, of which 32,278 genes were expressed in neem tissues. Neem genome consists about 32.5% (87 Mb) of repetitive DNA elements. Neem tree is phylogenetically related to citrus, <italic>Citrus sinensis</italic>
. Comparative analysis anchored 62% (161 Mb) of assembled neem genomic contigs onto citrus chromomes. Ultrahigh performance liquid chromatography-mass spectrometry-selected reaction monitoring (UHPLC-MS/SRM) method was used to quantify azadirachtin, nimbin, and salanin from neem tissues. Weighted Correlation Network Analysis (WCGNA) of expressed genes and metabolites resulted in identification of possible candidate genes involved in azadirachtin biosynthesis pathway. This study provides genomic, transcriptomic and quantity of top three neem metabolites resource, which will accelerate basic research in neem to understand biochemical pathways.</p>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct><analytic><author><name sortKey="Aboyoun, P" uniqKey="Aboyoun P">P Aboyoun</name>
</author>
<author><name sortKey="Pages, H" uniqKey="Pages H">H Pages</name>
</author>
<author><name sortKey="Lawrence, M" uniqKey="Lawrence M">M Lawrence</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Aerts, Rj" uniqKey="Aerts R">RJ Aerts</name>
</author>
<author><name sortKey="Mordue, Aj" uniqKey="Mordue A">AJ Mordue</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Altschul, Sf" uniqKey="Altschul S">SF Altschul</name>
</author>
<author><name sortKey="Gish, W" uniqKey="Gish W">W Gish</name>
</author>
<author><name sortKey="Miller, W" uniqKey="Miller W">W Miller</name>
</author>
<author><name sortKey="Myers, Ew" uniqKey="Myers E">EW Myers</name>
</author>
<author><name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Alverson, Aj" uniqKey="Alverson A">AJ Alverson</name>
</author>
<author><name sortKey="Wei, X" uniqKey="Wei X">X Wei</name>
</author>
<author><name sortKey="Rice, Dw" uniqKey="Rice D">DW Rice</name>
</author>
<author><name sortKey="Stern, Db" uniqKey="Stern D">DB Stern</name>
</author>
<author><name sortKey="Barry, K" uniqKey="Barry K">K Barry</name>
</author>
<author><name sortKey="Palmer, Jd" uniqKey="Palmer J">JD Palmer</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Boontong, C" uniqKey="Boontong C">C Boontong</name>
</author>
<author><name sortKey="Pandey, M" uniqKey="Pandey M">M Pandey</name>
</author>
<author><name sortKey="Changtragoon, S" uniqKey="Changtragoon S">S Changtragoon</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Brahmachari, G" uniqKey="Brahmachari G">G Brahmachari</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Broughton, Hb" uniqKey="Broughton H">HB Broughton</name>
</author>
<author><name sortKey="Ley, Sv" uniqKey="Ley S">SV Ley</name>
</author>
<author><name sortKey="Slawin, Amz" uniqKey="Slawin A">AMZ Slawin</name>
</author>
<author><name sortKey="Williams, Djj" uniqKey="Williams D">DJJ Williams</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Burge, C" uniqKey="Burge C">C Burge</name>
</author>
<author><name sortKey="Karlin, S" uniqKey="Karlin S">S Karlin</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Butterworth, Jh" uniqKey="Butterworth J">JH Butterworth</name>
</author>
<author><name sortKey="Morgan, Ed" uniqKey="Morgan E">ED Morgan</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Chevreux, B" uniqKey="Chevreux B">B Chevreux</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Cingolani, P" uniqKey="Cingolani P">P Cingolani</name>
</author>
<author><name sortKey="Platts, A" uniqKey="Platts A">A Platts</name>
</author>
<author><name sortKey="Coon, M" uniqKey="Coon M">M Coon</name>
</author>
<author><name sortKey="Nguyen, T" uniqKey="Nguyen T">T Nguyen</name>
</author>
<author><name sortKey="Wang, L" uniqKey="Wang L">L Wang</name>
</author>
<author><name sortKey="Land, Sj" uniqKey="Land S">SJ Land</name>
</author>
<author><name sortKey="Lu, X" uniqKey="Lu X">X Lu</name>
</author>
<author><name sortKey="Ruden, Dm" uniqKey="Ruden D">DM Ruden</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Cs S, M" uniqKey="Cs S M">M Csűös</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Delcher, Al" uniqKey="Delcher A">AL Delcher</name>
</author>
<author><name sortKey="Harmon, D" uniqKey="Harmon D">D Harmon</name>
</author>
<author><name sortKey="Kasif, S" uniqKey="Kasif S">S Kasif</name>
</author>
<author><name sortKey="White, O" uniqKey="White O">O White</name>
</author>
<author><name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Dewick, Pm" uniqKey="Dewick P">PM Dewick</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Doyle, Jj" uniqKey="Doyle J">JJ Doyle</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Dro D Y Ski, D" uniqKey="Dro D Y Ski D">D Drożdżyński</name>
</author>
<author><name sortKey="Kowalska, J" uniqKey="Kowalska J">J Kowalska</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Grabherr, Mg" uniqKey="Grabherr M">MG Grabherr</name>
</author>
<author><name sortKey="Haas, Bj" uniqKey="Haas B">BJ Haas</name>
</author>
<author><name sortKey="Yassour, M" uniqKey="Yassour M">M Yassour</name>
</author>
<author><name sortKey="Levin, Jz" uniqKey="Levin J">JZ Levin</name>
</author>
<author><name sortKey="Thompson, Da" uniqKey="Thompson D">DA Thompson</name>
</author>
<author><name sortKey="Amit, I" uniqKey="Amit I">I Amit</name>
</author>
<author><name sortKey="Adiconis, X" uniqKey="Adiconis X">X Adiconis</name>
</author>
<author><name sortKey="Fan, L" uniqKey="Fan L">L Fan</name>
</author>
<author><name sortKey="Raychowdhury, R" uniqKey="Raychowdhury R">R Raychowdhury</name>
</author>
<author><name sortKey="Zeng, Q" uniqKey="Zeng Q">Q Zeng</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Grimalt, S" uniqKey="Grimalt S">S Grimalt</name>
</author>
<author><name sortKey="Thompson, Dg" uniqKey="Thompson D">DG Thompson</name>
</author>
<author><name sortKey="Coppens, M" uniqKey="Coppens M">M Coppens</name>
</author>
<author><name sortKey="Chartrand, Dt" uniqKey="Chartrand D">DT Chartrand</name>
</author>
<author><name sortKey="Shorney, T" uniqKey="Shorney T">T Shorney</name>
</author>
<author><name sortKey="Meating, J" uniqKey="Meating J">J Meating</name>
</author>
<author><name sortKey="Scarr, T" uniqKey="Scarr T">T Scarr</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Heasley, B" uniqKey="Heasley B">B Heasley</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Hosfelt, J" uniqKey="Hosfelt J">J Hosfelt</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Jiang, H" uniqKey="Jiang H">H Jiang</name>
</author>
<author><name sortKey="Wong, Wh" uniqKey="Wong W">WH Wong</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Jiang, H" uniqKey="Jiang H">H Jiang</name>
</author>
<author><name sortKey="Wong, Wh" uniqKey="Wong W">WH Wong</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Johnson, S" uniqKey="Johnson S">S Johnson</name>
</author>
<author><name sortKey="Morgan, Ed" uniqKey="Morgan E">ED Morgan</name>
</author>
<author><name sortKey="Peiris, Cn" uniqKey="Peiris C">CN Peiris</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Kawahara, Y" uniqKey="Kawahara Y">Y Kawahara</name>
</author>
<author><name sortKey="De La Bastide, M" uniqKey="De La Bastide M">M de la Bastide</name>
</author>
<author><name sortKey="Hamilton, Jp" uniqKey="Hamilton J">JP Hamilton</name>
</author>
<author><name sortKey="Kanamori, H" uniqKey="Kanamori H">H Kanamori</name>
</author>
<author><name sortKey="Mccombie, Wr" uniqKey="Mccombie W">WR McCombie</name>
</author>
<author><name sortKey="Ouyang, S" uniqKey="Ouyang S">S Ouyang</name>
</author>
<author><name sortKey="Schwartz, Dc" uniqKey="Schwartz D">DC Schwartz</name>
</author>
<author><name sortKey="Tanaka, T" uniqKey="Tanaka T">T Tanaka</name>
</author>
<author><name sortKey="Wu, J" uniqKey="Wu J">J Wu</name>
</author>
<author><name sortKey="Zhou, S" uniqKey="Zhou S">S Zhou</name>
</author>
<author><name sortKey="Childs, Kl" uniqKey="Childs K">KL Childs</name>
</author>
<author><name sortKey="Davidson, Rm" uniqKey="Davidson R">RM Davidson</name>
</author>
<author><name sortKey="Lin, H" uniqKey="Lin H">H Lin</name>
</author>
<author><name sortKey="Quesada Ocampo, L" uniqKey="Quesada Ocampo L">L Quesada-Ocampo</name>
</author>
<author><name sortKey="Vaillancourt, B" uniqKey="Vaillancourt B">B Vaillancourt</name>
</author>
<author><name sortKey="Sakai, H" uniqKey="Sakai H">H Sakai</name>
</author>
<author><name sortKey="Lee, Ss" uniqKey="Lee S">SS Lee</name>
</author>
<author><name sortKey="Kim, J" uniqKey="Kim J">J Kim</name>
</author>
<author><name sortKey="Numa, H" uniqKey="Numa H">H Numa</name>
</author>
<author><name sortKey="Itoh, T" uniqKey="Itoh T">T Itoh</name>
</author>
<author><name sortKey="Buell, Cr" uniqKey="Buell C">CR Buell</name>
</author>
<author><name sortKey="Matsumoto, T" uniqKey="Matsumoto T">T Matsumoto</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Kent, Wj" uniqKey="Kent W">WJ Kent</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Krishnan, N" uniqKey="Krishnan N">N Krishnan</name>
</author>
<author><name sortKey="Pattnaik, S" uniqKey="Pattnaik S">S Pattnaik</name>
</author>
<author><name sortKey="Jain, P" uniqKey="Jain P">P Jain</name>
</author>
<author><name sortKey="Gaur, P" uniqKey="Gaur P">P Gaur</name>
</author>
<author><name sortKey="Choudhary, R" uniqKey="Choudhary R">R Choudhary</name>
</author>
<author><name sortKey="Vaidyanathan, S" uniqKey="Vaidyanathan S">S Vaidyanathan</name>
</author>
<author><name sortKey="Deepak, S" uniqKey="Deepak S">S Deepak</name>
</author>
<author><name sortKey="Hariharan, A" uniqKey="Hariharan A">A Hariharan</name>
</author>
<author><name sortKey="Krishna, Pg" uniqKey="Krishna P">PG Krishna</name>
</author>
<author><name sortKey="Nair, J" uniqKey="Nair J">J Nair</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Kurtz, S" uniqKey="Kurtz S">S Kurtz</name>
</author>
<author><name sortKey="Phillippy, A" uniqKey="Phillippy A">A Phillippy</name>
</author>
<author><name sortKey="Delcher, A" uniqKey="Delcher A">A Delcher</name>
</author>
<author><name sortKey="Smoot, M" uniqKey="Smoot M">M Smoot</name>
</author>
<author><name sortKey="Shumway, M" uniqKey="Shumway M">M Shumway</name>
</author>
<author><name sortKey="Antonescu, C" uniqKey="Antonescu C">C Antonescu</name>
</author>
<author><name sortKey="Salzberg, S" uniqKey="Salzberg S">S Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Langfelder, P" uniqKey="Langfelder P">P Langfelder</name>
</author>
<author><name sortKey="Horvath, S" uniqKey="Horvath S">S Horvath</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Langmead, B" uniqKey="Langmead B">B Langmead</name>
</author>
<author><name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Lechner, M" uniqKey="Lechner M">M Lechner</name>
</author>
<author><name sortKey="Findeia, S" uniqKey="Findeia S">S FindeiÃŸ</name>
</author>
<author><name sortKey="Steiner, L" uniqKey="Steiner L">L Steiner</name>
</author>
<author><name sortKey="Marz, M" uniqKey="Marz M">M Marz</name>
</author>
<author><name sortKey="Stadler, Pf" uniqKey="Stadler P">PF Stadler</name>
</author>
<author><name sortKey="Prohaska, Sj" uniqKey="Prohaska S">SJ Prohaska</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ley, Sv" uniqKey="Ley S">SV Ley</name>
</author>
<author><name sortKey="Denholm, Aa" uniqKey="Denholm A">AA Denholm</name>
</author>
<author><name sortKey="Wood, A" uniqKey="Wood A">A Wood</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Li, H" uniqKey="Li H">H Li</name>
</author>
<author><name sortKey="Handsaker, B" uniqKey="Handsaker B">B Handsaker</name>
</author>
<author><name sortKey="Wysoker, A" uniqKey="Wysoker A">A Wysoker</name>
</author>
<author><name sortKey="Fennell, T" uniqKey="Fennell T">T Fennell</name>
</author>
<author><name sortKey="Ruan, J" uniqKey="Ruan J">J Ruan</name>
</author>
<author><name sortKey="Homer, N" uniqKey="Homer N">N Homer</name>
</author>
<author><name sortKey="Marth, G" uniqKey="Marth G">G Marth</name>
</author>
<author><name sortKey="Abecasis, G" uniqKey="Abecasis G">G Abecasis</name>
</author>
<author><name sortKey="Durbin, R" uniqKey="Durbin R">R Durbin</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
<author><name sortKey="Godzik, A" uniqKey="Godzik A">A Godzik</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Metzker, Ml" uniqKey="Metzker M">ML Metzker</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ming, R" uniqKey="Ming R">R Ming</name>
</author>
<author><name sortKey="Vanburen, R" uniqKey="Vanburen R">R VanBuren</name>
</author>
<author><name sortKey="Liu, Y" uniqKey="Liu Y">Y Liu</name>
</author>
<author><name sortKey="Yang, M" uniqKey="Yang M">M Yang</name>
</author>
<author><name sortKey="Han, Y" uniqKey="Han Y">Y Han</name>
</author>
<author><name sortKey="Li, L T" uniqKey="Li L">L-T Li</name>
</author>
<author><name sortKey="Zhang, Q" uniqKey="Zhang Q">Q Zhang</name>
</author>
<author><name sortKey="Kim, M J" uniqKey="Kim M">M-J Kim</name>
</author>
<author><name sortKey="Schatz, Mc" uniqKey="Schatz M">MC Schatz</name>
</author>
<author><name sortKey="Campbell, M" uniqKey="Campbell M">M Campbell</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Narnoliya, Lk1" uniqKey="Narnoliya L">LK1 Narnoliya</name>
</author>
<author><name sortKey="Rajakani, R" uniqKey="Rajakani R">R Rajakani</name>
</author>
<author><name sortKey="Sangwan, Ns" uniqKey="Sangwan N">NS Sangwan</name>
</author>
<author><name sortKey="Gupta, V" uniqKey="Gupta V">V Gupta</name>
</author>
<author><name sortKey="Sangwan, Rs" uniqKey="Sangwan R">RS Sangwan</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ohri, D" uniqKey="Ohri D">D Ohri</name>
</author>
<author><name sortKey="Bhargava, A" uniqKey="Bhargava A">A Bhargava</name>
</author>
<author><name sortKey="Chatterjee, A" uniqKey="Chatterjee A">A Chatterjee</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Parra, G" uniqKey="Parra G">G Parra</name>
</author>
<author><name sortKey="Bradnam, K" uniqKey="Bradnam K">K Bradnam</name>
</author>
<author><name sortKey="Korf, I" uniqKey="Korf I">I Korf</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Puri, Hs" uniqKey="Puri H">HS Puri</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Puri, Hs" uniqKey="Puri H">HS Puri</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Rajakani, R" uniqKey="Rajakani R">R Rajakani</name>
</author>
<author><name sortKey="Narnoliya, L" uniqKey="Narnoliya L">L Narnoliya</name>
</author>
<author><name sortKey="Sangwan, Ns" uniqKey="Sangwan N">NS Sangwan</name>
</author>
<author><name sortKey="Sangwan, Rs" uniqKey="Sangwan R">RS Sangwan</name>
</author>
<author><name sortKey="Gupta, V" uniqKey="Gupta V">V Gupta</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ray, S" uniqKey="Ray S">S Ray</name>
</author>
<author><name sortKey="Satya, P" uniqKey="Satya P">P Satya</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Saxena, Rc" uniqKey="Saxena R">RC Saxena</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Schmid, R" uniqKey="Schmid R">R Schmid</name>
</author>
<author><name sortKey="Blaxter, Ml" uniqKey="Blaxter M">ML Blaxter</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Siddiqui, S" uniqKey="Siddiqui S">S Siddiqui</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Sidhu, O" uniqKey="Sidhu O">O Sidhu</name>
</author>
<author><name sortKey="Kumar, V" uniqKey="Kumar V">V Kumar</name>
</author>
<author><name sortKey="Behl, H" uniqKey="Behl H">H Behl</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Smit, A" uniqKey="Smit A">A Smit</name>
</author>
<author><name sortKey="Hubley, R" uniqKey="Hubley R">R Hubley</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Soderlund, C" uniqKey="Soderlund C">C Soderlund</name>
</author>
<author><name sortKey="Bomhoff, M" uniqKey="Bomhoff M">M Bomhoff</name>
</author>
<author><name sortKey="Nelson, Wm" uniqKey="Nelson W">WM Nelson</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Stanke, M" uniqKey="Stanke M">M Stanke</name>
</author>
<author><name sortKey="Keller, O" uniqKey="Keller O">O Keller</name>
</author>
<author><name sortKey="Gunduz, I" uniqKey="Gunduz I">I Gunduz</name>
</author>
<author><name sortKey="Hayes, A" uniqKey="Hayes A">A Hayes</name>
</author>
<author><name sortKey="Waack, S" uniqKey="Waack S">S Waack</name>
</author>
<author><name sortKey="Morgenstern, B" uniqKey="Morgenstern B">B Morgenstern</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Tan, Q G" uniqKey="Tan Q">Q-G Tan</name>
</author>
<author><name sortKey="Luo, X D" uniqKey="Luo X">X-D Luo</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Varshney, Rk" uniqKey="Varshney R">RK Varshney</name>
</author>
<author><name sortKey="Chen, W" uniqKey="Chen W">W Chen</name>
</author>
<author><name sortKey="Li, Y" uniqKey="Li Y">Y Li</name>
</author>
<author><name sortKey="Bharti, Ak" uniqKey="Bharti A">AK Bharti</name>
</author>
<author><name sortKey="Saxena, Rk" uniqKey="Saxena R">RK Saxena</name>
</author>
<author><name sortKey="Schlueter, Ja" uniqKey="Schlueter J">JA Schlueter</name>
</author>
<author><name sortKey="Donoghue, Mt" uniqKey="Donoghue M">MT Donoghue</name>
</author>
<author><name sortKey="Azam, S" uniqKey="Azam S">S Azam</name>
</author>
<author><name sortKey="Fan, G" uniqKey="Fan G">G Fan</name>
</author>
<author><name sortKey="Whaley, Am" uniqKey="Whaley A">AM Whaley</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Veitch, Ge" uniqKey="Veitch G">GE Veitch</name>
</author>
<author><name sortKey="Boyer, A" uniqKey="Boyer A">A Boyer</name>
</author>
<author><name sortKey="Ley, Sv" uniqKey="Ley S">SV Ley</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Warnes, Gr" uniqKey="Warnes G">GR Warnes</name>
</author>
<author><name sortKey="Bolker, B" uniqKey="Bolker B">B Bolker</name>
</author>
<author><name sortKey="Lumley, T" uniqKey="Lumley T">T Lumley</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Wickham, H" uniqKey="Wickham H">H Wickham</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Wu, Ga" uniqKey="Wu G">GA Wu</name>
</author>
<author><name sortKey="Prochnik, S" uniqKey="Prochnik S">S Prochnik</name>
</author>
<author><name sortKey="Jenkins, J" uniqKey="Jenkins J">J Jenkins</name>
</author>
<author><name sortKey="Salse, J" uniqKey="Salse J">J Salse</name>
</author>
<author><name sortKey="Hellsten, U" uniqKey="Hellsten U">U Hellsten</name>
</author>
<author><name sortKey="Murat, F" uniqKey="Murat F">F Murat</name>
</author>
<author><name sortKey="Perrier, X" uniqKey="Perrier X">X Perrier</name>
</author>
<author><name sortKey="Ruiz, M" uniqKey="Ruiz M">M Ruiz</name>
</author>
<author><name sortKey="Scalabrin, S" uniqKey="Scalabrin S">S Scalabrin</name>
</author>
<author><name sortKey="Terol, J" uniqKey="Terol J">J Terol</name>
</author>
<author><name sortKey="Takita, Ma" uniqKey="Takita M">MA Takita</name>
</author>
<author><name sortKey="Labadie, K" uniqKey="Labadie K">K Labadie</name>
</author>
<author><name sortKey="Poulain, J" uniqKey="Poulain J">J Poulain</name>
</author>
<author><name sortKey="Couloux, A" uniqKey="Couloux A">A Couloux</name>
</author>
<author><name sortKey="Jabbari, K" uniqKey="Jabbari K">K Jabbari</name>
</author>
<author><name sortKey="Cattonaro, F" uniqKey="Cattonaro F">F Cattonaro</name>
</author>
<author><name sortKey="Del Fabbro, C" uniqKey="Del Fabbro C">C Del Fabbro</name>
</author>
<author><name sortKey="Pinosio, S" uniqKey="Pinosio S">S Pinosio</name>
</author>
<author><name sortKey="Zuccolo, A" uniqKey="Zuccolo A">A Zuccolo</name>
</author>
<author><name sortKey="Chapman, J" uniqKey="Chapman J">J Chapman</name>
</author>
<author><name sortKey="Grimwood, J" uniqKey="Grimwood J">J Grimwood</name>
</author>
<author><name sortKey="Tadeo, Fr" uniqKey="Tadeo F">FR Tadeo</name>
</author>
<author><name sortKey="Estornell, Lh" uniqKey="Estornell L">LH Estornell</name>
</author>
<author><name sortKey="Mu Oz Sanz, J" uniqKey="Mu Oz Sanz J">J Muñoz-Sanz</name>
</author>
<author><name sortKey="Ibanez, V" uniqKey="Ibanez V">V Ibanez</name>
</author>
<author><name sortKey="Herrero Ortega, A" uniqKey="Herrero Ortega A">A Herrero-Ortega</name>
</author>
<author><name sortKey="Aleza, P" uniqKey="Aleza P">P Aleza</name>
</author>
<author><name sortKey="Perez Perez, J" uniqKey="Perez Perez J">J Pérez-Pérez</name>
</author>
<author><name sortKey="Ram N, D" uniqKey="Ram N D">D Ramón</name>
</author>
<author><name sortKey="Brunel, D" uniqKey="Brunel D">D Brunel</name>
</author>
<author><name sortKey="Luro, F" uniqKey="Luro F">F Luro</name>
</author>
<author><name sortKey="Chen, C" uniqKey="Chen C">C Chen</name>
</author>
<author><name sortKey="Farmerie, Wg" uniqKey="Farmerie W">WG Farmerie</name>
</author>
<author><name sortKey="Desany, B" uniqKey="Desany B">B Desany</name>
</author>
<author><name sortKey="Kodira, C" uniqKey="Kodira C">C Kodira</name>
</author>
<author><name sortKey="Mohiuddin, M" uniqKey="Mohiuddin M">M Mohiuddin</name>
</author>
<author><name sortKey="Harkins, T" uniqKey="Harkins T">T Harkins</name>
</author>
<author><name sortKey="Fredrikson, K" uniqKey="Fredrikson K">K Fredrikson</name>
</author>
<author><name sortKey="Burns, P" uniqKey="Burns P">P Burns</name>
</author>
<author><name sortKey="Lomsadze, A" uniqKey="Lomsadze A">A Lomsadze</name>
</author>
<author><name sortKey="Borodovsky, M" uniqKey="Borodovsky M">M Borodovsky</name>
</author>
<author><name sortKey="Reforgiato, G" uniqKey="Reforgiato G">G Reforgiato</name>
</author>
<author><name sortKey="Freitas Astua, J" uniqKey="Freitas Astua J">J Freitas-Astúa</name>
</author>
<author><name sortKey="Quetier, F" uniqKey="Quetier F">F Quetier</name>
</author>
<author><name sortKey="Navarro, L" uniqKey="Navarro L">L Navarro</name>
</author>
<author><name sortKey="Roose, M" uniqKey="Roose M">M Roose</name>
</author>
<author><name sortKey="Wincker, P" uniqKey="Wincker P">P Wincker</name>
</author>
<author><name sortKey="Schmutz, J" uniqKey="Schmutz J">J Schmutz</name>
</author>
<author><name sortKey="Morgante, M" uniqKey="Morgante M">M Morgante</name>
</author>
<author><name sortKey="Machado, Ma" uniqKey="Machado M">MA Machado</name>
</author>
<author><name sortKey="Talon, M" uniqKey="Talon M">M Talon</name>
</author>
<author><name sortKey="Jaillon, O" uniqKey="Jaillon O">O Jaillon</name>
</author>
<author><name sortKey="Ollitrault, P" uniqKey="Ollitrault P">P Ollitrault</name>
</author>
<author><name sortKey="Gmitter, F" uniqKey="Gmitter F">F Gmitter</name>
</author>
<author><name sortKey="Rokhsar, D" uniqKey="Rokhsar D">D Rokhsar</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Wyman, Sk" uniqKey="Wyman S">SK Wyman</name>
</author>
<author><name sortKey="Jansen, Rk" uniqKey="Jansen R">RK Jansen</name>
</author>
<author><name sortKey="Boore, Jl" uniqKey="Boore J">JL Boore</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Xu, Q" uniqKey="Xu Q">Q Xu</name>
</author>
<author><name sortKey="Chen, L L" uniqKey="Chen L">L-L Chen</name>
</author>
<author><name sortKey="Ruan, X" uniqKey="Ruan X">X Ruan</name>
</author>
<author><name sortKey="Chen, D" uniqKey="Chen D">D Chen</name>
</author>
<author><name sortKey="Zhu, A" uniqKey="Zhu A">A Zhu</name>
</author>
<author><name sortKey="Chen, C" uniqKey="Chen C">C Chen</name>
</author>
<author><name sortKey="Bertrand, D" uniqKey="Bertrand D">D Bertrand</name>
</author>
<author><name sortKey="Jiao, W B" uniqKey="Jiao W">W-B Jiao</name>
</author>
<author><name sortKey="Hao, B H" uniqKey="Hao B">B-H Hao</name>
</author>
<author><name sortKey="Lyon, Mp" uniqKey="Lyon M">MP Lyon</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Zerbino, Dr" uniqKey="Zerbino D">DR Zerbino</name>
</author>
<author><name sortKey="Birney, E" uniqKey="Birney E">E Birney</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Zhang, X" uniqKey="Zhang X">X Zhang</name>
</author>
<author><name sortKey="Wessler, Sr" uniqKey="Wessler S">SR Wessler</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article"><pmc-dir>properties open_access</pmc-dir>
  <front><journal-meta><journal-id journal-id-type="nlm-ta">PeerJ</journal-id>
<journal-id journal-id-type="iso-abbrev">PeerJ</journal-id>
<journal-id journal-id-type="pmc">PeerJ</journal-id>
<journal-id journal-id-type="publisher-id">PeerJ</journal-id>
<journal-title-group><journal-title>PeerJ</journal-title>
</journal-title-group>
<issn pub-type="epub">2167-8359</issn>
<publisher><publisher-name>PeerJ Inc.</publisher-name>
<publisher-loc>San Francisco, USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta><article-id pub-id-type="pmid">26290780</article-id>
<article-id pub-id-type="pmc">4540028</article-id>
<article-id pub-id-type="publisher-id">1066</article-id>
<article-id pub-id-type="doi">10.7717/peerj.1066</article-id>
<article-categories><subj-group subj-group-type="heading"><subject>Genomics</subject>
</subj-group>
</article-categories>
<title-group><article-title>Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree</article-title>
</title-group>
<contrib-group><contrib id="author-1" contrib-type="author" equal-contrib="yes"><name><surname>Kuravadi</surname>
<given-names>Nagesh A.</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-2" contrib-type="author" equal-contrib="yes"><name><surname>Yenagi</surname>
<given-names>Vijay</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-3" contrib-type="author"><name><surname>Rangiah</surname>
<given-names>Kannan</given-names>
</name>
<xref ref-type="aff" rid="aff-2">2</xref>
</contrib>
<contrib id="author-4" contrib-type="author"><name><surname>Mahesh</surname>
<given-names>HB</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
<xref ref-type="aff" rid="aff-3">3</xref>
</contrib>
<contrib id="author-5" contrib-type="author"><name><surname>Rajamani</surname>
<given-names>Anantharamanan</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-6" contrib-type="author"><name><surname>Shirke</surname>
<given-names>Meghana D.</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-7" contrib-type="author"><name><surname>Russiachand</surname>
<given-names>Heikham</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-8" contrib-type="author"><name><surname>Loganathan</surname>
<given-names>Ramya Malarini</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-9" contrib-type="author"><name><surname>Shankara Lingu</surname>
<given-names>Chandana</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-10" contrib-type="author"><name><surname>Siddappa</surname>
<given-names>Shilpa</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-11" contrib-type="author"><name><surname>Ramamurthy</surname>
<given-names>Aishwarya</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-12" contrib-type="author"><name><surname>Sathyanarayana</surname>
<given-names>BN</given-names>
</name>
<xref ref-type="aff" rid="aff-4">4</xref>
</contrib>
<contrib id="author-13" contrib-type="author" corresp="yes"><name><surname>Gowda</surname>
<given-names>Malali</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
<email>malalig@ncbs.res.in</email>
<email>malalig@ccamp.res.in</email>
</contrib>
<aff id="aff-1"><label>1</label>
<institution>Genomics Laboratory, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</aff>
<aff id="aff-2"><label>2</label>
<institution>Metabolomics Facility, Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</aff>
<aff id="aff-3"><label>3</label>
<institution>Marker Assisted Selection Laboratory, Department of Genetics and Plant Breeding, University of Agricultural Sciences, GKVK</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</aff>
<aff id="aff-4"><label>4</label>
<institution>Plant Tissue Culture Laboratory, University of Agricultural Sciences, GKVK</institution>
,<addr-line>Bangalore, Karnataka</addr-line>
,<country>India</country>
</aff>
</contrib-group>
<contrib-group><contrib id="editor-1" contrib-type="editor"><name><surname>Kumar</surname>
<given-names>Abhishek</given-names>
</name>
</contrib>
</contrib-group>
<pub-date pub-type="epub" date-type="pub" iso-8601-date="2015-08-06"><day>6</day>
<month>8</month>
<year iso-8601-date="2015">2015</year>
</pub-date>
<pub-date pub-type="collection"><year>2015</year>
</pub-date>
<volume>3</volume>
<elocation-id>e1066</elocation-id>
<history><date date-type="received" iso-8601-date="2015-01-25"><day>25</day>
<month>1</month>
<year iso-8601-date="2015">2015</year>
</date>
<date date-type="accepted" iso-8601-date="2015-06-09"><day>9</day>
<month>6</month>
<year iso-8601-date="2015">2015</year>
</date>
</history>
<permissions><copyright-statement>© 2015 Kuravadi et al.</copyright-statement>
<copyright-year>2015</copyright-year>
<copyright-holder>Kuravadi et al.</copyright-holder>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/"><license-p>This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution License</ext-link>
, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.</license-p>
</license>
</permissions>
<self-uri xlink:href="https://peerj.com/articles/1066"></self-uri>
<abstract><p>Neem (<italic>Azadirachta indica</italic>
 A. Juss) is one of the most versatile tropical evergreen tree species known in India since the Vedic period (1500 BC–600 BC). Neem tree is a rich source of limonoids, having a wide spectrum of activity against insect pests and microbial pathogens. Complex tetranortriterpenoids such as azadirachtin, salanin and nimbin are the major active principles isolated from neem seed. Absolutely nothing is known about the biochemical pathways of these metabolites in neem tree. To identify genes and pathways in neem, we sequenced neem genomes and transcriptomes using next generation sequencing technologies. Assembly of Illumina and 454 sequencing reads resulted in 267 Mb, which accounts for 70% of estimated size of neem genome. We predicted 44,495 genes in the neem genome, of which 32,278 genes were expressed in neem tissues. Neem genome consists about 32.5% (87 Mb) of repetitive DNA elements. Neem tree is phylogenetically related to citrus, <italic>Citrus sinensis</italic>
. Comparative analysis anchored 62% (161 Mb) of assembled neem genomic contigs onto citrus chromomes. Ultrahigh performance liquid chromatography-mass spectrometry-selected reaction monitoring (UHPLC-MS/SRM) method was used to quantify azadirachtin, nimbin, and salanin from neem tissues. Weighted Correlation Network Analysis (WCGNA) of expressed genes and metabolites resulted in identification of possible candidate genes involved in azadirachtin biosynthesis pathway. This study provides genomic, transcriptomic and quantity of top three neem metabolites resource, which will accelerate basic research in neem to understand biochemical pathways.</p>
</abstract>
<kwd-group kwd-group-type="author"><kwd>Next generation sequencing</kwd>
<kwd>Annotation</kwd>
<kwd><italic>A. indica</italic>
</kwd>
<kwd>Azadirachtin</kwd>
</kwd-group>
<funding-group><award-group id="fund-1"><funding-source>Department of Biotechnology, Government of India (Ramalingaswami Fellowship Grant)</funding-source>
<award-id>BT/HRD/35/02/2006</award-id>
</award-group>
<funding-statement>This work was supported by Department of Biotechnology, Government of India to Malali Gowda (Ramalingaswami Fellowship Grant; BT/HRD/35/02/2006). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</funding-statement>
</funding-group>
</article-meta>
</front>
<body><sec sec-type="intro"><title>Introduction</title>
<p>Neem, <italic>Azadirachta indica</italic>
 is an evergreen tree, native to the Indian subcontinent. It belongs to Meliaceae family plants, which are the major source for diverse limonoids (<xref rid="ref-50" ref-type="bibr">Tan & Luo, 2011</xref>
). Neem has been used in Ayurveda, Siddha, Unani and other Indian local health traditions. Over 700 herbal preparations and over 160 local practices containing neem ingredients are known in India, which claim to prevent various ailments or disorders in humans (<xref rid="ref-6" ref-type="bibr">Brahmachari, 2004</xref>
). Neem based pesticidal formulations are widely regarded as organic, and are found to have low toxicity against non-target beneficial organisms as compared to synthetic pesticides. Nimbin was the first chemical limonoid isolated from neem tree (<xref rid="ref-45" ref-type="bibr">Siddiqui, 1942</xref>
). Subsequently, more than 150 bioactive chemical compounds have been isolated from various neem tissues (<xref rid="ref-6" ref-type="bibr">Brahmachari, 2004</xref>
).</p>
<p>Azadirachtin is the major tetranortriterpenoid in neem seeds (<xref rid="ref-9" ref-type="bibr">Butterworth & Morgan, 1968</xref>
) and its molecular structure elucidation took more than 20 years of research (<xref rid="ref-7" ref-type="bibr">Broughton et al., 1986</xref>
). Azadirachtin is one of the highly successful biopesticides in the world, which is isolated from neem seeds and is non-persistent in the environment. Its content is highly variable in trees in various locations due to genetic variability and environmental factors (<xref rid="ref-46" ref-type="bibr">Sidhu, Kumar & Behl, 2003</xref>
). Chemical <italic>in-vitro</italic>
 synthesis of azadirachtin has been tried in the laboratory. However, long synthesis process and molecular complexity was realized and that chemical synthesis of azadirachtin is not a viable method for commercial production (<xref rid="ref-52" ref-type="bibr">Veitch, Boyer & Ley, 2008</xref>
). Hence, there is an increased interest to understand the <italic>in-vivo</italic>
 biosynthesis of azadirachtin pathway in neem. Genes and proteins involved in biochemical azadirachtin pathways have not been researched in neem. Recently, a few studies were attempted to generate genomic resources for neem tree (<xref rid="ref-41" ref-type="bibr">Rajakani et al., 2014</xref>
; <xref rid="ref-36" ref-type="bibr">Narnoliya et al., 2014</xref>
; <xref rid="ref-26" ref-type="bibr">Krishnan et al., 2012</xref>
). However, they generated limited ESTs (<xref rid="ref-41" ref-type="bibr">Rajakani et al., 2014</xref>
; <xref rid="ref-36" ref-type="bibr">Narnoliya et al., 2014</xref>
) and non availability of neem genome annotations (<xref rid="ref-26" ref-type="bibr">Krishnan et al., 2012</xref>
). In this study, we have sequenced three neem tree genomes and transcriptomes using next generation sequencing technologies. In addition, we quantified neem transcripts and metabolites using RNAseq and UHPLC-MS/SRM methods, respectively. This work will accelerate research to dissect biochemical limonoid pathways in neem.</p>
</sec>
<sec sec-type="materials|methods"><title>Materials and Methods</title>
<sec><title>Neem genotypes</title>
<p>Individual neem tree was identified from three varied geographical regions of southern India including Karnataka (GKVK, Bangalore, India (abbreviated as Genotype 1), Anuganalu, Hassan (Genotype 2)) and Tamil Nadu (Erode; Genotype 3) for this study (<xref ref-type="supplementary-material" rid="supp-21">Fig. S1</xref>
).</p>
</sec>
<sec><title>Isolation of genomic DNA and total RNA from neem tissues</title>
<p>Mature neem leaves were collected from neem trees for DNA isolation. CTAB method (6% CTAB, 1.4 M NaCl, 20 mM EDTA, 10 mM Tris Base pH8) was used for isolation of neem genomic DNA (<xref rid="ref-15" ref-type="bibr">Doyle, 1990</xref>
), with few modifications. Neem DNA isolation is a difficult process due to the presence of highly oxidized complex compounds in neem leaves. Instead of precipitating the DNA using iso-propanol as in CTAB method, we used the supernatant to purify genomic DNA using the Sigma Genelute plant DNA isolation kit (G2N70; Sigma, Seelze, Germany). RNA isolation was carried out using 100 mg of the neem tissues using Sigma Spectrum Plant Total RNA Kit (STRN50; Sigma, Seelze, Germany). DNase treatment was done using DNase digestion kit (DNASE 70; Sigma, Seelze, Germany). DNA free RNA was dissolved in DEPC water. The RNA integrity (RIN) was determined using Agilent Bioanalyzer.</p>
</sec>
<sec><title>Induction of callus tissue</title>
<p>Neem endosperm explants, of 4–5 mm length pieces were cultured on Murashige and Skoog solid medium (pH 5.6) containing 0.45 mg BAP, 0.80 mg NAA and 150 mg Casein hydrolysate. Cultures were maintained in growth chamber at 25 ± 2 °C with 55 ± 5% of relative humidity, for 16 h photoperiod and allowed for formation of callus tissue. After 2 weeks, mature callus was sub-cultured to fresh media every fourth or sixth weeks.</p>
</sec>
<sec><title>NGS library construction and sequencing</title>
<p>Whole genome shotgun DNA library was prepared using Illumina TrueSeq DNA sample preparation kit (FC-121-2001). The paired-end (PE) (2 × 100 nts) sequencing was carried out using Illumina HiSeq 1000 at the Next Generation Genomics Facility at Centre for Cellular and Molecular Platforms (C-CAMP). We prepared whole genome shotgun 454 library for Genotype 1 (GKVK, Bangalore, India) using the rapid library preparation kit from Roche (Cat. No. 05608228001y; version 4.0.12). The 454 sequencing was carried out using GS FLX+ chemistry as per Roche/454 manual instructions (<uri xlink:href="http://454.com">http://454.com</uri>
).</p>
</sec>
<sec><title>Transcriptome library preparation and sequencing</title>
<p>Total RNA isolation was carried out for 100 mg of the neem tissues using the Sigma Spectrum Plant Total RNA Kit (STRN50; Sigma, Seelze, Germany). The RNA-seq library was prepared using 1 µg of total RNA according to Illumina’s TruSeq RNA sample preparation kit (RS-122-2001).</p>
</sec>
<sec><title><italic>De novo</italic>
 whole genome assembly</title>
<p>Illumina PE reads were pre-processed using FASTX-Toolkit (v 0.0.13). The read quality score cut-off (<italic>q</italic>
) and percentage (<italic>p</italic>
) value was assigned as 20 and 100, respectively (i.e., <italic>q</italic>
/20 and <italic>p</italic>
/100). After quality filtering, we obtained a total 75–192 millions of paired-end reads and 61–71 millions of singleton reads (<xref ref-type="supplementary-material" rid="supp-1">Table S1</xref>
). <italic>De novo</italic>
 assembly of neem genome was performed using Velvet program (<xref rid="ref-58" ref-type="bibr">Zerbino & Birney, 2008</xref>
). We optimized the Velvet assembly through iteratively process for various k-mers (27 to 67 nts). The Velvet assemblies with best k-mers in the size of 45, 45 and 33 were used for Genotypes 1, 2 and 3, respectively. The parameter used for deciding the best k-mer for theoretical coverage were N50, maximum contigs length, totals contigs, assembled genome size and total number of reads used (<xref ref-type="supplementary-material" rid="supp-1">Table S1</xref>
).</p>
<table-wrap id="table-1" orientation="portrait" position="float"><object-id pub-id-type="doi">10.7717/peerj.1066/table-1</object-id>
<label>Table 1</label>
<caption><title>Neem genome assembly statistics.</title>
</caption>
<alternatives><graphic xlink:href="peerj-03-1066-g008"></graphic>
<table frame="hsides" rules="groups"><colgroup span="1"><col span="1"></col>
<col span="1"></col>
<col span="1"></col>
<col span="1"></col>
</colgroup>
<thead><tr><th rowspan="1" colspan="1">Assembly parameters</th>
<th rowspan="1" colspan="1">Velvet assembly from Illumina reads</th>
<th rowspan="1" colspan="1">MIRA assembly from 454 reads</th>
<th rowspan="1" colspan="1">Hybrid assembly of Genotype 1</th>
</tr>
</thead>
<tbody><tr><td rowspan="1" colspan="1">Total high quality reads</td>
<td rowspan="1" colspan="1">168,895,379</td>
<td rowspan="1" colspan="1">2,762,254</td>
<td rowspan="1" colspan="1">–</td>
</tr>
<tr><td rowspan="1" colspan="1">k-mer</td>
<td rowspan="1" colspan="1">45</td>
<td rowspan="1" colspan="1">–</td>
<td rowspan="1" colspan="1">–</td>
</tr>
<tr><td rowspan="1" colspan="1">Assembled genome size (Mb)</td>
<td rowspan="1" colspan="1">216</td>
<td rowspan="1" colspan="1">157</td>
<td rowspan="1" colspan="1">268</td>
</tr>
<tr><td rowspan="1" colspan="1">Total number of contigs</td>
<td rowspan="1" colspan="1">94,780</td>
<td rowspan="1" colspan="1">1,21,184</td>
<td rowspan="1" colspan="1">68,604</td>
</tr>
<tr><td rowspan="1" colspan="1">N50 (bp)</td>
<td rowspan="1" colspan="1">22,263</td>
<td rowspan="1" colspan="1">1,463</td>
<td rowspan="1" colspan="1">15,948</td>
</tr>
<tr><td rowspan="1" colspan="1">Maximum contig length (bp)</td>
<td rowspan="1" colspan="1">2,41,126</td>
<td rowspan="1" colspan="1">43,859</td>
<td rowspan="1" colspan="1">241,170</td>
</tr>
<tr><td rowspan="1" colspan="1">Mininum contig length (bp)</td>
<td rowspan="1" colspan="1">89</td>
<td rowspan="1" colspan="1">52</td>
<td rowspan="1" colspan="1">89</td>
</tr>
<tr><td rowspan="1" colspan="1">% of bases in contigs ≥ 1,000 bp</td>
<td rowspan="1" colspan="1">93.31</td>
<td rowspan="1" colspan="1">74.54</td>
<td rowspan="1" colspan="1">94.65</td>
</tr>
<tr><td rowspan="1" colspan="1">Total repeat size in Mb (%)</td>
<td rowspan="1" colspan="1">59.81 (27.41)</td>
<td rowspan="1" colspan="1">48.84 (31.05)</td>
<td rowspan="1" colspan="1">86.90 (32.44)</td>
</tr>
<tr><td rowspan="1" colspan="1">Number of predicted genes in Augustus</td>
<td rowspan="1" colspan="1">27,556</td>
<td rowspan="1" colspan="1">41,169</td>
<td rowspan="1" colspan="1">40,130</td>
</tr>
<tr><td rowspan="1" colspan="1">Number of predicted genes in GeneScan</td>
<td rowspan="1" colspan="1">35,501</td>
<td rowspan="1" colspan="1">57,356</td>
<td rowspan="1" colspan="1">52,617</td>
</tr>
<tr><td rowspan="1" colspan="1">Number of Genes clustered from GeneScan and Augustus</td>
<td rowspan="1" colspan="1">37,161</td>
<td rowspan="1" colspan="1">61,901</td>
<td rowspan="1" colspan="1">48,032</td>
</tr>
<tr><td rowspan="1" colspan="1">No of genes with >100 bp</td>
<td rowspan="1" colspan="1">34,992</td>
<td rowspan="1" colspan="1">52,957</td>
<td rowspan="1" colspan="1">44,495</td>
</tr>
<tr><td rowspan="1" colspan="1">Genes with RNA seq evidence</td>
<td rowspan="1" colspan="1">27,087</td>
<td rowspan="1" colspan="1">43,383</td>
<td rowspan="1" colspan="1">32,278</td>
</tr>
<tr><td rowspan="1" colspan="1">Non-TE genes</td>
<td rowspan="1" colspan="1">19,547</td>
<td rowspan="1" colspan="1">41,373</td>
<td rowspan="1" colspan="1">29,050</td>
</tr>
</tbody>
</table>
</alternatives>
</table-wrap>
<p>A total of 454 reads were assembled using MIRA software (<xref rid="ref-10" ref-type="bibr">Chevreux, 2005</xref>
). The first step of the assembly is to compare every read with every other read (and its reversed complement) to detect potential overlaps. These potential overlaps were examined with Smith–Waterman-based algorithm for local alignment of overlaps. If overlaps were found, then they were verified using Smith-Waterman methods and are assembled into contigs. The software used default parameters (minimum read length of 40 nts and minimum base quality of q10) with single end read format.</p>
</sec>
<sec><title>Hybrid neem genome assembly generation using Illumina and 454 contigs</title>
<p>Hybrid assembly of neem genome was carried out by merging contigs from Illumina and 454 sequence reads. The Genotype 1 (GKVK, Bangalore, India) assembled contigs, obtained from Velvet (<xref rid="ref-58" ref-type="bibr">Zerbino & Birney, 2008</xref>
) and MIRA (<xref rid="ref-10" ref-type="bibr">Chevreux, 2005</xref>
) assemblers were merged using clustering program, CD-HIT-est by keeping minimum similarity cut-off of 90% (<xref rid="ref-33" ref-type="bibr">Li & Godzik, 2006</xref>
). This clustering approach allowed both Illumina and 454 contigs to merge and build longer contigs of the genome. A total of 94,780 Illumina contigs were clustered with a total of 121,184,454 contigs, resulting in a total of 68,604 unique contig sequences (<xref ref-type="table" rid="table-1">Table 1</xref>
).</p>
</sec>
<sec><title>Eukaryotic core gene mapping</title>
<p>The completeness of the assembled genome was checked by using eukaryotic core gene-mapping approach (CEGMA) (<xref rid="ref-38" ref-type="bibr">Parra, Bradnam & Korf, 2007</xref>
). CEGMA used 248 core eukaryotic genes that are highly conserved and single-copy genes in eukaryotic genomes.</p>
</sec>
<sec><title>Nuclear gene prediction and annotation</title>
<p>Genes were predicted from hybrid genome assembly using Augustus (<xref rid="ref-49" ref-type="bibr">Stanke et al., 2006</xref>
) and GenScan (<xref rid="ref-8" ref-type="bibr">Burge & Karlin, 1997</xref>
) (<xref ref-type="supplementary-material" rid="supp-2">Table S2</xref>
). Then we clustered genes with similarity cut-off 90% using CD-HIT-est program (<xref rid="ref-33" ref-type="bibr">Li & Godzik, 2006</xref>
). This method gave the overall representation of genes, by merging similar genes predicted by both the programs. Then we discarded genes that were less than 100 bp in size (<xref rid="ref-13" ref-type="bibr">Delcher et al., 1999</xref>
). Then BLAT was used to obtain the number of unique and common genes from the CD-HIT out file (<xref rid="ref-25" ref-type="bibr">Kent, 2002</xref>
). Genes originated from repeat elements were identified using Repeat Modeler program (<uri xlink:href="http://www.repeatmasker.org/RepeatModeler.html">http://www.repeatmasker.org/RepeatModeler.html</uri>
). Gene expression was quantified by mapping RNA-Seq reads (<xref ref-type="table" rid="table-2">Table 2</xref>
). We used all the genes with and without RNA-seq evidence to search gene functions using UniProt database, Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Enzyme Commission number (EC). Schematic representations of GO classes in neem genome are summarized in <xref ref-type="supplementary-material" rid="supp-22">Fig. S2</xref>
. Above analyses was done using BLASTX (<xref rid="ref-3" ref-type="bibr">Altschul et al., 1990</xref>
) in annot8r (<xref rid="ref-44" ref-type="bibr">Schmid & Blaxter, 2008</xref>
) with E-value cut-off of 10<sup>−3</sup>
. Genes with multiple hits were filtered based on E-value; the annotation details for each gene are listed in the <xref ref-type="supplementary-material" rid="supp-3">Table S3</xref>
.</p>
<table-wrap id="table-2" orientation="portrait" position="float"><object-id pub-id-type="doi">10.7717/peerj.1066/table-2</object-id>
<label>Table 2</label>
<caption><title>RNAseq analysis from various explants of neem.</title>
</caption>
<alternatives><graphic xlink:href="peerj-03-1066-g009"></graphic>
<table frame="hsides" rules="groups"><colgroup span="1"><col span="1"></col>
<col span="1"></col>
<col span="1"></col>
<col span="1"></col>
<col span="1"></col>
</colgroup>
<thead><tr><th rowspan="1" colspan="1">Tissue</th>
<th rowspan="1" colspan="1">No. of reads<xref ref-type="fn" rid="table-2fn1"><sup>a</sup>
</xref>
</th>
<th rowspan="1" colspan="1">No of genes/RPKM > 1</th>
<th rowspan="1" colspan="1">No of genes/RPKM > 5</th>
<th rowspan="1" colspan="1">No of genes/RPKM > 10</th>
</tr>
</thead>
<tbody><tr><td rowspan="1" colspan="1">Mature leaf</td>
<td rowspan="1" colspan="1">5,401,910</td>
<td rowspan="1" colspan="1">19,308</td>
<td rowspan="1" colspan="1">14,807</td>
<td rowspan="1" colspan="1">11,763</td>
</tr>
<tr><td rowspan="1" colspan="1">Flower and bud</td>
<td rowspan="1" colspan="1">22,654,982</td>
<td rowspan="1" colspan="1">21,927</td>
<td rowspan="1" colspan="1">16,716</td>
<td rowspan="1" colspan="1">13,632</td>
</tr>
<tr><td rowspan="1" colspan="1">Fruit coat and pulp</td>
<td rowspan="1" colspan="1">55,627,021</td>
<td rowspan="1" colspan="1">21,537</td>
<td rowspan="1" colspan="1">16,693</td>
<td rowspan="1" colspan="1">13,888</td>
</tr>
<tr><td rowspan="1" colspan="1">Developing endosperm</td>
<td rowspan="1" colspan="1">31,340,522</td>
<td rowspan="1" colspan="1">19,480</td>
<td rowspan="1" colspan="1">15,262</td>
<td rowspan="1" colspan="1">12,614</td>
</tr>
<tr><td rowspan="1" colspan="1">Mature fruit</td>
<td rowspan="1" colspan="1">23,321,657</td>
<td rowspan="1" colspan="1">17,366</td>
<td rowspan="1" colspan="1">12,407</td>
<td rowspan="1" colspan="1">9,741</td>
</tr>
<tr><td rowspan="1" colspan="1">Seedling root</td>
<td rowspan="1" colspan="1">4,427,659</td>
<td rowspan="1" colspan="1">20,798</td>
<td rowspan="1" colspan="1">17,015</td>
<td rowspan="1" colspan="1">14,312</td>
</tr>
<tr><td rowspan="1" colspan="1">Seedling shoot</td>
<td rowspan="1" colspan="1">5,894,621</td>
<td rowspan="1" colspan="1">20,199</td>
<td rowspan="1" colspan="1">15,926</td>
<td rowspan="1" colspan="1">13,018</td>
</tr>
<tr><td rowspan="1" colspan="1">Drought root</td>
<td rowspan="1" colspan="1">7,255,199</td>
<td rowspan="1" colspan="1">21,371</td>
<td rowspan="1" colspan="1">17,015</td>
<td rowspan="1" colspan="1">14,236</td>
</tr>
<tr><td rowspan="1" colspan="1">Drought shoot</td>
<td rowspan="1" colspan="1">22,600,138</td>
<td rowspan="1" colspan="1">20,763</td>
<td rowspan="1" colspan="1">16,077</td>
<td rowspan="1" colspan="1">13,431</td>
</tr>
<tr><td rowspan="1" colspan="1">Albino root</td>
<td rowspan="1" colspan="1">1,267,4871</td>
<td rowspan="1" colspan="1">21,710</td>
<td rowspan="1" colspan="1">17,201</td>
<td rowspan="1" colspan="1">1,4273</td>
</tr>
<tr><td rowspan="1" colspan="1">Albino shoot</td>
<td rowspan="1" colspan="1">23,115,676</td>
<td rowspan="1" colspan="1">21,226</td>
<td rowspan="1" colspan="1">16,874</td>
<td rowspan="1" colspan="1">14,066</td>
</tr>
<tr><td rowspan="1" colspan="1">Leaf callus</td>
<td rowspan="1" colspan="1">2,150,935</td>
<td rowspan="1" colspan="1">18,615</td>
<td rowspan="1" colspan="1">15,356</td>
<td rowspan="1" colspan="1">12,727</td>
</tr>
</tbody>
</table>
</alternatives>
<table-wrap-foot><fn id="table-2fn"><p><bold>Notes.</bold>
</p>
</fn>
<fn id="table-2fn1"><label>a</label>
<p>read quality = q20 and read length = 100 nts.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec><title>Gene expression analysis</title>
<p>The combined transcriptome of twelve tissues of neem tree from Genotype 1 were assembled using Trinity software (<xref rid="ref-17" ref-type="bibr">Grabherr et al., 2011</xref>
). The transcripts were clustered to remove the over represented short fragments using CD-HIT-est program (<xref rid="ref-33" ref-type="bibr">Li & Godzik, 2006</xref>
) with a minimum similarity cut-off of 90%. The 44,495 genes from CD-HIT-est were mapped with RNA-seq data from individual tissues using SeqMap (<xref rid="ref-21" ref-type="bibr">Jiang & Wong, 2008</xref>
). The mapping of RNA-seq reads from each tissue were used to measure the expression value in RPKM (reads per kilo base per million) using rSeq tool (<xref rid="ref-22" ref-type="bibr">Jiang & Wong, 2009</xref>
). The gene expression was estimated using RPKM value minimum ≥1 for further analysis. The RPKM value was used to cluster the genes according to their expression pattern using WCGNA package in R tool (<xref rid="ref-28" ref-type="bibr">Langfelder & Horvath, 2008</xref>
). The expression value was also determined for assembled transcripts to verify expression of genes predicted from gene models.</p>
</sec>
<sec><title>Repeat prediction from neem genomes</title>
<p><italic>De novo</italic>
 repeat identification was done using RepeatModeler (<uri xlink:href="http://www.repeatmasker.org/RepeatModeler.html">http://www.repeatmasker.org/RepeatModeler.html</uri>
). The program was run with RM-BLAST (NCBI) database as an input for the repeat modelling. We trained Repeat Modeler using genomes of other published plant species including <italic>P. trichocarpa</italic>
 (39.40%), <italic>R. communis</italic>
 (51.74%) and <italic>A. thaliana</italic>
 (15.29%) for pipeline validation (<xref ref-type="supplementary-material" rid="supp-4">Table S4</xref>
). We downloaded NCBI data (<ext-link ext-link-type="NCBI:sra" xlink:href="SRA1085705">SRA1085705</ext-link>
) from <xref rid="ref-26" ref-type="bibr">Krishnan et al. (2012)</xref>
 and re-built the neem genome assembly using SOAPdenovo2 program with different k-mers (<xref ref-type="supplementary-material" rid="supp-5">Table S5</xref>
).</p>
</sec>
<sec><title>Quantitative PCR (qPCR) analysis</title>
<p>Total RNA from leaf, callus and developing endosperm (S1 =10 days post seed setting, S2 = 20 days post seed setting, S3 = 30 days post seed setting, S4 = 40 days post seed setting) was isolated using TRIzol reagent method and quantified using a Qubit Fluorometer. The cDNA synthesis was performed using total RNA (1 µg) with oligo(dT) random primers (50 µM) and SuperScript<sup>®</sup>
 III RT enzyme (200 u/µl) (Cat # 18080044; Life Technologies, Carlsbad, California, USA). The qPCR was performed on an Applied Biosystems, 7900HT Fast Real-Time PCR system machine. Real-time PCR was performed in a 384-wells optical reaction plate (Applied Biosystems, Foster City, California, USA) using SYBR green PCR mastermix (Life Technologies, Cat #4344463), which contains AmpliTaq Gold<sup>®</sup>
 DNA polymerase and ROX as a passive reference dye. Cycling conditions were 95 °C for 15 s, 60 °C for 30 s and 72 °C for 30 s with 40 cycles. We compared the fold change in qPCR experiments for selected genes from azadirachtin biosynthesis with conserved eukaryotic (rice) elongation factor 1<italic>α</italic>
 (eEF-1<italic>α</italic>
) gene (AK061464.1). All reactions were performed in triplicate with elongation factor primers and water as an internal control. The primer sequences of selected genes are listed in the <xref ref-type="supplementary-material" rid="supp-6">Table S6</xref>
.</p>
</sec>
<sec><title>SSRs, SNPs and InDels analysis</title>
<p>Identification of simple sequence repeats (SSRs) or microsatellite was done using MIcroSAtellite tool (MISA) (<uri xlink:href="http://pgrc.ipk-gatersleben.de/misa/">http://pgrc.ipk-gatersleben.de/misa/</uri>
) with assembled neem genome sequences. The SSRs containing contigs were extracted for SSRs motif variability prediction. The previously published SSR marker regions (<xref rid="ref-5" ref-type="bibr">Boontong, Pandey & Changtragoon, 2009</xref>
) for neem were compared to the newly identified SSRs from the genome. The SSR based polymorphism among the neem accessions was done using an in-house software pipeline. This pipeline uses the identified SSR region along with 100 bp upstream and downstream from each SSR loci. The SSR regions were aligned to each other for a pair of genomes using Bowtie2 alignment. The resulting .SAM files were parsed using the libraries function of genomic ranges (<xref rid="ref-1" ref-type="bibr">Aboyoun, Pages & Lawrence, 2010</xref>
), Gtools (<xref rid="ref-53" ref-type="bibr">Warnes, Bolker & Lumley, 2008</xref>
) and Stringr (<xref rid="ref-54" ref-type="bibr">Wickham, 2010</xref>
) in R program to obtain polymorphic SSR regions between neem genomes. The concordance was taken with neem tree Genotype 1 as a reference to shortlist the most polymorphic SSRs.</p>
<p>Single nucleotide polymorphism (SNP) and Insertion Deletion (InDels) markers were identified by mapping Illumina short reads from Genotype 2 and Genotype 3 to reference Genotype 1 assembly using Bowtie2 (<xref rid="ref-29" ref-type="bibr">Langmead & Salzberg, 2012</xref>
). The alignment results were converted into .BAM format using Samtools v1.19 (<xref rid="ref-32" ref-type="bibr">Li et al., 2009</xref>
). All .BAM files were merged into a combined .BAM file. The .BAM file was sorted, indexed and duplicates removed using Samtools v1.19 for further analyses. VCF file was generated for each genome to obtain SNPs and InDels from neem genome. The SNPs and InDels were further filtered to obtain aligned reads with quality >30, and minimum sequence depth of 10 reads. The SNPs were annotated using snpEff tool (<xref rid="ref-11" ref-type="bibr">Cingolani et al., 2012</xref>
).</p>
</sec>
<sec><title>Neem chloroplast and mitochondrial genome assembly and annotation</title>
<p>The reads for chloroplast and mitochondria were extracted separately by mapping genome reads to chloroplast and mitochondrial genomes of known plants; <italic>A. thaliana, B. napus, C. papaya, C. sinensis, N. tabacum, P. dactylifera, P. trichocarpa, R. communis, S. bicolour</italic>
 and <italic>V. vinifera</italic>
 using Bowtie2 (<xref rid="ref-29" ref-type="bibr">Langmead & Salzberg, 2012</xref>
). The mapped reads were extracted using Samtools (<xref rid="ref-32" ref-type="bibr">Li et al., 2009</xref>
) and assembled separately using Velvet (<xref rid="ref-58" ref-type="bibr">Zerbino & Birney, 2008</xref>
). Gene prediction and assembly of chloroplast genome was done using DOGMA (<xref rid="ref-56" ref-type="bibr">Wyman, Jansen & Boore, 2004</xref>
). Gene prediction and annotation of the mitochondrial assembly was done using Mitofy (<xref rid="ref-4" ref-type="bibr">Alverson et al., 2010</xref>
).</p>
</sec>
<sec><title>Synteny analysis</title>
<p>We aligned neem contigs on citrus (<italic>Citrus sinensis</italic>
) chromosomes using MUMMER program (<xref rid="ref-27" ref-type="bibr">Kurtz et al., 2004</xref>
). Synteny was computed among neem and citrus using Symap4.0 (<xref rid="ref-48" ref-type="bibr">Soderlund, Bomhoff & Nelson, 2011</xref>
). The synteny information was visualized using the Perl script provided with Symap4.0.</p>
</sec>
<sec><title>Classification of gene families and phylogenetic analysis</title>
<p>The proteome of 23 sequenced plant species along with neem were used to search for homologues and unique genes. All-vs-all BLAST-P (E-value e-10) was done using Proteinortho program (<xref rid="ref-30" ref-type="bibr">Lechner et al., 2011</xref>
). Comparative data from BLAST analysis was further classified into list of potential orthologs, co-orthologs and paralogs. Besides classifying, the program has also grouped the proteins into specific groups by clustering the gene-pairs. The gene content at the ancestral nodes along with the branches was reconstructed by using Wagner Parsimony and Likelihood-based approaches in the program count (<xref rid="ref-12" ref-type="bibr">Csűös, 2010</xref>
).</p>
</sec>
<sec><title>Quantification of azadirachtin, salanin and nimbin using UHPLC-MS/SRM method</title>
<p>We purchased neem standard metabolites such as azadirachtin (Cat. No. A7430; Sigma-Aldrich, Madhya Pradesh, India), salanin (Cat. No. ASB-00019028-005; Chromadex, Irvine, California, USA) and nimbin (Cat. No. N476280; Toronto Research Chemicals, Toronto, Canada). High purity MS grade solvents (methanol, acetonitrile and water) were obtained from Merck Millipore (Merck Millipore India Pvt. Ltd., Mumbai, India).</p>
<p>For metabolites analysis, we collected samples from various neem tissues (mature fruit, developing endosperm, mature leaf, flower and bud, fruit coat and pulp and seedling shoot and root) and dried at 37 °C for two days. The samples were ground into fine powder using Pestle and Mortar, and stored at −80 °C until use. There were no commercially internal standards available for neem metabolites quantification; therefore, we used estrone-d4 (Steraloids Inc, Newport, Rhode Island, USA) as an internal standard to construct the standard curve and also for absolute quantification. The calibration curves for the quantification of all three neem metabolites were linear over a 64-fold concentration range (azadirachtin and salanin) and 73-fold concentration range (nimbin) with linear regression correlation coefficients ranging from 0.998 to 0.999 (<xref ref-type="fig" rid="fig-1">Fig. 1D</xref>
). Typical UHPLC-MS/SRM chromatogram profile for neem metabolites from standard compounds and from the seed extract are shown in <xref ref-type="fig" rid="fig-1">Fig. 1C</xref>
. All analytes showed single sharp peak in C-18 column.</p>
<fig id="fig-1" orientation="portrait" position="float"><object-id pub-id-type="doi">10.7717/peerj.1066/fig-1</object-id>
<label>Figure 1</label>
<caption><title>Estimation of major neem metabolites from different tissues of neem.</title>
<p>(A) Neem tree, (B) structure of neem metabolites, (C) UHPLC-MS/SRM chromatogram of standards and metabolites from seed extract, (D) standard curve for all three metabolites and (E) concentration of neem metabolites from various tissues of neem tree.</p>
</caption>
<graphic xlink:href="peerj-03-1066-g001"></graphic>
</fig>
<p>Neem metabolites were extracted from the dried powder (2 mg) using 1 mL of methanol followed by 5 min sonication and centrifuged for 5 min (13,000 rpm, 10 °C), the supernatant was transferred to fresh micro-centrifuge tubes. About 5 µL of supernatant was spiked into 35 µL of methanol along with internal standard (10 µL estrone-d4 from 100 µg/mL). The analyses were done by injecting 10 µL of the sample into the UHPLC-MS/SRM system (LC-Agilent 1290 infinity series, MS–Thermo Fishers TSQ vantage). The intense product ions were selected for the LC-MS/SRM analysis [azadirachtin (703 → 567 Da), nimbin (541 → 509 Da), salanin (597 → 419 Da), and for estrone-d4 (275 → 257 Da)]. We used following LC conditions: solvent system A-Water (10 mM Ammonium Acetate) containing 0.1% FA and B-Acetonitrile containing 0.1% FA, Flow-200 µL/min, Column- C-18 column (Shim-pack, ODS III, 2.1 ×150 mm, 2 µm), Gradient- 2% B at 0 min, 2% B at 3 min, 40% B at 10 min, 95% B at 15 min, 2% B at 15.1 min, 2% B at 15.1–20 min. MS conditions: spray voltage, 3,000 V; ion transfer capillary temperature, 270 °C; source temperature 100 °C; sheath gas 18, auxillary gas 5 (arbitrary units); collision gas, argon; S-lens Voltage was optimized for individual metabolites; scan time of 50 millisec/transition and ion polarity positive. In case of azadirachtin (15.6 pg to 1 ng), nimbin (3.4 pg to 0.25 ng) and salanin (7.8 pg to 0.5 ng) on column was used to construct the standard curve. The overall scheme for the quantification of neem metabolites is shown in the <xref ref-type="fig" rid="fig-1">Fig. 1</xref>
. The UHPLC-MS/SRM chromatogram for lowest standard and metabolites from seed, standard curve for three metabolites and final quantification of neem metabolites from various tissues are illustrated. The final absolute quantification was done based on the constructed standard curve (ratio versus concentration) of individual metabolites.</p>
</sec>
<sec><title>Metabolites pathways analysis</title>
<p>The quantified metabolites data points from different explants or tissues were used to compare the gene expression pattern using WGCNA package in R (<xref rid="ref-28" ref-type="bibr">Langfelder & Horvath, 2008</xref>
) for tracking genes involved in azadirachtin biosynthetic pathway. The clustering program in WGCNA provided the initial correlation with gene expression and metabolite concentrations in different tissues of neem tree (<xref ref-type="fig" rid="fig-2">Fig. 2B</xref>
) (<xref ref-type="supplementary-material" rid="supp-7">Table S7</xref>
). The annotation of genes from KEGG and BLAST were used to select the candidate genes in secondary metabolite biosynthetic pathway. We selected the genes with best bit score for each step in terpenoid biosynthetic pathway. The selected gene annotation was confirmed by BLAST results against NCBI-nr database. In case of multiple hit for pathway function, the expression levels of genes in different tissues were compared to match the metabolite concentration by considering developing endosperm and mature leaf as contrasting datasets for azadirachtin concentration. Pearson’s correlation value was also calculated between the expression values of each gene in tissues versus Azadirachtin concentration (<xref ref-type="supplementary-material" rid="supp-8">Table S8A</xref>
). Further genes involved in conversion of squaline to azadirachtin could not be assigned to specified function. However, the genes showing a high expression level in developing endosperm and having high Pearson’s correlation with azadirachtin concentration were hypothesized to be part of azadirachtin biosynthesis. The hypothesized set of annotated genes in each pathway is listed in <xref ref-type="supplementary-material" rid="supp-8">Table S8</xref>
 and also shown in heatmap pattern (<xref ref-type="fig" rid="fig-3">Fig. 3</xref>
) using R statistical program.</p>
<fig id="fig-2" orientation="portrait" position="float"><object-id pub-id-type="doi">10.7717/peerj.1066/fig-2</object-id>
<label>Figure 2</label>
<caption><title>Gene expression and metabolite differences among different tissues of neem.</title>
<p>(A) Comparison of genes expressed commonly and uniquely in flower and bud, developing endosperm, mature leaf, mature fruit, fruit coat and pulp tissues. (B) Clustering dendrogram of samples based on gene expression values and corresponding metabolite concentration. The heat map shows the higher expression of the genes in specific tissue.</p>
</caption>
<graphic xlink:href="peerj-03-1066-g002"></graphic>
</fig>
<fig id="fig-3" orientation="portrait" position="float"><object-id pub-id-type="doi">10.7717/peerj.1066/fig-3</object-id>
<label>Figure 3</label>
<caption><title>Proposed Azadirachtin biosynthetic pathway in <italic>A. indica</italic>
.</title>
<p>Azadirachtin is the most prominent biopesticide belonging to organic molecule class called tetranortriterpenoids. Triterpenoids derived basically from squalene which in turn is derived from geranylpyrophosphate (GPP). GPP can be derived from mevalonate (MVA) or 2-C-methyl-D875 erythritol 4-phosphate (MEP) pathway. Enzymes of MVA pathway are as follows: AACT, acetyl-CoA acetyltransferase; HMGS, 3-hydroxy-3-methylglutaryl CoA synthase; HMGR, 3-hydroxy-3-methylglutaryl-coenzyme A reductase; MVK, mevalonate kinase; PMK, phosphomevalonate kinase; PMD, diphosphomevalonate decarboxylase. Enzymes of MEP pathway are DXS, 1-deoxy-D-xylulose-5-phosphate synthase; DXR, 1-deoxy-D xylulose-5-phosphate reducto-isomerase; MCT, 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; CMK, 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; MDS, 2-C methyl-D-erythritol 2,4-cyclodiphosphate synthase; HDS, 4-hydroxy-3-methylbut-2-enyl diphosphate synthase; HDR, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase. Isopentenyl pyrophosphate isomerase (IPPI) catalyzes the isomerisation of isopentenyl pyrophosphate (IPP) to dimethylallyl pyrophosphate (DMAPP), whereas conversion of IPP to geranyl pyrophosphate (GPP) is catalyzed by geranyl pyrophosphate synthase (GPS). GPP is further converted to farnesyl-diphosphate (FPP) and squalene by farnesyl-diphosphatesynthase (FPS) and squalene synthase (SS) respectively. Solid arrows indicate known steps, whereas broken arrows represents unknown intermediates and enzymes. Numbers beside coloured blocks indicate tissue types (1, developing endosperm; 2, mature leaf; 3, mature fruit; 4, seedling root; 5, fruit coat and pulp; 6, seedling shoot; 7, open flower and flower bud) and heatmap is represented as expression RPKM value.</p>
</caption>
<graphic xlink:href="peerj-03-1066-g003"></graphic>
</fig>
</sec>
</sec>
<sec sec-type="results"><title>Results</title>
<sec><title><italic>De novo</italic>
 sequencing and assembly of neem genome</title>
<p>We sequenced the neem genome using Illumina and 454 platforms. Illumina HiSeq paired-end (2 × 100 nts) sequencing yielded 13.86 Gb of high quality data for the neem Genotype 1 (GKVK, Bangalore). D<italic>e novo</italic>
 assembly resulted in 216 Mb (version 1.1) with 21X coverage using Velvet software. This analysis produced a total of 94,780 scaffolds where 90% of the genome was covered by scaffolds length longer than 1,000 bp. The N50 was about 22 Kb with 31.13% of GC content and the longest scaffold length was 241 Kb. In addition to Illumina data, we also generated 1.13 Gb (2,762,254 reads) data using Roche 454 GS FLX + chemistry for the Genotype 1. The average read length was 410 nts and longest 454 read length was 1,596 nts. The 454 assembly was generated using MIRA program (<xref ref-type="table" rid="table-1">Table 1</xref>
). The draft genome quality was further improved through hybrid assembly by merging Illumina contigs and 454 contigs using CD-HIT-est by keeping minimum similarity cut-off of 90%. The total size of hybrid genome assembly for neem Genotype 1 was improved to 267 Mb. Assembly statistics of the improved assembly neem Genotype 1 is shown in <xref ref-type="table" rid="table-1">Table 1</xref>
. Clustering approach in hybrid assembly has significantly reduced the number of contigs from 94,780 to 68,604 and decreased the number of N’s from 2.22% (4,841,912 nts) to 1.81% (4,842,395 nts). Although clustering approach reduced the N50 from 22.3 Kb to 15.95 Kb, the longest and shortest contig length remained unchanged. Hence, the clustered hybrid assembly (version 2.1) was chosen for further detailed analyses. To confirm the completeness of neem genome assembly, we analyzed conserved eukaryotic genes (CEG) in the neem hybrid genome assembly. This analysis was able to identify 224 out of 248 complete CEGs in the neem assembly.</p>
</sec>
<sec><title>Neem genome annotation</title>
<p>Gene prediction was performed using two programs, Augustus and GenScan, which led to identification of 40,130 and 52,617 genes, respectively. In total, we identified 44,495 genes after merging and clustering of genes from these two programs. The genic region of neem genome was about 114 Mb (42.5%). The initial gene prediction was carried out without masking the repeat regions in the genome to avoid missing of simple sequence repeats in the coding regions and to predict proper exon/intron boundaries. To annotate the neem genes, annot8r program was used by incorporating functional proteins from GO, EC and KEGG and merged with BLAST results (<xref ref-type="supplementary-material" rid="supp-9">Table S9</xref>
). There were 29,050 unique genes in the neem genome that were free from any repeat elements.</p>
<p>We used RNAseq data to validate predicted neem genes (<xref ref-type="table" rid="table-2">Table 2</xref>
). This analysis revealed thousands of genes expressed in various neem tissues including flower (21,927 genes), mature fruit (17,366), developing endosperm (19,480), mature leaf (19,308), fruit coat and pulp (21,537), seedling root (20,798), and seedling shoot (20,199). The analysis also showed that 3,008 genes exhibited tissue specific expression profile, while 13,711 genes were expressed in all the tissues (<xref ref-type="fig" rid="fig-2">Fig. 2A</xref>
). We identified 80,867 transcripts (53 Mb) from <italic>de novo</italic>
 assembly of neem transcriptome using Trinity program. These transcripts were further used as supporting expression evidences for the genes that involved in metabolites biosynthesis pathways. The protein sequences for 44,495 genes were compared with proteome of 23 sequenced plant species. Of these, 23,125 genes (52%) were classified into 18,327 families (<xref ref-type="fig" rid="fig-4">Fig. 4</xref>
). Neem genome found to have 4,320 multi-gene families (<xref ref-type="supplementary-material" rid="supp-9">Table S9</xref>
).</p>
<fig id="fig-4" orientation="portrait" position="float"><object-id pub-id-type="doi">10.7717/peerj.1066/fig-4</object-id>
<label>Figure 4</label>
<caption><title>Co-orthologous groups detected by Proteinortho using UPGMA method.</title>
<p>The clustering is based on similarity of the proteome among the plant species.</p>
</caption>
<graphic xlink:href="peerj-03-1066-g004"></graphic>
</fig>
</sec>
<sec><title>Gene ortholog analysis</title>
<p>Ancestral orthogroup reconstruction was done using Wagner Parsimony (<xref ref-type="supplementary-material" rid="supp-10">Table S10</xref>
) and likelihood based on birth–death model with equal gain-loss penalty (<xref ref-type="supplementary-material" rid="supp-11">Table S11</xref>
). The orthogroup reconstruction shows that 5,122 genes gained and 2,755 genes lost in comparison with ancestral nodes based on the gene family expansion in the neem genome.</p>
<p>Along with gene classification to orthogroups, the BLAST results of proteinortho (<xref rid="ref-30" ref-type="bibr">Lechner et al., 2011</xref>
) showed 24,216 genes (54.42%) as common between neem and citrus (<xref ref-type="fig" rid="fig-5">Fig. 5A</xref>
). The comparative analyses revealed 20,279 and 21,931 genes that are unique to neem and citrus, respectively. Out of 20,279 unique genes, 5,832 genes were expressed in various neem tissues.</p>
<fig id="fig-5" orientation="portrait" position="float"><object-id pub-id-type="doi">10.7717/peerj.1066/fig-5</object-id>
<label>Figure 5</label>
<caption><title>Comparison of neem genes with other sequenced plants.</title>
<p>(A) Schematic diagram showing common and unique genes between citrus and neem genomes. (B) Number and percentage of proteins having homolog hit in query (Arabidopsis, Populus, Grapes, Castor and Rice) with greater than 60% identity in genomes of neem and citrus.</p>
</caption>
<graphic xlink:href="peerj-03-1066-g005"></graphic>
</fig>
<p>Neem orthologous genes were compared with dicot (Arabidopsis, Populus and grapes) and monocot (rice) plants (<xref ref-type="fig" rid="fig-5">Fig. 5B</xref>
). 27,498 genes out of 44,495 genes were found to have orthologs (with more than 60% identity) in other plant species. Interestingly, 38% (16,997) of genes from neem did not show any orthologs in sequenced plant genomes. This analysis identified 16,997 genes that are unique to neem, which support the presence of unique gene families in neem tree. Among these unique genes, 680 genes share minor homology with hypothetical/predicted proteins in other plant species, while 2,343 were genes found to have no sequence similarity to either predicted or known genes. The list of unique genes in neem is summarized in the <xref ref-type="supplementary-material" rid="supp-12">Table S12</xref>
. The unique genes were further filtered for presence of repeat elements and expression evidences. More than 3,000 unique genes that have no repeat elements in the neem genes have expression evidence. This indicates that neem repeat derived genes are active in various neem tissues, which are interesting for future studies.</p>
</sec>
<sec><title>Repeat content in neem tree genome</title>
<p><italic>De novo</italic>
 repeat identification in neem genome was performed using RepeatModeler (<xref rid="ref-47" ref-type="bibr">Smit & Hubley, 2008</xref>
). This analysis identified 32.53% (87 Mb) of neem genome constitute for repeat elements, of which 17.06% of repeats were not annotated to any known repeat families (<xref ref-type="table" rid="table-3">Table 3</xref>
). The long terminal repeat (LTR), retro-transposons are the major classes of known repeats, which constitute about 10% of repeats the neem genome. The detailed repeat prediction is shown in <xref ref-type="supplementary-material" rid="supp-13">Table S13</xref>
. Our repeat prediction in neem (32.53%) is comparable to rice genome. Our RepeatModeler accuracy for repeat prediction in neem genome was further confirmed by independent analyses of the other two re-sequenced neem genomes (Genotype 2 = 27.63%, Genotype 3 = 26.76%). To verify repeats in the recently reported neem genome by <xref rid="ref-26" ref-type="bibr">Krishnan et al. (2012)</xref>
, we re-built the neem genome assembly reported by Krishnan et al. short read data (<ext-link ext-link-type="NCBI:sra" xlink:href="SRA1085705">SRA1085705</ext-link>
) using SOAPdenovo2 program. During our re-analysis of neem genome assembly from Krishnan et al. dataset (<ext-link ext-link-type="NCBI:sra" xlink:href="SRA1085705">SRA1085705</ext-link>
), we found 22 to 35% of genome harbour repeats (<xref ref-type="supplementary-material" rid="supp-5">Table S5</xref>
). In addition, we identified higher gaps with Ns (14.7% to 60%) for 31 to 37 k-mers from <ext-link ext-link-type="NCBI:sra" xlink:href="SRA1085705">SRA1085705</ext-link>
 (<xref ref-type="supplementary-material" rid="supp-5">Table S5</xref>
) as compared to neem genome assembly (1.81%) from our study.</p>
<table-wrap id="table-3" orientation="portrait" position="float"><object-id pub-id-type="doi">10.7717/peerj.1066/table-3</object-id>
<label>Table 3</label>
<caption><title><italic>De novo</italic>
 repeat prediction from neem genome using Repeat Modeler program.</title>
</caption>
<alternatives><graphic xlink:href="peerj-03-1066-g010"></graphic>
<table frame="hsides" rules="groups"><colgroup span="1"><col span="1"></col>
<col span="1"></col>
<col span="1"></col>
<col span="1"></col>
<col span="1"></col>
</colgroup>
<thead><tr><th rowspan="1" colspan="1">Repeat type</th>
<th rowspan="1" colspan="1">Subclass</th>
<th rowspan="1" colspan="1">Number of elements</th>
<th rowspan="1" colspan="1">Length in bp (%)</th>
<th rowspan="1" colspan="1">% of sequences</th>
</tr>
</thead>
<tbody><tr><td rowspan="1" colspan="1">RNA elements:</td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1"></td>
</tr>
<tr><td rowspan="1" colspan="1">SINEs:</td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1">119</td>
<td rowspan="1" colspan="1">10,033bp (0.01)</td>
<td rowspan="1" colspan="1">0.00</td>
</tr>
<tr><td rowspan="1" colspan="1">LINEs:</td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1">2,554</td>
<td rowspan="1" colspan="1">124,0701 bp (0.46)</td>
<td rowspan="1" colspan="1">0.46</td>
</tr>
<tr><td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1">LINE1</td>
<td rowspan="1" colspan="1">1,918</td>
<td rowspan="1" colspan="1">100,2661bp (0.37)</td>
<td rowspan="1" colspan="1">0.37</td>
</tr>
<tr><td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1">LINE2</td>
<td rowspan="1" colspan="1">119</td>
<td rowspan="1" colspan="1">39,194 bp (0.01)</td>
<td rowspan="1" colspan="1">0.01</td>
</tr>
<tr><td rowspan="1" colspan="1">LTR elements:</td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1">51,260</td>
<td rowspan="1" colspan="1">26,838,058 bp (10.02)</td>
<td rowspan="1" colspan="1">10.02</td>
</tr>
<tr><td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1">ERV_class1</td>
<td rowspan="1" colspan="1">462</td>
<td rowspan="1" colspan="1">130,859 bp (0.05)</td>
<td rowspan="1" colspan="1">0.05</td>
</tr>
<tr><td rowspan="1" colspan="1">DNA elements:</td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1">17,356</td>
<td rowspan="1" colspan="1">7,112,032 bp (2.65)</td>
<td rowspan="1" colspan="1">2.65</td>
</tr>
<tr><td rowspan="1" colspan="1">Unclassified:</td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1">16,6584</td>
<td rowspan="1" colspan="1">45,693,338 bp (17.06)</td>
<td rowspan="1" colspan="1">17.06</td>
</tr>
<tr><td rowspan="1" colspan="1">Total interspersed repeats:</td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1">80,894,162 bp (30.20)</td>
<td rowspan="1" colspan="1">30.20</td>
</tr>
<tr><td rowspan="1" colspan="1">Simple repeats:</td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1">43,430</td>
<td rowspan="1" colspan="1">1,601,548 bp (0.60)</td>
<td rowspan="1" colspan="1">0.60</td>
</tr>
<tr><td rowspan="1" colspan="1">Low complexity:</td>
<td rowspan="1" colspan="1"></td>
<td rowspan="1" colspan="1">99,976</td>
<td rowspan="1" colspan="1">5,150,192 bp (1.92)</td>
<td rowspan="1" colspan="1">1.92</td>
</tr>
</tbody>
</table>
</alternatives>
</table-wrap>
</sec>
<sec><title>DNA polymorphism among neem genotypes</title>
<p>Simple sequence repeats (SSR), single-nucleotide polymorphisms (SNPs) and small insertions and deletions (InDels) are the most abundant DNA markers in any plant genomes. Three sequenced neem genotypes were compared to assess genome-wide genetic diversity. This analysis identified 140807, 108020 and 95840 SSRs from neem Genotype 1, Genotype 2 and Genotype 3, respectively (<xref ref-type="supplementary-material" rid="supp-14">Tables S14A</xref>
 and <xref ref-type="supplementary-material" rid="supp-14">S14B</xref>
). Genes in Genotype 1 (2,217), Genotype 2 (1,665) and Genotype 3 (1,606) were associated with the presence of SSRs motif. SSR markers in the genic regions of Genotype 1 are summarized in <xref ref-type="supplementary-material" rid="supp-14">Tables S14A</xref>
 and <xref ref-type="supplementary-material" rid="supp-14">S14B</xref>
. Tri-repeats were highest (1,841 SSRs) for genes as compared to other repeat units. The AAG/CTT (533), AG/CT (114) and A/T (127) were the most predominant SSR motifs in genic regions of neem genome (<xref ref-type="supplementary-material" rid="supp-14">Tables S14B</xref>
). The list of SSR associated genes in Genotype 1 has been provided in <xref ref-type="supplementary-material" rid="supp-15">Table S15</xref>
.</p>
<p>With the availability of three neem genomes, we developed an in-house analysis pipeline to predict <italic>in-silico</italic>
 polymorphism in SSR loci. Nearly 100 nts from upstream and downstream regions of SSR motif from Genotype 1 (reference) were used for SSR polymorphism analysis. These analyses resulted in 2,199 and 2,120 polymorphic SSRs in the Genotype 2 and Genotype 3, respectively. The concordance SSR analyses showed 571 SSRs were polymorphic among three neem genomes (<xref ref-type="supplementary-material" rid="supp-16">Table S16</xref>
). The <italic>in-silico</italic>
 identified polymorphic SSR markers can be used to test polymorphism in neem germplasm or natural plantation.</p>
<p>In addition, SNP and InDel were further analyzed among three neem genotypes using Genotype 1 as a reference genome. This analysis yielded 698,173 SNPs and 53,508 InDels for Genotype 2 and 860,215 SNPs and 66,171 InDels for Genotype 3, respectively. SNP annotation was carried out to discern presence of SNPs in the coding region of the genome. The SNP annotation showed that 80.70% and 79.16% of SNPs in Genotype 2 and 3, respectively, are located in the non-coding regions (upstream and downstream) of the genes. We identified 130,668 (2.78%) and 159,375 (3.21%) SNPs, for exonic regions of coding genes from Genotype 2 and 3, respectively. There were 31,312 and 37,864 SNPs in the coding sequence which represents the synonymous amino acid changes in protein coding regions of Genotype 2 and 3, respectively. There were 84,194 and 102,663 SNPs that showed non-synonymous amino acid substitutions in the protein coding regions of Genotype 2 and 3, respectively. The presence of SNPs in start and stop codons were less as expected. The details of SNP annotation is summarized in <xref ref-type="supplementary-material" rid="supp-17">Table S17</xref>
.</p>
</sec>
<sec><title>Azadirachtin biosynthesis pathways</title>
<p>We have quantified neem metabolites using high sensitive UHPLC-MS/SRM method, which can detect the picogram (pg) level of neem metabolites (azadirachtin—15.6 pg, nimbin—3.4 pg and salanin—7.8 pg). The concentration of neem metabolites in mature seeds (azadirachtin (11,046 pg/µg), nimbin (3,607 pg/µg) and salanin (5,235 pg/µg)) was higher than other neem tissues. The concentration of Azadirachtin was always higher in mature seeds, followed by developing endosperm, shoot, root, cambium, pulp, flower, bark and leaf. Nimbin concentration was higher in seed, followed by bark, cambium, endosperm, root, shoot, flower, pulp and leaf. Similarly for salanin, the trend was higher in seeds followed by endosperm, bark, cambium, root, shoot, leaf, pulp and flower. The quantification of neem metabolites from various tissues is shown in <xref ref-type="fig" rid="fig-1">Fig. 1E</xref>
. Three of these metabolites are always found to be higher in seeds than other tissues, therefore, we quantified these across various seed developmental stages (S1 = 10 days post seed setting, S2, S3 and S4 = 40 days post seed setting). The level of all these three metabolites was higher across developing seed stages (<xref ref-type="fig" rid="fig-6">Fig. 6B</xref>
).</p>
<fig id="fig-6" orientation="portrait" position="float"><object-id pub-id-type="doi">10.7717/peerj.1066/fig-6</object-id>
<label>Figure 6</label>
<caption><title>Correlation of gene expression using qPCR and metabolites content in various neem tissues.</title>
<p>(A) qPCR performed with genes selected from RNA-seq data. Developing seeds were selected across different stages (S1—developing seed 1, S2—developing seed 2, S3—developing seed 3, S4—developing seed 4) based on growth stages show high gene expression as compared to other tissues. (B) LC-MS quantified metabolites data from various tissues.</p>
</caption>
<graphic xlink:href="peerj-03-1066-g006"></graphic>
</fig>
<p>Genes involved in various steps from tirucallol to azadirachtin are not yet established. Therefore, we devised bioinformatics approach by clustering expressed genes along with amount of metabolites (azadirachtin, nimbin and salanin) in each tissue using Weighted Correlation Network Analysis (WGCNA) method. This WGCNA clustering analysis identified azadirachtin biosynthesis genes that are up-regulated in developing endosperm, and had a minimal expression in other tissues such as leaf, flower, fruit coat and pulp (<xref ref-type="supplementary-material" rid="supp-8">Table S8A</xref>
). From this analysis, we identified more than 150 genes that were highly expressed in developing endosperm. However, the majority of these genes have homology to other plant species with a Pearson’s co-relation value above +0.8 for azadirachtin. To validate our method, we selected 10 genes with high correlation value (≥0.9) with azadirachtin (<xref ref-type="supplementary-material" rid="supp-8">Table S8A</xref>
) to perform quantitative PCR (qPCR). For this analysis, we used RNA from leaf, callus and developing seed (S1, S2, S3, S4 different stages of seed development) from neem Genotype 1. This analysis showed that 8 out of 10 genes (<xref ref-type="fig" rid="fig-6">Fig. 6A</xref>
) have high expression in developing seeds and low expression in other tissues (leaf and callus). The qPCR results concordance with WGCNA based clustering of RNA-seq and UHPLC-MS/SRM dataset. The highly expressed genes such as transketolases (Ai02g19151 and Ai02g23582) and dehydrogenases (Ai02g25309 and Ai02g12737) (<xref ref-type="fig" rid="fig-6">Fig. 6A</xref>
) were among the top ranked in Pearson’s co-relation value in WGCNA and qPCR analysis.</p>
</sec>
<sec><title>Organelle genomes assembly and annotation</title>
<p>We filtered reads which map to chloroplast and mitochondrial genomes of other plant species. The assembly and annotation statistics for organelle genome are shown in <xref ref-type="table" rid="table-4">Table 4</xref>
. The chloroplast genome assembly contains 60 scaffolds with size of 112,958 nts, which accounted for 72% of average plant chloroplast genome. The chloroplast genome had N50 of 2,125 nts, GC content of 38.07% and longest scaffold length of 8,435 nts. Further, gene prediction and assembly of chloroplast genome was done using DOGMA (<xref rid="ref-56" ref-type="bibr">Wyman, Jansen & Boore, 2004</xref>
), which showed 77 unique genes in chloroplast genome of neem (<xref ref-type="supplementary-material" rid="supp-18">Table S18</xref>
).</p>
<table-wrap id="table-4" orientation="portrait" position="float"><object-id pub-id-type="doi">10.7717/peerj.1066/table-4</object-id>
<label>Table 4</label>
<caption><title>Mitochondria and chloroplast genomes assembly statistics.</title>
</caption>
<alternatives><graphic xlink:href="peerj-03-1066-g011"></graphic>
<table frame="hsides" rules="groups"><colgroup span="1"><col span="1"></col>
<col span="1"></col>
<col span="1"></col>
</colgroup>
<thead><tr><th rowspan="1" colspan="1">Assembly parameters</th>
<th rowspan="1" colspan="1">Mitochondrion</th>
<th rowspan="1" colspan="1">Chloroplast</th>
</tr>
</thead>
<tbody><tr><td rowspan="1" colspan="1">Total number of reads</td>
<td rowspan="1" colspan="1">15,659,391</td>
<td rowspan="1" colspan="1">22,211,576</td>
</tr>
<tr><td rowspan="1" colspan="1">k-mer</td>
<td rowspan="1" colspan="1">63</td>
<td rowspan="1" colspan="1">61</td>
</tr>
<tr><td rowspan="1" colspan="1">Assembled genome size (bp)</td>
<td rowspan="1" colspan="1">266,430</td>
<td rowspan="1" colspan="1">112,958</td>
</tr>
<tr><td rowspan="1" colspan="1">Total number of contigs</td>
<td rowspan="1" colspan="1">348</td>
<td rowspan="1" colspan="1">152</td>
</tr>
<tr><td rowspan="1" colspan="1">N50 (bp)</td>
<td rowspan="1" colspan="1">1,490</td>
<td rowspan="1" colspan="1">2,125</td>
</tr>
<tr><td rowspan="1" colspan="1">Maximum contig length (bp)</td>
<td rowspan="1" colspan="1">9,110</td>
<td rowspan="1" colspan="1">8,435</td>
</tr>
<tr><td rowspan="1" colspan="1">Minimum contig length (bp)</td>
<td rowspan="1" colspan="1">125</td>
<td rowspan="1" colspan="1">121</td>
</tr>
<tr><td rowspan="1" colspan="1">% of bases in contigs ≥1,000 bp</td>
<td rowspan="1" colspan="1">60.71</td>
<td rowspan="1" colspan="1">63.33</td>
</tr>
<tr><td rowspan="1" colspan="1">GC content %</td>
<td rowspan="1" colspan="1">43.1</td>
<td rowspan="1" colspan="1">38.07</td>
</tr>
<tr><td rowspan="1" colspan="1">No of genes predicted</td>
<td rowspan="1" colspan="1">39</td>
<td rowspan="1" colspan="1">77</td>
</tr>
</tbody>
</table>
</alternatives>
</table-wrap>
<p>The mitochondrial genome of neem was also assembled (<xref ref-type="table" rid="table-4">Table 4</xref>
). N50 for mitochondrial genome had 1,490 nts, the sequence covered 266,430 nts of the genome in 348 scaffolds. The GC content of the mitochondrial genome was 43.10%. Gene prediction and annotation of the mitochondrial assembly was done using Mitofy (<xref rid="ref-4" ref-type="bibr">Alverson et al., 2010</xref>
), which identified 39 genes out of 41 reported mitochondrial genes (<xref ref-type="supplementary-material" rid="supp-19">Table S19</xref>
).</p>
</sec>
<sec><title>Genome orthology and synteny analysis</title>
<p>Genome size and chromosomal architecture was not available for neem tree. Synteny analysis was performed to obtain conserved chromosomal blocks of neem genome and genes in a closely related plant species. Our analysis revealed that the neem genome is phylogentically related to citrus. Therefore, we used the citrus genome (nine citrus pseudomolecules) as a reference to order neem contigs. This comparative analysis anchored 24,902 neem contigs onto 9 citrus chromosomes (<xref ref-type="fig" rid="fig-7">Fig. 7</xref>
). This anchoring method ordered 161 Mb (62%) of neem genome covering, which is equivalent to 48% of citrus genome. Detailed anchoring analysis revealed that 497 syntenic blocks with 12,176 synteny hit with citrus genome (<xref ref-type="fig" rid="fig-7">Fig. 7</xref>
).</p>
<fig id="fig-7" orientation="portrait" position="float"><object-id pub-id-type="doi">10.7717/peerj.1066/fig-7</object-id>
<label>Figure 7</label>
<caption><title>Schematic representation of syntenic relationship between citrus and neem.</title>
<p>The colored line (1–9, UNK) represents the syntenic blocks in neem anchored to chromosomal region in citrus. The turquoise colored blocks show the synteny of un-anchored neem scaffolds with citrus.</p>
</caption>
<graphic xlink:href="peerj-03-1066-g007"></graphic>
</fig>
</sec>
</sec>
<sec sec-type="discussion"><title>Discussion</title>
<p>Neem is medicinally, agriculturally and environmentally important tropical tree in Indian subcontinent. Neem is well-known for its complex tetranortriterpenoids compounds such as azadirachtin, nimbin and salanin, which are the main constituents in insecticidal and pharmaceutical formulations (<xref rid="ref-6" ref-type="bibr">Brahmachari, 2004</xref>
). Unlike inorganic synthetic pesticides, neem compounds are best-known bio-pesticides, which can easily be degraded and have lower pesticidal toxicity in the environment. Biochemical compounds from neem tree have been well investigated in the 20th century (<xref rid="ref-6" ref-type="bibr">Brahmachari, 2004</xref>
). However, genetic, molecular and genomic resources are not well developed to understand genes and biochemical pathways in neem. Recently, attempts were made to generate ESTs (<xref rid="ref-41" ref-type="bibr">Rajakani et al., 2014</xref>
; <xref rid="ref-36" ref-type="bibr">Narnoliya et al., 2014</xref>
) and genomic (<xref rid="ref-26" ref-type="bibr">Krishnan et al., 2012</xref>
) resources. However, these studies have generated limited number of ESTs (<xref rid="ref-41" ref-type="bibr">Rajakani et al., 2014</xref>
; <xref rid="ref-36" ref-type="bibr">Narnoliya et al., 2014</xref>
) and also non-availability of genome assembly and genes (<xref rid="ref-26" ref-type="bibr">Krishnan et al., 2012</xref>
) in the public domain. Our study aimed to develop comprehensive genomic, transcriptomic and metabolomic resources for neem tree.</p>
<p>We sequenced three neem genomes from three distinct geographical regions of southern India. We annotated the neem genome to the best of our knowledge using available bioinformatics tools. We could able to assemble 70% (267 Mb) of genome based on estimated genome neem size by <xref rid="ref-37" ref-type="bibr">Ohri, Bhargava & Chatterjee (2004)</xref>
. More than 90% of conserved eukaryotic genes (CEGs) mapped to <italic>de novo</italic>
 assembled neem genome. Our prediction of genes (44,495) and TE-related genes (35%) are comparable with well annotated rice genes (<uri xlink:href="http://rice.plantbiology.msu.edu">http://rice.plantbiology.msu.edu</uri>
) (<xref rid="ref-24" ref-type="bibr">Kawahara et al., 2013</xref>
). More than 30,000 genes in the neem genome are supported by expression evidences, 13,711 genes expressed in most of neem tissues and 3,000 genes expressed in a tissue specific manner.</p>
<p>Our comprehensive analysis predicted about 87 Mb (33%) repeats in the neem genome in contrast to previous study (<xref rid="ref-26" ref-type="bibr">Krishnan et al., 2012</xref>
). They have reported that neem genome contains the lowest repeat content (13.03%) in the plant kingdom (<xref rid="ref-26" ref-type="bibr">Krishnan et al., 2012</xref>
). The previously published neem genome by Krishnan and colleagues has not released genome assembly and hence we cannot really make use of information. We tried to rebuild the neem genome using Krishnan et al. short reads dataset (<ext-link ext-link-type="NCBI:sra" xlink:href="SRA1085705">SRA1085705</ext-link>
). This analysis predicted more than 20% of repeats and constitutes higher gaps with lots of Ns (up to 60%) (<xref ref-type="supplementary-material" rid="supp-5">Table S5</xref>
). According to our knowledge, they have underestimated the repeat content in the neem genome and their results are not reproducible (<xref ref-type="supplementary-material" rid="supp-5">Table S5</xref>
). Our assembly generated from neem Genotype 1 showed higher repeats (33%) content and lesser Ns (less than 2%) than the study by Krishnan and colleagues.</p>
<p>Other highlight of our study is that we sequenced three neem genotypes from varied environmental conditions, which assisted us to identify DNA molecular markers such as SSRs, SNPs and InDels. We obtained about 2.9 SNPs and 0.22 InDels per 1,000 nts on the reference neem genome (Genotype 1). SNP and InDel markers density distribution in neem genome are lower than citrus genome (3.6 SNPs and 0.6 InDels per Kb) (<xref rid="ref-57" ref-type="bibr">Xu et al., 2013</xref>
). The lower SNP and InDel diversity in the neem genome and the genic region indicates that neem trees might be genetically less diverse. Neem trees might be forced to self pollinate because of bisexual nature of flowers, closed floral anatomy and lack of self incompatibility (<xref rid="ref-39" ref-type="bibr">Puri, 1999</xref>
). Other reason might be due to the presence of insect repellent compounds in leaves and flowers, which may significantly reduce cross pollination. Molecular markers (SNPs, InDels and SSRs) from this study are highly useful in identification of elite genotypes, tagging of traits and cloning of genes involved in biochemical pathways in neem through genome-wide association studies.</p>
<p>In the molecular phylogeny, neem is genetically related to sweet orange (<italic>Citrus sinensis</italic>
) at the family level (<xref rid="ref-57" ref-type="bibr">Xu et al., 2013</xref>
). A lot more genetic and genomic resources are available for citrus plant (<xref rid="ref-55" ref-type="bibr">Wu et al., 2014</xref>
) as compared to neem. Therefore we used citrus for comparative analysis with neem genome (<xref ref-type="fig" rid="fig-5">Fig. 5A</xref>
). We observed that extensive syntenic blocks (62% of neem genome) between neem and citrus chromosomes and about 50% (24,216) of neem genes were conserved in the citrus genome. Citrus has been well researched plant for limonoids (<xref rid="ref-57" ref-type="bibr">Xu et al., 2013</xref>
; Wu et al., 2014), which are highly oxygenated limonoids present in both Rutaceae and Meliaceae. Molecular resources from our study will help in dissecting common and specific limonoid pathways in neem and citrus.</p>
<p>The proposed biosynthetic pathway of azadirachtin in neem is not well studied. The tirucallol (C30 triterpene) a steroid triterpenoid, is a possible precursor for azadirachtin biosynthesis in neem (<xref rid="ref-23" ref-type="bibr">Johnson, Morgan & Peiris, 1996</xref>
; <xref rid="ref-31" ref-type="bibr">Ley, Denholm & Wood, 1993</xref>
). In the first step, two molecules of farnesyl diphosphate combine to generate tirucallol molecule followed by losing three methyl groups and oxidized to form apotirucallol (a tetranortriterpenoid, or limonoid). Then it loses the four terminal carbons (<xref rid="ref-14" ref-type="bibr">Dewick, 2011</xref>
; <xref rid="ref-31" ref-type="bibr">Ley, Denholm & Wood, 1993</xref>
). The third ring of apotirucallol is oxidized to form the C-seco limonoids, nimbin and salannin (<xref rid="ref-23" ref-type="bibr">Johnson, Morgan & Peiris, 1996</xref>
; <xref rid="ref-31" ref-type="bibr">Ley, Denholm & Wood, 1993</xref>
; <xref rid="ref-40" ref-type="bibr">Puri, 2003</xref>
; <xref rid="ref-43" ref-type="bibr">Saxena, 1989</xref>
). These molecules are heavily oxidized and cyclised to produce azadirachtin (<xref rid="ref-2" ref-type="bibr">Aerts & Mordue, 1997</xref>
; <xref rid="ref-20" ref-type="bibr">Hosfelt, 2008</xref>
).</p>
<p>Our study added genomic, transcriptomic and metabolites data to support limonoids biosynthesis pathway genes in neem. To understand the neem metabolites bio-synthesis, we quantified major metabolites using a sensitive UHPLC-MS/SRM method. We identified the known secondary metabolite pathway genes including farnesyl diphosphate synthase (Ai02g10209), squalene synthase (Ai02g1634), geranyl diphosphate synthase (Ai02g31423), mevalonate kinase (Ai02g16520) and other unannotated genes upstream to squalene (<xref ref-type="supplementary-material" rid="supp-20">Table S20</xref>
). Across different stages of developing seed (10 to 40 days after seed setting), we observed increased content of major metabolites as compared to leaf and callus tissues (<xref ref-type="fig" rid="fig-6">Fig. 6B</xref>
). In mature seeds, the concentration of neem metabolites (azadiractin (11,046 pg/µg), nimbin (3,607 pg/µg) and salanin (5,235 pg/µg)) was higher than other neem tissues as reported earlier. Azadirachtin was 5,000 fold higher in seed as compared to leaves. Azadirachtin is the major complex limonoids in neem tree. Genes involved in various steps from tirucallol to azadirachtin are not yet established. Genes identified from this study will help to identify biochemical pathways in future. Clustering of genes expression and metabolites data aided in identification of genes that associate with azadirachtin biosynthesis pathways. The comprehensive resources from our study will help to unlock the biochemical pathways in neem.</p>
</sec>
<sec><title>Conclusion</title>
<p>Neem is an important tropical evergreen tree in India, used for centuries in agriculture and traditional medicine. In this study, we report the detailed analysis of neem genomes, transcriptomes and metabolites. We identified possible candidate genes involved in azadirachtin biosynthesis pathways. Genomic resources such as sequence of genomes, genes, transcripts, SSRs, SNPs and InDels from this study will have a profound application to study diversity, traits association, with biopesticidal properties and biochemical pathways in neem and other species of Melieacea family.</p>
</sec>
<sec sec-type="supplementary-material" id="supplemental-information"><title>Supplemental Information</title>
<supplementary-material content-type="local-data" id="supp-1"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-1</object-id>
<label>Table S1</label>
<caption><title>Genome assembly (version 01) and annotation of neem genotypes using Illumina PE data</title>
</caption>
<media xlink:href="peerj-03-1066-s001.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-2"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-2</object-id>
<label>Table S2</label>
<caption><title>Gene prediction statistics using Augustus and Genscan for Genoytpe 1 hybrid genome assembly (version 2)</title>
</caption>
<media xlink:href="peerj-03-1066-s002.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-3"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-3</object-id>
<label>Table S3</label>
<caption><title>Neem genome annotation</title>
</caption>
<media xlink:href="peerj-03-1066-s003.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-4"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-4</object-id>
<label>Table S4</label>
<caption><title>Comparison of de-novo repeats prediction in <italic>A. indica</italic>
 with <italic>P. trichocarpa</italic>
, <italic>R. communis</italic>
 and <italic>A. thaliana</italic>
 using RepeatModeler program</title>
</caption>
<media xlink:href="peerj-03-1066-s004.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-5"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-5</object-id>
<label>Table S5</label>
<caption><title>Assembly statistics of re-analysed Krishnan and co-workers data (<ext-link ext-link-type="NCBI:sra" xlink:href="SRP013453">SRP013453</ext-link>
)</title>
</caption>
<media xlink:href="peerj-03-1066-s005.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-6"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-6</object-id>
<label>Table S6</label>
<caption><title>List of primers used in qPCR analysis</title>
</caption>
<media xlink:href="peerj-03-1066-s006.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-7"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-7</object-id>
<label>Table S7</label>
<caption><title>Metabolite quantity measurement in different explants of neem tree (Genotype 1)</title>
</caption>
<media xlink:href="peerj-03-1066-s007.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-8"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-8</object-id>
<label>Table S8</label>
<caption><title>List of genes with very high expression in developing endosperm of neem and also with high correlation with nimbin content in the tissues under study</title>
</caption>
<media xlink:href="peerj-03-1066-s008.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-9"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-9</object-id>
<label>Table S9</label>
<caption><title>Multigene family in the neem genome</title>
</caption>
<media xlink:href="peerj-03-1066-s009.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-10"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-10</object-id>
<label>Table S10</label>
<caption><title>Ancestral orthogroup reconstruction wagner parsimony using equal gain-loss penalty, as implemented in the program count</title>
</caption>
<media xlink:href="peerj-03-1066-s010.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-11"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-11</object-id>
<label>Table S11</label>
<caption><title>Ancestral orthogroup reconstruction using a birth–death model that allows for lineage specific gain/loss rates, as implemented in the program count</title>
</caption>
<media xlink:href="peerj-03-1066-s011.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-12"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-12</object-id>
<label>Table S12</label>
<caption><title>Genes unique to neem with expression and without repeat elements</title>
</caption>
<media xlink:href="peerj-03-1066-s012.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-13"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-13</object-id>
<label>Table S13</label>
<caption><title>Description of repeats prediction in <italic>A. indica</italic>
 using RepeatModeler program</title>
</caption>
<media xlink:href="peerj-03-1066-s013.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-14"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-14</object-id>
<label>Table S14</label>
<caption><title>Simple Sequence Repeats (SSRs) prediction for genomes and genes from 3 Genotypes of <italic>A. indica</italic>
 and their comparison</title>
</caption>
<media xlink:href="peerj-03-1066-s014.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-15"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-15</object-id>
<label>Table S15</label>
<caption><title>The list of SSRs in genes of neem Genotype 1</title>
</caption>
<media xlink:href="peerj-03-1066-s015.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-16"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-16</object-id>
<label>Table S16</label>
<caption><title>The list of polymorphic SSRs among 3 genotypes</title>
</caption>
<media xlink:href="peerj-03-1066-s016.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-17"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-17</object-id>
<label>Table S17</label>
<caption><title>Summary of SNP annotation for Genotype 2 and Genotype 3 by using Genotype 1 as a reference</title>
</caption>
<media xlink:href="peerj-03-1066-s017.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-18"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-18</object-id>
<label>Table S18</label>
<caption><title>Genes annotated from neem chloroplast</title>
</caption>
<media xlink:href="peerj-03-1066-s018.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-19"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-19</object-id>
<label>Table S19</label>
<caption><title>Genes annotated from neem mitochondria</title>
</caption>
<media xlink:href="peerj-03-1066-s019.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-20"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-20</object-id>
<label>Table S20</label>
<caption><title>Neem secondary metabolites biosynthesis genes</title>
</caption>
<media xlink:href="peerj-03-1066-s020.xls"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-21"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-21</object-id>
<label>Figure S1</label>
<caption><title>Work flow of sequencing and genome assembly of neem genome</title>
</caption>
<media xlink:href="peerj-03-1066-s021.pdf"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="supp-22"><object-id pub-id-type="doi">10.7717/peerj.1066/supp-22</object-id>
<label>Figure S2</label>
<caption><title>Schematic representations of GO classes in neem genome</title>
</caption>
<media xlink:href="peerj-03-1066-s022.pdf"><caption><p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
</sec>
</body>
<back><ack><p>We acknowledge Genomics facility (BT/PR3481/INF/22/140/2011) at Centre for Cellular and Molecular Platforms, Bangalore for sequencing of Neem genomes. We acknowledge Pradeep H, Aarati Karaba, Manojkumar S and Annapurna for their help in NGS library preparation and sequencing. We thank Ashmita G and Divya S for their help in manual curation of SSR markers. We are grateful to Rajanna, National Botanical Garden, University of Agricultural Sciences, GKVK campus, Bangalore for his help during neem sample collection.</p>
</ack>
<glossary content-type="abbreviations" id="glossary-1"><title>Abbreviations</title>
<def-list><def-item><term>nts</term>
<def><p>Nucleotides</p>
</def>
</def-item>
<def-item><term>UHPLC-MS/SRM</term>
<def><p>Ultrahigh performance liquid chromatography-mass spectrometry-selected reaction monitoring</p>
</def>
</def-item>
<def-item><term>Kb</term>
<def><p>Kilobases</p>
</def>
</def-item>
<def-item><term>RPKM</term>
<def><p>reads per kilo base per million</p>
</def>
</def-item>
<def-item><term>WGCNA</term>
<def><p>Weighted Correlation Network Analysis</p>
</def>
</def-item>
<def-item><term>SNP</term>
<def><p>Single nucleotide polymorphism</p>
</def>
</def-item>
<def-item><term>SSR</term>
<def><p>Simple sequence repeat</p>
</def>
</def-item>
</def-list>
</glossary>
<sec sec-type="additional-information"><title>Additional Information and Declarations</title>
<fn-group content-type="competing-interests"><title>Competing Interests</title>
<fn id="conflict-1" fn-type="conflict"><p>The authors declare there are no competing interests.</p>
</fn>
</fn-group>
<fn-group content-type="author-contributions"><title>Author Contributions</title>
<fn id="contribution-1" fn-type="con"><p><xref ref-type="contrib" rid="author-1">Nagesh A. Kuravadi</xref>
 and <xref ref-type="contrib" rid="author-2">Vijay Yenagi</xref>
 performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables.</p>
</fn>
<fn id="contribution-2" fn-type="con"><p><xref ref-type="contrib" rid="author-3">Kannan Rangiah</xref>
 performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.</p>
</fn>
<fn id="contribution-3" fn-type="con"><p><xref ref-type="contrib" rid="author-4">HB Mahesh</xref>
 performed the experiments, analyzed the data, wrote the paper.</p>
</fn>
<fn id="contribution-4" fn-type="con"><p><xref ref-type="contrib" rid="author-5">Anantharamanan Rajamani</xref>
 and <xref ref-type="contrib" rid="author-6">Meghana D. Shirke</xref>
 analyzed the data, wrote the paper.</p>
</fn>
<fn id="contribution-5" fn-type="con"><p><xref ref-type="contrib" rid="author-7">Heikham Russiachand</xref>
 analyzed the data, prepared figures and/or tables.</p>
</fn>
<fn id="contribution-6" fn-type="con"><p><xref ref-type="contrib" rid="author-8">Ramya Malarini Loganathan</xref>
, <xref ref-type="contrib" rid="author-9">Chandana Shankara Lingu</xref>
, <xref ref-type="contrib" rid="author-10">Shilpa Siddappa</xref>
 and <xref ref-type="contrib" rid="author-11">Aishwarya Ramamurthy</xref>
 performed the experiments.</p>
</fn>
<fn id="contribution-7" fn-type="con"><p><xref ref-type="contrib" rid="author-12">BN Sathyanarayana</xref>
 performed the experiments, contributed reagents/materials/analysis tools, reviewed drafts of the paper.</p>
</fn>
<fn id="contribution-8" fn-type="con"><p><xref ref-type="contrib" rid="author-13">Malali Gowda</xref>
 conceived and designed the experiments, contributed reagents/materials/analysis tools, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.</p>
</fn>
</fn-group>
<fn-group content-type="other"><title>DNA Deposition</title>
<fn id="addinfo-1" fn-type="other"><p>The following information was supplied regarding the deposition of DNA sequences:</p>
<p>1. Whole genome of neem deposited in NCBI accession number <ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="AMWY00000000.1">AMWY00000000.1</ext-link>
 (Bioproject ID: <ext-link ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA176672">PRJNA 176672</ext-link>
)</p>
<p>2. Raw reads deposited in sequence read archive database (<ext-link ext-link-type="NCBI:sra" xlink:href="SRP052002">SRP052002</ext-link>
).</p>
</fn>
</fn-group>
</sec>
<ref-list content-type="authoryear"><title>References</title>
<ref id="ref-1"><label>Aboyoun, Pages & Lawrence (2010)</label>
<element-citation publication-type="software"><person-group person-group-type="author"><name><surname>Aboyoun</surname>
<given-names>P</given-names>
</name>
<name><surname>Pages</surname>
<given-names>H</given-names>
</name>
<name><surname>Lawrence</surname>
<given-names>M</given-names>
</name>
</person-group>
<source>GenomicRanges: representation and manipulation of genomic intervals</source>
<edition designator="1">R package version 1</edition>
<year>2010</year>
<fpage>1</fpage>
<lpage>25</lpage>
</element-citation>
</ref>
<ref id="ref-2"><label>Aerts & Mordue (1997)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Aerts</surname>
<given-names>RJ</given-names>
</name>
<name><surname>Mordue</surname>
<given-names>AJ</given-names>
</name>
</person-group>
<article-title>Feeding deterrence and toxicity of neem triterpenoids</article-title>
<source>Journal of Chemical Ecology</source>
<year>1997</year>
<volume>23</volume>
<fpage>2117</fpage>
<lpage>2132</lpage>
<pub-id pub-id-type="doi">10.1023/B:JOEC.0000006433.14030.04</pub-id>
</element-citation>
</ref>
<ref id="ref-3"><label>Altschul et al. (1990)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Altschul</surname>
<given-names>SF</given-names>
</name>
<name><surname>Gish</surname>
<given-names>W</given-names>
</name>
<name><surname>Miller</surname>
<given-names>W</given-names>
</name>
<name><surname>Myers</surname>
<given-names>EW</given-names>
</name>
<name><surname>Lipman</surname>
<given-names>DJ</given-names>
</name>
</person-group>
<article-title>Basic local alignment search tool</article-title>
<source>Journal of Molecular Biology</source>
<year>1990</year>
<volume>215</volume>
<fpage>403</fpage>
<lpage>410</lpage>
<pub-id pub-id-type="doi">10.1016/S0022-2836(05)80360-2</pub-id>
<pub-id pub-id-type="pmid">2231712</pub-id>
</element-citation>
</ref>
<ref id="ref-4"><label>Alverson et al. (2010)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Alverson</surname>
<given-names>AJ</given-names>
</name>
<name><surname>Wei</surname>
<given-names>X</given-names>
</name>
<name><surname>Rice</surname>
<given-names>DW</given-names>
</name>
<name><surname>Stern</surname>
<given-names>DB</given-names>
</name>
<name><surname>Barry</surname>
<given-names>K</given-names>
</name>
<name><surname>Palmer</surname>
<given-names>JD</given-names>
</name>
</person-group>
<article-title>Insights into the evolution of mitochondrial genome size from complete sequences of <italic>Citrullus lanatus</italic>
 and <italic>Cucurbita pepo</italic>
 (Cucurbitaceae)</article-title>
<source>Molecular Biology and Evolution</source>
<year>2010</year>
<volume>27</volume>
<fpage>1436</fpage>
<lpage>1448</lpage>
<pub-id pub-id-type="doi">10.1093/molbev/msq029</pub-id>
<pub-id pub-id-type="pmid">20118192</pub-id>
</element-citation>
</ref>
<ref id="ref-5"><label>Boontong, Pandey & Changtragoon (2009)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Boontong</surname>
<given-names>C</given-names>
</name>
<name><surname>Pandey</surname>
<given-names>M</given-names>
</name>
<name><surname>Changtragoon</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Isolation and characterization of microsatellite markers in Indian neem (<italic>Azadirachta indica var. indica A. Juss</italic>
) and cross-amplification in Thai neem (<italic>A. indica var. siamensis Valenton</italic>
)</article-title>
<source>Conservation Genetics</source>
<year>2009</year>
<volume>10</volume>
<fpage>669</fpage>
<lpage>671</lpage>
<pub-id pub-id-type="doi">10.1007/s10592-008-9610-5</pub-id>
</element-citation>
</ref>
<ref id="ref-6"><label>Brahmachari (2004)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Brahmachari</surname>
<given-names>G</given-names>
</name>
</person-group>
<article-title>Neem-an omnipotent plant: a retrospection</article-title>
<source>Chembiochem</source>
<year>2004</year>
<volume>5</volume>
<fpage>408</fpage>
<lpage>421</lpage>
<pub-id pub-id-type="doi">10.1002/cbic.200300749</pub-id>
<pub-id pub-id-type="pmid">15185362</pub-id>
</element-citation>
</ref>
<ref id="ref-7"><label>Broughton et al. (1986)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Broughton</surname>
<given-names>HB</given-names>
</name>
<name><surname>Ley</surname>
<given-names>SV</given-names>
</name>
<name><surname>Slawin</surname>
<given-names>AMZ</given-names>
</name>
<name><surname>Williams</surname>
<given-names>DJJ</given-names>
</name>
</person-group>
<article-title>X-ray crystallographic structure determination of detigloyldihydroazadirachtin and reassignment of the structure of the limonoid insect antifeedant azadirachtin</article-title>
<source>Journal of the Chemical Society. Chemical Communications</source>
<year>1986</year>
<volume>1</volume>
<fpage>46</fpage>
<lpage>47</lpage>
<pub-id pub-id-type="doi">10.1039/c39860000046</pub-id>
</element-citation>
</ref>
<ref id="ref-8"><label>Burge & Karlin (1997)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Burge</surname>
<given-names>C</given-names>
</name>
<name><surname>Karlin</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Prediction of complete gene structures in human genomic DNA</article-title>
<source>Journal of Molecular Biology</source>
<year>1997</year>
<volume>268</volume>
<fpage>78</fpage>
<lpage>94</lpage>
<pub-id pub-id-type="doi">10.1006/jmbi.1997.0951</pub-id>
<pub-id pub-id-type="pmid">9149143</pub-id>
</element-citation>
</ref>
<ref id="ref-9"><label>Butterworth & Morgan (1968)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Butterworth</surname>
<given-names>JH</given-names>
</name>
<name><surname>Morgan</surname>
<given-names>ED</given-names>
</name>
</person-group>
<article-title>Isolation of a substance that suppresses feeding in locusts</article-title>
<source>Chemical Communications (London)</source>
<issue>1</issue>
<year>1968</year>
<fpage>23</fpage>
<lpage>24</lpage>
<pub-id pub-id-type="doi">10.1039/c19680000023</pub-id>
</element-citation>
</ref>
<ref id="ref-10"><label>Chevreux (2005)</label>
<element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Chevreux</surname>
<given-names>B</given-names>
</name>
</person-group>
<source>MIRA: an automated genome and EST assembler</source>
<year>2005</year>
<publisher-loc>Heidelberg</publisher-loc>
<publisher-name>Ruprecht-Karls University</publisher-name>
</element-citation>
</ref>
<ref id="ref-11"><label>Cingolani et al. (2012)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cingolani</surname>
<given-names>P</given-names>
</name>
<name><surname>Platts</surname>
<given-names>A</given-names>
</name>
<name><surname>Coon</surname>
<given-names>M</given-names>
</name>
<name><surname>Nguyen</surname>
<given-names>T</given-names>
</name>
<name><surname>Wang</surname>
<given-names>L</given-names>
</name>
<name><surname>Land</surname>
<given-names>SJ</given-names>
</name>
<name><surname>Lu</surname>
<given-names>X</given-names>
</name>
<name><surname>Ruden</surname>
<given-names>DM</given-names>
</name>
</person-group>
<article-title>A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of <italic>Drosophila melanogaster</italic>
 strain w1118; iso-2; iso-3</article-title>
<source>Fly</source>
<year>2012</year>
<volume>6</volume>
<fpage>80</fpage>
<lpage>92</lpage>
<pub-id pub-id-type="doi">10.4161/fly.19695</pub-id>
<pub-id pub-id-type="pmid">22728672</pub-id>
</element-citation>
</ref>
<ref id="ref-12"><label>Csűös (2010)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Csűös</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Count: evolutionary analysis of phylogenetic profiles with parsimony and likelihood</article-title>
<source>Bioinformatics</source>
<year>2010</year>
<volume>26</volume>
<fpage>1910</fpage>
<lpage>1912</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btq315</pub-id>
<pub-id pub-id-type="pmid">20551134</pub-id>
</element-citation>
</ref>
<ref id="ref-13"><label>Delcher et al. (1999)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Delcher</surname>
<given-names>AL</given-names>
</name>
<name><surname>Harmon</surname>
<given-names>D</given-names>
</name>
<name><surname>Kasif</surname>
<given-names>S</given-names>
</name>
<name><surname>White</surname>
<given-names>O</given-names>
</name>
<name><surname>Salzberg</surname>
<given-names>SL</given-names>
</name>
</person-group>
<article-title>Improved microbial gene identification with GLIMMER</article-title>
<source>Nucleic Acids Research</source>
<year>1999</year>
<volume>27</volume>
<fpage>4636</fpage>
<lpage>4641</lpage>
<pub-id pub-id-type="doi">10.1093/nar/27.23.4636</pub-id>
<pub-id pub-id-type="pmid">10556321</pub-id>
</element-citation>
</ref>
<ref id="ref-14"><label>Dewick (2011)</label>
<element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Dewick</surname>
<given-names>PM</given-names>
</name>
</person-group>
<source>Medicinal natural products: a biosynthetic approach</source>
<year>2011</year>
<publisher-loc>New York</publisher-loc>
<publisher-name>John Wiley & Sons</publisher-name>
</element-citation>
</ref>
<ref id="ref-15"><label>Doyle (1990)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Doyle</surname>
<given-names>JJ</given-names>
</name>
</person-group>
<article-title>Isolation of plant DNA from fresh tissue</article-title>
<source>Focus</source>
<year>1990</year>
<volume>12</volume>
<fpage>13</fpage>
<lpage>15</lpage>
</element-citation>
</ref>
<ref id="ref-16"><label>Drożdżyński & Kowalska (2009)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Drożdżyński</surname>
<given-names>D</given-names>
</name>
<name><surname>Kowalska</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>Rapid analysis of organic farming insecticides in soil and produce using ultra-performance liquid chromatography/tandem mass spectrometry</article-title>
<source>Analytical and Bioanalytical Chemistry</source>
<year>2009</year>
<volume>394</volume>
<fpage>2241</fpage>
<lpage>2247</lpage>
<pub-id pub-id-type="doi">10.1007/s00216-009-2931-5</pub-id>
<pub-id pub-id-type="pmid">19579019</pub-id>
</element-citation>
</ref>
<ref id="ref-17"><label>Grabherr et al. (2011)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Grabherr</surname>
<given-names>MG</given-names>
</name>
<name><surname>Haas</surname>
<given-names>BJ</given-names>
</name>
<name><surname>Yassour</surname>
<given-names>M</given-names>
</name>
<name><surname>Levin</surname>
<given-names>JZ</given-names>
</name>
<name><surname>Thompson</surname>
<given-names>DA</given-names>
</name>
<name><surname>Amit</surname>
<given-names>I</given-names>
</name>
<name><surname>Adiconis</surname>
<given-names>X</given-names>
</name>
<name><surname>Fan</surname>
<given-names>L</given-names>
</name>
<name><surname>Raychowdhury</surname>
<given-names>R</given-names>
</name>
<name><surname>Zeng</surname>
<given-names>Q</given-names>
</name>
</person-group>
<article-title>Full-length transcriptome assembly from RNA-Seq data without a reference genome</article-title>
<source>Nature Biotechnology</source>
<year>2011</year>
<volume>29</volume>
<fpage>644</fpage>
<lpage>652</lpage>
<pub-id pub-id-type="doi">10.1038/nbt.1883</pub-id>
</element-citation>
</ref>
<ref id="ref-18"><label>Grimalt et al. (2011)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Grimalt</surname>
<given-names>S</given-names>
</name>
<name><surname>Thompson</surname>
<given-names>DG</given-names>
</name>
<name><surname>Coppens</surname>
<given-names>M</given-names>
</name>
<name><surname>Chartrand</surname>
<given-names>DT</given-names>
</name>
<name><surname>Shorney</surname>
<given-names>T</given-names>
</name>
<name><surname>Meating</surname>
<given-names>J</given-names>
</name>
<name><surname>Scarr</surname>
<given-names>T</given-names>
</name>
</person-group>
<article-title>Analytical study of azadirachtin and 3-tigloylazadirachtol residues in foliage and phloem of hardwood tree species by liquid chromatography–electrospray mass spectrometry</article-title>
<source>Journal of Agricultural and Food Chemistry</source>
<year>2011</year>
<volume>59</volume>
<fpage>8070</fpage>
<lpage>8077</lpage>
<pub-id pub-id-type="doi">10.1021/jf2023947</pub-id>
<pub-id pub-id-type="pmid">21726086</pub-id>
</element-citation>
</ref>
<ref id="ref-19"><label>Heasley (2011)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Heasley</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>Synthesis of limonoid natural products</article-title>
<source>European Journal of Organic Chemistry</source>
<year>2011</year>
<volume>2011</volume>
<fpage>19</fpage>
<lpage>46</lpage>
<pub-id pub-id-type="doi">10.1002/ejoc.201001218</pub-id>
</element-citation>
</ref>
<ref id="ref-20"><label>Hosfelt (2008)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hosfelt</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>Azadirachtin</article-title>
<source>Chemistry</source>
<year>2008</year>
<volume>150</volume>
<comment><italic>Available at <uri xlink:href="http://chemgroups.ucdavis.edu/~shaw/CHE_150_2008/DHC-Website/Azadirachtin_HosfeltJ.pdf">http://chemgroups.ucdavis.edu/~shaw/CHE_150_2008/DHC-Website/Azadirachtin_HosfeltJ.pdf</uri>
</italic>
</comment>
</element-citation>
</ref>
<ref id="ref-21"><label>Jiang & Wong (2008)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname>
<given-names>H</given-names>
</name>
<name><surname>Wong</surname>
<given-names>WH</given-names>
</name>
</person-group>
<article-title>SeqMap: mapping massive amount of oligonucleotides to the genome</article-title>
<source>Bioinformatics</source>
<year>2008</year>
<volume>24</volume>
<fpage>2395</fpage>
<lpage>2396</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btn429</pub-id>
<pub-id pub-id-type="pmid">18697769</pub-id>
</element-citation>
</ref>
<ref id="ref-22"><label>Jiang & Wong (2009)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname>
<given-names>H</given-names>
</name>
<name><surname>Wong</surname>
<given-names>WH</given-names>
</name>
</person-group>
<article-title>Statistical inferences for isoform expression in RNA-Seq</article-title>
<source>Bioinformatics</source>
<year>2009</year>
<volume>25</volume>
<fpage>1026</fpage>
<lpage>1032</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btp113</pub-id>
<pub-id pub-id-type="pmid">19244387</pub-id>
</element-citation>
</ref>
<ref id="ref-23"><label>Johnson, Morgan & Peiris (1996)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Johnson</surname>
<given-names>S</given-names>
</name>
<name><surname>Morgan</surname>
<given-names>ED</given-names>
</name>
<name><surname>Peiris</surname>
<given-names>CN</given-names>
</name>
</person-group>
<article-title>Development of the major triterpenoids and oil in the fruit and seeds of neem (<italic>Azadirachta indica</italic>
)</article-title>
<source>Annals of Botany</source>
<year>1996</year>
<volume>78</volume>
<fpage>383</fpage>
<lpage>388</lpage>
<pub-id pub-id-type="doi">10.1006/anbo.1996.0133</pub-id>
</element-citation>
</ref>
<ref id="ref-24"><label>Kawahara et al. (2013)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kawahara</surname>
<given-names>Y</given-names>
</name>
<name><surname>de la Bastide</surname>
<given-names>M</given-names>
</name>
<name><surname>Hamilton</surname>
<given-names>JP</given-names>
</name>
<name><surname>Kanamori</surname>
<given-names>H</given-names>
</name>
<name><surname>McCombie</surname>
<given-names>WR</given-names>
</name>
<name><surname>Ouyang</surname>
<given-names>S</given-names>
</name>
<name><surname>Schwartz</surname>
<given-names>DC</given-names>
</name>
<name><surname>Tanaka</surname>
<given-names>T</given-names>
</name>
<name><surname>Wu</surname>
<given-names>J</given-names>
</name>
<name><surname>Zhou</surname>
<given-names>S</given-names>
</name>
<name><surname>Childs</surname>
<given-names>KL</given-names>
</name>
<name><surname>Davidson</surname>
<given-names>RM</given-names>
</name>
<name><surname>Lin</surname>
<given-names>H</given-names>
</name>
<name><surname>Quesada-Ocampo</surname>
<given-names>L</given-names>
</name>
<name><surname>Vaillancourt</surname>
<given-names>B</given-names>
</name>
<name><surname>Sakai</surname>
<given-names>H</given-names>
</name>
<name><surname>Lee</surname>
<given-names>SS</given-names>
</name>
<name><surname>Kim</surname>
<given-names>J</given-names>
</name>
<name><surname>Numa</surname>
<given-names>H</given-names>
</name>
<name><surname>Itoh</surname>
<given-names>T</given-names>
</name>
<name><surname>Buell</surname>
<given-names>CR</given-names>
</name>
<name><surname>Matsumoto</surname>
<given-names>T</given-names>
</name>
</person-group>
<article-title>Improvement of the <italic>Oryza sativa</italic>
 Nipponbare reference genome using next generation sequence and optical map data</article-title>
<source>Rice</source>
<issue>1</issue>
<year>2013</year>
<volume>6</volume>
<fpage>4</fpage>
<pub-id pub-id-type="doi">10.1186/1939-8433-6-4</pub-id>
<pub-id pub-id-type="pmid">24280374</pub-id>
</element-citation>
</ref>
<ref id="ref-25"><label>Kent (2002)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kent</surname>
<given-names>WJ</given-names>
</name>
</person-group>
<article-title>BLAT—the BLAST-like alignment tool</article-title>
<source>Genome Research</source>
<year>2002</year>
<volume>12</volume>
<fpage>656</fpage>
<lpage>664</lpage>
<pub-id pub-id-type="doi">10.1101/gr.229202</pub-id>
<pub-id pub-id-type="pmid">11932250</pub-id>
</element-citation>
</ref>
<ref id="ref-26"><label>Krishnan et al. (2012)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Krishnan</surname>
<given-names>N</given-names>
</name>
<name><surname>Pattnaik</surname>
<given-names>S</given-names>
</name>
<name><surname>Jain</surname>
<given-names>P</given-names>
</name>
<name><surname>Gaur</surname>
<given-names>P</given-names>
</name>
<name><surname>Choudhary</surname>
<given-names>R</given-names>
</name>
<name><surname>Vaidyanathan</surname>
<given-names>S</given-names>
</name>
<name><surname>Deepak</surname>
<given-names>S</given-names>
</name>
<name><surname>Hariharan</surname>
<given-names>A</given-names>
</name>
<name><surname>Krishna</surname>
<given-names>PG</given-names>
</name>
<name><surname>Nair</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>A draft of the genome and four transcriptomes of a medicinal and pesticidal angiosperm <italic>Azadirachta indica</italic>
</article-title>
<source>BMC Genomics</source>
<year>2012</year>
<volume>13</volume>
<fpage>464</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2164-13-464</pub-id>
<pub-id pub-id-type="pmid">22958331</pub-id>
</element-citation>
</ref>
<ref id="ref-27"><label>Kurtz et al. (2004)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kurtz</surname>
<given-names>S</given-names>
</name>
<name><surname>Phillippy</surname>
<given-names>A</given-names>
</name>
<name><surname>Delcher</surname>
<given-names>A</given-names>
</name>
<name><surname>Smoot</surname>
<given-names>M</given-names>
</name>
<name><surname>Shumway</surname>
<given-names>M</given-names>
</name>
<name><surname>Antonescu</surname>
<given-names>C</given-names>
</name>
<name><surname>Salzberg</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Versatile and open software for comparing large genomes</article-title>
<source>Genome Biology</source>
<issue>2</issue>
<year>2004</year>
<volume>5</volume>
<elocation-id>e1066</elocation-id>
<pub-id pub-id-type="doi">10.1186/gb-2004-5-2-r12</pub-id>
</element-citation>
</ref>
<ref id="ref-28"><label>Langfelder & Horvath (2008)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Langfelder</surname>
<given-names>P</given-names>
</name>
<name><surname>Horvath</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>WGCNA: an R package for weighted correlation network analysis</article-title>
<source>BMC Bioinformatics</source>
<year>2008</year>
<volume>9</volume>
<fpage>559</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2105-9-559</pub-id>
<pub-id pub-id-type="pmid">19114008</pub-id>
</element-citation>
</ref>
<ref id="ref-29"><label>Langmead & Salzberg (2012)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Langmead</surname>
<given-names>B</given-names>
</name>
<name><surname>Salzberg</surname>
<given-names>SL</given-names>
</name>
</person-group>
<article-title>Fast gapped-read alignment with Bowtie 2</article-title>
<source>Nature Methods</source>
<year>2012</year>
<volume>9</volume>
<fpage>357</fpage>
<lpage>359</lpage>
<pub-id pub-id-type="doi">10.1038/nmeth.1923</pub-id>
<pub-id pub-id-type="pmid">22388286</pub-id>
</element-citation>
</ref>
<ref id="ref-30"><label>Lechner et al. (2011)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lechner</surname>
<given-names>M</given-names>
</name>
<name><surname>FindeiÃŸ</surname>
<given-names>S</given-names>
</name>
<name><surname>Steiner</surname>
<given-names>L</given-names>
</name>
<name><surname>Marz</surname>
<given-names>M</given-names>
</name>
<name><surname>Stadler</surname>
<given-names>PF</given-names>
</name>
<name><surname>Prohaska</surname>
<given-names>SJ</given-names>
</name>
</person-group>
<article-title>Proteinortho: detection of (Co-) orthologs in large-scale analysis</article-title>
<source>BMC Bioinformatics</source>
<year>2011</year>
<volume>12</volume>
<fpage>124</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2105-12-124</pub-id>
<pub-id pub-id-type="pmid">21526987</pub-id>
</element-citation>
</ref>
<ref id="ref-31"><label>Ley, Denholm & Wood (1993)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ley</surname>
<given-names>SV</given-names>
</name>
<name><surname>Denholm</surname>
<given-names>AA</given-names>
</name>
<name><surname>Wood</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>The chemistry of azadirachtin</article-title>
<source>Natural Products Reports</source>
<year>1993</year>
<volume>10</volume>
<fpage>109</fpage>
<lpage>157</lpage>
<pub-id pub-id-type="doi">10.1039/np9931000109</pub-id>
</element-citation>
</ref>
<ref id="ref-32"><label>Li et al. (2009)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname>
<given-names>H</given-names>
</name>
<name><surname>Handsaker</surname>
<given-names>B</given-names>
</name>
<name><surname>Wysoker</surname>
<given-names>A</given-names>
</name>
<name><surname>Fennell</surname>
<given-names>T</given-names>
</name>
<name><surname>Ruan</surname>
<given-names>J</given-names>
</name>
<name><surname>Homer</surname>
<given-names>N</given-names>
</name>
<name><surname>Marth</surname>
<given-names>G</given-names>
</name>
<name><surname>Abecasis</surname>
<given-names>G</given-names>
</name>
<name><surname>Durbin</surname>
<given-names>R</given-names>
</name>
<collab>Genome Project Data Processing S</collab>
</person-group>
<article-title>The sequence alignment/map format and SAMtools</article-title>
<source>Bioinformatics</source>
<year>2009</year>
<volume>25</volume>
<fpage>2078</fpage>
<lpage>2079</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id>
<pub-id pub-id-type="pmid">19505943</pub-id>
</element-citation>
</ref>
<ref id="ref-33"><label>Li & Godzik (2006)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname>
<given-names>W</given-names>
</name>
<name><surname>Godzik</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>CD-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences</article-title>
<source>Bioinformatics</source>
<year>2006</year>
<volume>22</volume>
<fpage>1658</fpage>
<lpage>1659</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btl158</pub-id>
<pub-id pub-id-type="pmid">16731699</pub-id>
</element-citation>
</ref>
<ref id="ref-34"><label>Metzker (2010)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Metzker</surname>
<given-names>ML</given-names>
</name>
</person-group>
<article-title>Sequencing technologies—the next generation</article-title>
<source>Nature Reviews Genetics</source>
<year>2010</year>
<volume>11</volume>
<fpage>31</fpage>
<lpage>46</lpage>
<pub-id pub-id-type="doi">10.1038/nrg2626</pub-id>
</element-citation>
</ref>
<ref id="ref-35"><label>Ming et al. (2013)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ming</surname>
<given-names>R</given-names>
</name>
<name><surname>VanBuren</surname>
<given-names>R</given-names>
</name>
<name><surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name><surname>Yang</surname>
<given-names>M</given-names>
</name>
<name><surname>Han</surname>
<given-names>Y</given-names>
</name>
<name><surname>Li</surname>
<given-names>L-T</given-names>
</name>
<name><surname>Zhang</surname>
<given-names>Q</given-names>
</name>
<name><surname>Kim</surname>
<given-names>M-J</given-names>
</name>
<name><surname>Schatz</surname>
<given-names>MC</given-names>
</name>
<name><surname>Campbell</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Genome of the long-living sacred lotus (<italic>Nelumbo nucifera Gaertn)</italic>
</article-title>
<source>Genome Biology</source>
<year>2013</year>
<volume>14</volume>
<fpage>R41</fpage>
<pub-id pub-id-type="doi">10.1186/gb-2013-14-5-r41</pub-id>
<pub-id pub-id-type="pmid">23663246</pub-id>
</element-citation>
</ref>
<ref id="ref-36"><label>Narnoliya et al. (2014)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Narnoliya</surname>
<given-names>LK1</given-names>
</name>
<name><surname>Rajakani</surname>
<given-names>R</given-names>
</name>
<name><surname>Sangwan</surname>
<given-names>NS</given-names>
</name>
<name><surname>Gupta</surname>
<given-names>V</given-names>
</name>
<name><surname>Sangwan</surname>
<given-names>RS</given-names>
</name>
</person-group>
<article-title>Comparative transcripts profiling of fruit mesocarp and endocarp relevant to secondary metabolism by suppression subtractive hybridization in <italic>Azadirachta indica</italic>
 (neem)</article-title>
<source>Molecular Biology Reports</source>
<issue>5</issue>
<year>2014</year>
<volume>41</volume>
<fpage>R41</fpage>
<pub-id pub-id-type="doi">10.1007/s11033-014-3174-x</pub-id>
</element-citation>
</ref>
<ref id="ref-37"><label>Ohri, Bhargava & Chatterjee (2004)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ohri</surname>
<given-names>D</given-names>
</name>
<name><surname>Bhargava</surname>
<given-names>A</given-names>
</name>
<name><surname>Chatterjee</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>Nuclear DNA amounts in 112 species of tropical hardwoods-new estimates</article-title>
<source>Plant Biology</source>
<year>2004</year>
<volume>6</volume>
<fpage>555</fpage>
<lpage>561</lpage>
<pub-id pub-id-type="doi">10.1055/s-2004-821235</pub-id>
<pub-id pub-id-type="pmid">15375726</pub-id>
</element-citation>
</ref>
<ref id="ref-38"><label>Parra, Bradnam & Korf (2007)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Parra</surname>
<given-names>G</given-names>
</name>
<name><surname>Bradnam</surname>
<given-names>K</given-names>
</name>
<name><surname>Korf</surname>
<given-names>I</given-names>
</name>
</person-group>
<article-title>CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes</article-title>
<source>Bioinformatics</source>
<year>2007</year>
<volume>23</volume>
<fpage>1061</fpage>
<lpage>1067</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btm071</pub-id>
<pub-id pub-id-type="pmid">17332020</pub-id>
</element-citation>
</ref>
<ref id="ref-39"><label>Puri (1999)</label>
<element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Puri</surname>
<given-names>HS</given-names>
</name>
</person-group>
<source>Neem: the divine tree. <italic>Azadirachta indica</italic>
</source>
<year>1999</year>
<publisher-loc>Amsterdam</publisher-loc>
<publisher-name>Harwood Academic Publishers</publisher-name>
</element-citation>
</ref>
<ref id="ref-40"><label>Puri (2003)</label>
<element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Puri</surname>
<given-names>HS</given-names>
</name>
</person-group>
<source>Neem: the divine tree <italic>Azadirachta indica</italic>
</source>
<year>2003</year>
<publisher-loc>Boca Raton</publisher-loc>
<publisher-name>CRC Press</publisher-name>
</element-citation>
</ref>
<ref id="ref-41"><label>Rajakani et al. (2014)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rajakani</surname>
<given-names>R</given-names>
</name>
<name><surname>Narnoliya</surname>
<given-names>L</given-names>
</name>
<name><surname>Sangwan</surname>
<given-names>NS</given-names>
</name>
<name><surname>Sangwan</surname>
<given-names>RS</given-names>
</name>
<name><surname>Gupta</surname>
<given-names>V</given-names>
</name>
</person-group>
<article-title>Subtractive transcriptomes of fruit and leaf reveal differential representation of transcripts in <italic>Azadirachta indica</italic>
</article-title>
<source>Tree Genetics & Genomes</source>
<year>2014</year>
<volume>10</volume>
<fpage>1331</fpage>
<lpage>1351</lpage>
<pub-id pub-id-type="doi">10.1007/s11295-014-0764-7</pub-id>
</element-citation>
</ref>
<ref id="ref-42"><label>Ray & Satya (2014)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ray</surname>
<given-names>S</given-names>
</name>
<name><surname>Satya</surname>
<given-names>P</given-names>
</name>
</person-group>
<article-title>Next generation sequencing technologies for next generation plant breeding</article-title>
<source>Frontiers in Plant Science</source>
<year>2014</year>
<volume>5</volume>
<fpage>367</fpage>
<pub-id pub-id-type="doi">10.3389/fpls.2014.00367</pub-id>
<pub-id pub-id-type="pmid">25126091</pub-id>
</element-citation>
</ref>
<ref id="ref-43"><label>Saxena (1989)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Saxena</surname>
<given-names>RC</given-names>
</name>
</person-group>
<source>Insecticides from neem</source>
<series>ACS Symposium series</series>
<volume>387</volume>
<year>1989</year>
<fpage>110</fpage>
<lpage>135</lpage>
</element-citation>
</ref>
<ref id="ref-44"><label>Schmid & Blaxter (2008)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Schmid</surname>
<given-names>R</given-names>
</name>
<name><surname>Blaxter</surname>
<given-names>ML</given-names>
</name>
</person-group>
<article-title>annot8r: GO, EC and KEGG annotation of EST datasets</article-title>
<source>BMC Bioinformatics</source>
<year>2008</year>
<volume>9</volume>
<fpage>180</fpage>
<pub-id pub-id-type="doi">10.1186/1471-2105-9-180</pub-id>
<pub-id pub-id-type="pmid">18400082</pub-id>
</element-citation>
</ref>
<ref id="ref-45"><label>Siddiqui (1942)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Siddiqui</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>A note on the isolation of three new bitter principles from the nim oil</article-title>
<source>Current Science</source>
<year>1942</year>
<volume>11</volume>
<fpage>278</fpage>
<lpage>279</lpage>
</element-citation>
</ref>
<ref id="ref-46"><label>Sidhu, Kumar & Behl (2003)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sidhu</surname>
<given-names>O</given-names>
</name>
<name><surname>Kumar</surname>
<given-names>V</given-names>
</name>
<name><surname>Behl</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>Variability in neem (<italic>Azadirachta indica</italic>
) with respect to azadirachtin content</article-title>
<source>Journal of Agricultural and Food Chemistry</source>
<year>2003</year>
<volume>51</volume>
<fpage>910</fpage>
<lpage>915</lpage>
<pub-id pub-id-type="doi">10.1021/jf025994m</pub-id>
<pub-id pub-id-type="pmid">12568548</pub-id>
</element-citation>
</ref>
<ref id="ref-47"><label>Smit & Hubley (2008)</label>
<element-citation publication-type="other"><person-group><name><surname>Smit</surname>
<given-names>A</given-names>
</name>
<name><surname>Hubley</surname>
<given-names>R</given-names>
</name>
</person-group>
<year>2008</year>
<comment>RepeatModeler Open-1.0. 2008-2010. <italic>Available at <uri xlink:href="http://www.repeatmasker.org">http://www.repeatmasker.org</uri>
</italic>
</comment>
</element-citation>
</ref>
<ref id="ref-48"><label>Soderlund, Bomhoff & Nelson (2011)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Soderlund</surname>
<given-names>C</given-names>
</name>
<name><surname>Bomhoff</surname>
<given-names>M</given-names>
</name>
<name><surname>Nelson</surname>
<given-names>WM</given-names>
</name>
</person-group>
<article-title>SyMAP v3. 4: a turnkey synteny system with application to plant genomes</article-title>
<source>Nucleic Acids Research</source>
<year>2011</year>
<volume>39</volume>
<fpage>e68</fpage>
<lpage>e68</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkr123</pub-id>
<pub-id pub-id-type="pmid">21398631</pub-id>
</element-citation>
</ref>
<ref id="ref-49"><label>Stanke et al. (2006)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Stanke</surname>
<given-names>M</given-names>
</name>
<name><surname>Keller</surname>
<given-names>O</given-names>
</name>
<name><surname>Gunduz</surname>
<given-names>I</given-names>
</name>
<name><surname>Hayes</surname>
<given-names>A</given-names>
</name>
<name><surname>Waack</surname>
<given-names>S</given-names>
</name>
<name><surname>Morgenstern</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>AUGUSTUS: <italic>ab initio</italic>
 prediction of alternative transcripts</article-title>
<source>Nucleic Acids Research</source>
<year>2006</year>
<volume>34</volume>
<fpage>W435</fpage>
<lpage>W439</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkl200</pub-id>
<pub-id pub-id-type="pmid">16845043</pub-id>
</element-citation>
</ref>
<ref id="ref-50"><label>Tan & Luo (2011)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tan</surname>
<given-names>Q-G</given-names>
</name>
<name><surname>Luo</surname>
<given-names>X-D</given-names>
</name>
</person-group>
<article-title>Meliaceous limonoids: chemistry and biological activities</article-title>
<source>Chemical Reviews</source>
<year>2011</year>
<volume>111</volume>
<fpage>7437</fpage>
<lpage>7522</lpage>
<pub-id pub-id-type="doi">10.1021/cr9004023</pub-id>
<pub-id pub-id-type="pmid">21894902</pub-id>
</element-citation>
</ref>
<ref id="ref-51"><label>Varshney et al. (2012)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Varshney</surname>
<given-names>RK</given-names>
</name>
<name><surname>Chen</surname>
<given-names>W</given-names>
</name>
<name><surname>Li</surname>
<given-names>Y</given-names>
</name>
<name><surname>Bharti</surname>
<given-names>AK</given-names>
</name>
<name><surname>Saxena</surname>
<given-names>RK</given-names>
</name>
<name><surname>Schlueter</surname>
<given-names>JA</given-names>
</name>
<name><surname>Donoghue</surname>
<given-names>MT</given-names>
</name>
<name><surname>Azam</surname>
<given-names>S</given-names>
</name>
<name><surname>Fan</surname>
<given-names>G</given-names>
</name>
<name><surname>Whaley</surname>
<given-names>AM</given-names>
</name>
</person-group>
<article-title>Draft genome sequence of pigeonpea (<italic>Cajanus cajan</italic>
), an orphan legume crop of resource-poor farmers</article-title>
<source>Nature Biotechnology</source>
<year>2012</year>
<volume>30</volume>
<fpage>83</fpage>
<lpage>89</lpage>
<pub-id pub-id-type="doi">10.1038/nbt.2022</pub-id>
</element-citation>
</ref>
<ref id="ref-52"><label>Veitch, Boyer & Ley (2008)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Veitch</surname>
<given-names>GE</given-names>
</name>
<name><surname>Boyer</surname>
<given-names>A</given-names>
</name>
<name><surname>Ley</surname>
<given-names>SV</given-names>
</name>
</person-group>
<article-title>The azadirachtin story</article-title>
<source>Angewandte Chemie International Edition</source>
<year>2008</year>
<volume>47</volume>
<fpage>9402</fpage>
<lpage>9429</lpage>
<pub-id pub-id-type="doi">10.1002/anie.200802675</pub-id>
</element-citation>
</ref>
<ref id="ref-53"><label>Warnes, Bolker & Lumley (2008)</label>
<element-citation publication-type="software"><person-group person-group-type="author"><name><surname>Warnes</surname>
<given-names>GR</given-names>
</name>
<name><surname>Bolker</surname>
<given-names>B</given-names>
</name>
<name><surname>Lumley</surname>
<given-names>T</given-names>
</name>
</person-group>
<source>gtools: various R programming tools</source>
<edition designator="2">R package version 2</edition>
<year>2008</year>
<comment><italic>Available at <uri xlink:href="https://cran.r-project.org/web/packages/gtools/index.html">https://cran.r-project.org/web/packages/gtools/index.html</uri>
</italic>
</comment>
</element-citation>
</ref>
<ref id="ref-54"><label>Wickham (2010)</label>
<element-citation publication-type="other"><person-group><name><surname>Wickham</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>stringr: make it easier to work with strings</article-title>
<year>2010</year>
<comment><italic>Available at <uri xlink:href="http://CRANR-projectorg/package=stringrRpackageversion04">http://CRANR-projectorg/package=stringrRpackageversion04</uri>
</italic>
</comment>
</element-citation>
</ref>
<ref id="ref-55"><label>Wu et al. (2014)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname>
<given-names>GA</given-names>
</name>
<name><surname>Prochnik</surname>
<given-names>S</given-names>
</name>
<name><surname>Jenkins</surname>
<given-names>J</given-names>
</name>
<name><surname>Salse</surname>
<given-names>J</given-names>
</name>
<name><surname>Hellsten</surname>
<given-names>U</given-names>
</name>
<name><surname>Murat</surname>
<given-names>F</given-names>
</name>
<name><surname>Perrier</surname>
<given-names>X</given-names>
</name>
<name><surname>Ruiz</surname>
<given-names>M</given-names>
</name>
<name><surname>Scalabrin</surname>
<given-names>S</given-names>
</name>
<name><surname>Terol</surname>
<given-names>J</given-names>
</name>
<name><surname>Takita</surname>
<given-names>MA</given-names>
</name>
<name><surname>Labadie</surname>
<given-names>K</given-names>
</name>
<name><surname>Poulain</surname>
<given-names>J</given-names>
</name>
<name><surname>Couloux</surname>
<given-names>A</given-names>
</name>
<name><surname>Jabbari</surname>
<given-names>K</given-names>
</name>
<name><surname>Cattonaro</surname>
<given-names>F</given-names>
</name>
<name><surname>Del Fabbro</surname>
<given-names>C</given-names>
</name>
<name><surname>Pinosio</surname>
<given-names>S</given-names>
</name>
<name><surname>Zuccolo</surname>
<given-names>A</given-names>
</name>
<name><surname>Chapman</surname>
<given-names>J</given-names>
</name>
<name><surname>Grimwood</surname>
<given-names>J</given-names>
</name>
<name><surname>Tadeo</surname>
<given-names>FR</given-names>
</name>
<name><surname>Estornell</surname>
<given-names>LH</given-names>
</name>
<name><surname>Muñoz-Sanz</surname>
<given-names>J</given-names>
</name>
<name><surname>Ibanez</surname>
<given-names>V</given-names>
</name>
<name><surname>Herrero-Ortega</surname>
<given-names>A</given-names>
</name>
<name><surname>Aleza</surname>
<given-names>P</given-names>
</name>
<name><surname>Pérez-Pérez</surname>
<given-names>J</given-names>
</name>
<name><surname>Ramón</surname>
<given-names>D</given-names>
</name>
<name><surname>Brunel</surname>
<given-names>D</given-names>
</name>
<name><surname>Luro</surname>
<given-names>F</given-names>
</name>
<name><surname>Chen</surname>
<given-names>C</given-names>
</name>
<name><surname>Farmerie</surname>
<given-names>WG</given-names>
</name>
<name><surname>Desany</surname>
<given-names>B</given-names>
</name>
<name><surname>Kodira</surname>
<given-names>C</given-names>
</name>
<name><surname>Mohiuddin</surname>
<given-names>M</given-names>
</name>
<name><surname>Harkins</surname>
<given-names>T</given-names>
</name>
<name><surname>Fredrikson</surname>
<given-names>K</given-names>
</name>
<name><surname>Burns</surname>
<given-names>P</given-names>
</name>
<name><surname>Lomsadze</surname>
<given-names>A</given-names>
</name>
<name><surname>Borodovsky</surname>
<given-names>M</given-names>
</name>
<name><surname>Reforgiato</surname>
<given-names>G</given-names>
</name>
<name><surname>Freitas-Astúa</surname>
<given-names>J</given-names>
</name>
<name><surname>Quetier</surname>
<given-names>F</given-names>
</name>
<name><surname>Navarro</surname>
<given-names>L</given-names>
</name>
<name><surname>Roose</surname>
<given-names>M</given-names>
</name>
<name><surname>Wincker</surname>
<given-names>P</given-names>
</name>
<name><surname>Schmutz</surname>
<given-names>J</given-names>
</name>
<name><surname>Morgante</surname>
<given-names>M</given-names>
</name>
<name><surname>Machado</surname>
<given-names>MA</given-names>
</name>
<name><surname>Talon</surname>
<given-names>M</given-names>
</name>
<name><surname>Jaillon</surname>
<given-names>O</given-names>
</name>
<name><surname>Ollitrault</surname>
<given-names>P</given-names>
</name>
<name><surname>Gmitter</surname>
<given-names>F</given-names>
</name>
<name><surname>Rokhsar</surname>
<given-names>D</given-names>
</name>
</person-group>
<article-title>Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication</article-title>
<source>Nature Biotechnology</source>
<issue>7</issue>
<year>2014</year>
<volume>32</volume>
<fpage>656</fpage>
<lpage>662</lpage>
<pub-id pub-id-type="doi">10.1038/nbt.2906</pub-id>
</element-citation>
</ref>
<ref id="ref-56"><label>Wyman, Jansen & Boore (2004)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wyman</surname>
<given-names>SK</given-names>
</name>
<name><surname>Jansen</surname>
<given-names>RK</given-names>
</name>
<name><surname>Boore</surname>
<given-names>JL</given-names>
</name>
</person-group>
<article-title>Automatic annotation of organellar genomes with DOGMA</article-title>
<source>Bioinformatics</source>
<year>2004</year>
<volume>20</volume>
<fpage>3252</fpage>
<lpage>3255</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/bth352</pub-id>
<pub-id pub-id-type="pmid">15180927</pub-id>
</element-citation>
</ref>
<ref id="ref-57"><label>Xu et al. (2013)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname>
<given-names>Q</given-names>
</name>
<name><surname>Chen</surname>
<given-names>L-L</given-names>
</name>
<name><surname>Ruan</surname>
<given-names>X</given-names>
</name>
<name><surname>Chen</surname>
<given-names>D</given-names>
</name>
<name><surname>Zhu</surname>
<given-names>A</given-names>
</name>
<name><surname>Chen</surname>
<given-names>C</given-names>
</name>
<name><surname>Bertrand</surname>
<given-names>D</given-names>
</name>
<name><surname>Jiao</surname>
<given-names>W-B</given-names>
</name>
<name><surname>Hao</surname>
<given-names>B-H</given-names>
</name>
<name><surname>Lyon</surname>
<given-names>MP</given-names>
</name>
</person-group>
<article-title>The draft genome of sweet orange (<italic>Citrus sinensis</italic>
)</article-title>
<source>Nature Genetics</source>
<year>2013</year>
<volume>45</volume>
<fpage>59</fpage>
<lpage>66</lpage>
<pub-id pub-id-type="doi">10.1038/ng.2472</pub-id>
<pub-id pub-id-type="pmid">23179022</pub-id>
</element-citation>
</ref>
<ref id="ref-58"><label>Zerbino & Birney (2008)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zerbino</surname>
<given-names>DR</given-names>
</name>
<name><surname>Birney</surname>
<given-names>E</given-names>
</name>
</person-group>
<article-title>Velvet: algorithms for <italic>de novo</italic>
 short read assembly using <italic>de Bruijn</italic>
 graphs</article-title>
<source>Genome Research</source>
<year>2008</year>
<volume>18</volume>
<fpage>821</fpage>
<lpage>829</lpage>
<pub-id pub-id-type="doi">10.1101/gr.074492.107</pub-id>
<pub-id pub-id-type="pmid">18349386</pub-id>
</element-citation>
</ref>
<ref id="ref-59"><label>Zhang & Wessler (2004)</label>
<element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname>
<given-names>X</given-names>
</name>
<name><surname>Wessler</surname>
<given-names>SR</given-names>
</name>
</person-group>
<article-title>Genome-wide comparative analysis of the transposable elements in the related species <italic>Arabidopsis thaliana</italic>
 and <italic>Brassica oleracea</italic>
</article-title>
<source>Proceedings of the National Academy of Sciences of the United States of America</source>
<year>2004</year>
<volume>101</volume>
<fpage>5589</fpage>
<lpage>5594</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.0401243101</pub-id>
<pub-id pub-id-type="pmid">15064405</pub-id>
</element-citation>
</ref>
</ref-list>
</back>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Bois/explor/OrangerV1/Data/Pmc/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000233 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000233 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Bois
   |area=    OrangerV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:4540028
   |texte=   Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:26290780" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a OrangerV1

This area was generated with Dilib version V0.6.25.
Data generation: Sat Dec 3 17:11:04 2016. Site generation: Wed Mar 6 18:18:32 2024

	Serveur d'exploration sur l'oranger
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur l'oranger

Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree

Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree

Source :

Abstract

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri

Pour générer des pages wiki