Serveur d'exploration sur le cobalt au Maghreb

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

De Novo Sequences of Haloquadratum walsbyi from Lake Tyrrell, Australia, Reveal a Variable Genomic Landscape

Identifieur interne : 000398 ( Pmc/Corpus ); précédent : 000397; suivant : 000399

De Novo Sequences of Haloquadratum walsbyi from Lake Tyrrell, Australia, Reveal a Variable Genomic Landscape

Auteurs : Benjamin J. Tully ; Joanne B. Emerson ; Karen Andrade ; Jochen J. Brocks ; Eric E. Allen ; Jillian F. Banfield ; Karla B. Heidelberg

Source :

RBID : PMC:4330952

Abstract

Hypersaline systems near salt saturation levels represent an extreme environment, in which organisms grow and survive near the limits of life. One of the abundant members of the microbial communities in hypersaline systems is the square archaeon, Haloquadratum walsbyi. Utilizing a short-read metagenome from Lake Tyrrell, a hypersaline ecosystem in Victoria, Australia, we performed a comparative genomic analysis of H. walsbyi to better understand the extent of variation between strains/subspecies. Results revealed that previously isolated strains/subspecies do not fully describe the complete repertoire of the genomic landscape present in H. walsbyi. Rearrangements, insertions, and deletions were observed for the Lake Tyrrell derived Haloquadratum genomes and were supported by environmental de novo sequences, including shifts in the dominant genomic landscape of the two most abundant strains. Analysis pertaining to halomucins indicated that homologs for this large protein are not a feature common for all species of Haloquadratum. Further, we analyzed ATP-binding cassette transporters (ABC-type transporters) for evidence of niche partitioning between different strains/subspecies. We were able to identify unique and variable transporter subunits from all five genomes analyzed and the de novo environmental sequences, suggesting that differences in nutrient and carbon source acquisition may play a role in maintaining distinct strains/subspecies.


Url:
DOI: 10.1155/2015/875784
PubMed: 25709557
PubMed Central: 4330952

Links to Exploration step

PMC:4330952

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">
<italic>De Novo</italic>
Sequences of
<italic>Haloquadratum walsbyi</italic>
from Lake Tyrrell, Australia, Reveal a Variable Genomic Landscape</title>
<author>
<name sortKey="Tully, Benjamin J" sort="Tully, Benjamin J" uniqKey="Tully B" first="Benjamin J." last="Tully">Benjamin J. Tully</name>
<affiliation>
<nlm:aff id="I1">Department of Biological Sciences, Dornsife College of Letters, Arts and Sciences, University of Southern California, 3616 Trousdale Parkway, Los Angeles, CA 90089, USA</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Emerson, Joanne B" sort="Emerson, Joanne B" uniqKey="Emerson J" first="Joanne B." last="Emerson">Joanne B. Emerson</name>
<affiliation>
<nlm:aff id="I2">Cooperative Institute for Research in Environmental Sciences, CIRES Building, Room 318, University of Colorado Boulder, Boulder, CO 80309, USA</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Andrade, Karen" sort="Andrade, Karen" uniqKey="Andrade K" first="Karen" last="Andrade">Karen Andrade</name>
<affiliation>
<nlm:aff id="I3">Department of Environmental Science, Policy and Management, University of California, Berkeley, 54 Mulford Hall, Berkeley, CA 94720, USA</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Brocks, Jochen J" sort="Brocks, Jochen J" uniqKey="Brocks J" first="Jochen J." last="Brocks">Jochen J. Brocks</name>
<affiliation>
<nlm:aff id="I4">Research School of Earth Sciences, The Australian National University, Canberra, ACT 0200, Australia</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Allen, Eric E" sort="Allen, Eric E" uniqKey="Allen E" first="Eric E." last="Allen">Eric E. Allen</name>
<affiliation>
<nlm:aff id="I5">Division of Biological Sciences, University of California, San Diego, La Jolla, CA 92093-0202, USA</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I6">Marine Biology Research Division, Scripps Institution of Oceanography, La Jolla, CA 92093, USA</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Banfield, Jillian F" sort="Banfield, Jillian F" uniqKey="Banfield J" first="Jillian F." last="Banfield">Jillian F. Banfield</name>
<affiliation>
<nlm:aff id="I2">Cooperative Institute for Research in Environmental Sciences, CIRES Building, Room 318, University of Colorado Boulder, Boulder, CO 80309, USA</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Heidelberg, Karla B" sort="Heidelberg, Karla B" uniqKey="Heidelberg K" first="Karla B." last="Heidelberg">Karla B. Heidelberg</name>
<affiliation>
<nlm:aff id="I1">Department of Biological Sciences, Dornsife College of Letters, Arts and Sciences, University of Southern California, 3616 Trousdale Parkway, Los Angeles, CA 90089, USA</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">25709557</idno>
<idno type="pmc">4330952</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4330952</idno>
<idno type="RBID">PMC:4330952</idno>
<idno type="doi">10.1155/2015/875784</idno>
<date when="2015">2015</date>
<idno type="wicri:Area/Pmc/Corpus">000398</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000398</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">
<italic>De Novo</italic>
Sequences of
<italic>Haloquadratum walsbyi</italic>
from Lake Tyrrell, Australia, Reveal a Variable Genomic Landscape</title>
<author>
<name sortKey="Tully, Benjamin J" sort="Tully, Benjamin J" uniqKey="Tully B" first="Benjamin J." last="Tully">Benjamin J. Tully</name>
<affiliation>
<nlm:aff id="I1">Department of Biological Sciences, Dornsife College of Letters, Arts and Sciences, University of Southern California, 3616 Trousdale Parkway, Los Angeles, CA 90089, USA</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Emerson, Joanne B" sort="Emerson, Joanne B" uniqKey="Emerson J" first="Joanne B." last="Emerson">Joanne B. Emerson</name>
<affiliation>
<nlm:aff id="I2">Cooperative Institute for Research in Environmental Sciences, CIRES Building, Room 318, University of Colorado Boulder, Boulder, CO 80309, USA</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Andrade, Karen" sort="Andrade, Karen" uniqKey="Andrade K" first="Karen" last="Andrade">Karen Andrade</name>
<affiliation>
<nlm:aff id="I3">Department of Environmental Science, Policy and Management, University of California, Berkeley, 54 Mulford Hall, Berkeley, CA 94720, USA</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Brocks, Jochen J" sort="Brocks, Jochen J" uniqKey="Brocks J" first="Jochen J." last="Brocks">Jochen J. Brocks</name>
<affiliation>
<nlm:aff id="I4">Research School of Earth Sciences, The Australian National University, Canberra, ACT 0200, Australia</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Allen, Eric E" sort="Allen, Eric E" uniqKey="Allen E" first="Eric E." last="Allen">Eric E. Allen</name>
<affiliation>
<nlm:aff id="I5">Division of Biological Sciences, University of California, San Diego, La Jolla, CA 92093-0202, USA</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="I6">Marine Biology Research Division, Scripps Institution of Oceanography, La Jolla, CA 92093, USA</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Banfield, Jillian F" sort="Banfield, Jillian F" uniqKey="Banfield J" first="Jillian F." last="Banfield">Jillian F. Banfield</name>
<affiliation>
<nlm:aff id="I2">Cooperative Institute for Research in Environmental Sciences, CIRES Building, Room 318, University of Colorado Boulder, Boulder, CO 80309, USA</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Heidelberg, Karla B" sort="Heidelberg, Karla B" uniqKey="Heidelberg K" first="Karla B." last="Heidelberg">Karla B. Heidelberg</name>
<affiliation>
<nlm:aff id="I1">Department of Biological Sciences, Dornsife College of Letters, Arts and Sciences, University of Southern California, 3616 Trousdale Parkway, Los Angeles, CA 90089, USA</nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Archaea</title>
<idno type="ISSN">1472-3646</idno>
<idno type="eISSN">1472-3654</idno>
<imprint>
<date when="2015">2015</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>Hypersaline systems near salt saturation levels represent an extreme environment, in which organisms grow and survive near the limits of life. One of the abundant members of the microbial communities in hypersaline systems is the square archaeon,
<italic>Haloquadratum walsbyi</italic>
. Utilizing a short-read metagenome from Lake Tyrrell, a hypersaline ecosystem in Victoria, Australia, we performed a comparative genomic analysis of
<italic>H. walsbyi</italic>
to better understand the extent of variation between strains/subspecies. Results revealed that previously isolated strains/subspecies do not fully describe the complete repertoire of the genomic landscape present in
<italic>H. walsbyi</italic>
. Rearrangements, insertions, and deletions were observed for the Lake Tyrrell derived
<italic>Haloquadratum</italic>
genomes and were supported by environmental
<italic>de novo</italic>
sequences, including shifts in the dominant genomic landscape of the two most abundant strains. Analysis pertaining to halomucins indicated that homologs for this large protein are not a feature common for all species of
<italic>Haloquadratum</italic>
. Further, we analyzed ATP-binding cassette transporters (ABC-type transporters) for evidence of niche partitioning between different strains/subspecies. We were able to identify unique and variable transporter subunits from all five genomes analyzed and the
<italic>de novo</italic>
environmental sequences, suggesting that differences in nutrient and carbon source acquisition may play a role in maintaining distinct strains/subspecies.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Bodaker, I" uniqKey="Bodaker I">I. Bodaker</name>
</author>
<author>
<name sortKey="Sharon, I" uniqKey="Sharon I">I. Sharon</name>
</author>
<author>
<name sortKey="Suzuki, M T" uniqKey="Suzuki M">M. T. Suzuki</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Boujelben, I" uniqKey="Boujelben I">I. Boujelben</name>
</author>
<author>
<name sortKey="Gomariz, M" uniqKey="Gomariz M">M. Gomariz</name>
</author>
<author>
<name sortKey="Martinez Garcia, M" uniqKey="Martinez Garcia M">M. Martínez-García</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Demergasso, C" uniqKey="Demergasso C">C. Demergasso</name>
</author>
<author>
<name sortKey="Escudero, L" uniqKey="Escudero L">L. Escudero</name>
</author>
<author>
<name sortKey="Casamayor, E O" uniqKey="Casamayor E">E. O. Casamayor</name>
</author>
<author>
<name sortKey="Chong, G" uniqKey="Chong G">G. Chong</name>
</author>
<author>
<name sortKey="Balague, V" uniqKey="Balague V">V. Balagué</name>
</author>
<author>
<name sortKey="Pedr S Ali, C" uniqKey="Pedr S Ali C">C. Pedrós-Alió</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Oh, D" uniqKey="Oh D">D. Oh</name>
</author>
<author>
<name sortKey="Porter, K" uniqKey="Porter K">K. Porter</name>
</author>
<author>
<name sortKey="Russ, B" uniqKey="Russ B">B. Russ</name>
</author>
<author>
<name sortKey="Burns, D" uniqKey="Burns D">D. Burns</name>
</author>
<author>
<name sortKey="Dyall Smith, M" uniqKey="Dyall Smith M">M. Dyall-Smith</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Oren, A" uniqKey="Oren A">A. Oren</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Andrei, A" uniqKey="Andrei A">A.-Ş. Andrei</name>
</author>
<author>
<name sortKey="Banciu, H L" uniqKey="Banciu H">H. L. Banciu</name>
</author>
<author>
<name sortKey="Oren, A" uniqKey="Oren A">A. Oren</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Walsby, A E" uniqKey="Walsby A">A. E. Walsby</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bolhuis, H" uniqKey="Bolhuis H">H. Bolhuis</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bolhuis, H" uniqKey="Bolhuis H">H. Bolhuis</name>
</author>
<author>
<name sortKey="Te Poele, E M" uniqKey="Te Poele E">E. M. Te Poele</name>
</author>
<author>
<name sortKey="Rodriguez Valera, F" uniqKey="Rodriguez Valera F">F. Rodriguez-Valera</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bolhuis, H" uniqKey="Bolhuis H">H. Bolhuis</name>
</author>
<author>
<name sortKey="Palm, P" uniqKey="Palm P">P. Palm</name>
</author>
<author>
<name sortKey="Wende, A" uniqKey="Wende A">A. Wende</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Beja, O" uniqKey="Beja O">O. Béjá</name>
</author>
<author>
<name sortKey="Spudich, E N" uniqKey="Spudich E">E. N. Spudich</name>
</author>
<author>
<name sortKey="Spudich, J L" uniqKey="Spudich J">J. L. Spudich</name>
</author>
<author>
<name sortKey="Leclerc, M" uniqKey="Leclerc M">M. Leclerc</name>
</author>
<author>
<name sortKey="Delong, E F" uniqKey="Delong E">E. F. DeLong</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sherwood, J E" uniqKey="Sherwood J">J. E. Sherwood</name>
</author>
<author>
<name sortKey="Stagnitti, F" uniqKey="Stagnitti F">F. Stagnitti</name>
</author>
<author>
<name sortKey="Kokkinn, M J" uniqKey="Kokkinn M">M. J. Kokkinn</name>
</author>
<author>
<name sortKey="Williams, W D" uniqKey="Williams W">W. D. Williams</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Legault, B A" uniqKey="Legault B">B. A. Legault</name>
</author>
<author>
<name sortKey="Lopez Lopez, A" uniqKey="Lopez Lopez A">A. Lopez-Lopez</name>
</author>
<author>
<name sortKey="Alba Casado, J C" uniqKey="Alba Casado J">J. C. Alba-Casado</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Podell, S" uniqKey="Podell S">S. Podell</name>
</author>
<author>
<name sortKey="Ugalde, J A" uniqKey="Ugalde J">J. A. Ugalde</name>
</author>
<author>
<name sortKey="Narasingarao, P" uniqKey="Narasingarao P">P. Narasingarao</name>
</author>
<author>
<name sortKey="Banfield, J F" uniqKey="Banfield J">J. F. Banfield</name>
</author>
<author>
<name sortKey="Heidelberg, K B" uniqKey="Heidelberg K">K. B. Heidelberg</name>
</author>
<author>
<name sortKey="Allen, E E" uniqKey="Allen E">E. E. Allen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Treusch, A H" uniqKey="Treusch A">A. H. Treusch</name>
</author>
<author>
<name sortKey="Kletzin, A" uniqKey="Kletzin A">A. Kletzin</name>
</author>
<author>
<name sortKey="Raddatz, G" uniqKey="Raddatz G">G. Raddatz</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Giovannoni, S J" uniqKey="Giovannoni S">S. J. Giovannoni</name>
</author>
<author>
<name sortKey="Tripp, H J" uniqKey="Tripp H">H. J. Tripp</name>
</author>
<author>
<name sortKey="Givan, S" uniqKey="Givan S">S. Givan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Saunders, N F W" uniqKey="Saunders N">N. F. W. Saunders</name>
</author>
<author>
<name sortKey="Thomas, T" uniqKey="Thomas T">T. Thomas</name>
</author>
<author>
<name sortKey="Curmi, P M G" uniqKey="Curmi P">P. M. G. Curmi</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Burns, D G" uniqKey="Burns D">D. G. Burns</name>
</author>
<author>
<name sortKey="Camakaris, H M" uniqKey="Camakaris H">H. M. Camakaris</name>
</author>
<author>
<name sortKey="Janssen, P H" uniqKey="Janssen P">P. H. Janssen</name>
</author>
<author>
<name sortKey="Dyall Smith, M L" uniqKey="Dyall Smith M">M. L. Dyall-Smith</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Dyall Smith, M L" uniqKey="Dyall Smith M">M. L. Dyall-Smith</name>
</author>
<author>
<name sortKey="Pfeiffer, F" uniqKey="Pfeiffer F">F. Pfeiffer</name>
</author>
<author>
<name sortKey="Klee, K" uniqKey="Klee K">K. Klee</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cuadros Orellana, S" uniqKey="Cuadros Orellana S">S. Cuadros-Orellana</name>
</author>
<author>
<name sortKey="Martin Cuadrado, A B" uniqKey="Martin Cuadrado A">A.-B. Martin-Cuadrado</name>
</author>
<author>
<name sortKey="Legault, B" uniqKey="Legault B">B. Legault</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Narasingarao, P" uniqKey="Narasingarao P">P. Narasingarao</name>
</author>
<author>
<name sortKey="Podell, S" uniqKey="Podell S">S. Podell</name>
</author>
<author>
<name sortKey="Ugalde, J A" uniqKey="Ugalde J">J. A. Ugalde</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Podell, S" uniqKey="Podell S">S. Podell</name>
</author>
<author>
<name sortKey="Emerson, J B" uniqKey="Emerson J">J. B. Emerson</name>
</author>
<author>
<name sortKey="Jones, C M" uniqKey="Jones C">C. M. Jones</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Emerson, J B" uniqKey="Emerson J">J. B. Emerson</name>
</author>
<author>
<name sortKey="Thomas, B C" uniqKey="Thomas B">B. C. Thomas</name>
</author>
<author>
<name sortKey="Andrade, K" uniqKey="Andrade K">K. Andrade</name>
</author>
<author>
<name sortKey="Allen, E E" uniqKey="Allen E">E. E. Allen</name>
</author>
<author>
<name sortKey="Heidelberg, K B" uniqKey="Heidelberg K">K. B. Heidelberg</name>
</author>
<author>
<name sortKey="Banfielda, J F" uniqKey="Banfielda J">J. F. Banfielda</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Emerson, J B" uniqKey="Emerson J">J. B. Emerson</name>
</author>
<author>
<name sortKey="Andrade, K" uniqKey="Andrade K">K. Andrade</name>
</author>
<author>
<name sortKey="Thomas, B C" uniqKey="Thomas B">B. C. Thomas</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Heidelberg, K B" uniqKey="Heidelberg K">K. B. Heidelberg</name>
</author>
<author>
<name sortKey="Nelson, W C" uniqKey="Nelson W">W. C. Nelson</name>
</author>
<author>
<name sortKey="Holm, J B" uniqKey="Holm J">J. B. Holm</name>
</author>
<author>
<name sortKey="Eisenkolb, N" uniqKey="Eisenkolb N">N. Eisenkolb</name>
</author>
<author>
<name sortKey="Andrade, K" uniqKey="Andrade K">K. Andrade</name>
</author>
<author>
<name sortKey="Emerson, J B" uniqKey="Emerson J">J. B. Emerson</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Peng, Y" uniqKey="Peng Y">Y. Peng</name>
</author>
<author>
<name sortKey="Leung, H C M" uniqKey="Leung H">H. C. M. Leung</name>
</author>
<author>
<name sortKey="Yiu, S M" uniqKey="Yiu S">S. M. Yiu</name>
</author>
<author>
<name sortKey="Chin, F Y L" uniqKey="Chin F">F. Y. L. Chin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, H" uniqKey="Li H">H. Li</name>
</author>
<author>
<name sortKey="Durbin, R" uniqKey="Durbin R">R. Durbin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Iverson, V" uniqKey="Iverson V">V. Iverson</name>
</author>
<author>
<name sortKey="Morris, R M" uniqKey="Morris R">R. M. Morris</name>
</author>
<author>
<name sortKey="Frazar, C D" uniqKey="Frazar C">C. D. Frazar</name>
</author>
<author>
<name sortKey="Berthiaume, C T" uniqKey="Berthiaume C">C. T. Berthiaume</name>
</author>
<author>
<name sortKey="Morales, R L" uniqKey="Morales R">R. L. Morales</name>
</author>
<author>
<name sortKey="Armbrust, E V" uniqKey="Armbrust E">E. V. Armbrust</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rho, M" uniqKey="Rho M">M. Rho</name>
</author>
<author>
<name sortKey="Tang, H" uniqKey="Tang H">H. Tang</name>
</author>
<author>
<name sortKey="Ye, Y" uniqKey="Ye Y">Y. Ye</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Aziz, R K" uniqKey="Aziz R">R. K. Aziz</name>
</author>
<author>
<name sortKey="Bartels, D" uniqKey="Bartels D">D. Bartels</name>
</author>
<author>
<name sortKey="Best, A" uniqKey="Best A">A. Best</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Markowitz, V M" uniqKey="Markowitz V">V. M. Markowitz</name>
</author>
<author>
<name sortKey="Korzeniewski, F" uniqKey="Korzeniewski F">F. Korzeniewski</name>
</author>
<author>
<name sortKey="Palaniappan, K" uniqKey="Palaniappan K">K. Palaniappan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Schloss, P D" uniqKey="Schloss P">P. D. Schloss</name>
</author>
<author>
<name sortKey="Westcott, S L" uniqKey="Westcott S">S. L. Westcott</name>
</author>
<author>
<name sortKey="Ryabin, T" uniqKey="Ryabin T">T. Ryabin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Darling, A E" uniqKey="Darling A">A. E. Darling</name>
</author>
<author>
<name sortKey="Mau, B" uniqKey="Mau B">B. Mau</name>
</author>
<author>
<name sortKey="Perna, N T" uniqKey="Perna N">N. T. Perna</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Thompson, J D" uniqKey="Thompson J">J. D. Thompson</name>
</author>
<author>
<name sortKey="Higgins, D G" uniqKey="Higgins D">D. G. Higgins</name>
</author>
<author>
<name sortKey="Gibson, T J" uniqKey="Gibson T">T. J. Gibson</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Petersen, T N" uniqKey="Petersen T">T. N. Petersen</name>
</author>
<author>
<name sortKey="Brunak, S" uniqKey="Brunak S">S. Brunak</name>
</author>
<author>
<name sortKey="Von Heijne, G" uniqKey="Von Heijne G">G. von Heijne</name>
</author>
<author>
<name sortKey="Nielsen, H" uniqKey="Nielsen H">H. Nielsen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Altschul, S F" uniqKey="Altschul S">S. F. Altschul</name>
</author>
<author>
<name sortKey="Madden, T L" uniqKey="Madden T">T. L. Madden</name>
</author>
<author>
<name sortKey="Sch Ffer, A A" uniqKey="Sch Ffer A">A. A. Schäffer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Nishihara, M" uniqKey="Nishihara M">M. Nishihara</name>
</author>
<author>
<name sortKey="Yamazaki, T" uniqKey="Yamazaki T">T. Yamazaki</name>
</author>
<author>
<name sortKey="Oshima, T" uniqKey="Oshima T">T. Oshima</name>
</author>
<author>
<name sortKey="Koga, Y" uniqKey="Koga Y">Y. Koga</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Archaea</journal-id>
<journal-id journal-id-type="iso-abbrev">Archaea</journal-id>
<journal-id journal-id-type="publisher-id">ARCHAEA</journal-id>
<journal-title-group>
<journal-title>Archaea</journal-title>
</journal-title-group>
<issn pub-type="ppub">1472-3646</issn>
<issn pub-type="epub">1472-3654</issn>
<publisher>
<publisher-name>Hindawi Publishing Corporation</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">25709557</article-id>
<article-id pub-id-type="pmc">4330952</article-id>
<article-id pub-id-type="doi">10.1155/2015/875784</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Research Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>
<italic>De Novo</italic>
Sequences of
<italic>Haloquadratum walsbyi</italic>
from Lake Tyrrell, Australia, Reveal a Variable Genomic Landscape</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Tully</surname>
<given-names>Benjamin J.</given-names>
</name>
<xref ref-type="aff" rid="I1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="cor1">
<sup>*</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Emerson</surname>
<given-names>Joanne B.</given-names>
</name>
<xref ref-type="aff" rid="I2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Andrade</surname>
<given-names>Karen</given-names>
</name>
<xref ref-type="aff" rid="I3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Brocks</surname>
<given-names>Jochen J.</given-names>
</name>
<xref ref-type="aff" rid="I4">
<sup>4</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Allen</surname>
<given-names>Eric E.</given-names>
</name>
<xref ref-type="aff" rid="I5">
<sup>5</sup>
</xref>
<xref ref-type="aff" rid="I6">
<sup>6</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Banfield</surname>
<given-names>Jillian F.</given-names>
</name>
<xref ref-type="aff" rid="I2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Heidelberg</surname>
<given-names>Karla B.</given-names>
</name>
<xref ref-type="aff" rid="I1">
<sup>1</sup>
</xref>
</contrib>
</contrib-group>
<aff id="I1">
<sup>1</sup>
Department of Biological Sciences, Dornsife College of Letters, Arts and Sciences, University of Southern California, 3616 Trousdale Parkway, Los Angeles, CA 90089, USA</aff>
<aff id="I2">
<sup>2</sup>
Cooperative Institute for Research in Environmental Sciences, CIRES Building, Room 318, University of Colorado Boulder, Boulder, CO 80309, USA</aff>
<aff id="I3">
<sup>3</sup>
Department of Environmental Science, Policy and Management, University of California, Berkeley, 54 Mulford Hall, Berkeley, CA 94720, USA</aff>
<aff id="I4">
<sup>4</sup>
Research School of Earth Sciences, The Australian National University, Canberra, ACT 0200, Australia</aff>
<aff id="I5">
<sup>5</sup>
Division of Biological Sciences, University of California, San Diego, La Jolla, CA 92093-0202, USA</aff>
<aff id="I6">
<sup>6</sup>
Marine Biology Research Division, Scripps Institution of Oceanography, La Jolla, CA 92093, USA</aff>
<author-notes>
<corresp id="cor1">*Benjamin J. Tully:
<email>btully@usc.edu</email>
</corresp>
<fn fn-type="other">
<p>Academic Editor: Timothy Williams</p>
</fn>
</author-notes>
<pub-date pub-type="collection">
<year>2015</year>
</pub-date>
<pub-date pub-type="epub">
<day>1</day>
<month>2</month>
<year>2015</year>
</pub-date>
<volume>2015</volume>
<elocation-id>875784</elocation-id>
<history>
<date date-type="received">
<day>19</day>
<month>6</month>
<year>2014</year>
</date>
<date date-type="rev-recd">
<day>2</day>
<month>9</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>16</day>
<month>9</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright © 2015 Benjamin J. Tully et al.</copyright-statement>
<copyright-year>2015</copyright-year>
<license license-type="open-access">
<license-p>This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<abstract>
<p>Hypersaline systems near salt saturation levels represent an extreme environment, in which organisms grow and survive near the limits of life. One of the abundant members of the microbial communities in hypersaline systems is the square archaeon,
<italic>Haloquadratum walsbyi</italic>
. Utilizing a short-read metagenome from Lake Tyrrell, a hypersaline ecosystem in Victoria, Australia, we performed a comparative genomic analysis of
<italic>H. walsbyi</italic>
to better understand the extent of variation between strains/subspecies. Results revealed that previously isolated strains/subspecies do not fully describe the complete repertoire of the genomic landscape present in
<italic>H. walsbyi</italic>
. Rearrangements, insertions, and deletions were observed for the Lake Tyrrell derived
<italic>Haloquadratum</italic>
genomes and were supported by environmental
<italic>de novo</italic>
sequences, including shifts in the dominant genomic landscape of the two most abundant strains. Analysis pertaining to halomucins indicated that homologs for this large protein are not a feature common for all species of
<italic>Haloquadratum</italic>
. Further, we analyzed ATP-binding cassette transporters (ABC-type transporters) for evidence of niche partitioning between different strains/subspecies. We were able to identify unique and variable transporter subunits from all five genomes analyzed and the
<italic>de novo</italic>
environmental sequences, suggesting that differences in nutrient and carbon source acquisition may play a role in maintaining distinct strains/subspecies.</p>
</abstract>
</article-meta>
</front>
<body>
<sec id="sec1">
<title>1. Background</title>
<p>The microbiology of low diversity, extreme hypersaline environments has been extensively studied providing a more detailed understanding of extant communities [
<xref rid="B1" ref-type="bibr">1</xref>
<xref rid="B4" ref-type="bibr">4</xref>
]. The microorganisms, including predominantly halophilic Archaea, in these extreme environments thrive in high salt concentrations that define that which can sustain life, providing experimental model systems to evaluate the limits of microbial growth. Biological stressors in these environments are not limited to high salt concentrations but also include high temperatures, intense UV exposure, fluctuating oxygen levels, and variable ionic ratios that can impact the establishment of electrochemical gradients [
<xref rid="B5" ref-type="bibr">5</xref>
]. Even with limiting oxygen concentrations, many of the microorganisms in these systems grow optimally under aerobic, heterotrophic conditions [
<xref rid="B6" ref-type="bibr">6</xref>
].</p>
<p>
<italic>H. walsbyi</italic>
is known for its dominance in hypersaline environments and its unique square-shaped morphology [
<xref rid="B7" ref-type="bibr">7</xref>
].
<italic> H. walsbyi</italic>
is an obligate halophile and has a number of adaptations that appear linking its ability to grow in environments with high salt concentrations. Due to its flat cell structure,
<italic> H. walsbyi</italic>
has the highest surface area-to-volume (s/v) ratio of any microbe. Individual cells can measure 2 
<italic>μ</italic>
m × 2 
<italic>μ</italic>
m × 0.2 
<italic>μ</italic>
m, but sheets of cells have been shown to grow on scales up to 20–40 
<italic>μ</italic>
m
<sup>2</sup>
[
<xref rid="B8" ref-type="bibr">8</xref>
]. The immense size of the
<italic> H. walsbyi</italic>
sheets is possible as the s/v ratio is only dependent on the thickness of the cells. This high s/v ratio is directly linked to the importance of membrane processes in cell development [
<xref rid="B9" ref-type="bibr">9</xref>
].
<italic> H. walsbyi</italic>
has large suite of cellular transport proteins essential to maintain its predominantly heterotrophic lifestyle [
<xref rid="B10" ref-type="bibr">10</xref>
]. However, it is also capable of utilizing bacteriorhodopsins to support phototrophic growth [
<xref rid="B11" ref-type="bibr">11</xref>
]. This process is enhanced by the presence of bacteriorhodopsin proteins on both faces of the cell, as incident light can pass through the thin cellular cytoplasm [
<xref rid="B10" ref-type="bibr">10</xref>
].
<italic> H. walsbyi</italic>
is nonmotile but utilizes gas vesicles to position itself within the water column to presumably maximize both irradiance [
<xref rid="B10" ref-type="bibr">10</xref>
] and available oxygen, as the solubility of oxygen in hypersaline environments is low [
<xref rid="B12" ref-type="bibr">12</xref>
] and the removal of oxygen through heterotrophic processes results in the limited penetration of oxygen within shallow hypersaline systems.
<italic> H. walsbyi</italic>
is unique amongst the class Halobacteriaceae in that it has a substantially lower genomic percent G + C (%G + C) content [
<xref rid="B13" ref-type="bibr">13</xref>
<xref rid="B15" ref-type="bibr">15</xref>
]. Two hypotheses have been proposed to explain this sharp deviation from the other halophilic Archaea. First, low %G + C is thought to be common among marine oligotrophs as one possible mechanism for reducing nitrogen demands [
<xref rid="B16" ref-type="bibr">16</xref>
]. More likely the low %G + C helps to decrease DNA stability caused by the overstabilization of high internal Mg
<sup>2+</sup>
concentrations, in a similar fashion as to how thermophiles have increased %G + C to increase DNA stability in high temperature environments [
<xref rid="B17" ref-type="bibr">17</xref>
]. Interestingly, 16S rRNA gene studies have shown that
<italic> H. walsbyi</italic>
has limited divergence (≤2%) in the 16S rRNA gene [
<xref rid="B4" ref-type="bibr">4</xref>
] on a global scale compared to other groups within the Halobacteriaceae (~7%), indicating that
<italic> H. walsbyi</italic>
may be more evolutionarily constrained.</p>
<p>To date, the most extensive genomic comparison of
<italic> H. walsbyi</italic>
has focused on two isolated strains cultured from salterns in Spain (str. DSM16790) [
<xref rid="B9" ref-type="bibr">9</xref>
] and Australia (str. C23) [
<xref rid="B18" ref-type="bibr">18</xref>
] that have been fully sequenced [
<xref rid="B10" ref-type="bibr">10</xref>
,
<xref rid="B19" ref-type="bibr">19</xref>
]. Some previous metagenomic work was performed from the Spain saltern using fosmid end clone libraries to explore
<italic> H. walsbyi</italic>
diversity within the system and revealed that there was underlying diversity within the system, including novel
<italic> Haloquadratum</italic>
-related clones and the identification of four genomic islands [
<xref rid="B13" ref-type="bibr">13</xref>
,
<xref rid="B20" ref-type="bibr">20</xref>
]. Comparative analysis of the complete DSM16790 and C23 genomes revealed a lack of genomic rearrangements with much of the variation between the genomes occurring as the result of insertions/deletions of small- and large-scale genomic regions [
<xref rid="B19" ref-type="bibr">19</xref>
]. The limited genomic variation of these two genomes and low 16S rRNA gene divergence has led to a proposed hypothesis that
<italic> H. walsbyi</italic>
is globally homogenous in genomic content and is either highly dispersed or under strict evolutionary constraints, resulting in limited biogeography [
<xref rid="B19" ref-type="bibr">19</xref>
]. Expanding comparative analysis of
<italic> Haloquadratum</italic>
genomes beyond previous results identified from fosmid-based metagenomes [
<xref rid="B13" ref-type="bibr">13</xref>
,
<xref rid="B20" ref-type="bibr">20</xref>
] would allow for a better understanding of conserved genomic properties within the genus.</p>
<p>Previous results from Lake Tyrrell, Victoria, Australia, a naturally occurring thalassohaline hypersaline lake, have been used to reconstruct several genomes for uncultured [
<xref rid="B21" ref-type="bibr">21</xref>
] and novel strains of halophilic Archaea [
<xref rid="B14" ref-type="bibr">14</xref>
,
<xref rid="B22" ref-type="bibr">22</xref>
] and viruses [
<xref rid="B23" ref-type="bibr">23</xref>
,
<xref rid="B24" ref-type="bibr">24</xref>
]. Previous work on halophilic Archaea in this system was performed on metagenomic samples collected in 2007 and 2008 using long-read Sanger- and 454-derived sequences. Several draft and composite genomes were generated using the long-read Sanger dataset from 2007 Lake Tyrrell metagenome, resulting in two additional
<italic> H. walsbyi</italic>
genomes (J07HQW1 and J07HQW2), as well as the first genome of a separate candidate species of the genus
<italic> Haloquadratum</italic>
(J07HQX50) [
<xref rid="B14" ref-type="bibr">14</xref>
]. In this study, we use metagenomic samples collected in 2010 using short-read Illumina paired-end sequences to explore the genomic diversity of the most abundant halophilic Archaea in the system,
<italic> Haloquadratum walsbyi</italic>
. We used
<italic> Haloquadratum</italic>
-related assemblies of the 2010 Lake Tyrrell metagenome to specifically evaluate
<italic> H. walsbyi</italic>
heterogeneity. Comparative analysis of the five
<italic> Haloquadratum</italic>
genomes and the environmental assemblies has revealed new insights into the genomic landscape of these abundant halophilic Archaea.</p>
</sec>
<sec id="sec2">
<title>2. Materials and Methods</title>
<sec id="sec2.1">
<title>2.1. Sample Collection and Metagenome Sequencing</title>
<p>Samples were collected using a serial filtration method in January 2010 (Austral summer) from evaporative brine samples collected over a 4-day time-series experiment at Lake Tyrrell in Western Victoria, Australia (35.52°S, 142,80°W). As has been reported in Heidelberg et al. [
<xref rid="B25" ref-type="bibr">25</xref>
], salt concentrations were typically > 300 g L
<sup>−1</sup>
, and samples were collected from a shallow (<20 cm depth) pool separated from the main lake by a salt barrier. As described in Narasingarao et al. [
<xref rid="B21" ref-type="bibr">21</xref>
] and Podell et al. [
<xref rid="B14" ref-type="bibr">14</xref>
], the water samples were prefiltered through a 20 
<italic>μ</italic>
m Nytex screen and then serially filtered through 3.0, 0.8, and 0.1 
<italic>μ</italic>
m polyethersulfone filters (Pall Corporation, NY, USA). DNA was extracted from the 0.1 
<italic>μ</italic>
m, 0.8 
<italic>μ</italic>
m, and 3.0 
<italic>μ</italic>
m filters, as described in Heidelberg et al. [
<xref rid="B25" ref-type="bibr">25</xref>
]. Sequencing was performed at the J. Craig Venter Institute (JCVI, Rockville, MD, USA) using a paired 2 × 100 bp Illumina sequencing platform with a 100 bp insert size.</p>
</sec>
<sec id="sec2.2">
<title>2.2. Assembly, Binning, and Annotation</title>
<p>Samples were assembled using an iterative process of assembly using IDBA-UD (v. 1.1.0) [
<xref rid="B26" ref-type="bibr">26</xref>
], binning via hierarchical clustering of tetranucleotide sequences, and sequence recruitment using the BWA aligner (v. 0.6.1-r104) [
<xref rid="B27" ref-type="bibr">27</xref>
] and the Ref_Select function of the program suite, SEAStAR (v. 0.4.17) [
<xref rid="B28" ref-type="bibr">28</xref>
]. For all assemblies, IDBA-UD was used with the default settings and the precorrection setting (- -pre_correction) turned on. For all read recruitment steps, the FASTQ files for the forward and reverse reads were aligned to the contiguous DNA sequences (contigs) using the BWA aligner with the
<italic> aln</italic>
program to generate the alignment and
<italic> samse</italic>
to generate the corresponding SAM file, with the maximum number of differences allowed for both processes set at four (-n 4). FASTA files were then generated using the ref_select command within SEAStAR.</p>
<p>For the first round of assembly, genomic data from each sample filter was independently assembled to generate contigs. Contigs greater than 5,000 bp in length were binned using hierarchical clustering of tetranucleotide sequences using Pearson's correlation value of 0.80 as a cutoff. The putative coding DNA sequences (CDSs) for all clusters of contigs with a total span of greater than 10,000 bp were determined using the program FragGeneScan (v. 1.16) [
<xref rid="B29" ref-type="bibr">29</xref>
]. Translated putative CDSs were compared using BLAST against a database of the protein sequences of 16 organisms from the family Halobacteriaceae, two organisms from the class Nanohaloarchaea (see Supplemental Table S1 available online at
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1155/2014/875784">http://dx.doi.org/10.1155/2014/875784</ext-link>
). The contigs from all bins for which >60% of the translated putative CDS were assigned to the genus
<italic> Haloquadratum</italic>
were retained and used to recruit the corresponding sequence reads using SEAStAR.</p>
<p>For the second round of assembly, multiple filters from the same sample were assembled together. Contigs greater than 5,000 bp in length were then clustered, as above. Based on assessment of the clusters, it was determined that a lower threshold of Pearson's correlation (0.50) would sufficiently capture similar contigs. Thirteen bins were identified with a total span of contigs > 1 Mb (largest bin = 6.4 Mb). The sequence reads from each individual filter were then recruited to the contigs within each of the identified bins (e.g., sample LT71 was divided into three bins containing greater than 1 Mb of sequence. The sequences from the 0.1 
<italic>μ</italic>
m and 0.8 
<italic>μ</italic>
m filters were then recruited against the contigs of each bin separately to generate new FASTA files).</p>
<p>For the final round of assembly, sequences from each bin and the corresponding filter were assembled (e.g., sequences from the 0.1 
<italic>μ</italic>
m filter of sample LT71 would be in 3 FASTA files representing the bins identified in the second round of assembly). For further analysis of the
<italic> de novo</italic>
assemblies, only contigs > 50,000 bp in length were considered. Putative CDSs were determined using the RAST Annotation Server [
<xref rid="B30" ref-type="bibr">30</xref>
] with the following settings applied: gene caller = RAST; release64; automatically fix errors = yes; backfill gaps = yes.</p>
</sec>
<sec id="sec2.3">
<title>2.3. Recruitment Statistics</title>
<p>The genome sequences of the five currently available
<italic> Haloquadratum</italic>
species were downloaded from the IMG database (
<italic>Haloquadratum walsbyi</italic>
C23, DSM16790, J07HQW1, J07HQW2, and
<italic> Haloquadratum</italic>
sp. J07HQX50) [
<xref rid="B31" ref-type="bibr">31</xref>
]. The genomes of J07HQW1, J07HQW2, and J07HQX50 were reconstructed from environmental samples collected in 2007 from Lake Tyrrell. The J07HQW1 and J07HQW2 genomes have high coverage with long-read Sanger sequencing (9.0X and 8.7X, resp.) and limited gaps along the length of a single closed scaffold (gaps along 0.2% and 0.5% of the total predicted genome, resp.), for which all of the gaps are shorter in length (~50–100 bp) than is spanned by both the plasmids (8–10 kbp) and fosmids (35 kbp) used to generate sequence reads. All genomes were used in conjunction with the binned contigs generated after the second round of assembly to recruit, using SEAStAR, the maximum number of putative
<italic> Haloquadratum</italic>
-related sequences from all of the samples and filters. This pool of redundant sequences included those recruited from the entire Lake Tyrrell metagenome against the IMG genomes and pooled sequence reads from each filter of each sample against the set of contigs derived after the second round of assembly. The recruited sequences were processed to remove duplicated sequences and then identical sequences using the unique.seqs command from the mothur suite of bioinformatic tools [
<xref rid="B32" ref-type="bibr">32</xref>
]. The sequences were then processed to remove sequences that had become unmated during the process. The remaining pool of sequences was aligned against the
<italic> Haloquadratum</italic>
genomes, including extrachromosomal DNA. Alignments were performed using Geneious (v. 6.1.6) using the “Map to Reference” program with the following settings applied: medium-low sensitivity/fast; iterate up to 5 times; do not trim.</p>
<p>The three samples that had different filter fraction sizes (LT71, LT80, and LT85) were aligned to the
<italic> Haloquadratum</italic>
genomes derived previously from Lake Tyrrell (J07HQW1, J07HQW2, and J07HQX50) to estimate the relative abundance of each organism in different filter fractions. Alignments to the genomes were performed as shown above.</p>
</sec>
<sec id="sec2.4">
<title>2.4. Whole Genome Alignments and Genome Synteny</title>
<p>Contigs greater than 50,000 bp in length generated after the third round of assembly were aligned to the J07HQW1 genome using the progressive MAUVE genome aligner [
<xref rid="B33" ref-type="bibr">33</xref>
]. Contigs were manually assigned into one of four categories based on the degree of synteny compared to the reference genome: high, medium, low, and no synteny. Contigs were assigned to the high category of synteny if the resulting alignment indicated that the entire contig contained colinearized regions compared to J07HQW1 or approximately 70–90% of the contig had colinearized regions, but these regions were rearranged compared to J07HQW1. Contigs were assigned to medium category if approximately 30–70% of the contigs had colinearized regions. The low category contained sequences with less than 30% of the contig possessed colinearized regions. The none category had little to no alignment to the reference genome. Contigs in the no synteny category were further aligned to the other
<italic> Haloquadratum</italic>
genomes to determine if these sequences represented regions in the other organisms but not contained in the J07HQW1 genome.</p>
<p>Large regions of the J07HQW1 genome to which many environmental contigs aligned were identified for further analysis. The contigs and the corresponding segment of the J07HQW1 genome were aligned using the progressive MAUVE genome aligner to the
<italic> Haloquadratum walsbyi</italic>
genomes (C23, DSM16790, and J07HQW2) and manually inspected for insertions, deletions, and rearrangements. Putative gene content for all insertions and deletions was determined where possible.</p>
</sec>
<sec id="sec2.5">
<title>2.5. Genes of Interest: Halomucins and ABC-Type Transporters</title>
<p>Halomucins were annotated previously in the
<italic> Haloquadratum walsbyi</italic>
C23 and DSM16790 genomes. These sequences were used to identify similar sequences in the previously assembled genomes from Lake Tyrrell and the environmental contigs generated for this research. Alignments of putative CDSs were performed using the CLUSTALW [
<xref rid="B34" ref-type="bibr">34</xref>
] with the following settings applied: cost matrix = IUB; gap open cost = 15; gap extend cost = 6.66. Putative signal peptides were identified using SignalP (v. 4.1) with cutoffs between 0.45 and 0.51 [
<xref rid="B35" ref-type="bibr">35</xref>
].</p>
<p>A database was constructed from all of the putative protein sequences related to ABC-type transporters, including all subunits of the transporters (i.e., ATPase, substrate-binding, and permease), from all of the previously annotated
<italic> Haloquadratum</italic>
genomes. Each
<italic> Haloquadratum</italic>
genome was then compared using BLAST [
<xref rid="B36" ref-type="bibr">36</xref>
] to a subset of this database that did not include proteins within that specific genome. ABC-type transporters were identified from the environmental annotations and compared to the total database using BLAST. BLAST results were parsed for ABC-type transporter subunits that did not have significant BLAST matches to the database or had less than 80% AAID. Putative substrates and functions of the subunits were derived from available annotations and not independently verified.</p>
</sec>
</sec>
<sec id="sec3">
<title>3. Results and Discussion</title>
<sec id="sec3.1">
<title>3.1. Sample Collection and Metagenome Sequencing</title>
<p>Surface water from five time points was sampled via filtration over the course of 4 days from Lake Tyrrell in January 2010. Based on library construction and sequencing success, eight samples were chosen for inclusion in this study, including three time points for which both small (0.1 
<italic>μ</italic>
m) and large (0.8 or 3.0 
<italic>μ</italic>
m) filter fractions were obtained. The final metagenome consisted of 135,239,438 trimmed, high quality, and paired-end sequences that contained 12.7 Gbp of data (
<xref ref-type="table" rid="tab1">Table 1</xref>
).</p>
</sec>
<sec id="sec3.2">
<title>3.2. Assembly and Binning</title>
<p>An iterative assembly and binning process was used to reduce complexity and enrich
<italic> Haloquadratum</italic>
sequences in the combined dataset. The initial round of assembly generated 5,403 contigs greater than 5,000 bp in length, for which 856 bins were generated using hierarchical clustering of tetranucleotide frequencies. Of these 856 bins, 424 were putative
<italic> Haloquadratum</italic>
origin based on comparison to a database of class Halobacteriaceae and class Nanohaloarchaea, containing 2,096 contigs. Sequence reads were recruited to the contigs putatively related to
<italic> Haloquadratum</italic>
. The second round of assembly resulted in a total of 1,965 of the generated contigs that were >5,000 bp in length. These contigs were subjected to tetranucleotide hierarchical clustering, as above; however, visual inspection of the clustering relationship suggested that Pearson's correlation cutoff of 0.50 would be more inclusive of the assembly results (i.e., generating larger bins), while simultaneously dividing the dataset into distinct genomic units (Supplemental Figure S1). In total, 13 bins contained over 1 Mbp in assemblies, with the largest bin containing 6.4 Mbp of sequence data.</p>
<p>For the final round of assembly, the sequence reads from each filter fraction were recruited against the contigs within each bin from a single sample and reassembled (i.e., sample LT71 had 2 filter fractions and 3 identified bins; each filter was recruited against each bin, such that 6 total assemblies were performed) (Supplemental Table S2). Results from this round of assembly indicated that, for assembly statistics, including N50, mean length, and total length, the values increased. Only the maximum length statistic had a relatively small decline, but this decrease was offset by the increase in both N50 and mean length (
<xref ref-type="table" rid="tab2">Table 2</xref>
; Supplemental Table S2). The third round of assembly produced 195 contigs at greater than 50,000 bp in length, which was used for further analysis. Annotations of the contigs identified 27,801 putative CDSs (for additional results see Supplemental Information).</p>
</sec>
<sec id="sec3.3">
<title>3.3. Recruitment to Reference Genomes</title>
<p>Several of the 2010 Lake Tyrrell metagenome samples had multiple sequenced filter fractions (LT71, LT80, and LT85), which included a small filter fraction (0.1 
<italic>μ</italic>
m) and either a 0.8 
<italic>μ</italic>
m filter (LT71) or a 3.0 
<italic>μ</italic>
m filter (LT80 and LT82). Recruiting the 2010 metagenomic sequences against the genomes assembled from the 2007 Lake Tyrrell metagenome revealed the genomes recruited 13–30% of the total library from the 3.0 
<italic>μ</italic>
m filter, compared to <7% of the library from the 0.1 
<italic>μ</italic>
m filters. These results suggest that a majority of the
<italic> Haloquadratum</italic>
populations in the Lake Tyrrell system exist as aggregates larger than 3.0 
<italic>μ</italic>
m in size and expand on results identified in a 16S rDNA analysis of the Spanish saltern from which DSM16790 was isolated [
<xref rid="B13" ref-type="bibr">13</xref>
] (for additional results see Supplemental Information).</p>
<p>A comparative analysis of two
<italic> H. walsbyi</italic>
genomes from cultured representative originally isolated from salterns in Spain and SE Australia has suggested that
<italic> H. walsbyi</italic>
, as a species, is mostly uniform in genomic content across a large dispersal range implying a lack of biogeography for these organisms [
<xref rid="B19" ref-type="bibr">19</xref>
]. Heterogeneity within the Spanish saltern had been observed previously in an end-sequenced fosmid metagenome [
<xref rid="B13" ref-type="bibr">13</xref>
]. Sequencing of full-length fosmids revealed genomic islands (GIs) along the DSM16790 genome and determined that about ~50% of fully sequenced fosmids were nonsyntenic [
<xref rid="B20" ref-type="bibr">20</xref>
]. However, the number of available
<italic> H. walsbyi</italic>
genomes and the depth of sequencing limited both fosmid metagenome analyses. The Lake Tyrrell genomes were derived from environmental sequences from a portion of the lake used for commercial salt production, similar to saltern crystallizer ponds, while previously genomes of
<italic> Haloquadratum</italic>
were obtained using laboratory isolates [
<xref rid="B10" ref-type="bibr">10</xref>
,
<xref rid="B19" ref-type="bibr">19</xref>
]. The Lake Tyrrell genomes and
<italic> H. walsbyi</italic>
C23 are both derived from hypersaline environments in Australia separated by ~330 km allowing for further testing of the uniform nature of their genomic content. Recruitment of the environmental sequences against the
<italic> Haloquadratum</italic>
genomes, including DSM16790 and C23, was used to determine how related the Lake Tyrrell environmental sequences were to the finished genomes.</p>
<p>A subset of 128,992,632 redundant sequences was recruited to the
<italic> Haloquadratum</italic>
genomes and the contigs generated after the second round of assembly. Redundant sequences were removed via a two-step procedure: removal of (1) copies of the same sequences that were recruited to multiple reference genomes and (2) sequences that were identical. Unpaired sequences were removed, resulting in 20,477,772 nonredundant, paired, and unique sequences.</p>
<p>Results from the alignment of these sequences indicate that the most completely covered genome by the environmental sequences was the J07HQW1 genome (99.96% coverage) followed closely by the J07HQW2 genome (99.7% coverage) (
<xref ref-type="table" rid="tab3">Table 3</xref>
). The coverage of J07HQW1 and J07HQW2 is higher than the coverage of the C23 (92.3% coverage) and DSM16790 (93.3% coverage) genomes and substantially higher than the J07HQX50 genome (mean = 84.0% coverage). The lower coverage of the J07HQX50 genome is expected, as previous results have shown that J07HQX50 represents a smaller proportion of the
<italic> Haloquadratum</italic>
population (6.9% compared to 74.1% in 2007), and therefore the nonredundant sequences will contain substantially less J07HQX50-like sequences resulting in lower coverage of the genome. The mean coverage depth for the
<italic> H. walsbyi</italic>
genomes is greater than 222 times coverage (range = 0–6,090X coverage). The C23 and DSM16790 genomes have similar values for mean coverage to J07HQW1 and J07HQW2 but do not have the same percent coverage, suggesting that there is a distinction between the genomic contents of these
<italic> Haloquadratum </italic>
species. Coverage of the PL6B plasmid suggests it may be present in upwards of 32–40% of the
<italic> Haloquadratum</italic>
population (see Supplemental Information). Both J07HQW1 and J07HQW2 were not completely aligned across 100% of their respective genomes, potentially indicating that there has also been some genomic variation in the form of insertions or deletions that has occurred in approximately three years since the genomes were sampled, though there is the potential for stochastic variation in the shotgun sequencing methodology.</p>
</sec>
<sec id="sec3.4">
<title>3.4. Whole Genome Alignments</title>
<p>To explore the extent to which genome content and rearrangements influenced the alignment results for the
<italic> H. walsbyi</italic>
genomes, whole genome alignments were performed for the genomes and the environmental contigs generated after the third round of assembly. The 195 contigs > 50,000 bp in length were aligned to the J07HQW1 genome. The contigs in the high and medium synteny category, 82 and 53, respectively, had successful alignments covering much of the reference genome. For the 32 contigs assigned to the no synteny category, it was determined that all of the contigs could be assigned as related to either J07HQW2 or J07HQX50 based on alignments to the respective genome. While some of these contigs contained segments that possessed novel gene content (data not shown), a majority of the contigs, for all of the categories, could be identified as having genomic architecture similar to one of the three Lake Tyrrell
<italic> Haloquadratum</italic>
genomes.</p>
<p>Regions were selected that highlight large-scale genomic rearrangements and heterogeneous gene content. From the initial alignments that spanned the length of the J07HQW1 genome, 14 regions were identified along the J07HQW1 genome that met the criteria for further examination; the region had to (1) be at least 60,000 bp in length and (2) have multiple environmental contigs spanning the region of interest. 122 contigs were identified from these regions of interest. For each region of interest, the corresponding environmental contigs and
<italic> H. walsbyi</italic>
genomes were aligned together. These data revealed numerous features previously unseen in the comparison with C23 and DSM16790, including large genomic rearrangements and substantial genomic insertions and deletions. Three such variable regions are presented in detail below (additional information for each region in Supplemental Information).</p>
<sec id="sec3.4.1">
<title>3.4.1. Region Spanning 600,000–770,000 bp along J07HQW1</title>
<p>This region spans ~170 kbp of the J07HQW1 genome, but the corresponding regions in the other
<italic> H. walsbyi</italic>
genomes are smaller in scale as a result of a large insertion/deletion of 16 CDSs common for J07HQW1 and C23, plus additional 33 insertions along the J07HQW1 genome (
<xref ref-type="fig" rid="fig1">Figure 1</xref>
). Many of the 33 insertions along the J07HQW1 genome appear to be noncoding, although there were several annotated transposase or transposase-like CDSs, as well as annotated CDSs with putative cellular functions. The 16 CDS segment of J07HQW1 and C23 is poorly annotated but contains several homologs of
<italic> ftsZ</italic>
/GTPase domain containing CDSs, a gene family required for successful cell division, specifically in the formation of daughter cells.</p>
<p>There are five environmental contigs that appear to be more closely related to the J07HQW2 genome due to the lack of the 16 CDS segment, described above, and the presence of a ~50 kbp inversion in the same genomic landscape near the insertion segment found in C23 and J07HQW1. The full length of the environmental contigs is syntenic to the J07HQW2 genome. Interestingly, DSM16790 lacks both the 16 CDS segment and the inversion seen in J07HQW2 and the environmental contigs, suggesting that there are at least three potential orientations for this segment, and the inserted/deleted sequences are not required for the inversion. This result is interesting because numerous results indicated that J07HQW1 and J07HQW2 are present in the environment in about equal abundance. However, for the region all six environmental contigs (from four different samples) possess the J07HQW2 orientation. While difficult to understand completely, as this result may be due to the incomplete nature of metagenomic sampling, this could be evidence of a change in the dominant genomic architecture for this genomic region in Lake Tyrrell to the J07HQW2 orientation.</p>
</sec>
<sec id="sec3.4.2">
<title>3.4.2. Region Spanning 1,600,000–1,660,000 bp along J07HQW1</title>
<p>This region is shared between J07HQW1 and J07HQW2 but is split over two portions of the C23 and DSM16790 genomes separated by ~200 kbp (approximate positions: 1,240–1,270 kbp and 1,470–1,530 kbp), though all four genomes have similar gene content (
<xref ref-type="fig" rid="fig2">Figure 2</xref>
). The ~200 kbp region present in C23 and DSM16790 has previously been identified as a genomic island [
<xref rid="B20" ref-type="bibr">20</xref>
]. Gene content variation between the
<italic> H. walsbyi</italic>
genomes includes a number of putative CDSs with predicted functions.</p>
<p>The four environmental contigs have a high degree of similarity between J07HQW1 and J07HQW2, but the gene content suggests that for this region the dominant genomic architecture is that of J07HQW1. The longest environmental contig (ID: LT71_0.8_B_scaffold_0) has an additional ~40 kbp segment at the end of the sequence compared to the other contigs. This segment is a large rearrangement relative to J07HQW1 and is syntenic to a segment of the genome at the approximate position, 2,232–2,284 kbp. A single ~55 kbp environmental contig (ID: LT80_0.1_B_scaffold_30) has full synteny to the J07HQW2 genome (approximate position 3,283–3,336 kbp). Unlike the above region, these contigs support previous research that suggests the J07HQW1 genomic architecture is the more abundant gene synteny in the Lake Tyrrell system, while J07HQW2 represents a second distinct synteny. Yet despite the similarities, the largest environmental contig still represents a large-scale rearrangement of the J07HQW1 genome, potentially suggesting a genomic landscape undergoing episodes of rearrangement.</p>
</sec>
<sec id="sec3.4.3">
<title>3.4.3. Region Spanning 2,619,000–2,702,000 bp along J07HQW1</title>
<p>For this region of interest, the overall genomic structure is conserved for all four
<italic> H. walsbyi</italic>
genomes (
<xref ref-type="fig" rid="fig3">Figure 3</xref>
). There are several gene indels, including a defining feature of J07HQW1, C23, and DSM16790 compared to J07HQW2, in the form of a hypothetical protein (J07HQW1_02778). C23 has an insertion of 13 CDSs that includes a number of annotated genes, such as an ISH11-type transposase (Hqrw_3137), an ABC-type transport operon (ATPase, substrate-binding, and membrane permease subunits) without an annotated target substrate (Hqrw_3141-3145), and two homologs of CrcB proteins (Hqrw_3147 and 3148), related to camphor resistance and chromosome condensation.</p>
<p>The environmental contigs can be separated into those related to J07HQW1 (3 contigs) and those related to J07HQW2 (4 contigs) based on the presence/absence of the hypothetical protein mentioned above. The longest of the four contigs (ID: LT75_0.8_A_scaffold_6) (~160 kbp) is fully syntenic to the J07HQW2 genome (1,240–1,400 kbp). An ~86 kbp contig (ID: LT71_0.1_A_scaffold_2) contains a high degree of rearrangement compared to the J07HQW1 genome along a ~40 kbp span of the contig. This span is syntenic to seven different segments of the J07HQW1 genome and while there is some synteny to the other
<italic> Haloquadratum</italic>
genomes all of the other alignments have substantial differences between the sequences. Unlike the two previously discussed regions, the split between J07HQW1- and J07HQW2-like contigs is closer to the predicted abundances of these two species in the environment, though LT71_0.1_A_scaffold_2 has a unique genomic structure that has not been previously seen, representing a novel orientation of the
<italic> H. walsbyi</italic>
genome.</p>
<p>The
<italic> de novo</italic>
environmental contigs for the regions described above overwhelming can be assigned to one of the three
<italic> Haloquadratum</italic>
reference genomes generated from the 2007 Lake Tyrrell metagenome. Interestingly, despite evidence suggesting that J07HQW1 and J07HQW2 represent equal proportions of the
<italic> Haloquadratum</italic>
population, two of the three regions had aligned contigs that indicate one of the two
<italic> H. walsbyi</italic>
strains was the dominant genomic landscape in the environment. The
<italic> de novo</italic>
contigs do not accurately represent abundance in the environment but offer an estimate of presence/absence of each region among environmental strains. While stochastic processes may influence the set of
<italic> de novo</italic>
contigs aligned to the reference genomes, the reproduction of similar contigs with the same gene synteny from multiple samples and filters suggests that the results may represent an accurate snapshot of the genomic landscape of the
<italic> Haloquadratum</italic>
community and demonstrate recombination events that have swept to a majority of the
<italic> H. walsbyi</italic>
individuals. Further, much of the observed variability is between the genomes of the
<italic> H. walsbyi</italic>
strains; only a minority of the
<italic> de novo</italic>
contigs demonstrate genomic variation and rearrangement, suggesting that the majority of the
<italic> H. walsbyi</italic>
individuals within the community represent stable gene synteny that has persisted over the three-year period since J07HQW1 and J07HQW2 were sampled.</p>
</sec>
</sec>
<sec id="sec3.5">
<title>3.5. Genes of Interest</title>
<sec id="sec3.5.1">
<title>3.5.1. Halomucins</title>
<p>In the C23 and DSM16790 genomes, several large protein-coding sequences (>7,000 amino acids) were identified as mucin homologs, a family of high molecular weight proteins that prevent desiccation, and were given the term halomucin (
<italic>hmu</italic>
). These proteins are proposed to have a role in protecting the cell against desiccation and/or creating a localized environment with higher water activity than the surrounding environment [
<xref rid="B10" ref-type="bibr">10</xref>
,
<xref rid="B19" ref-type="bibr">19</xref>
]. Both of the J07HQW1 and J07HQW2 genomes lack annotated
<italic> hmu</italic>
genes or putative candidates.</p>
<p>The only indication of a fully functional
<italic> hmu</italic>
homolog for sequences generated from Lake Tyrrell appears to be a putative CDS on an environmental contig (referred to as
<italic> ehmu1</italic>
; short-hand for environmental-
<italic>hmu</italic>
; ID: LT85_0.1_A_scaffold_0).
<italic> ehmu1</italic>
is the longest putative CDS annotation for the 195 environmental contigs. It is similar in length to the C23 annotated Hmu (7,243 AA versus 7,836 AA, resp.), although both are smaller than the DSM16790 annotated HmuI (9,159 AA). The Hmu homologs of DSM16790 and C23 share 73.8% amino acid identity (AAID), while eHmu1 has 46.0% and 47.9% AAID to DSM16790 and C23, respectively. eHmu1 has several regions in the alignments to DSM16790 and C23 that have > 75% AAID, suggesting that several of the domains common to the Hmu protein are present in eHmu1. Signal peptide prediction of eHmu1 suggests that this putative CDS would undergo translocation across the membrane. Collectively, this evidence suggests that eHmu1 is a homolog of the
<italic> hmu</italic>
gene.</p>
<p>The second and third longest putative CDSs had identical annotations (“large exoprotein involved in heme utilization or adhesion”), were adjacent to each other on a contig from the same sample as eHmu1, but from a different filter fraction (ID: LT85_3.0_A_scaffold_9), and collectively had a length similar to eHmu1 (3,701 AA + 3,527 AA = 7,218 AA). One of these putative CDS could be aligned to the C-terminus of eHmu1 with 99.5% AAID (eHmu2) and the other to the N-terminus of eHmu1 with 94.6% AAID (eHmu3). eHmu2 lacked a detectable signal peptide sequence, while eHmu3 possessed an N-terminus signal peptide, suggesting that it could be translocated across the membrane. Examination of the alignment of eHmu2 and eHmu3 against eHmu1 reveals two deletions relative to the structure of eHmu1, with a 28 bp deletion resulting in a frameshift/stop codon (
<xref ref-type="fig" rid="fig4">Figure 4(a)</xref>
). For many bacteria and Archaea, a stop codon in a putative CDS will generally result in a nonfunctional product. The nature of eHmu2 and eHmu3 implicates that these putative CDSs represent a partially degraded environmental Hmu homolog.</p>
<p>Evidence for another degraded
<italic> hmu</italic>
homolog occurs in the J07HQX50 genome. The longest putative CDS in the genome is 2,413 AA and is annotated as “autotransporter adhesin.” The protein sequence of this putative CDS has 99.7% AAID to the C-terminus of eHmu1. Sequences upstream of the “autotransporter adhesin” contain five more putative CDSs with a collective length of 7,129 AA (2,413 + 989 + 1,285 + 667 + 878 + 897 AA). One of these CDSs (“hypothetical protein,” 878 AA) contains an N-terminus signal peptide sequence. A nucleotide alignment of this segment of the J07QX50 genome (214,814–238,458 bp) reveals that there are areas that have high nucleic acid identity (NAID) with the
<italic> ehmu1</italic>
sequence with two different portions having >98.0% NAID, while the adjacent portions had 83.9%, 40.8%, and 38.9% NAID, as well as an 81 bp gap (
<xref ref-type="fig" rid="fig4">Figure 4(b)</xref>
). The data suggest this series of putative CDSs represents a highly degraded
<italic> hmu</italic>
-like sequence. For the three
<italic> hmu</italic>
homologs detected in the dataset, two appear to be degraded to the point of no longer being functional. eHmu1 appears to be a complete
<italic> hmu</italic>
homolog with the necessary signal peptides required for translocation across the membrane and may play a role by increasing the water activity near the cell in a high salt, low water activity environment. The lack of evidence for a
<italic> hmu</italic>
homolog in the J07HQW1 and J07HQW2 genomes and the degraded nature of eHmu2, eHmu3, and the putative CDSs of the J07HQX50 genome indicates that functional
<italic> hmu</italic>
genes are not a universal adaptation for the genus [
<xref rid="B19" ref-type="bibr">19</xref>
] and suggests that other factors may be driving the successful adaption of
<italic> Haloquadratum</italic>
in high salt environments. However, it is possible that, due to the role plasmids play in haloarchaeal communities, halomucins are present on extrachromosomal genetic elements, but there is currently only evidence to support halomucins as a feature of the genomic chromosome.</p>
</sec>
<sec id="sec3.5.2">
<title>3.5.2. ABC-Type Transport Systems</title>
<p>One particular area of interest in microbial ecology is how microorganisms/strains can successfully occupy the same physical environment without evolutionary pressures driving an extinction event. Evidence suggests that J07HQW1 and J07HQW2 have persisted as distinct strains and J07HQX50 as a different species, over at least three years, such that it is likely that each occupies noncompetitive ecological niches. In examining these niche differentiations, special attention was paid to putative CDSs with annotations related to ATP-binding cassette (ABC) type transporters, a system of active transport that requires the use of ATP to move substrates across the cell membrane. These transport proteins were specifically targeted for this research because they represent the substrates for which organisms are expending energy to move across the membrane. Transport protein activity could govern the available niches that each species may occupy. Additionally, transporter protein variation can be a form of adaptation against viral predation, preventing attachment of viral particles to cell membrane.</p>
<p>The annotated ABC-type transporters for each individual genome and the putative CDSs annotated from the environmental contigs were compared to a database of the genome transporters. The goal was to identify ABC-type transporter components that either (1) do not have an ortholog in the other organisms or (2) have a degree of divergence relative to other genomes. A cutoff of 80% AAID was used to determine if a transporter sequence was divergent from those present in the other
<italic> Haloquadratum</italic>
species. This divergence is particularly relevant for the substrate-binding and permease subunits of the transporters, as changes in the protein structure may impact the specificity and efficiency of substrate transport. Variation in the ATPase subunit is likely to have a limited impact, as the functional aspect must remain consistent for the transporter to function.</p>
<p>The
<italic> Haloquadratum</italic>
genomes had between 138 and 159 annotated ABC-type transport subunits. In general, comparisons between the genomes only revealed a few protein sequences that fell below the 80% AAID threshold. For C23 and DSM16790, the average similarity for transporter components above the threshold was ~98-99% AAID. For the Lake Tyrrell genomes, the average similarity of components above the threshold was ~89–93% AAID. These values are expected as J07HQW2 (91% AAID) and J07HQX50 (89% AAID) are more divergent organisms within the
<italic> Haloquadratum</italic>
clade compared to J07HQW1, C23, and DSM16790. The high average similarity of transporter subunits above the 80% AAID threshold suggests that the diversity below the threshold represents subunits with potentially variable transport properties.</p>
<p>None of the
<italic> H. walsbyi</italic>
strains contained unique ABC-type transporter components. However, each genome did have a distinct set of variant transport components that indicate differentiations between the species/strains (Supplemental Table S3). The results from C23 revealed 10 of 150 ABC-type transporter subunits below the 80% AAID threshold. Of the 10, there were two copper (Cu) permeases (Hqrw_4112 and Hqrw_1178) and zinc (Zn) (Hqrw_2414) and urea/short-chain amides substrate-binding (Hqrw_4030) subunits. In addition to this, there was a full operonic ABC-type transporter without an assigned function (Hqrw_3142-3145). In comparison, DSM16790 had eight variant subunits (of 146 annotated subunits); five came from a complete operon related to branched-chain amino acids (HQ2192A-HQ2197A). J07HQW2 also has subunits related to branched-chain amino acid transport, but it does not include the full operon, only the permease (J07HQW2_03665 and 03668) and ATPase (J07HQW2_03669 and 03670). The other three variant subunits from DSM16790 are related to lipoprotein transport (two permeases and an ATPase) (HQ3476A-3478A), indicating that, unlike C23, which appears to have only variant transporters related to metallic ions and amides transport, DSM16790 may have variant specificity/efficiency in transporting lipoproteins.</p>
<p>J07HQW1 had 11 variant transporter subunits of 141 annotated subunits. Five of the 11 are ATPases, two are annotated as “hypothetical proteins” (J07HQW1_00367 and 00669), and the remaining four are two antimicrobial peptide (J07HQW1_00014 and 00042), one nucleoside (J07HQW1_01905), and one Fe
<sup>3+</sup>
-hydroxamate (J07HQW1_00956) permeases. The Fe
<sup>3+</sup>
-hydroxamate permease has binding sites for both Fe and cobalamin, both of which are competitively scavenged in the Lake Tyrrell system. The antimicrobial peptide permeases could be indicative of allelopathy between J07HQW1 and other organisms within the system.</p>
<p>J07HQW2 had the largest suite of variant transporter subunits for the
<italic> H. walsbyi</italic>
species with 29 of 159 subunits determined to be variants, 13 of which were annotated as ATPases (Supplemental Table S3). J07HQW2 has variant permease and substrate-binding subunits for nitrate/sulfonate/bicarbonate, spermidine/putrescine, di-/oligopeptide/Ni, phosphate/phosphonate, branched-chain amino acids, and phosphate (PhoT family transporters). The heterotrophic nature of these transport subunits is expected for
<italic> Haloquadratum</italic>
. These transporters are present in the other
<italic> H. walsbyi</italic>
genomes, but the variant substrate-binding and permease subunits of J07HQW2 may possibly indicate different sources/specificities of the substrates for these transporters.</p>
<p>Similar to J07HQW2, J07HQX50 has an entire suite of variant transporters with similar annotations to those found in the other
<italic> Haloquadratum</italic>
genomes, but these differences may be indicative of the niche differentiation between the genera (Supplemental Table S3). J07HQX50 has 138 annotated ABC-type transporter subunits. Three of these subunits were identified as not having an ortholog in the other genomes. One annotation (J07HQXv2_01450) had no similarity to other sequences in the GenBank nonredundant database and was <150 bp in length. The other two annotated subunits without
<italic> Haloquadratum</italic>
orthologs include a spermidine/putrescine substrate-binding protein (J07HQXv2_01469) and a glycine betaine/choline-binding lipoprotein permease (J07HQXv2_02756). J07HQX50 has 86 variant transporter subunits many of whose predicted substrates have been observed in the variants of the other genomes, including spermidine/putrescine, phosphate (PhoT and PstA family transporters), di-/oligopeptides, Fe
<sup>3+</sup>
, cobalamin, amides, cobalt, and branched-chain amino acids. Interestingly, 30 of the variant transporter subunits are annotated as sugar, carbohydrate, and/or monosaccharide subunits. The variety of sugar-related transporters in J07HQX50 may be an indication of the ecological niche that this organism occupies in the system, utilizing a specific subset of simple sugars that allow it to remain present in the system along with the more dominant species.</p>
<p>The environmental contigs had 734 ABC-type transporter subunits identifiable based on RAST annotations. This number includes some degree of redundancy, as there were many contigs within the dataset that may represent the same organism but collected on a different filter fraction or in a different sample. Specific numbers for any individual transporter subunit are not considered as evidence of abundance or importance. As such, five subunits were identified on the environmental contigs that did not have orthologs in the
<italic> Haloquadratum</italic>
genomes. These subunits included two unassigned ATPases, urea and nitrate substrate-binding proteins, and a nonfunctional fragment of an oligopeptide transport protein. The environmental contigs contained 96 variant transporter subunits (Supplemental Table S3). As with J07HQW2 and J07HQX50, many of the predicted substrates for these subunits are related to the heterotrophic nature of these organisms. One group of variant transporter subunits that was not identified in the other
<italic> Haloquadratum</italic>
genomes was predicted to utilize glycerol-3-phosphate and dihydroxyacetone (DHA) as substrates. It has been shown that DSM16790 has the genetic potential to convert both DHA and glycerol-3-phosphate into dihydroxyacetone phosphate (DHAP) [
<xref rid="B10" ref-type="bibr">10</xref>
]. DHAP has been shown to be an important intermediate within the phosphate and carbon metabolisms for DSM16790 because it can be used as substrate for gluconeogenesis or glycolysis or converted via glycerol-1-phosphate into a key component of archaeal membrane lipids [
<xref rid="B37" ref-type="bibr">37</xref>
]. The presence of variants for these transport units within the environmental contigs and not in the analysis of the
<italic> Haloquadratum</italic>
genomes indicates that alternative forms of these subunits may be contributing to niche maintenance in the Lake Tyrrell system.</p>
</sec>
</sec>
</sec>
<sec id="sec4">
<title>4. Concluding Remarks</title>
<p>Detailed genomic comparisons between our environmental metagenome and the publically available
<italic> Haloquadratum</italic>
genomes have revealed a number of new aspects of how these organisms thrive in hypersaline environments. Specifically, this research indicates that the previous assumptions made regarding the homogenous nature of
<italic> H. walsbyi</italic>
gene synteny are not supported when comparisons are made between more representatives of the species and genus. We found both small-scale insertions/deletions in genomic content (i.e., single genes, e.g., transport proteins and hydrolases) and large-scale rearrangements and variation in genomic content. Further, previous results suggesting that halomucins play a crucial role in the adaptation of
<italic> Haloquadratum</italic>
to high salt environments are not supported by results from Lake Tyrrell. While some small portion of Lake Tyrrell
<italic> Haloquadratum</italic>
populations has an intact halomucin homolog, there is evidence that this particular gene is under negative selective pressure in both the environment and J07HQX50. As such, halomucins probably do not represent the only mechanism by which
<italic> Haloquadratum</italic>
spp. maintain cellular homeostasis but are part of a suite of potential mechanisms that allow for growth in high salt environments. Lastly, analysis of ABC-type transporters has given insight into how each member of the genus
<italic> Haloquadratum</italic>
may maintain niche differentiation in the same physical environment, including variations in transporters for metal cofactors (e.g., Fe, Cu, Ni, and Co), phosphonates/phosphates, amides (e.g., spermidine and putrescine), and carbohydrates (e.g., monosaccharides).</p>
</sec>
<sec sec-type="supplementary-material" id="supplementary-material-sec">
<title>Supplementary Material</title>
<supplementary-material content-type="local-data" id="f1">
<caption>
<p>The Supplementary Material: provides increased levels of analysis and discussion in regards to the “Assembly and Binning”, recruitment of the metagenome on to the
<italic>H. walsbyi</italic>
genomes and plasmids from different filter fractions, and the whole genome alignment segments presented in the manuscript, including an additional fourth region of alignment spanning 3,033,000-3,387,000 bp along J07HQW1.</p>
<p>Supplementary figures: illustrate the raw results of the tetranucleotide binning protocol and offer more details for the whole genome aligned segments. Supplementary tables include details regarding the
<italic>Halobacteriaceae</italic>
genomes used to construct the local database, changes to assembly efficiency through the various round of assembly, the genes identified as variant ABC-transporter subunits, and percent recruitment of the Lake Tyrrell
<italic>H. walsbyi</italic>
genomes from different filter fractions of the metagenome.</p>
</caption>
<media xlink:href="875784.f1.pdf" mimetype="application" mime-subtype="pdf" orientation="portrait" id="d35e997" position="anchor"></media>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<title>Acknowledgments</title>
<p>Funding for this was provided by the National Science Foundation (NSF) MCB Award no. 0626526 to J. Banfield, E. Allen, and K. Heidelberg. Cheetham Salt Works (Sea Lake, VIC, AUS) provided permission to collect samples. Assistance with fieldwork was provided by E. Scott, N. Eisenkolb, L.R. Comolli, and J.G. Heidelberg. Special thanks are due to A. Nguyen for providing automation in constructing the initial gene maps used in Figures
<xref ref-type="fig" rid="fig1">1</xref>
<xref ref-type="fig" rid="fig4">4</xref>
.</p>
</ack>
<sec sec-type="conflict">
<title>Conflict of Interests</title>
<p>The authors declare that there is no conflict of interests regarding the publication of this paper.</p>
</sec>
<ref-list>
<ref id="B1">
<label>1</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bodaker</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Sharon</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Suzuki</surname>
<given-names>M. T.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Comparative community genomics in the Dead Sea: an increasingly extreme environment</article-title>
<source>
<italic>ISME Journal</italic>
</source>
<year>2010</year>
<volume>4</volume>
<issue>3</issue>
<fpage>399</fpage>
<lpage>407</lpage>
<pub-id pub-id-type="doi">10.1038/ismej.2009.141</pub-id>
<pub-id pub-id-type="other">2-s2.0-77149143533</pub-id>
<pub-id pub-id-type="pmid">20033072</pub-id>
</element-citation>
</ref>
<ref id="B2">
<label>2</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Boujelben</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Gomariz</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Martínez-García</surname>
<given-names>M.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Spatial and seasonal prokaryotic community dynamics in ponds of increasing salinity of Sfax solar saltern in Tunisia</article-title>
<source>
<italic>Antonie van Leeuwenhoek</italic>
</source>
<year>2012</year>
<volume>101</volume>
<issue>4</issue>
<fpage>845</fpage>
<lpage>857</lpage>
<pub-id pub-id-type="doi">10.1007/s10482-012-9701-7</pub-id>
<pub-id pub-id-type="other">2-s2.0-84862672984</pub-id>
</element-citation>
</ref>
<ref id="B3">
<label>3</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Demergasso</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Escudero</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Casamayor</surname>
<given-names>E. O.</given-names>
</name>
<name>
<surname>Chong</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Balagué</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Pedrós-Alió</surname>
<given-names>C.</given-names>
</name>
</person-group>
<article-title>Novelty and spatio-temporal heterogeneity in the bacterial diversity of hypersaline Lake Tebenquiche (Salar de Atacama)</article-title>
<source>
<italic>Extremophiles</italic>
</source>
<year>2008</year>
<volume>12</volume>
<issue>4</issue>
<fpage>491</fpage>
<lpage>504</lpage>
<pub-id pub-id-type="doi">10.1007/s00792-008-0153-y</pub-id>
<pub-id pub-id-type="other">2-s2.0-46149110769</pub-id>
<pub-id pub-id-type="pmid">18347752</pub-id>
</element-citation>
</ref>
<ref id="B4">
<label>4</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Oh</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Porter</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Russ</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Burns</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Dyall-Smith</surname>
<given-names>M.</given-names>
</name>
</person-group>
<article-title>Diversity of Haloquadratum and other haloarchaea in three, geographically distant, Australian saltern crystallizer ponds</article-title>
<source>
<italic>Extremophiles</italic>
</source>
<year>2010</year>
<volume>14</volume>
<issue>2</issue>
<fpage>161</fpage>
<lpage>169</lpage>
<pub-id pub-id-type="doi">10.1007/s00792-009-0295-6</pub-id>
<pub-id pub-id-type="other">2-s2.0-77951769203</pub-id>
<pub-id pub-id-type="pmid">20091074</pub-id>
</element-citation>
</ref>
<ref id="B5">
<label>5</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Oren</surname>
<given-names>A.</given-names>
</name>
</person-group>
<person-group person-group-type="editor">
<name>
<surname>Rosenberg</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>DeLong</surname>
<given-names>E. F.</given-names>
</name>
<name>
<surname>Lory</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Stackebrandt</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Thompson</surname>
<given-names>F.</given-names>
</name>
</person-group>
<article-title>Life at high salt concentrations</article-title>
<source>
<italic>The Prokaryotes</italic>
</source>
<year>2013</year>
<publisher-loc>Berlin, Germany</publisher-loc>
<publisher-name>Springer</publisher-name>
<fpage>421</fpage>
<lpage>440</lpage>
</element-citation>
</ref>
<ref id="B6">
<label>6</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Andrei</surname>
<given-names>A.-Ş.</given-names>
</name>
<name>
<surname>Banciu</surname>
<given-names>H. L.</given-names>
</name>
<name>
<surname>Oren</surname>
<given-names>A.</given-names>
</name>
</person-group>
<article-title>Living with salt: metabolic and phylogenetic diversity of archaea inhabiting saline ecosystems</article-title>
<source>
<italic>FEMS Microbiology Letters</italic>
</source>
<year>2012</year>
<volume>330</volume>
<issue>1</issue>
<fpage>1</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1111/j.1574-6968.2012.02526.x</pub-id>
<pub-id pub-id-type="other">2-s2.0-84859430137</pub-id>
<pub-id pub-id-type="pmid">22339687</pub-id>
</element-citation>
</ref>
<ref id="B7">
<label>7</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Walsby</surname>
<given-names>A. E.</given-names>
</name>
</person-group>
<article-title>A square bacterium</article-title>
<source>
<italic>Nature</italic>
</source>
<year>1980</year>
<volume>283</volume>
<issue>5742</issue>
<fpage>69</fpage>
<lpage>71</lpage>
<pub-id pub-id-type="doi">10.1038/283069a0</pub-id>
<pub-id pub-id-type="other">2-s2.0-0018873836</pub-id>
</element-citation>
</ref>
<ref id="B8">
<label>8</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Bolhuis</surname>
<given-names>H.</given-names>
</name>
</person-group>
<person-group person-group-type="editor">
<name>
<surname>Gunde-Cimerman</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Oren</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Plemenitas</surname>
<given-names>A.</given-names>
</name>
</person-group>
<article-title>Walsby's square archaeon</article-title>
<source>
<italic>Adaptation of Life at High Salt Concentrations in Archaea, Bacteria, and Eukarya</italic>
</source>
<year>2005</year>
<fpage>185</fpage>
<lpage>199</lpage>
</element-citation>
</ref>
<ref id="B9">
<label>9</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bolhuis</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Te Poele</surname>
<given-names>E. M.</given-names>
</name>
<name>
<surname>Rodriguez-Valera</surname>
<given-names>F.</given-names>
</name>
</person-group>
<article-title>Isolation and cultivation of Walsby's square archaeon</article-title>
<source>
<italic>Environmental Microbiology</italic>
</source>
<year>2004</year>
<volume>6</volume>
<issue>12</issue>
<fpage>1287</fpage>
<lpage>1291</lpage>
<pub-id pub-id-type="doi">10.1111/j.1462-2920.2004.00692.x</pub-id>
<pub-id pub-id-type="other">2-s2.0-10044226982</pub-id>
<pub-id pub-id-type="pmid">15560825</pub-id>
</element-citation>
</ref>
<ref id="B10">
<label>10</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bolhuis</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Palm</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Wende</surname>
<given-names>A.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The genome of the square archaeon
<italic>Haloquadratum walsbyi</italic>
: life at the limits of water activity</article-title>
<source>
<italic>BMC Genomics</italic>
</source>
<year>2006</year>
<volume>7, article 169</volume>
<pub-id pub-id-type="doi">10.1186/1471-2164-7-169</pub-id>
<pub-id pub-id-type="other">2-s2.0-33747334086</pub-id>
</element-citation>
</ref>
<ref id="B11">
<label>11</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Béjá</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Spudich</surname>
<given-names>E. N.</given-names>
</name>
<name>
<surname>Spudich</surname>
<given-names>J. L.</given-names>
</name>
<name>
<surname>Leclerc</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>DeLong</surname>
<given-names>E. F.</given-names>
</name>
</person-group>
<article-title>Proteorhodopsin phototrophy in the ocean</article-title>
<source>
<italic>Nature</italic>
</source>
<year>2001</year>
<volume>411</volume>
<issue>6839</issue>
<fpage>786</fpage>
<lpage>789</lpage>
<pub-id pub-id-type="doi">10.1038/35081051</pub-id>
<pub-id pub-id-type="other">2-s2.0-0035859080</pub-id>
<pub-id pub-id-type="pmid">11459054</pub-id>
</element-citation>
</ref>
<ref id="B12">
<label>12</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sherwood</surname>
<given-names>J. E.</given-names>
</name>
<name>
<surname>Stagnitti</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Kokkinn</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Williams</surname>
<given-names>W. D.</given-names>
</name>
</person-group>
<article-title>Dissolved-oxygen concentrations in hypersaline waters</article-title>
<source>
<italic>Limnology and Oceanography</italic>
</source>
<year>1991</year>
<volume>36</volume>
<issue>2</issue>
<fpage>235</fpage>
<lpage>250</lpage>
<pub-id pub-id-type="doi">10.4319/lo.1991.36.2.0235</pub-id>
</element-citation>
</ref>
<ref id="B13">
<label>13</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Legault</surname>
<given-names>B. A.</given-names>
</name>
<name>
<surname>Lopez-Lopez</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Alba-Casado</surname>
<given-names>J. C.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Environmental genomics of “
<italic>Haloquadratum walsbyi</italic>
” in a saltern crystallizer indicates a large pool of accessory genes in an otherwise coherent species</article-title>
<source>
<italic>BMC Genomics</italic>
</source>
<year>2006</year>
<volume>7, article 171</volume>
<pub-id pub-id-type="doi">10.1186/1471-2164-7-171</pub-id>
<pub-id pub-id-type="other">2-s2.0-33748299286</pub-id>
</element-citation>
</ref>
<ref id="B14">
<label>14</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Podell</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ugalde</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Narasingarao</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Banfield</surname>
<given-names>J. F.</given-names>
</name>
<name>
<surname>Heidelberg</surname>
<given-names>K. B.</given-names>
</name>
<name>
<surname>Allen</surname>
<given-names>E. E.</given-names>
</name>
</person-group>
<article-title>Assembly-driven community genomics of a hypersaline microbial ecosystem</article-title>
<source>
<italic>PLoS ONE</italic>
</source>
<year>2013</year>
<volume>8</volume>
<issue>4</issue>
<pub-id pub-id-type="publisher-id">e61692</pub-id>
<pub-id pub-id-type="doi">10.1371/journal.pone.0061692</pub-id>
<pub-id pub-id-type="other">2-s2.0-84876310485</pub-id>
</element-citation>
</ref>
<ref id="B15">
<label>15</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Treusch</surname>
<given-names>A. H.</given-names>
</name>
<name>
<surname>Kletzin</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Raddatz</surname>
<given-names>G.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Characterization of large-insert DNA libraries from soil for environmental genomic studies of Archaea</article-title>
<source>
<italic>Environmental Microbiology</italic>
</source>
<year>2004</year>
<volume>6</volume>
<issue>9</issue>
<fpage>970</fpage>
<lpage>980</lpage>
<pub-id pub-id-type="doi">10.1111/j.1462-2920.2004.00663.x</pub-id>
<pub-id pub-id-type="other">2-s2.0-4344693429</pub-id>
<pub-id pub-id-type="pmid">15305922</pub-id>
</element-citation>
</ref>
<ref id="B16">
<label>16</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Giovannoni</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Tripp</surname>
<given-names>H. J.</given-names>
</name>
<name>
<surname>Givan</surname>
<given-names>S.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Genetics: genome streamlining in a cosmopolitan oceanic bacterium</article-title>
<source>
<italic>Science</italic>
</source>
<year>2005</year>
<volume>309</volume>
<issue>5738</issue>
<fpage>1242</fpage>
<lpage>1245</lpage>
<pub-id pub-id-type="doi">10.1126/science.1114057</pub-id>
<pub-id pub-id-type="other">2-s2.0-23844446418</pub-id>
<pub-id pub-id-type="pmid">16109880</pub-id>
</element-citation>
</ref>
<ref id="B17">
<label>17</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saunders</surname>
<given-names>N. F. W.</given-names>
</name>
<name>
<surname>Thomas</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Curmi</surname>
<given-names>P. M. G.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Mechanisms of thermal adaptatation revealed from genomes of the anatarctic Archaea
<italic>Methanogenium frigidum</italic>
and
<italic>Methanacoccoides burtonii</italic>
</article-title>
<source>
<italic>Genome Research</italic>
</source>
<year>2003</year>
<volume>13</volume>
<issue>7</issue>
<fpage>1580</fpage>
<lpage>1588</lpage>
<pub-id pub-id-type="doi">10.1101/gr.1180903</pub-id>
<pub-id pub-id-type="other">2-s2.0-0038824870</pub-id>
<pub-id pub-id-type="pmid">12805271</pub-id>
</element-citation>
</ref>
<ref id="B18">
<label>18</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Burns</surname>
<given-names>D. G.</given-names>
</name>
<name>
<surname>Camakaris</surname>
<given-names>H. M.</given-names>
</name>
<name>
<surname>Janssen</surname>
<given-names>P. H.</given-names>
</name>
<name>
<surname>Dyall-Smith</surname>
<given-names>M. L.</given-names>
</name>
</person-group>
<article-title>Cultivation of Walsby's square haloarchaeon</article-title>
<source>
<italic>FEMS Microbiology Letters</italic>
</source>
<year>2004</year>
<volume>238</volume>
<issue>2</issue>
<fpage>469</fpage>
<lpage>473</lpage>
<pub-id pub-id-type="doi">10.1016/j.femsle.2004.08.016</pub-id>
<pub-id pub-id-type="other">2-s2.0-4444287648</pub-id>
<pub-id pub-id-type="pmid">15358434</pub-id>
</element-citation>
</ref>
<ref id="B19">
<label>19</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dyall-Smith</surname>
<given-names>M. L.</given-names>
</name>
<name>
<surname>Pfeiffer</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Klee</surname>
<given-names>K.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>
<italic>Haloquadratum walsbyi</italic>
: Limited Diversity in a Global Pond</article-title>
<source>
<italic>PLoS ONE</italic>
</source>
<year>2011</year>
<volume>6</volume>
<issue>6</issue>
<pub-id pub-id-type="publisher-id">e20968</pub-id>
<pub-id pub-id-type="doi">10.1371/journal.pone.0020968</pub-id>
<pub-id pub-id-type="other">2-s2.0-79959307357</pub-id>
</element-citation>
</ref>
<ref id="B20">
<label>20</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cuadros-Orellana</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Martin-Cuadrado</surname>
<given-names>A.-B.</given-names>
</name>
<name>
<surname>Legault</surname>
<given-names>B.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Genomic plasticity in prokaryotes: the case of the square haloarchaeon</article-title>
<source>
<italic>The ISME Journal</italic>
</source>
<year>2007</year>
<volume>1</volume>
<issue>3</issue>
<fpage>235</fpage>
<lpage>245</lpage>
<pub-id pub-id-type="doi">10.1038/ismej.2007.35</pub-id>
<pub-id pub-id-type="other">2-s2.0-34547299728</pub-id>
<pub-id pub-id-type="pmid">18043634</pub-id>
</element-citation>
</ref>
<ref id="B21">
<label>21</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Narasingarao</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Podell</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ugalde</surname>
<given-names>J. A.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>De novo metagenomic assembly reveals abundant novel major lineage of Archaea in hypersaline microbial communities</article-title>
<source>
<italic>ISME Journal</italic>
</source>
<year>2012</year>
<volume>6</volume>
<issue>1</issue>
<fpage>81</fpage>
<lpage>93</lpage>
<pub-id pub-id-type="doi">10.1038/ismej.2011.78</pub-id>
<pub-id pub-id-type="other">2-s2.0-84355162254</pub-id>
<pub-id pub-id-type="pmid">21716304</pub-id>
</element-citation>
</ref>
<ref id="B22">
<label>22</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Podell</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Emerson</surname>
<given-names>J. B.</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>C. M.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Seasonal fluctuations in ionic concentrations drive microbial succession in a hypersaline lake community</article-title>
<source>
<italic>ISME Journal</italic>
</source>
<year>2014</year>
<volume>8</volume>
<issue>5</issue>
<fpage>979</fpage>
<lpage>990</lpage>
<pub-id pub-id-type="doi">10.1038/ismej.2013.221</pub-id>
<pub-id pub-id-type="other">2-s2.0-84899487999</pub-id>
<pub-id pub-id-type="pmid">24335829</pub-id>
</element-citation>
</ref>
<ref id="B23">
<label>23</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Emerson</surname>
<given-names>J. B.</given-names>
</name>
<name>
<surname>Thomas</surname>
<given-names>B. C.</given-names>
</name>
<name>
<surname>Andrade</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Allen</surname>
<given-names>E. E.</given-names>
</name>
<name>
<surname>Heidelberg</surname>
<given-names>K. B.</given-names>
</name>
<name>
<surname>Banfielda</surname>
<given-names>J. F.</given-names>
</name>
</person-group>
<article-title>Dynamic viral populations in hypersaline systems as revealed by metagenomic assembly</article-title>
<source>
<italic>Applied and Environmental Microbiology</italic>
</source>
<year>2012</year>
<volume>78</volume>
<issue>17</issue>
<fpage>6309</fpage>
<lpage>6320</lpage>
<pub-id pub-id-type="doi">10.1128/AEM.01212-12</pub-id>
<pub-id pub-id-type="other">2-s2.0-84866154398</pub-id>
<pub-id pub-id-type="pmid">22773627</pub-id>
</element-citation>
</ref>
<ref id="B24">
<label>24</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Emerson</surname>
<given-names>J. B.</given-names>
</name>
<name>
<surname>Andrade</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Thomas</surname>
<given-names>B. C.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Virus-host and CRISPR dynamics in archaea-dominated hypersaline Lake tyrrell, Victoria, Australia</article-title>
<source>
<italic>Archaea</italic>
</source>
<year>2013</year>
<volume>2013</volume>
<fpage>12</fpage>
<pub-id pub-id-type="publisher-id">370871</pub-id>
<pub-id pub-id-type="doi">10.1155/2013/370871</pub-id>
<pub-id pub-id-type="other">2-s2.0-84880112474</pub-id>
</element-citation>
</ref>
<ref id="B25">
<label>25</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Heidelberg</surname>
<given-names>K. B.</given-names>
</name>
<name>
<surname>Nelson</surname>
<given-names>W. C.</given-names>
</name>
<name>
<surname>Holm</surname>
<given-names>J. B.</given-names>
</name>
<name>
<surname>Eisenkolb</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Andrade</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Emerson</surname>
<given-names>J. B.</given-names>
</name>
</person-group>
<article-title>Characterization of eukaryotic microbial diversity in hypersaline lake tyrrell, australia</article-title>
<source>
<italic>Frontiers in Microbiology</italic>
</source>
<year>2013</year>
<volume>4</volume>
<pub-id pub-id-type="publisher-id">Article 115</pub-id>
<pub-id pub-id-type="doi">10.3389/fmicb.2013.00115</pub-id>
<pub-id pub-id-type="other">2-s2.0-84880094479</pub-id>
</element-citation>
</ref>
<ref id="B26">
<label>26</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peng</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Leung</surname>
<given-names>H. C. M.</given-names>
</name>
<name>
<surname>Yiu</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Chin</surname>
<given-names>F. Y. L.</given-names>
</name>
</person-group>
<article-title>IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth</article-title>
<source>
<italic>Bioinformatics</italic>
</source>
<year>2012</year>
<volume>28</volume>
<issue>11</issue>
<fpage>1420</fpage>
<lpage>1428</lpage>
<pub-id pub-id-type="publisher-id">bts174</pub-id>
<pub-id pub-id-type="doi">10.1093/bioinformatics/bts174</pub-id>
<pub-id pub-id-type="other">2-s2.0-84861760530</pub-id>
<pub-id pub-id-type="pmid">22495754</pub-id>
</element-citation>
</ref>
<ref id="B27">
<label>27</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Durbin</surname>
<given-names>R.</given-names>
</name>
</person-group>
<article-title>Fast and accurate short read alignment with Burrows-Wheeler transform</article-title>
<source>
<italic>Bioinformatics</italic>
</source>
<year>2009</year>
<volume>25</volume>
<issue>14</issue>
<fpage>1754</fpage>
<lpage>1760</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btp324</pub-id>
<pub-id pub-id-type="other">2-s2.0-67649884743</pub-id>
<pub-id pub-id-type="pmid">19451168</pub-id>
</element-citation>
</ref>
<ref id="B28">
<label>28</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Iverson</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Morris</surname>
<given-names>R. M.</given-names>
</name>
<name>
<surname>Frazar</surname>
<given-names>C. D.</given-names>
</name>
<name>
<surname>Berthiaume</surname>
<given-names>C. T.</given-names>
</name>
<name>
<surname>Morales</surname>
<given-names>R. L.</given-names>
</name>
<name>
<surname>Armbrust</surname>
<given-names>E. V.</given-names>
</name>
</person-group>
<article-title>Untangling genomes from metagenomes: revealing an uncultured class of marine euryarchaeota</article-title>
<source>
<italic>Science</italic>
</source>
<year>2012</year>
<volume>335</volume>
<issue>6068</issue>
<fpage>587</fpage>
<lpage>590</lpage>
<pub-id pub-id-type="doi">10.1126/science.1212665</pub-id>
<pub-id pub-id-type="other">2-s2.0-84856659328</pub-id>
<pub-id pub-id-type="pmid">22301318</pub-id>
</element-citation>
</ref>
<ref id="B29">
<label>29</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rho</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Ye</surname>
<given-names>Y.</given-names>
</name>
</person-group>
<article-title>FragGeneScan: predicting genes in short and error-prone reads</article-title>
<source>
<italic>Nucleic Acids Research</italic>
</source>
<year>2010</year>
<volume>38</volume>
<issue>20</issue>
<fpage>p. e191</fpage>
<pub-id pub-id-type="doi">10.1093/nar/gkq747</pub-id>
<pub-id pub-id-type="other">2-s2.0-78651326786</pub-id>
</element-citation>
</ref>
<ref id="B30">
<label>30</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aziz</surname>
<given-names>R. K.</given-names>
</name>
<name>
<surname>Bartels</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Best</surname>
<given-names>A.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The RAST Server: rapid annotations using subsystems technology</article-title>
<source>
<italic>BMC Genomics</italic>
</source>
<year>2008</year>
<volume>9, article 75</volume>
<pub-id pub-id-type="doi">10.1186/1471-2164-9-75</pub-id>
<pub-id pub-id-type="other">2-s2.0-40549120596</pub-id>
</element-citation>
</ref>
<ref id="B31">
<label>31</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Markowitz</surname>
<given-names>V. M.</given-names>
</name>
<name>
<surname>Korzeniewski</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Palaniappan</surname>
<given-names>K.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The integrated microbial genomes (IMG) system</article-title>
<source>
<italic>Nucleic Acids Research</italic>
</source>
<year>2006</year>
<volume>34</volume>
<fpage>D344</fpage>
<lpage>D348</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkj024</pub-id>
<pub-id pub-id-type="other">2-s2.0-33644873697</pub-id>
<pub-id pub-id-type="pmid">16381883</pub-id>
</element-citation>
</ref>
<ref id="B32">
<label>32</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schloss</surname>
<given-names>P. D.</given-names>
</name>
<name>
<surname>Westcott</surname>
<given-names>S. L.</given-names>
</name>
<name>
<surname>Ryabin</surname>
<given-names>T.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities</article-title>
<source>
<italic>Applied and Environmental Microbiology</italic>
</source>
<year>2009</year>
<volume>75</volume>
<issue>23</issue>
<fpage>7537</fpage>
<lpage>7541</lpage>
<pub-id pub-id-type="doi">10.1128/AEM.01541-09</pub-id>
<pub-id pub-id-type="other">2-s2.0-72949107142</pub-id>
<pub-id pub-id-type="pmid">19801464</pub-id>
</element-citation>
</ref>
<ref id="B33">
<label>33</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Darling</surname>
<given-names>A. E.</given-names>
</name>
<name>
<surname>Mau</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Perna</surname>
<given-names>N. T.</given-names>
</name>
</person-group>
<article-title>Progressivemauve: multiple genome alignment with gene gain, loss and rearrangement</article-title>
<source>
<italic>PLoS ONE</italic>
</source>
<year>2010</year>
<volume>5</volume>
<issue>6</issue>
<pub-id pub-id-type="publisher-id">e11147</pub-id>
<pub-id pub-id-type="doi">10.1371/journal.pone.0011147</pub-id>
<pub-id pub-id-type="other">2-s2.0-77956193448</pub-id>
</element-citation>
</ref>
<ref id="B34">
<label>34</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thompson</surname>
<given-names>J. D.</given-names>
</name>
<name>
<surname>Higgins</surname>
<given-names>D. G.</given-names>
</name>
<name>
<surname>Gibson</surname>
<given-names>T. J.</given-names>
</name>
</person-group>
<article-title>Clustal W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</article-title>
<source>
<italic>Nucleic Acids Research</italic>
</source>
<year>1994</year>
<volume>22</volume>
<issue>22</issue>
<fpage>4673</fpage>
<lpage>4680</lpage>
<pub-id pub-id-type="doi">10.1093/nar/22.22.4673</pub-id>
<pub-id pub-id-type="other">2-s2.0-0027968068</pub-id>
<pub-id pub-id-type="pmid">7984417</pub-id>
</element-citation>
</ref>
<ref id="B35">
<label>35</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Petersen</surname>
<given-names>T. N.</given-names>
</name>
<name>
<surname>Brunak</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>von Heijne</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>H.</given-names>
</name>
</person-group>
<article-title>SignalP 4.0: discriminating signal peptides from transmembrane regions</article-title>
<source>
<italic>Nature Methods</italic>
</source>
<year>2011</year>
<volume>8</volume>
<issue>10</issue>
<fpage>785</fpage>
<lpage>786</lpage>
<pub-id pub-id-type="doi">10.1038/nmeth.1701</pub-id>
<pub-id pub-id-type="other">2-s2.0-80053345905</pub-id>
<pub-id pub-id-type="pmid">21959131</pub-id>
</element-citation>
</ref>
<ref id="B36">
<label>36</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Altschul</surname>
<given-names>S. F.</given-names>
</name>
<name>
<surname>Madden</surname>
<given-names>T. L.</given-names>
</name>
<name>
<surname>Schäffer</surname>
<given-names>A. A.</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</article-title>
<source>
<italic>Nucleic Acids Research</italic>
</source>
<year>1997</year>
<volume>25</volume>
<issue>17</issue>
<fpage>3389</fpage>
<lpage>3402</lpage>
<pub-id pub-id-type="doi">10.1093/nar/25.17.3389</pub-id>
<pub-id pub-id-type="other">2-s2.0-0030801002</pub-id>
<pub-id pub-id-type="pmid">9254694</pub-id>
</element-citation>
</ref>
<ref id="B37">
<label>37</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nishihara</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Yamazaki</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Oshima</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Koga</surname>
<given-names>Y.</given-names>
</name>
</person-group>
<article-title>
<italic>sn</italic>
-Glycerol-1-phosphate-forming activities in
<italic>Archaea</italic>
: separation of archaeal phospholipid biosynthesis and glycerol catabolism by glycerophosphate enantiomers</article-title>
<source>
<italic>Journal of Bacteriology</italic>
</source>
<year>1999</year>
<volume>181</volume>
<issue>4</issue>
<fpage>1330</fpage>
<lpage>1333</lpage>
<pub-id pub-id-type="other">2-s2.0-0033026470</pub-id>
<pub-id pub-id-type="pmid">9973362</pub-id>
</element-citation>
</ref>
</ref-list>
</back>
<floats-group>
<fig id="fig1" orientation="portrait" position="float">
<label>Figure 1</label>
<caption>
<p>Inset: a gene map of the region spanning 600,000–770,000 bp along the
<italic> H. walsbyi</italic>
J07HQW1 genome, the corresponding regions in the other
<italic> H. walsbyi</italic>
genomes, and the environmental contigs. Full image: an expanded view of the region denoted in the inset. Syntenic regions are marked by matching colors. Pink arrows denote the 16 CDS segment identified as unique to the
<italic> H. walsbyi</italic>
J07HQW1 and C23 genomes. Environmental contigs use a long form identification number whereby the first item is the sample (e.g., LT71), the second item is the filter fraction (e.g., 0.8 
<italic>μ</italic>
m), the third item is the phylogenetic bin (e.g., B), and the fourth item is the scaffold number (e.g., scaffold 13). The regions represented on the gene map from each genome are as follows: J07HQW1, 610,420–769,897 bp; J07HQW2, 2,470,010–2,313,700 bp; C23, 510,000–630,000 bp; DSM16790, 510,000–610,000 bp.</p>
</caption>
<graphic xlink:href="ARCHAEA2015-875784.001"></graphic>
</fig>
<fig id="fig2" orientation="portrait" position="float">
<label>Figure 2</label>
<caption>
<p>Inset: a gene map of the region spanning 1,600,000–1,660,000 bp along the
<italic> H. walsbyi</italic>
J07HQW1 genome, the corresponding regions in the other
<italic> H. walsbyi genomes</italic>
, and the environmental contigs. Full image: an expanded view of the region denoted in the inset. Syntenic regions are marked by matching colors. Environmental contigs use a long form identification number (see
<xref ref-type="fig" rid="fig1">Figure 1</xref>
caption). The regions represented on the gene map from each genome are as follows: J07HQW1, 1,600,465–1,660,398 bp; J07HQW2, 3,299,582–3,359,276 bp; C23, 1,240,000–1,270,000 bp and 1,530,000–1,490,000 bp; DSM16790, 1,240,690–1,270,000 bp and 1,510,000–1,470,000 bp.</p>
</caption>
<graphic xlink:href="ARCHAEA2015-875784.002"></graphic>
</fig>
<fig id="fig3" orientation="portrait" position="float">
<label>Figure 3</label>
<caption>
<p>Inset: a gene map of the region spanning 2,619,000–2,702,000 bp along the
<italic> H. walsbyi</italic>
J07HQW1 genome, the corresponding regions in the other
<italic> H. walsbyi genomes</italic>
, and the environmental contigs. Full image: an expanded view of the region denoted in the inset. Syntenic regions are marked by matching colors. Arrows highlighted in green denote individual sequences specifically identified in the paper. Environmental contigs use a long form identification number (see
<xref ref-type="fig" rid="fig1">Figure 1</xref>
caption). The regions represented on the gene map from each genome are as follows: J07HQW1, 2,619,269–2,703,165 bp; J07HQW2, 1,229,374–1,314,534 bp; C23, 2,110,000–2,220,000 bp; DSM16790, 2,030,000–2,125,000 bp.</p>
</caption>
<graphic xlink:href="ARCHAEA2015-875784.003"></graphic>
</fig>
<fig id="fig4" orientation="portrait" position="float">
<label>Figure 4</label>
<caption>
<p>(a) A comparison of halomucin homologs, eHmu1, found on the environmental contig LT80_0.1_scaffold_0 and the corresponding fragments of eHmu2 and eHmu3, found on the contig LT80_3.0_scaffold_9. The section indicated in the black box reveals the nature of the 28 bp deletion within eHmu2 that causes the frameshift resulting in a stop codon approx. 230 bp downstream. AAID: amino acid identity. (b) A comparison of the halomucin homolog eHmu1 and the corresponding region and annotations identified in
<italic> H. </italic>
sp. J07HQX50. ∗ represents the location of an 81 bp gap in a GLUG motif containing CDS from
<italic> H. </italic>
sp. J07HQX50. NAID: nucleic acid identity.</p>
</caption>
<graphic xlink:href="ARCHAEA2015-875784.004"></graphic>
</fig>
<table-wrap id="tab1" orientation="portrait" position="float">
<label>Table 1</label>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">Sample </th>
<th align="center" rowspan="1" colspan="1">Date </th>
<th align="center" rowspan="1" colspan="1">Time
<break></break>
(24 hr)</th>
<th align="center" rowspan="1" colspan="1">Temperature
<break></break>
(°C) [
<xref rid="B11" ref-type="bibr">11</xref>
]</th>
<th align="center" rowspan="1" colspan="1">TDS (wt%)
<sup>a</sup>
<break></break>
[
<xref rid="B11" ref-type="bibr">11</xref>
] </th>
<th align="center" rowspan="1" colspan="1">pH
<break></break>
[
<xref rid="B11" ref-type="bibr">11</xref>
] </th>
<th align="center" rowspan="1" colspan="1">Filter size</th>
<th align="center" rowspan="1" colspan="1">Number of trimmed
<break></break>
sequences in library</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="2" colspan="1">LT71 (site 1)</td>
<td align="center" rowspan="1" colspan="1">Jan. 7, 2010 </td>
<td align="center" rowspan="1" colspan="1">07:45 </td>
<td align="center" rowspan="1" colspan="1">20 </td>
<td align="center" rowspan="1" colspan="1">32 </td>
<td align="center" rowspan="1" colspan="1">7.2 </td>
<td align="center" rowspan="1" colspan="1">0.1 </td>
<td align="center" rowspan="1" colspan="1">12,845,524</td>
</tr>
<tr>
<td align="center" rowspan="1" colspan="1">Jan. 7, 2011 </td>
<td align="center" rowspan="1" colspan="1">07:45 </td>
<td align="center" rowspan="1" colspan="1">20 </td>
<td align="center" rowspan="1" colspan="1">32 </td>
<td align="center" rowspan="1" colspan="1">7.2 </td>
<td align="center" rowspan="1" colspan="1">0.8 </td>
<td align="center" rowspan="1" colspan="1">4,357,176</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">LT75 (site 1)</td>
<td align="center" rowspan="1" colspan="1">Jan. 7, 2010 </td>
<td align="center" rowspan="1" colspan="1">20:00 </td>
<td align="center" rowspan="1" colspan="1">32 </td>
<td align="center" rowspan="1" colspan="1">36 </td>
<td align="center" rowspan="1" colspan="1">7.3 </td>
<td align="center" rowspan="1" colspan="1">0.8 </td>
<td align="center" rowspan="1" colspan="1">13,475,176</td>
</tr>
<tr>
<td align="left" rowspan="2" colspan="1">LT80 (site 1)</td>
<td align="center" rowspan="1" colspan="1">Jan. 9, 2010 </td>
<td align="center" rowspan="1" colspan="1">16:50 </td>
<td align="center" rowspan="1" colspan="1">45 </td>
<td align="center" rowspan="1" colspan="1">27 </td>
<td align="center" rowspan="1" colspan="1">7.1 </td>
<td align="center" rowspan="1" colspan="1">0.1 </td>
<td align="center" rowspan="1" colspan="1">20,609,138</td>
</tr>
<tr>
<td align="center" rowspan="1" colspan="1">Jan. 9, 2011 </td>
<td align="center" rowspan="1" colspan="1">16:50 </td>
<td align="center" rowspan="1" colspan="1">45 </td>
<td align="center" rowspan="1" colspan="1">27 </td>
<td align="center" rowspan="1" colspan="1">7.1 </td>
<td align="center" rowspan="1" colspan="1">3.0 </td>
<td align="center" rowspan="1" colspan="1">9,014,584</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">LT82 (site 1)</td>
<td align="center" rowspan="1" colspan="1">Jan. 10, 2010 </td>
<td align="center" rowspan="1" colspan="1">12:50 </td>
<td align="center" rowspan="1" colspan="1">33 </td>
<td align="center" rowspan="1" colspan="1">32 </td>
<td align="center" rowspan="1" colspan="1">7.2 </td>
<td align="center" rowspan="1" colspan="1">3.0 </td>
<td align="center" rowspan="1" colspan="1">15,358,950</td>
</tr>
<tr>
<td align="left" rowspan="2" colspan="1">LT85 (site 2)</td>
<td align="center" rowspan="1" colspan="1">Jan. 10, 2010 </td>
<td align="center" rowspan="1" colspan="1">12:50 </td>
<td align="center" rowspan="1" colspan="1">37 </td>
<td align="center" rowspan="1" colspan="1">35 </td>
<td align="center" rowspan="1" colspan="1">7.1 </td>
<td align="center" rowspan="1" colspan="1">0.1 </td>
<td align="center" rowspan="1" colspan="1">52,520,328</td>
</tr>
<tr>
<td align="center" rowspan="1" colspan="1">Jan. 10, 2011 </td>
<td align="center" rowspan="1" colspan="1">12:50 </td>
<td align="center" rowspan="1" colspan="1">37 </td>
<td align="center" rowspan="1" colspan="1">35 </td>
<td align="center" rowspan="1" colspan="1">7.1 </td>
<td align="center" rowspan="1" colspan="1">3.0 </td>
<td align="center" rowspan="1" colspan="1">7,058,560</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>
<sup>a</sup>
Total dissolved solids, weight percent.</p>
</fn>
<fn>
<p>Data from Temp (°C), TDS, and pH were original published in Emerson et al. 2012 [
<xref rid="B23" ref-type="bibr">23</xref>
].</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tab2" orientation="portrait" position="float">
<label>Table 2</label>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">Assembly comparison</th>
<th align="center" rowspan="1" colspan="1">Mean ratio of third round to
<break></break>
second round assemblies</th>
<th align="center" rowspan="1" colspan="1">Standard
<break></break>
deviation</th>
<th align="center" rowspan="1" colspan="1">Median</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">N50</td>
<td align="center" rowspan="1" colspan="1">1.28</td>
<td align="center" rowspan="1" colspan="1">0.48</td>
<td align="center" rowspan="1" colspan="1">1.20</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Maximum Contig Length</td>
<td align="center" rowspan="1" colspan="1">0.76</td>
<td align="center" rowspan="1" colspan="1">0.24</td>
<td align="center" rowspan="1" colspan="1">0.74</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Mean Contig Length</td>
<td align="center" rowspan="1" colspan="1">1.03</td>
<td align="center" rowspan="1" colspan="1">0.47</td>
<td align="center" rowspan="1" colspan="1">0.90</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Total Length Contained in Contigs</td>
<td align="center" rowspan="1" colspan="1">1.10</td>
<td align="center" rowspan="1" colspan="1">0.20</td>
<td align="center" rowspan="1" colspan="1">1.16</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="tab3" orientation="portrait" position="float">
<label>Table 3</label>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">Genome ID </th>
<th align="left" rowspan="1" colspan="1">Source</th>
<th align="center" rowspan="1" colspan="1">Maximum
<break></break>
coverage</th>
<th align="center" rowspan="1" colspan="1">Mean
<break></break>
coverage </th>
<th align="center" rowspan="1" colspan="1">% Coverage</th>
<th align="center" rowspan="1" colspan="1">Number of recruited
<break></break>
sequences
<break></break>
(10
<sup>6</sup>
)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">
<italic>H. walsbyi </italic>
J07HQW1 </td>
<td align="left" rowspan="1" colspan="1">Metagenome Lake Tyrrell, AUS (2007) [
<xref rid="B8" ref-type="bibr">8</xref>
] </td>
<td align="center" rowspan="1" colspan="1">2075 </td>
<td align="center" rowspan="1" colspan="1">275.1 </td>
<td align="center" rowspan="1" colspan="1">99.96 </td>
<td align="center" rowspan="1" colspan="1">10.58</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<italic>H. walsbyi </italic>
J07HQW2 </td>
<td align="left" rowspan="1" colspan="1">Metagenome Lake Tyrrell, AUS (2007) [
<xref rid="B8" ref-type="bibr">8</xref>
]</td>
<td align="center" rowspan="1" colspan="1">6090 </td>
<td align="center" rowspan="1" colspan="1">222.3 </td>
<td align="center" rowspan="1" colspan="1">99.7 </td>
<td align="center" rowspan="1" colspan="1">8.78</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<italic>H. </italic>
sp. J07HQX50 Scaffold 1</td>
<td align="left" rowspan="1" colspan="1">Metagenome Lake Tyrrell, AUS (2007) [
<xref rid="B8" ref-type="bibr">8</xref>
] </td>
<td align="center" rowspan="1" colspan="1">2101 </td>
<td align="center" rowspan="1" colspan="1">97.6 </td>
<td align="center" rowspan="1" colspan="1">84.6 </td>
<td align="center" rowspan="1" colspan="1">1.68</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<italic>H. </italic>
sp. J07HQX50 Scaffold 2</td>
<td align="left" rowspan="1" colspan="1">Metagenome Lake Tyrrell, AUS (2007) [
<xref rid="B8" ref-type="bibr">8</xref>
] </td>
<td align="center" rowspan="1" colspan="1">2064 </td>
<td align="center" rowspan="1" colspan="1">98 </td>
<td align="center" rowspan="1" colspan="1">83.4 </td>
<td align="center" rowspan="1" colspan="1">1.6</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<italic>H. walsbyi </italic>
C23 </td>
<td align="left" rowspan="1" colspan="1">Cultured isolate from Geelong, AUS [
<xref rid="B23" ref-type="bibr">23</xref>
] </td>
<td align="center" rowspan="1" colspan="1">3387 </td>
<td align="center" rowspan="1" colspan="1">254.2 </td>
<td align="center" rowspan="1" colspan="1">92.3 </td>
<td align="center" rowspan="1" colspan="1">8.754</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<italic>H. walsbyi </italic>
C23 Plasmid PL6A</td>
<td align="left" rowspan="1" colspan="1">Cultured isolate from Geelong, AUS [
<xref rid="B23" ref-type="bibr">23</xref>
] </td>
<td align="center" rowspan="1" colspan="1">240 </td>
<td align="center" rowspan="1" colspan="1">45.4 </td>
<td align="center" rowspan="1" colspan="1">98.2 </td>
<td align="center" rowspan="1" colspan="1">0.00332</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<italic>H. walsbyi </italic>
C23 Plasmid PL6B</td>
<td align="left" rowspan="1" colspan="1">Cultured isolate from Geelong, AUS [
<xref rid="B23" ref-type="bibr">23</xref>
] </td>
<td align="center" rowspan="1" colspan="1">484 </td>
<td align="center" rowspan="1" colspan="1">90.1 </td>
<td align="center" rowspan="1" colspan="1">98.6 </td>
<td align="center" rowspan="1" colspan="1">0.006635</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<italic>H. walsbyi </italic>
C23 Plasmid PL100</td>
<td align="left" rowspan="1" colspan="1">Cultured isolate from Geelong, AUS [
<xref rid="B23" ref-type="bibr">23</xref>
] </td>
<td align="center" rowspan="1" colspan="1">1016 </td>
<td align="center" rowspan="1" colspan="1">80.1 </td>
<td align="center" rowspan="1" colspan="1">95.9 </td>
<td align="center" rowspan="1" colspan="1">0.086</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<italic>H. walsbyi </italic>
DSM16790 </td>
<td align="left" rowspan="1" colspan="1">Cultured isolate from Alicante, Spain [
<xref rid="B15" ref-type="bibr">15</xref>
] </td>
<td align="center" rowspan="1" colspan="1">3368 </td>
<td align="center" rowspan="1" colspan="1">254.9 </td>
<td align="center" rowspan="1" colspan="1">93.3 </td>
<td align="center" rowspan="1" colspan="1">8.76</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<italic>H. walsbyi </italic>
DSM16790 Plasmid</td>
<td align="left" rowspan="1" colspan="1">Cultured isolate from Alicante, Spain [
<xref rid="B15" ref-type="bibr">15</xref>
] </td>
<td align="center" rowspan="1" colspan="1">515 </td>
<td align="center" rowspan="1" colspan="1">66.1 </td>
<td align="center" rowspan="1" colspan="1">71.5 </td>
<td align="center" rowspan="1" colspan="1">0.034</td>
</tr>
</tbody>
</table>
</table-wrap>
</floats-group>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Terre/explor/CobaltMaghrebV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000398 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000398 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Terre
   |area=    CobaltMaghrebV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:4330952
   |texte=   De Novo Sequences of Haloquadratum walsbyi from Lake Tyrrell, Australia, Reveal a Variable Genomic Landscape
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:25709557" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a CobaltMaghrebV1 

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Tue Nov 14 12:56:51 2017. Site generation: Mon Feb 12 07:59:49 2024