Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Full-length model of the human galectin-4 and insights into dynamics of inter-domain communication

Identifieur interne : 000147 ( Pmc/Corpus ); précédent : 000146; suivant : 000148

Full-length model of the human galectin-4 and insights into dynamics of inter-domain communication

Auteurs : Joane K. Rustiguel ; Ricardo O. S. Soares ; Steve P. Meisburger ; Katherine M. Davis ; Kristina L. Malzbender ; Nozomi Ando ; Marcelo Dias-Baruffi ; Maria Cristina Nonato

Source :

RBID : PMC:5027518

Abstract

Galectins are proteins involved in diverse cellular contexts due to their capacity to decipher and respond to the information encoded by β-galactoside sugars. In particular, human galectin-4, normally expressed in the healthy gastrointestinal tract, displays differential expression in cancerous tissues and is considered a potential drug target for liver and lung cancer. Galectin-4 is a tandem-repeat galectin characterized by two carbohydrate recognition domains connected by a linker-peptide. Despite their relevance to cell function and pathogenesis, structural characterization of full-length tandem-repeat galectins has remained elusive. Here, we investigate galectin-4 using X-ray crystallography, small- and wide-angle X-ray scattering, molecular modelling, molecular dynamics simulations, and differential scanning fluorimetry assays and describe for the first time a structural model for human galectin-4. Our results provide insight into the structural role of the linker-peptide and shed light on the dynamic characteristics of the mechanism of carbohydrate recognition among tandem-repeat galectins.


Url:
DOI: 10.1038/srep33633
PubMed: 27642006
PubMed Central: 5027518

Links to Exploration step

PMC:5027518

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Full-length model of the human galectin-4 and insights into dynamics of inter-domain communication</title>
<author>
<name sortKey="Rustiguel, Joane K" sort="Rustiguel, Joane K" uniqKey="Rustiguel J" first="Joane K." last="Rustiguel">Joane K. Rustiguel</name>
<affiliation>
<nlm:aff id="a1">
<institution>Laboratório de Cristalografia de Proteínas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo</institution>
, SP,
<country>Brazil</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Soares, Ricardo O S" sort="Soares, Ricardo O S" uniqKey="Soares R" first="Ricardo O. S." last="Soares">Ricardo O. S. Soares</name>
<affiliation>
<nlm:aff id="a1">
<institution>Laboratório de Cristalografia de Proteínas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo</institution>
, SP,
<country>Brazil</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Meisburger, Steve P" sort="Meisburger, Steve P" uniqKey="Meisburger S" first="Steve P." last="Meisburger">Steve P. Meisburger</name>
<affiliation>
<nlm:aff id="a2">
<institution>Department of Chemistry, Princeton University</institution>
, Princeton, NJ,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Davis, Katherine M" sort="Davis, Katherine M" uniqKey="Davis K" first="Katherine M." last="Davis">Katherine M. Davis</name>
<affiliation>
<nlm:aff id="a2">
<institution>Department of Chemistry, Princeton University</institution>
, Princeton, NJ,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Malzbender, Kristina L" sort="Malzbender, Kristina L" uniqKey="Malzbender K" first="Kristina L." last="Malzbender">Kristina L. Malzbender</name>
<affiliation>
<nlm:aff id="a2">
<institution>Department of Chemistry, Princeton University</institution>
, Princeton, NJ,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Ando, Nozomi" sort="Ando, Nozomi" uniqKey="Ando N" first="Nozomi" last="Ando">Nozomi Ando</name>
<affiliation>
<nlm:aff id="a2">
<institution>Department of Chemistry, Princeton University</institution>
, Princeton, NJ,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Dias Baruffi, Marcelo" sort="Dias Baruffi, Marcelo" uniqKey="Dias Baruffi M" first="Marcelo" last="Dias-Baruffi">Marcelo Dias-Baruffi</name>
<affiliation>
<nlm:aff id="a3">
<institution>Departamento de Análises Clínicas, Toxicológicas e Bromatológicas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo</institution>
, SP,
<country>Brazil</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Nonato, Maria Cristina" sort="Nonato, Maria Cristina" uniqKey="Nonato M" first="Maria Cristina" last="Nonato">Maria Cristina Nonato</name>
<affiliation>
<nlm:aff id="a1">
<institution>Laboratório de Cristalografia de Proteínas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo</institution>
, SP,
<country>Brazil</country>
</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">27642006</idno>
<idno type="pmc">5027518</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5027518</idno>
<idno type="RBID">PMC:5027518</idno>
<idno type="doi">10.1038/srep33633</idno>
<date when="2016">2016</date>
<idno type="wicri:Area/Pmc/Corpus">000147</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Full-length model of the human galectin-4 and insights into dynamics of inter-domain communication</title>
<author>
<name sortKey="Rustiguel, Joane K" sort="Rustiguel, Joane K" uniqKey="Rustiguel J" first="Joane K." last="Rustiguel">Joane K. Rustiguel</name>
<affiliation>
<nlm:aff id="a1">
<institution>Laboratório de Cristalografia de Proteínas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo</institution>
, SP,
<country>Brazil</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Soares, Ricardo O S" sort="Soares, Ricardo O S" uniqKey="Soares R" first="Ricardo O. S." last="Soares">Ricardo O. S. Soares</name>
<affiliation>
<nlm:aff id="a1">
<institution>Laboratório de Cristalografia de Proteínas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo</institution>
, SP,
<country>Brazil</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Meisburger, Steve P" sort="Meisburger, Steve P" uniqKey="Meisburger S" first="Steve P." last="Meisburger">Steve P. Meisburger</name>
<affiliation>
<nlm:aff id="a2">
<institution>Department of Chemistry, Princeton University</institution>
, Princeton, NJ,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Davis, Katherine M" sort="Davis, Katherine M" uniqKey="Davis K" first="Katherine M." last="Davis">Katherine M. Davis</name>
<affiliation>
<nlm:aff id="a2">
<institution>Department of Chemistry, Princeton University</institution>
, Princeton, NJ,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Malzbender, Kristina L" sort="Malzbender, Kristina L" uniqKey="Malzbender K" first="Kristina L." last="Malzbender">Kristina L. Malzbender</name>
<affiliation>
<nlm:aff id="a2">
<institution>Department of Chemistry, Princeton University</institution>
, Princeton, NJ,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Ando, Nozomi" sort="Ando, Nozomi" uniqKey="Ando N" first="Nozomi" last="Ando">Nozomi Ando</name>
<affiliation>
<nlm:aff id="a2">
<institution>Department of Chemistry, Princeton University</institution>
, Princeton, NJ,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Dias Baruffi, Marcelo" sort="Dias Baruffi, Marcelo" uniqKey="Dias Baruffi M" first="Marcelo" last="Dias-Baruffi">Marcelo Dias-Baruffi</name>
<affiliation>
<nlm:aff id="a3">
<institution>Departamento de Análises Clínicas, Toxicológicas e Bromatológicas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo</institution>
, SP,
<country>Brazil</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Nonato, Maria Cristina" sort="Nonato, Maria Cristina" uniqKey="Nonato M" first="Maria Cristina" last="Nonato">Maria Cristina Nonato</name>
<affiliation>
<nlm:aff id="a1">
<institution>Laboratório de Cristalografia de Proteínas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo</institution>
, SP,
<country>Brazil</country>
</nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Scientific Reports</title>
<idno type="eISSN">2045-2322</idno>
<imprint>
<date when="2016">2016</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>Galectins are proteins involved in diverse cellular contexts due to their capacity to decipher and respond to the information encoded by β-galactoside sugars. In particular, human galectin-4, normally expressed in the healthy gastrointestinal tract, displays differential expression in cancerous tissues and is considered a potential drug target for liver and lung cancer. Galectin-4 is a tandem-repeat galectin characterized by two carbohydrate recognition domains connected by a linker-peptide. Despite their relevance to cell function and pathogenesis, structural characterization of full-length tandem-repeat galectins has remained elusive. Here, we investigate galectin-4 using X-ray crystallography, small- and wide-angle X-ray scattering, molecular modelling, molecular dynamics simulations, and differential scanning fluorimetry assays and describe for the first time a structural model for human galectin-4. Our results provide insight into the structural role of the linker-peptide and shed light on the dynamic characteristics of the mechanism of carbohydrate recognition among tandem-repeat galectins.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Barondes, S H" uniqKey="Barondes S">S. H. Barondes</name>
</author>
<author>
<name sortKey="Cooper, D N" uniqKey="Cooper D">D. N. Cooper</name>
</author>
<author>
<name sortKey="Gitt, M A" uniqKey="Gitt M">M. A. Gitt</name>
</author>
<author>
<name sortKey="Leffler, H" uniqKey="Leffler H">H. Leffler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hughes, R C" uniqKey="Hughes R">R. C. Hughes</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Leffler, H" uniqKey="Leffler H">H. Leffler</name>
</author>
<author>
<name sortKey="Carlsson, S" uniqKey="Carlsson S">S. Carlsson</name>
</author>
<author>
<name sortKey="Hedlund, M" uniqKey="Hedlund M">M. Hedlund</name>
</author>
<author>
<name sortKey="Qian, Y" uniqKey="Qian Y">Y. Qian</name>
</author>
<author>
<name sortKey="Poirier, F" uniqKey="Poirier F">F. Poirier</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Compagno, D" uniqKey="Compagno D">D. Compagno</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ebrahim, A H" uniqKey="Ebrahim A">A. H. Ebrahim</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hirabayashi, J" uniqKey="Hirabayashi J">J. Hirabayashi</name>
</author>
<author>
<name sortKey="Kasai, K" uniqKey="Kasai K">K. Kasai</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="L Pez Lucendo, M F" uniqKey="L Pez Lucendo M">M. F. López-Lucendo</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kashio, Y" uniqKey="Kashio Y">Y. Kashio</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bi, S" uniqKey="Bi S">S. Bi</name>
</author>
<author>
<name sortKey="Earl, L A" uniqKey="Earl L">L. A. Earl</name>
</author>
<author>
<name sortKey="Jacobs, L" uniqKey="Jacobs L">L. Jacobs</name>
</author>
<author>
<name sortKey="Baum, L G" uniqKey="Baum L">L. G. Baum</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Levy, Y" uniqKey="Levy Y">Y. Levy</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Andre, S" uniqKey="Andre S">S. André</name>
</author>
<author>
<name sortKey="Wang, G N" uniqKey="Wang G">G. N. Wang</name>
</author>
<author>
<name sortKey="Gabius, H J" uniqKey="Gabius H">H. J. Gabius</name>
</author>
<author>
<name sortKey="Murphy, P V" uniqKey="Murphy P">P. V. Murphy</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Earl, L A" uniqKey="Earl L">L. A. Earl</name>
</author>
<author>
<name sortKey="Bi, S" uniqKey="Bi S">S. Bi</name>
</author>
<author>
<name sortKey="Baum, L G" uniqKey="Baum L">L. G. Baum</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Troncoso, M F" uniqKey="Troncoso M">M. F. Troncoso</name>
</author>
<author>
<name sortKey="Elola, M T" uniqKey="Elola M">M. T. Elola</name>
</author>
<author>
<name sortKey="Croci, D O" uniqKey="Croci D">D. O. Croci</name>
</author>
<author>
<name sortKey="Rabinovich, G A" uniqKey="Rabinovich G">G. A. Rabinovich</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kim, S W" uniqKey="Kim S">S. W. Kim</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Belo, A I" uniqKey="Belo A">A. I. Belo</name>
</author>
<author>
<name sortKey="Van Der Sar, A M" uniqKey="Van Der Sar A">A. M. van der Sar</name>
</author>
<author>
<name sortKey="Tefsen, B" uniqKey="Tefsen B">B. Tefsen</name>
</author>
<author>
<name sortKey="Van Die, I" uniqKey="Van Die I">I. van Die</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Satelli, A" uniqKey="Satelli A">A. Satelli</name>
</author>
<author>
<name sortKey="Rao, P S" uniqKey="Rao P">P. S. Rao</name>
</author>
<author>
<name sortKey="Thirumala, S" uniqKey="Thirumala S">S. Thirumala</name>
</author>
<author>
<name sortKey="Rao, U S" uniqKey="Rao U">U. S. Rao</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hayashi, T" uniqKey="Hayashi T">T. Hayashi</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kondoh, N" uniqKey="Kondoh N">N. Kondoh</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Huflejt, M E" uniqKey="Huflejt M">M. E. Huflejt</name>
</author>
<author>
<name sortKey="Jordan, E T" uniqKey="Jordan E">E. T. Jordan</name>
</author>
<author>
<name sortKey="Gitt, M A" uniqKey="Gitt M">M. A. Gitt</name>
</author>
<author>
<name sortKey="Barondes, S H" uniqKey="Barondes S">S. H. Barondes</name>
</author>
<author>
<name sortKey="Leffler, H" uniqKey="Leffler H">H. Leffler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Matulis, D" uniqKey="Matulis D">D. Matulis</name>
</author>
<author>
<name sortKey="Kranz, J K" uniqKey="Kranz J">J. K. Kranz</name>
</author>
<author>
<name sortKey="Salemme, F R" uniqKey="Salemme F">F. R. Salemme</name>
</author>
<author>
<name sortKey="Todd, M J" uniqKey="Todd M">M. J. Todd</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bum Erdene, K" uniqKey="Bum Erdene K">K. Bum-Erdene</name>
</author>
<author>
<name sortKey="Leffler, H" uniqKey="Leffler H">H. Leffler</name>
</author>
<author>
<name sortKey="Nilsson, U J" uniqKey="Nilsson U">U. J. Nilsson</name>
</author>
<author>
<name sortKey="Blanchard, H" uniqKey="Blanchard H">H. Blanchard</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bum Erdene, K" uniqKey="Bum Erdene K">K. Bum-Erdene</name>
</author>
<author>
<name sortKey="Leffler, H" uniqKey="Leffler H">H. Leffler</name>
</author>
<author>
<name sortKey="Nilsson, U J" uniqKey="Nilsson U">U. J. Nilsson</name>
</author>
<author>
<name sortKey="Blanchard, H" uniqKey="Blanchard H">H. Blanchard</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zimbardi, A L" uniqKey="Zimbardi A">A. L. Zimbardi</name>
</author>
<author>
<name sortKey="Pinheiro, M P" uniqKey="Pinheiro M">M. P. Pinheiro</name>
</author>
<author>
<name sortKey="Dias Baruffi, M" uniqKey="Dias Baruffi M">M. Dias-Baruffi</name>
</author>
<author>
<name sortKey="Nonato, M C" uniqKey="Nonato M">M. C. Nonato</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rustiguel, J K" uniqKey="Rustiguel J">J. K. Rustiguel</name>
</author>
<author>
<name sortKey="Kumagai, P S" uniqKey="Kumagai P">P. S. Kumagai</name>
</author>
<author>
<name sortKey="Dias Baruffi, M" uniqKey="Dias Baruffi M">M. Dias-Baruffi</name>
</author>
<author>
<name sortKey="Costa Filho, A J" uniqKey="Costa Filho A">A. J. Costa-Filho</name>
</author>
<author>
<name sortKey="Nonato, M C" uniqKey="Nonato M">M. C. Nonato</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ideo, H" uniqKey="Ideo H">H. Ideo</name>
</author>
<author>
<name sortKey="Seko, A" uniqKey="Seko A">A. Seko</name>
</author>
<author>
<name sortKey="Yamashita, K" uniqKey="Yamashita K">K. Yamashita</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Fagherazzi, G" uniqKey="Fagherazzi G">G. Fagherazzi</name>
</author>
<author>
<name sortKey="Glatter, O" uniqKey="Glatter O">O. Glatter</name>
</author>
<author>
<name sortKey="Kratky, O" uniqKey="Kratky O">O. Kratky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Di Lella, S" uniqKey="Di Lella S">S. Di Lella</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rabinovich, G A" uniqKey="Rabinovich G">G. A. Rabinovich</name>
</author>
<author>
<name sortKey="Toscano, M A" uniqKey="Toscano M">M. A. Toscano</name>
</author>
<author>
<name sortKey="Jackson, S S" uniqKey="Jackson S">S. S. Jackson</name>
</author>
<author>
<name sortKey="Vasta, G R" uniqKey="Vasta G">G. R. Vasta</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yoshida, H" uniqKey="Yoshida H">H. Yoshida</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kato, Y" uniqKey="Kato Y">Y. Kato</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Van Weelden, S" uniqKey="Van Weelden S">S. van Weelden</name>
</author>
<author>
<name sortKey="Van Hellemond, J" uniqKey="Van Hellemond J">J. van Hellemond</name>
</author>
<author>
<name sortKey="Opperdoes, F" uniqKey="Opperdoes F">F. Opperdoes</name>
</author>
<author>
<name sortKey="Tielens, A" uniqKey="Tielens A">A. Tielens</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Niesen, F H" uniqKey="Niesen F">F. H. Niesen</name>
</author>
<author>
<name sortKey="Berglund, H" uniqKey="Berglund H">H. Berglund</name>
</author>
<author>
<name sortKey="Vedadi, M" uniqKey="Vedadi M">M. Vedadi</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Battye, T G" uniqKey="Battye T">T. G. Battye</name>
</author>
<author>
<name sortKey="Kontogiannis, L" uniqKey="Kontogiannis L">L. Kontogiannis</name>
</author>
<author>
<name sortKey="Johnson, O" uniqKey="Johnson O">O. Johnson</name>
</author>
<author>
<name sortKey="Powell, H R" uniqKey="Powell H">H. R. Powell</name>
</author>
<author>
<name sortKey="Leslie, A G" uniqKey="Leslie A">A. G. Leslie</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Evans, P" uniqKey="Evans P">P. Evans</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Evans, P R" uniqKey="Evans P">P. R. Evans</name>
</author>
<author>
<name sortKey="Murshudov, G N" uniqKey="Murshudov G">G. N. Murshudov</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Winn, M D" uniqKey="Winn M">M. D. Winn</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mccoy, A J" uniqKey="Mccoy A">A. J. McCoy</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Adams, P D" uniqKey="Adams P">P. D. Adams</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Emsley, P" uniqKey="Emsley P">P. Emsley</name>
</author>
<author>
<name sortKey="Lohkamp, B" uniqKey="Lohkamp B">B. Lohkamp</name>
</author>
<author>
<name sortKey="Scott, W G" uniqKey="Scott W">W. G. Scott</name>
</author>
<author>
<name sortKey="Cowtan, K" uniqKey="Cowtan K">K. Cowtan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chen, V B" uniqKey="Chen V">V. B. Chen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Delano, W L" uniqKey="Delano W">W. L. DeLano</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Laskowski, R A" uniqKey="Laskowski R">R. A. Laskowski</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kim, D E" uniqKey="Kim D">D. E. Kim</name>
</author>
<author>
<name sortKey="Chivian, D" uniqKey="Chivian D">D. Chivian</name>
</author>
<author>
<name sortKey="Baker, D" uniqKey="Baker D">D. Baker</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sali, A" uniqKey="Sali A">A. Sali</name>
</author>
<author>
<name sortKey="Blundell, T L" uniqKey="Blundell T">T. L. Blundell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pronk, S" uniqKey="Pronk S">S. Pronk</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lindorff Larsen, K" uniqKey="Lindorff Larsen K">K. Lindorff-Larsen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hoover, W G" uniqKey="Hoover W">W. G. Hoover</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Parrinello, M" uniqKey="Parrinello M">M. Parrinello</name>
</author>
<author>
<name sortKey="Rahman, A" uniqKey="Rahman A">A. Rahman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Miyamoto, S" uniqKey="Miyamoto S">S. Miyamoto</name>
</author>
<author>
<name sortKey="Kollman, P A" uniqKey="Kollman P">P. A. Kollman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hess, B" uniqKey="Hess B">B. Hess</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jorgensen, W L" uniqKey="Jorgensen W">W. L. Jorgensen</name>
</author>
<author>
<name sortKey="Chandrasekhar, J" uniqKey="Chandrasekhar J">J. Chandrasekhar</name>
</author>
<author>
<name sortKey="Madura, J D" uniqKey="Madura J">J. D. Madura</name>
</author>
<author>
<name sortKey="Impey, R W" uniqKey="Impey R">R. W. Impey</name>
</author>
<author>
<name sortKey="Klein, M L" uniqKey="Klein M">M. L. Klein</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Thurlkill, R L" uniqKey="Thurlkill R">R. L. Thurlkill</name>
</author>
<author>
<name sortKey="Grimsley, G R" uniqKey="Grimsley G">G. R. Grimsley</name>
</author>
<author>
<name sortKey="Scholtz, J M" uniqKey="Scholtz J">J. M. Scholtz</name>
</author>
<author>
<name sortKey="Pace, C N" uniqKey="Pace C">C. N. Pace</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kirschner, K N" uniqKey="Kirschner K">K. N. Kirschner</name>
</author>
<author>
<name sortKey="Lins, R D" uniqKey="Lins R">R. D. Lins</name>
</author>
<author>
<name sortKey="Maass, A" uniqKey="Maass A">A. Maass</name>
</author>
<author>
<name sortKey="Soares, T A" uniqKey="Soares T">T. A. Soares</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sousa Da Silva, A W" uniqKey="Sousa Da Silva A">A. W. Sousa da Silva</name>
</author>
<author>
<name sortKey="Vranken, W F" uniqKey="Vranken W">W. F. Vranken</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Grant, B J" uniqKey="Grant B">B. J. Grant</name>
</author>
<author>
<name sortKey="Rodrigues, A P" uniqKey="Rodrigues A">A. P. Rodrigues</name>
</author>
<author>
<name sortKey="Elsawy, K M" uniqKey="Elsawy K">K. M. ElSawy</name>
</author>
<author>
<name sortKey="Mccammon, J A" uniqKey="Mccammon J">J. A. McCammon</name>
</author>
<author>
<name sortKey="Caves, L S" uniqKey="Caves L">L. S. Caves</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Humphrey, W" uniqKey="Humphrey W">W. Humphrey</name>
</author>
<author>
<name sortKey="Dalke, A" uniqKey="Dalke A">A. Dalke</name>
</author>
<author>
<name sortKey="Schulten, K" uniqKey="Schulten K">K. Schulten</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hutchinson, E G" uniqKey="Hutchinson E">E. G. Hutchinson</name>
</author>
<author>
<name sortKey="Thornton, J M" uniqKey="Thornton J">J. M. Thornton</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="De Beer, T A" uniqKey="De Beer T">T. A. de Beer</name>
</author>
<author>
<name sortKey="Berka, K" uniqKey="Berka K">K. Berka</name>
</author>
<author>
<name sortKey="Thornton, J M" uniqKey="Thornton J">J. M. Thornton</name>
</author>
<author>
<name sortKey="Laskowski, R A" uniqKey="Laskowski R">R. A. Laskowski</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Nielsen, S S" uniqKey="Nielsen S">S. S. Nielsen</name>
</author>
<author>
<name sortKey="M Ller, M" uniqKey="M Ller M">M. Møller</name>
</author>
<author>
<name sortKey="Gillilan, R E" uniqKey="Gillilan R">R. E. Gillilan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Skou, S" uniqKey="Skou S">S. Skou</name>
</author>
<author>
<name sortKey="Gillilan, R E" uniqKey="Gillilan R">R. E. Gillilan</name>
</author>
<author>
<name sortKey="Ando, N" uniqKey="Ando N">N. Ando</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Petoukhov, M V" uniqKey="Petoukhov M">M. V. Petoukhov</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Svergun, D" uniqKey="Svergun D">D. Svergun</name>
</author>
<author>
<name sortKey="Barberato, C" uniqKey="Barberato C">C. Barberato</name>
</author>
<author>
<name sortKey="Koch, M" uniqKey="Koch M">M. Koch</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Svergun, D" uniqKey="Svergun D">D. Svergun</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Svergun, D I" uniqKey="Svergun D">D. I. Svergun</name>
</author>
<author>
<name sortKey="Petoukhov, M V" uniqKey="Petoukhov M">M. V. Petoukhov</name>
</author>
<author>
<name sortKey="Koch, M H" uniqKey="Koch M">M. H. Koch</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Volkov, V V" uniqKey="Volkov V">V. V. Volkov</name>
</author>
<author>
<name sortKey="Svergun, D I" uniqKey="Svergun D">D. I. Svergun</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Sci Rep</journal-id>
<journal-id journal-id-type="iso-abbrev">Sci Rep</journal-id>
<journal-title-group>
<journal-title>Scientific Reports</journal-title>
</journal-title-group>
<issn pub-type="epub">2045-2322</issn>
<publisher>
<publisher-name>Nature Publishing Group</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">27642006</article-id>
<article-id pub-id-type="pmc">5027518</article-id>
<article-id pub-id-type="pii">srep33633</article-id>
<article-id pub-id-type="doi">10.1038/srep33633</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Full-length model of the human galectin-4 and insights into dynamics of inter-domain communication</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Rustiguel</surname>
<given-names>Joane K.</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Soares</surname>
<given-names>Ricardo O. S.</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Meisburger</surname>
<given-names>Steve P.</given-names>
</name>
<xref ref-type="aff" rid="a2">2</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Davis</surname>
<given-names>Katherine M.</given-names>
</name>
<xref ref-type="aff" rid="a2">2</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Malzbender</surname>
<given-names>Kristina L.</given-names>
</name>
<xref ref-type="aff" rid="a2">2</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ando</surname>
<given-names>Nozomi</given-names>
</name>
<xref ref-type="aff" rid="a2">2</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Dias-Baruffi</surname>
<given-names>Marcelo</given-names>
</name>
<xref ref-type="aff" rid="a3">3</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Nonato</surname>
<given-names>Maria Cristina</given-names>
</name>
<xref ref-type="corresp" rid="c1">a</xref>
<xref ref-type="aff" rid="a1">1</xref>
</contrib>
<aff id="a1">
<label>1</label>
<institution>Laboratório de Cristalografia de Proteínas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo</institution>
, SP,
<country>Brazil</country>
</aff>
<aff id="a2">
<label>2</label>
<institution>Department of Chemistry, Princeton University</institution>
, Princeton, NJ,
<country>USA</country>
</aff>
<aff id="a3">
<label>3</label>
<institution>Departamento de Análises Clínicas, Toxicológicas e Bromatológicas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo</institution>
, SP,
<country>Brazil</country>
</aff>
</contrib-group>
<author-notes>
<corresp id="c1">
<label>a</label>
<email>cristy@fcfrp.usp.br</email>
</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>19</day>
<month>09</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="collection">
<year>2016</year>
</pub-date>
<volume>6</volume>
<elocation-id>33633</elocation-id>
<history>
<date date-type="received">
<day>26</day>
<month>05</month>
<year>2016</year>
</date>
<date date-type="accepted">
<day>31</day>
<month>08</month>
<year>2016</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright © 2016, The Author(s)</copyright-statement>
<copyright-year>2016</copyright-year>
<copyright-holder>The Author(s)</copyright-holder>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<pmc-comment>author-paid</pmc-comment>
<license-p>This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</ext-link>
</license-p>
</license>
</permissions>
<abstract>
<p>Galectins are proteins involved in diverse cellular contexts due to their capacity to decipher and respond to the information encoded by β-galactoside sugars. In particular, human galectin-4, normally expressed in the healthy gastrointestinal tract, displays differential expression in cancerous tissues and is considered a potential drug target for liver and lung cancer. Galectin-4 is a tandem-repeat galectin characterized by two carbohydrate recognition domains connected by a linker-peptide. Despite their relevance to cell function and pathogenesis, structural characterization of full-length tandem-repeat galectins has remained elusive. Here, we investigate galectin-4 using X-ray crystallography, small- and wide-angle X-ray scattering, molecular modelling, molecular dynamics simulations, and differential scanning fluorimetry assays and describe for the first time a structural model for human galectin-4. Our results provide insight into the structural role of the linker-peptide and shed light on the dynamic characteristics of the mechanism of carbohydrate recognition among tandem-repeat galectins.</p>
</abstract>
</article-meta>
</front>
<body>
<p>Galectins are a family of glycan-binding proteins characterized by their affinity for β-galactosides and the presence of one or more structurally conserved carbohydrate recognition domains (CRDs)
<xref ref-type="bibr" rid="b1">1</xref>
. With fifteen members identified in vertebrates, galectins display diversity in ligand specificity and can be found in both intracellular and extracellular environments
<xref ref-type="bibr" rid="b2">2</xref>
<xref ref-type="bibr" rid="b3">3</xref>
. Notably, galectins have been shown to act as modulators of cell behaviour by regulating signalling processes as well as inflammatory and immune responses
<xref ref-type="bibr" rid="b4">4</xref>
. Galectins are promising candidates as diagnostic markers and novel drugs targets for a number of human diseases
<xref ref-type="bibr" rid="b4">4</xref>
<xref ref-type="bibr" rid="b5">5</xref>
.</p>
<p>To date, three subtypes of galectins have been identified, based on the number and structural arrangement of the CRDs: prototype, chimera and tandem-repeat
<xref ref-type="bibr" rid="b6">6</xref>
. While high-resolution structures of many full-length galectins remain elusive, crystallographic studies have revealed a significant structural similarity among CRDs. Common to most CRDs is a conserved β-sandwich fold with an overall jellyroll topology as well as a signature sequence for carbohydrate recognition
<xref ref-type="bibr" rid="b7">7</xref>
.</p>
<p>The tandem-repeat subtype of galectins contains two distinct CRDs (galectin-4N at the N-terminus and galectin-4C at the C-terminus) connected in a single polypeptide chain by a linker region
<xref ref-type="bibr" rid="b6">6</xref>
. Studies with tandem-repeat galectins have shown that the linker’s role, likely mediating the intramolecular interactions of CRDs, is associated with potency in inducing a specific biological response
<xref ref-type="bibr" rid="b8">8</xref>
<xref ref-type="bibr" rid="b9">9</xref>
<xref ref-type="bibr" rid="b10">10</xref>
<xref ref-type="bibr" rid="b11">11</xref>
<xref ref-type="bibr" rid="b12">12</xref>
<xref ref-type="bibr" rid="b13">13</xref>
. Other proposed roles for the linker region include protein-protein interactions, membrane insertion, and positioning the CRDs
<xref ref-type="bibr" rid="b10">10</xref>
<xref ref-type="bibr" rid="b11">11</xref>
<xref ref-type="bibr" rid="b13">13</xref>
.</p>
<p>Despite the importance of the linker, structural studies of galectins have thus far been limited to the individual CRDs or to engineered tandem-repeat galectins where the linker has been truncated. Furthermore, the anticipated flexibility of the linker and its susceptibility to proteolysis have made structural characterizations of full-length tandem-repeat galectins particularly challenging. In order to unravel the structural mechanisms that govern signalling modulation by tandem-repeat galectins, we chose human galectin-4 as our model of study. Galectin-4 belongs to the tandem-repeat category of galectins, together with galectins -6, -8, -9 and -12. Galectin-4 is largely expressed by intestinal epithelial cells and shows antagonist effects depending on the type of cancer.</p>
<p>Galectin-4 functions as a tumour suppressor of human colorectal and pancreatic cancer
<xref ref-type="bibr" rid="b14">14</xref>
<xref ref-type="bibr" rid="b15">15</xref>
<xref ref-type="bibr" rid="b16">16</xref>
. By contrast, in liver and lung cancer, the leading types of cancer that cause death worldwide, galectin-4 expression leads to increased metastasis and cancer progression
<xref ref-type="bibr" rid="b17">17</xref>
<xref ref-type="bibr" rid="b18">18</xref>
, suggesting its use as a promising target for drug development
<xref ref-type="bibr" rid="b5">5</xref>
. Here, we provide the first structural characterization of the full-length human galectin-4 using X-ray crystallography, small- and wide-angle X-ray scattering (SAXS/WAXS), molecular modelling, molecular dynamics simulations, and differential scanning fluorimetry assays. Our findings reveal that full-length galectin-4 folds as a compact structure and provide insight into the process by which the linker-peptide mediates recognition through correlated movements and transient interactions. These results shed light on the structural role of galectin-4’s linker-peptide and its biological function in this important class of proteins. Moreover, the generated knowledge and experimental tools described here can be exploited to investigate the role of galectin-4 under different pathological conditions.</p>
<sec disp-level="1">
<title>Results</title>
<sec disp-level="2">
<title>Protein production and thermal analysis of galectin-4, galectin-4N and galectin-4C</title>
<p>Galectin-4 is composed of 323 amino acids residues, which can be divided into an N-terminal domain (aa 1–150; galectin-4N), linker-peptide (aa 151–178) and C-terminal domain (aa 179–323; galectin-4C)
<xref ref-type="bibr" rid="b19">19</xref>
(
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. S1</xref>
). The full-length protein and its individual domains, galectin-4N and galectin-4C were cloned, overexpressed, and purified as described in the methods section. First, the folding stability of each construct was examined by differential scanning fluorimetry (Thermofluor), a methodology used to monitor protein unfolding. By measuring the fluorescence-probe intensity as a function of temperature, thermofluor assays allow for the comparison of melting temperatures (
<italic>T</italic>
<sub>
<italic>m</italic>
</sub>
), transition profiles and thermal shift (Δ
<italic>T</italic>
<sub>
<italic>m</italic>
</sub>
) values compared to the reference curves (obtained in buffer) at different conditions. Here, a positive Δ
<italic>T</italic>
<sub>
<italic>m</italic>
</sub>
indicates thermal stabilization induced by changes in the physicochemical environment.</p>
<p>Reference curves resulted in sigmoidal profiles with respective
<italic>T</italic>
<sub>
<italic>m</italic>
</sub>
values of 55.92 ± 0.05 °C for galectin-4, 56.8 ± 0.1 °C for galectin-4 N and 68.12 ± 0.05 °C for galectin-4C (
<xref ref-type="fig" rid="f1">Fig. 1a</xref>
). The thermal behaviour of galectin-4 and its domains was also evaluated against the 94 additives from the Solubility & Stability Screen kit (Hampton Research) (
<xref ref-type="supplementary-material" rid="S1">Supplementary Table S1</xref>
). Analysis of thermal shift (Δ
<italic>T</italic>
<sub>
<italic>m</italic>
</sub>
) values in the presence of additives revealed that galectin-4C displays the largest Δ
<italic>T</italic>
<sub>
<italic>m</italic>
</sub>
values and the most distinctive behaviour under changes in the physicochemical environment (
<xref ref-type="fig" rid="f1">Fig. 1b</xref>
). Lower Δ
<italic>T</italic>
<sub>
<italic>m</italic>
</sub>
values are observed for the full-length protein than its CRDs, suggesting that the galectin-4 gained stability due to the interaction between the CRDs.</p>
<p>The thermal shift of the three constructs was also evaluated in the presence of lactose, a low affinity β-galactoside ligand for galectin-4. A hyperbolic profile dependence on lactose concentration was observed, allowing for the estimation of saturating Δ
<italic>T</italic>
<sub>
<italic>m</italic>
</sub>
values of 9.4 ± 0.4 °C, 9.3 ± 0.4 °C and 9.3 ± 0.6 °C, for galectin-4, galectin-4N and galectin-4 C, respectively (
<xref ref-type="fig" rid="f1">Fig. 1c</xref>
). Fitting of the apparent binding constant,
<italic>k</italic>
, for lactose yields similar values for galectin-4 and galectin-4 N of 53 ± 6 and 50 ± 5 mM, respectively, and a
<italic>k</italic>
of 78 ± 10 mM for galectin-4C. Apparent affinities obtained by thermofluor, which are proportional to the dissociation constants
<xref ref-type="bibr" rid="b20">20</xref>
, are in agreement with previous findings which described lactose as a weak ligand with galectin-4C displaying 1.5 times lower affinity than galectin-4N (1.3 mM and 1.9 mM for galectin-4N and galectin-4C, respectively
<xref ref-type="bibr" rid="b21">21</xref>
<xref ref-type="bibr" rid="b22">22</xref>
).</p>
<p>Additionally, melting curves for full-length galectin-4 were evaluated at different ionic strengths and pH values using the Solubility & Stability Screen 2 kit (Hampton Research). Although the
<italic>T</italic>
<sub>
<italic>m</italic>
</sub>
for full-length galectin-4 was lower with decreasing pHs, the melting curves consistently occurred in a single-domain protein denaturing event, suggesting that the global structure of galectin-4 remains stable as a compact unit over a wide range of conditions.</p>
</sec>
<sec disp-level="2">
<title>Structural models for galectin-4N, galectin-4C and full-length galectin-4</title>
<p>To elucidate the full structural architecture of galectin-4, we solved the crystal structures of galectin-4N and galectin-4C at 1.48 Å and 1.78 Å resolution, respectively
<xref ref-type="bibr" rid="b23">23</xref>
<xref ref-type="bibr" rid="b24">24</xref>
(
<xref ref-type="table" rid="t1">Table 1</xref>
). The final models for galectin-4N and galectin-4C are comprised of residues 5 to 152 and 184 to 323, respectively and share the same structural features previously described by Bum-Erdene and co-workers
<xref ref-type="bibr" rid="b21">21</xref>
<xref ref-type="bibr" rid="b22">22</xref>
. Both structures show the canonical β-sandwich fold arranged in a jellyroll topology, in which the monomer is formed by two antiparallel β-sheets, each composed of six (F0-F5/F0′-F5′ and S1-S6/S1′-S6′) β-strands (
<xref ref-type="fig" rid="f2">Fig. 2a</xref>
).</p>
<p>Structural analysis of both galectin-4N and galectin-4C domains, which share a root mean square deviation (RMSD) of 1.2 Å between Cα atoms, reveal a large difference in charge distribution when the electrostatic potential surface is calculated at the physiological pH 7.4 (
<xref ref-type="fig" rid="f2">Fig. 2b</xref>
). The galectin-4C surface charge distribution is mostly positive, whereas, the galectin-4N surface displays a more heterogeneous distribution with a positive region localized in the binding site.</p>
<p>The carbohydrate-binding site is located in a shallow pocket composed of residues present in the S4/S4′, S5/S5′ and S6/S6′ strands and the S5/S5′ adjacent loop. The residues involved are His63/236, Asn65/238, Arg67/240, Asn77/249, Trp84/256, Glu87/259 and Arg89/Lys261 in the galectin-4N/galectin-4C structures, respectively (
<xref ref-type="fig" rid="f2">Fig. 2c,d</xref>
). The S2/S2′ and S3/S3′ strands, thought to contribute to the selectivity between galectin-4N and galectin-4C domains, form an extended cleft that permits interaction with different ligands. The main amino acid substitutions in the galectin-4N/galectin-4C structures are His135/Thr309, Gln137/Glu311 and Asp139/Gln313 for the S2/S2′ strand and Arg45/Ser220, Phe47/Ala222 and Val51/Lys226 for the S3/S3′ strand (
<xref ref-type="fig" rid="f2">Fig. 2c,d</xref>
). Arg45 in the S3 strand from galectin-4N has been identified as the main residue to interact with a cholesterol sulphate ligand
<xref ref-type="bibr" rid="b25">25</xref>
and to contribute weakly to lactose-3′-sulfate interaction
<xref ref-type="bibr" rid="b22">22</xref>
. Asn224 and Lys226 (S3′ strand), as well as Glu311 and Gln313 (S2′ strand) from galectin-4C have been shown to establish additional interactions with lacto-N-tetraose and lacto-N-neotetraose ligands
<xref ref-type="bibr" rid="b21">21</xref>
. Also in galectin-4C, Ser220 was identified as responsible for A-type saccharide preference
<xref ref-type="bibr" rid="b21">21</xref>
. Additional differences are observed in the loops between strands S3/S3′-S4/S4′ and S4/S4′-S5/S5′, where insertions are observed when comparing galectin-4N and galectin-4C amino acid sequences (
<xref ref-type="fig" rid="f2">Fig. 2d</xref>
).</p>
<p>A structural model for full-length galectin-4 was obtained by combining molecular modelling and molecular dynamics (MD) simulations. First,
<italic>ab initio</italic>
prediction was used to generate different models of the linker-peptide. The best models, which share a compact structure and the presence of a short helix segment, were elected based on geometry and agreement between observed and predicted content in secondary structure. The linkers were combined as a single polypeptide chain with the X-ray structures of galectin-4N and galectin-4C, which were randomly arranged in relation to each other giving rise to six different starting models for full-length galectin-4. The model with the lowest potential energy (
<xref ref-type="fig" rid="f3">Fig. 3a</xref>
) was submitted to a conformational refinement by MD. We began with a standard backbone-restrained solvation and thermalisation (2 ns) to achieve a pressure of 1 atm and a temperature of 37 °C (310 K) in the simulation box. A 30 ns production simulation was subsequently performed to ensure that the system reached and maintained proper equilibrium. The resulting trajectory was then analysed by principal component analysis (PCA), allowing us to select the lowest energy frame, which was designated as the starting point to all further rounds of MD simulations described in this work (
<xref ref-type="fig" rid="f3">Fig. 3b</xref>
).</p>
<p>The galectin-4 model displays four antiparallel β-sheets connected by a linker-peptide that can be described as a proline-rich hinge followed by a short α-helix (amino acids 170–173) and an extended region (
<xref ref-type="fig" rid="f3">Fig. 3b</xref>
). We observe a compact structure, having overall dimensions of 74 Å × 55 Å × 45 Å, in which the CRDs interact with each other and with the linker-peptide. These interactions are stabilized by 10 hydrogen bonds and 152 non-bonded contacts (
<xref ref-type="fig" rid="f3">Fig. 3c</xref>
). The contact areas between interfaces were determined to be 465 Å
<sup>2</sup>
(galectin-4N/linker), 349 Å
<sup>2</sup>
(galectin-4N/galectin-4C) and 418 Å
<sup>2</sup>
(linker/galectin-4 C).</p>
</sec>
<sec disp-level="2">
<title>Solution conformation of human galectin-4</title>
<p>To evaluate the energy-minimized full-length galectin-4 model obtained by MD (
<xref ref-type="fig" rid="f3">Fig. 3b</xref>
), the overall conformation of the protein was examined in solution by X-ray scattering, a technique that is ideally suited for probing ligand-induced conformational changes and for examining dynamic proteins that are challenging to crystallise. In-line size exclusion chromatography (SEC) was used to separate any mixtures as well as to ensure accurate background subtractions. Scattering was measured over a wide range of scattering angles on galectin-4 both in the absence of any ligands and in the presence of 30 mM lactose (
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. S2</xref>
). For each sample, approximately 500 exposures were collected as the elution flowed directly into a continuous-flow cell. In each case, sample homogeneity was confirmed in the central region of the elution peak (
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. S2</xref>
, blue regions) by singular value decomposition (SVD) and Guinier analysis (
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. S2</xref>
)
<xref ref-type="bibr" rid="b26">26</xref>
, and thus, the scattering profiles within these regions were averaged (
<xref ref-type="fig" rid="f4">Fig. 4a</xref>
, gray circles). A comparison of the experimental curve with the theoretical scattering of a model of galectin-4, in which the CRDs are non-associating (
<xref ref-type="fig" rid="f3">Fig. 3a</xref>
, dotted curve) shows a poor fit, whereas a comparison with the theoretical scattering calculated from the full-length model described above (
<xref ref-type="fig" rid="f3">Fig. 3b</xref>
, black curve) shows remarkable agreement. Consistent with this result, the
<italic>ab initio</italic>
shape reconstruction of galectin-4 derived from the SAXS data also suggests a compact conformation in which galectin-4N and galectin-4C are associated (
<xref ref-type="fig" rid="f4">Fig. 4b</xref>
). Interestingly, the scattering of galectin-4 in the presence of lactose is nearly superimposable with that of ligand-free galectin-4. Only a subtle difference is apparent at low angles, corresponding to features at large length scales. Consistent with this, Guinier analysis yields slightly different radii of gyration for galectin-4 without and with lactose of 23.7 ± 0.1 Å and 24.9 ± 0.1 Å, respectively. The subtle expansion in the conformation upon addition of lactose is best visualized by an increase in the width of the pair-distance distribution function,
<italic>P</italic>
(
<italic>r</italic>
) (
<xref ref-type="fig" rid="f4">Fig. 4c</xref>
).</p>
</sec>
<sec disp-level="2">
<title>Molecular dynamics simulations</title>
<p>We performed molecular dynamics simulations of both galectin-4 and the galectin-4-lactose complex to investigate the behaviour of the protein in the presence and absence of a ligand. For each system, we performed four independent trajectories of 100 ns using different seeds (named MD 1, MD 2, MD 3 and MD 4). Analysis of the RMSD for backbone atoms showed that all simulations systems reached equilibrium before 100 ns (
<xref ref-type="supplementary-material" rid="S1">Supplementary Figs S3 and S4</xref>
). Variations among MDs simulations showed that the apo structure adopts two main conformations: an “open” conformation with an average Rg of 23 Å and a “closed” conformation with an average Rg of 22 Å (
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. S3</xref>
). The Rg histogram for MDs also revealed that in the protein-lactose complex, galectin-4 is stabilized in the “open” conformation (
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. S4</xref>
).</p>
<p>Analyses of RMSD plots for each independent domain (
<xref ref-type="supplementary-material" rid="S1">Supplementary Figs S3 and S4</xref>
) reveal that galectin-4C remained stable throughout the MD trajectory. Galectin-4N was shown to converge to similar structures sharing in average 1.4 Å deviation. Larger conformational fluctuations were observed in the linker-peptide, as expected for this type of disordered secondary structural element (
<xref ref-type="supplementary-material" rid="S1">Supplementary Figs S3 and S4</xref>
).</p>
</sec>
<sec disp-level="2">
<title>Inter-domain communication in galectin-4</title>
<p>To guarantee an investigation over a well-thermalized system we extended the MD 1 simulation to 250 ns and compared the 150 ns time interval, between 100 and 250 ns for both simulations (with and without lactose). RMSD plots (
<xref ref-type="fig" rid="f5">Fig. 5a,b</xref>
) consistently showed differing galectin-4 behaviour in the absence and presence of lactose. In both cases, the linker-peptide generally demonstrated the highest deviation values, which are correlated with conformational changes associated to the full-length structure (
<xref ref-type="fig" rid="f5">Fig. 5a,b</xref>
). Moreover, in the presence of the ligand, the galectin-4N domain showed a higher structural variability than galectin-4C.</p>
<p>For both MD simulations, we evaluated mobility using root mean square fluctuation (RMSF) box charts (
<xref ref-type="fig" rid="f5">Fig. 5c</xref>
). The average RMSF was 1.0 ± 0.4 Å for galectin−4 and 2.0 ± 0.8 Å for the galectin-4-lactose system. Overall, the highest B-factors were in the galectin-4-lactose system, indicating greater flexibility than galectin-4 without lactose (
<xref ref-type="fig" rid="f5">Fig. 5c</xref>
, inset). In both cases, the flexible regions were mainly found on the N-terminus, linker-peptide and regions between β-strands, with an emphasis on seven loops of galectin-4 (S3-S4, S5-S6, S3′-S4′, S4′-S5′, S5′-S6′, F4′-F5′ and F5′-S2′) and sixteen loops of galectin-4-lactose (F0-S1, F2-S3, S3-S4, S4-S5, S5-S6, S6-F3, S2-F1, F0′-S1′, F2′-S3′, S3′-S4′, S4′-S5′, S5′-S6′, S6′-F3′, F4′-F5′, F5′-S2′ and S2′-F1′).</p>
<p>This protein flexibility is related to the nature of intramolecular interactions. Hydrogen bond pairs with more than 10% occupancy were analysed between domains (
<xref ref-type="supplementary-material" rid="S1">Supplementary Table S2</xref>
). For the MD simulation without lactose, we observed four H-bond pairs between galectin-4N/linker, five between galectin-4C/linker and four between galectin-4N/galectin-4C, of which, only five had greater than 50% occupancy. With lactose, there are seven H-bond pairs between galectin-4N/linker, nine between galectin-4C/linker and five between galectin-4N/galectin-4C, however only eight pairs interacted more than 50% of the time. Although the two MD simulations share only one H-bond pair, 148ASN(D22)-171HIS(ND1), eight common residues are involved in different H-bonding interactions. Moreover, a structural comparison between simulations at 250 ns revealed that the main interactions are non-bonded contacts, among which, many residues are the same in both systems.</p>
<p>Due to its more compact structure, the model without ligand showed larger interface areas than the galectin-4-lactose complex (
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. S5</xref>
). The contact areas between surfaces in galectin-4 were determined to be 540 Å
<sup>2</sup>
(galectin-4N/linker), 481 Å
<sup>2</sup>
(galectin-4N/galectin-4C) and 334 Å
<sup>2</sup>
(linker/galectin-4C). For the structure with lactose, these values were 325 Å
<sup>2</sup>
, 202 Å
<sup>2</sup>
and 428 Å
<sup>2</sup>
, respectively. These interface areas suggest that in the first system the linker-peptide is shifted towards galectin-4N, while in the system with lactose it is shifted towards galectin-4C. The dynamic nature of the interface where the interaction are sustained by transient contacts, gives this region an intrinsic flexibility.</p>
<p>Principal component analysis (PCA) was used to estimate the primary domain motions (
<xref ref-type="fig" rid="f5">Fig. 5d,e</xref>
). The results indicate that only a portion of the linker showed significant movement in the simulation without lactose. In contrast, both CRDs showed opposing rotational movements when in presence of lactose (
<xref ref-type="fig" rid="f5">Fig. 5d,e</xref>
). According to the RMSD plot (
<xref ref-type="fig" rid="f5">Fig. 5b</xref>
), the structural rearrangement in the linker is associated with a movement that pushes the CRDs in opposite directions (
<xref ref-type="fig" rid="f5">Fig. 5d,e</xref>
).</p>
<p>Additionally, correlation plots showed that both structures, galectin-4 and galectin-4-lactose, have different structural correlation patterns (
<xref ref-type="fig" rid="f5">Fig. 5f,g</xref>
). Galectin-4 mainly showed positive intra-domain correlations, with few anti-correlated movements between CRDs. Although the linker had shown high flexibility, its movement was not correlated with any domain (
<xref ref-type="fig" rid="f5">Fig. 5f</xref>
). The galectin-4-lactose complex, in contrast, showed a larger number of positive and negative correlations (
<xref ref-type="fig" rid="f5">Fig. 5g</xref>
), involving residues of all domains.</p>
<p>Despite movement, the low RMSD of each domain through trajectory (
<xref ref-type="fig" rid="f5">Fig. 5b</xref>
) indicates low structural variability. Even so, galectin-4N and galectin-4C show long-range anti-correlated movements with respect to each other (
<xref ref-type="fig" rid="f5">Fig. 5g</xref>
). The combination of these two behaviours reflects a correlated movement of rigid bodies mediated by the exchange of weak interactions with the linker.</p>
</sec>
</sec>
<sec disp-level="1">
<title>Discussion</title>
<p>It is well known that CRDs share a conserved β-sandwich fold and that there is a sequence signature for carbohydrate recognition and binding (
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. S1</xref>
)
<xref ref-type="bibr" rid="b7">7</xref>
. However, one of the most notable properties about galectins and their CRDs is the meticulous way in which they discriminate among different glycans, resulting in a variable and complex biological response
<xref ref-type="bibr" rid="b27">27</xref>
<xref ref-type="bibr" rid="b28">28</xref>
.</p>
<p>Studies have demonstrated that the tandem-repeat galectins are more potent than galectins-1 and -3 in activating signalling in T cells and neutrophils
<xref ref-type="bibr" rid="b9">9</xref>
<xref ref-type="bibr" rid="b12">12</xref>
<xref ref-type="bibr" rid="b13">13</xref>
. In addition, they display a broad spectrum of biological activities as major signalling modulators both inside and outside the cell. This characteristic suggests that a combination of two distinct CRDs and a linker-peptide brings together chemical, structural and dynamic diversity able to impact on potency and on the plurality of carbohydrate-dependent events involved in their signalling ability and adhesive properties
<xref ref-type="bibr" rid="b10">10</xref>
.</p>
<p>The impact of tandem-repeat galectins on biological response has been associated with structural flexibility, relative orientation, and spacing between CRDs
<xref ref-type="bibr" rid="b9">9</xref>
. However, structural and dynamic characteristics of tandem-repeat galectins, including the type of interactions between CRDs and the linker-peptide, remain elusive and thus merit concentrated investigative efforts. However, despite the importance of this class of proteins in both physiological and pathological processes, the flexibility imposed by the linker and its susceptibility to proteolysis
<xref ref-type="bibr" rid="b29">29</xref>
have made these studies very challenging.</p>
<p>As an important step toward assessing the underlying mechanisms that govern the function of tandem-repeat galectins acting on multiple targets, we presented for the first time a structural model of human galectin-4 based on a combination of theoretical and experimental approaches. The final model of galectin-4, constructed based on X-ray crystallography, molecular modelling and MD simulations and further supported by SAXS experiments, reveals that galectin-4 folds as a compact structure in which the CRDs interact both with each other and with the linker-peptide (
<xref ref-type="fig" rid="f3">Fig. 3b</xref>
). The galectin-4 domains, galectin-4N, galectin-4C and the linker-peptide, were found to be mainly connected by weak (hydrogen and other non-bonded interactions) and transient contacts, revealing the dynamic nature of the interfacial interactions (
<xref ref-type="supplementary-material" rid="S1">Supplementary Table S2</xref>
).</p>
<p>Experimental evidence for interaction between the CRDs was also observed when comparing the thermal denaturation profiles of the full-length galectin-4 with its independent domains (
<xref ref-type="fig" rid="f1">Fig. 1a</xref>
). Although there was an 11 °C difference between the melting temperatures of the CRD domains, large enough to be distinguished if the unfolding process was characterized by sequential (non-cooperative) events of CRD domains, the profile for the melting curve obtained for full-length galectin-4 was consistent with a single-domain protein denaturing event (
<xref ref-type="fig" rid="f1">Fig. 1a</xref>
). The same profile was observed when galectin-4 was submitted to different pH, ionic strengths and additives. This results reinforces the hypothesis that CRDs are not only associated under physiological conditions, but also remain together under diverse conditions, including those that mimic acidic extracellular microenvironments characteristic of tumour tissue
<xref ref-type="bibr" rid="b30">30</xref>
in which the protein is often present.</p>
<p>Corroborating the idea of a compact structure, full-length galectin-4 was also shown to be more stable than its independent domains (
<xref ref-type="fig" rid="f1">Fig. 1b</xref>
). In fact, a comparison of the melting curves of galectin-4, galectin-4N and galectin-4C allowed us to compare the behaviour of isolated CRDs with full-length galectin-4 and infer the individual contribution of each CRD for galectin-4 structure.</p>
<p>Differences between the galectin-4N and galectin-4C melting curves under the different conditions are notable (
<xref ref-type="fig" rid="f1">Fig. 1b</xref>
,
<xref ref-type="supplementary-material" rid="S1">Supplementary Table S1</xref>
) and can be explained as a consequence of variation in their chemical properties, i.e., number and charge distribution of amino acids among CRDs (
<xref ref-type="fig" rid="f2">Fig. 2b</xref>
). Galectin-4C was shown to be more sensitive to changes in the chemical environment, displaying larger thermal shift (Δ
<italic>T</italic>
<sub>
<italic>m</italic>
</sub>
) values, but it appears more stable than galectin-4N overall (
<xref ref-type="fig" rid="f1">Fig. 1b</xref>
,
<xref ref-type="supplementary-material" rid="S1">Supplementary Table S1</xref>
). In agreement, MD data shows that galectin-4C is more rigid (
<xref ref-type="fig" rid="f5">Fig. 5b</xref>
), a requirement to compensate for increased thermal fluctuations. In contrast, the larger RMSD values observed during simulation reveal that galectin-4N can be more plastic (
<xref ref-type="fig" rid="f5">Fig. 5b</xref>
), a characteristic that allows this domain to be more promiscuous in carbohydrate recognition and binding, as well as more potent in achieving a biological response.</p>
<p>Careful analysis of melting curves and thermal shift values under different chemical environments reveals that galectin-4 takes advantage of the stability of both domains to remain stable over a larger range of chemical conditions, i.e., the most stable domain governs the denaturation process of galectin-4 (
<xref ref-type="fig" rid="f1">Fig. 1b</xref>
). This combined response is a reflection of its compact structure and of the ability of the linker-peptide to switch back and forth between CRDs that allows for transient interactions to stabilize the more susceptible domain (
<xref ref-type="supplementary-material" rid="S1">Supplementary Table S2</xref>
).</p>
<p>The similarity between the hyperbolic profile dependence on lactose concentration for galectin-4 and galectin-4N indicates that the response for the full-length protein is governed by a single binding site with similar properties to those of galectin-4N domain (
<xref ref-type="fig" rid="f1">Fig. 1c</xref>
). The lack of a clear evidence of the contribution of the galectin-4C binding site for full-length protein behaviour (
<xref ref-type="fig" rid="f1">Fig. 1c</xref>
) can be explained as a result from the contribution of the linker, as observed in our MD simulations (
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. 5</xref>
). Whether the cross talk between galectin-4N and galectin-4C has a positive or a negative impact on galectin-4C lactose recognition remains to be elucidated.</p>
<p>Thermofluor studies complemented by our MD data provide insight into protein flexibility under different conditions. These results demonstrated that the sequence variation among galectin-4-CRDs, although preserving the integrity of the CRD β-fold sandwich and sequence signature for carbohydrate recognition, enable CRDs to respond differently to a given chemical environment. Thus, physiologically, the CRDs not only work as agents of glycan recognition, but can also be considered biochemical sensors of the microenvironment important for adapting the lectin properties of galectin-4 to different conditions, and thereby assuring its biological impact in distinct physiological and pathological processes.</p>
<p>Different from the apo protein, the galectin-4-lactose complex is found stabilized in an open conformation, characterized by a hinge-bending motion (
<xref ref-type="fig" rid="f5">Fig. 5d,e</xref>
) and a decrease in contact areas between domains (
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. S5</xref>
). Consistent with our MD results (
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. S4</xref>
), an increase in radius of gyration is observed by SAXS in the presence of lactose. Covariance analysis showed that the movement between linker and CRDs is directly correlated (
<xref ref-type="fig" rid="f5">Fig. 5g</xref>
). Whereas, analysis of both RMSD and RMSF distributions demonstrates that both CRDs move as rigid bodies, without any significant intra-domain distortion or disruption of the carbohydrate-binding site (
<xref ref-type="fig" rid="f5">Fig. 5b,c</xref>
).</p>
<p>Together, thermofluor, SAXS and MD analyses associate this lactose-stabilized, elbow-hinged switch in the full-length galectin-4 with a gain of thermal stability in each individual CRD domain (
<xref ref-type="fig" rid="f1">Fig. 1c</xref>
) and flexibility (
<xref ref-type="fig" rid="f5">Fig. 5c</xref>
). In another words, the enthalpy gain associated to lactose binding is compensated by an entropy loss within CRD domains and is correlated with an entropy gain in the full structure.</p>
<p>Our work also sheds light on the role of the linker-peptide as a key element in tandem-repeat galectins. In the galectin-4 model, the linker was observed to function as a molecular hinge that mediates the interaction between the CRDs (
<xref ref-type="fig" rid="f3">Fig. 3c</xref>
), thanks to the high content of proline residues, 28.6%, that imposed severe restrictions in the conformation and movement of this region. In fact, a comparison among the five known tandem-repeat galectins and their isoforms reveals the existence of ten different linker-peptides characterized by high variability in length and amino acid distribution, but sharing a high content of proline residues (
<xref ref-type="supplementary-material" rid="S1">Supplementary Fig. S1</xref>
). This feature affects the global structure of tandem-repeat galectins and in the manner in which the linker-peptide coordinates the movement and distance between CRDs. Thus, it is reasonable to predict that each member of the tandem-repeat galectin subfamily possesses a structural arrangement that depends on features of all individual domains. Galectin-4 and its homologue galectin-6, for example, share high sequence identity, but very distinct linker-peptides capable of offering unique structural and dynamic features for each protein, and in turn unique biological roles. Our model for galectin-4 provides the basis for further investigation.</p>
<p>Notably, all tandem-repeat galectin linker-peptides share proline-rich regions (PRRs). Besides their influence on protein structure and stability, PRRs are also described as binding domains
<xref ref-type="bibr" rid="b31">31</xref>
. In particular, they have a unique architecture which allows them to participate in molecular interactions that rely on multiple weak binding sites
<xref ref-type="bibr" rid="b31">31</xref>
. This architecture is characterized by restricted mobility, which reduces the unfavourable entropy loss of peptides upon binding. It is further influenced by the flat hydrophobic surface of prolines and the characteristics of the amide bond preceding proline, which make it a strong hydrogen bond acceptor. The unique architecture of PRRs can be particularly important in protein-protein and protein-nucleic acid interactions involved in intracellular signalling dependent on tandem-repeat galectins
<xref ref-type="bibr" rid="b4">4</xref>
. In particular, the continuous surface observed in galectin-4, as a consequence of its single domain arrangement, may favour protein-protein interactions including galectin-4 dimerization, as previously observed
<xref ref-type="bibr" rid="b25">25</xref>
. This is in contrast to a scenario in which the CRDs are flexible and move independently of each other.</p>
<p>In summary, a multi-technique approach has allowed us to investigate the structure of galectin-4 and its thermal and dynamic behaviours. Our results suggest that changes in the physicochemical environment have a direct effect on the ability to CRDs to reach different conformational states, and in turn modulate ligand recognition. The relative positions between the CRDs and the extent of cross talk between them depend on the structural features of linker-peptide, in an orchestrated mechanism of detection and response to a cellular stimulus.</p>
</sec>
<sec disp-level="1">
<title>Methods</title>
<sec disp-level="2">
<title>Protein cloning, expression and purification</title>
<p>The human galectin-4 open reading frame (GenBank: CR536544.1), coding for amino acids 1–323, was amplified from a previously constructed plasmid encoding galectin-4 and was cloned into the
<italic>Eco</italic>
RI/
<italic>Xho</italic>
I site of the pET-28a (Novagen) modified vector, pET-28a-SUMO. This vector was designed to produce an N-terminal His-tagged SUMO fusion protein via the insertion of a carrier ubiquitin-like protein, SMT3 from
<italic>Saccharomyces cerevisiae</italic>
(UniProtKB/Swiss-prot: Q12306.1), between the
<italic>Nhe</italic>
I and
<italic>Bam</italic>
HI sites. DNA sequencing confirmed proper insertion of the galectin-4 gene fragment into the pET28a-SUMO vector.
<italic>Escherichia coli</italic>
Rosetta (DE3) cells (Novagen), transformed with the expression vector, were cultured in LB media containing 34 μg ml
<sup>−1</sup>
chloramphenicol and 30 μg ml
<sup>−1</sup>
kanamycin at 37 °C. Overproduction of recombinant galectin-4 was induced by adding 50 μM of isopropyl β-D-1-thiogalactopyranoside once the optical density OD
<sub>600</sub>
reached 0.5. Growth continued for 24 h at 25 °C and 180 rev min
<sup>−1</sup>
. Cells were harvested by centrifugation at 10,000
<italic>g</italic>
for 10 minutes at 4 °C. The cell pellet was kept on ice and suspended in lysis buffer (50 mM monosodium phosphate pH 8.0, 600 mM NaCl, 14 mM β-mercaptoethanol and 1 tablet of EDTA-free SIGMA
<italic>FAST</italic>
<sup>TM</sup>
protease inhibitor cocktail). Cells were subsequently disrupted by ten 30 s, 10 W sonication pulses applied at 30 s intervals. The lysate was then clarified by centrifugation at 4 °C and 16,000 
<italic>g</italic>
for 30 minutes. The resulting supernatant was loaded onto a Ni-NTA column pre-equilibrated with buffer A (50 mM monosodium phosphate pH 8.0, 600 mM NaCl and 14 mM β-mercaptoethanol). The column was washed with a step gradient of 0 and 25 mM imidazole added to buffer A, at ten column volumes each. The His
<sub>6</sub>
-SUMO-galectin-4 fusion eluted with ten column volumes of buffer A plus 500 mM imidazole. Protein fractions were identified by their absorbance at 280 nm, pooled, concentrated using a 10 kDa cut-off centrifugal filter unit Amicon
<sup>®</sup>
Ultra-15 (Millipore) and dialyzed against buffer A. The His
<sub>6</sub>
-tagged SUMO was cleaved by a ULP1 protease (Ubiquitin-like-specific Protease 1– EC 3.4.22.68) for 16 h at 8 °C. The sample was subsequently loaded onto a Ni-NTA resin column where galectin-4 was separated from ULP1 and SUMO through elution with buffer A plus 25 mM imidazole.</p>
<p>Galectin-4N (N-terminal domain from human galectin-4, residues 1–152)
<xref ref-type="bibr" rid="b23">23</xref>
and galectin-4C (C-terminal domain from human galectin-4, residues 179–323)
<xref ref-type="bibr" rid="b24">24</xref>
were cloned, expressed and purified as previously described. All three proteins were further submitted to size exclusion chromatography using a Superdex200 10/300 column (GE Healthcare) pre-equilibrated with 50 mM HEPES pH 7.2, 150 mM NaCl and 14 mM β-mercaptoethanol. Purity of the resultant fractions was analysed by SDS-PAGE stained with Coomassie Brilliant Blue.</p>
</sec>
<sec disp-level="2">
<title>Thermofluor for galectin-4, galectin-4N and galectin-4C</title>
<p>Thermofluor was used to map the response to chemical environments of galectin-4 and its domains galectin-4N and galectin-4C. The experiments were conducted in an Mx3005P RT-PCR (Agilent Technologies) using SYPRO
<sup>®</sup>
orange (492/610 nm) (Invitrogen) as a fluorescent probe to detect exposed hydrophobic regions of the proteins. Samples were filtered through 0.2 μm membranes (Millipore) and quantified at 280 nm based on the theoretical molar extinction coefficient. Analysis of the proteins’ thermal denaturation profiles were performed using a 96-well PCR plate (Agilent Technologies). The samples were heated from 25 °C to 95 °C at 1 °C/min and fluorescence measurements were taken. Thermal melting curves were processed as in the protocol described by Niesen and co-workers
<xref ref-type="bibr" rid="b32">32</xref>
, and the melting temperature was obtained using GraphPad Prism software (
<ext-link ext-link-type="uri" xlink:href="http://www.graphpad.com">www.graphpad.com</ext-link>
). For a comparison of the galectin-4, galectin-4N and galectin-4C denaturation profiles, we initiated a 20 μl reaction containing 10 μM protein in 25 mM HEPES pH 7.2, 75 mM NaCl, 7 mM β-mercaptoethanol and 5X SYPRO
<sup>®</sup>
orange. In the same conditions, the behaviour of galectin-4 and its domains was assessed using the Solubility and Stability Screen (Hampton Research). Evaluation of the proteins’ behaviour in the presence of lactose was performed using serial dilution from a parent solution of 409.6 mM lactose. The behaviour of galectin-4 at different pHs and ionic strengths was assessed using the Solubility and Stability Screen 2
<sup>TM</sup>
(Hampton Research). Here, we initiated a 20 μl reaction containing 2.8 μM protein in 2.5 mM HEPES pH 7.2, 7.5 mM NaCl, 0.7 mM β-mercaptoethanol and 5X SYPRO
<sup>®</sup>
orange.</p>
</sec>
<sec disp-level="2">
<title>Protein crystallisation, data collection and structural analysis</title>
<p>The galectin-4N and galectin-4C domains were crystallised as previously described
<xref ref-type="bibr" rid="b23">23</xref>
<xref ref-type="bibr" rid="b24">24</xref>
. Cryogenic X-ray diffraction data for galectin-4N and galectin-4C were collected at the Diamond Light Source (beamline I04-1) and the SRL/SLAC National Accelerator Laboratory (beamline BL12-2) respectively. The data were indexed with MOSFLM
<xref ref-type="bibr" rid="b33">33</xref>
and reduction was performed with Scala
<xref ref-type="bibr" rid="b34">34</xref>
and Aimless
<xref ref-type="bibr" rid="b35">35</xref>
in the CCP4 suite
<xref ref-type="bibr" rid="b36">36</xref>
. The structure of galectin-4N was determined to 1.48 Å resolution using the previous solution
<xref ref-type="bibr" rid="b23">23</xref>
as a search model in Phaser
<xref ref-type="bibr" rid="b37">37</xref>
, implemented in the PHENIX suite
<xref ref-type="bibr" rid="b38">38</xref>
. The galectin-4C structure was determined to 1.78 Å resolution as described
<xref ref-type="bibr" rid="b24">24</xref>
. Model building and refinement were performed with Coot
<xref ref-type="bibr" rid="b39">39</xref>
and phenix.refine
<xref ref-type="bibr" rid="b38">38</xref>
. The quality of the final models was validated by MolProbity
<xref ref-type="bibr" rid="b40">40</xref>
, where Ramachandran statistics indicate that 98.1% of residues lie in the favoured regions with no outliers for both galectin-4N and galectin-4C final models. Figures were prepared with PyMOL
<xref ref-type="bibr" rid="b41">41</xref>
. Diffraction data and refinement statistics are shown in
<xref ref-type="table" rid="t1">Table 1</xref>
. Structures were analysed with Coot
<xref ref-type="bibr" rid="b39">39</xref>
, PyMol
<xref ref-type="bibr" rid="b41">41</xref>
and PDBsum
<xref ref-type="bibr" rid="b42">42</xref>
.</p>
</sec>
<sec disp-level="2">
<title>Modelling of linker-peptide and full-length galectin-4 construction</title>
<p>A sequence of 33 amino acid residues (from 153 to 185,
<underline>QPLRPQGPPMMPPYPGPGHCHQQLNS</underline>
LP TMEGP in which the underlined region corresponds to the linker-peptide) from galectin-4 was submitted to the ROBETTA server
<xref ref-type="bibr" rid="b43">43</xref>
for
<italic>ab initio</italic>
structure prediction. Geometry idealization was performed for all resulting models using the
<italic>phenix.geometry_minimization</italic>
program
<xref ref-type="bibr" rid="b38">38</xref>
and results were evaluated based on model quality with the MolProbity server. Crystallographic structures of galectin-4N and galectin-4C together with the top two linker-peptide models were used to build six different structures for galectin-4 using MODELLER v9.14
<xref ref-type="bibr" rid="b44">44</xref>
. Two steps of optimization were implemented in the model generating script, Variable Target Function Method (VTFM) and molecular dynamics simulations (MD). Conjugated gradient and simulated annealing were implemented between VTFM and MD routines. The resultant full-length models were also submitted to geometry idealization and analysed with the MolProbity server. As with the linker-peptide, the structures were compared and the best model was used for preliminary molecular dynamics simulations.</p>
</sec>
<sec disp-level="2">
<title>Molecular dynamics simulations</title>
<p>Molecular dynamics simulations were carried out using the GROMACS package
<xref ref-type="bibr" rid="b45">45</xref>
along with the AMBER99sb-ILDN force field parameters
<xref ref-type="bibr" rid="b46">46</xref>
. The temperature and pressure were set to 310 K and 1 atm, and controlled by the Nosé-Hoover
<xref ref-type="bibr" rid="b47">47</xref>
and Parrinello-Rahman
<xref ref-type="bibr" rid="b48">48</xref>
algorithms, respectively. The electrostatic interactions of each atom were treated with the Particle Mesh Ewald scheme and, like the non-bonded interactions (described by the Lennard-Jones potential), were limited to a cut-off radius of 1.0 nm. All water-bonded interactions were constrained by the SETTLE algorithm
<xref ref-type="bibr" rid="b49">49</xref>
, whereas LINCS
<xref ref-type="bibr" rid="b50">50</xref>
was used to constrain the bonded interactions of the protein. The time step integration of the leap-frog algorithm was set to 2 fs.</p>
</sec>
<sec disp-level="2">
<title>Galectin-4 starting MD model</title>
<p>The homology model was enclosed and centred in a dodecahedron box within a distance of 1.2 nm from the faces, and the system was explicitly solvated with the TIP3P water model
<xref ref-type="bibr" rid="b51">51</xref>
. The pH of each system was set indirectly to neutral according to the correspondent ionization states of the amino acids side-chains of the protein
<xref ref-type="bibr" rid="b52">52</xref>
. Therefore, the addition of counter ions Na
<sup>+</sup>
and Cl
<sup></sup>
was controlled to neutralize the protein charges and reach an ionic strength of 150 mM. In order to remove spurious molecular contacts, a steepest descent energy minimization was carried out, levelling the total potential energy of the system to a value smaller than 2000 kJ.mol
<sup>−1</sup>
.nm
<sup>−1</sup>
. Then a restriction potential of 1000 kJ.mol
<sup>−1</sup>
nm
<sup>2</sup>
was applied to the
<italic>xyz</italic>
coordinates of the backbone amino acids for 2 ns in order to adjust the solvation layer on the surface of the protein. Afterwards, we produced a 30 ns trajectory, which allowed us to thermalize the system as well as adapt the protein structure to an aqueous environment. From the resulting trajectory, we performed principal component analysis using a covariance matrix and obtained the set of eigenvectors in order to sample its conformational space. We then selected the first and second projections, and fed the values to generate a trajectory on the average structure. The potential energy of the resulting model was minimized using the method of steepest descent.</p>
</sec>
<sec disp-level="2">
<title>Galectin-4 molecular dynamics: equilibrium and production</title>
<p>The final galectin-4 model from MD energy minimization was submitted to four 100 ns trajectories in the absence and presence of the lactose ligand (β-D-galactopyranosyl-D-glucose), using different seeds. The starting complex model was built by three-dimensional superimposition of each CRD from galectin-4 with the CRDs from galectin-8 (PDB ID 3VKL). The side chains of residues from the binding site of galectin-4 were positioned as in galectin-8, complexed with lactose. Next, lactose was transferred into the binding site of galectin-4. The ligand was built and parameterized with the Glycam
<xref ref-type="bibr" rid="b53">53</xref>
server
<xref ref-type="bibr" rid="b54">54</xref>
. We performed the solvation, energy minimization and restriction steps in the same way as described above for the protein model. The resulting structure and topology files were converted to the GROMACS notation with
<italic>acpype</italic>
<xref ref-type="bibr" rid="b55">55</xref>
and the runs were analysed by GROMACS tools, Bio3D
<xref ref-type="bibr" rid="b56">56</xref>
, VMD
<xref ref-type="bibr" rid="b57">57</xref>
and Pymol
<xref ref-type="bibr" rid="b41">41</xref>
. Secondary structure was assessed with PROMOTIF program
<xref ref-type="bibr" rid="b58">58</xref>
implemented in PDBsum analysis
<xref ref-type="bibr" rid="b59">59</xref>
.</p>
</sec>
<sec disp-level="2">
<title>X-ray Scattering of full-length galectin-4</title>
<p>X-ray scattering measurements were performed at the G1 Station of the Cornell High Energy Synchrotron Source (CHESS) using 11.75 keV X-rays with a flux of 10
<sup>11</sup>
photons per second at a beam size of 250 × 480 μm
<sup>2</sup>
. Small-angle and wide-angle X-ray scattering (SAXS/WAXS) images were collected simultaneously on two photon-counting detectors (Pilatus 100K) at sample-to-detector distances of 1.47 m and 0.42 m respectively. The SAXS detector covered a
<italic>q</italic>
-range of 0.014 to 0.336 Å
<sup>−1</sup>
, and the WAXS detector covered a
<italic>q-</italic>
range of 0.338 to 0.960 Å
<sup>−1</sup>
, where
<italic>q</italic>
is the momentum transfer, defined as
<italic>q</italic>
 = (4π/λ)sin(2θ/2), where λ is the X-ray wavelength and 2θ is the scattering angle. Samples were passed continuously through an
<italic>in vacuo</italic>
X-ray sample cell
<xref ref-type="bibr" rid="b60">60</xref>
via an in-line size exclusion column (GE Superdex 200 5/15GL) operated by a room-temperature GE Äkta Purifier using a flow rate of 0.075 ml min
<sup>−1</sup>
. The column was pre-equilibrated with the running buffer, consisting of 50 mM HEPES pH 7.2, 140 mM NaCl, and 9 mM DTT (−lactose), or the same buffer with 30 mM lactose added (+lactose). Protein samples were injected into a 50 μL loop at a concentration of 22.6 mg ml
<sup>−1</sup>
(+lactose) and 20 mg ml
<sup>−1</sup>
, (−lactose). Approximately 500 eight-second exposures were collected per sample. Images were integrated and normalized by the incident X-ray intensity as measured by an N
<sub>2</sub>
-filled ion chamber located after the beam-defining slits. Data were processed and analysed following established protocols
<xref ref-type="bibr" rid="b61">61</xref>
using the ATSAS suite of programs
<xref ref-type="bibr" rid="b62">62</xref>
and custom code written in MATLAB. Predicted SAXS profiles were calculated using CRYSOL
<xref ref-type="bibr" rid="b63">63</xref>
with maximum order of harmonics equal to 35 and Fibonacci grid of order 18. The SAXS and WAXS regions were merged prior to pair distance distribution analysis in GNOM
<xref ref-type="bibr" rid="b64">64</xref>
.
<italic>Ab initio</italic>
shape reconstructions were performed in GASBOR
<xref ref-type="bibr" rid="b65">65</xref>
. 10 models were generated with 323 dummy residues, and subsequently aligned and averaged in DAMAVER
<xref ref-type="bibr" rid="b66">66</xref>
. The final, most probable model had a normalized spatial discrepancy (NSD) of 1.07 with a standard deviation of 0.03.</p>
</sec>
</sec>
<sec disp-level="1">
<title>Additional Information</title>
<p>
<bold>Accession codes:</bold>
Atomic coordinates and structure factors have been deposited in the Protein Data Bank under accession codes 4XZP (galectin-4N) and 5CBL (galectin-4C).</p>
<p>
<bold>How to cite this article</bold>
: Rustiguel, J. K.
<italic>et al</italic>
. Full-length model of the human galectin-4 and insights into dynamics of inter-domain communication.
<italic>Sci. Rep.</italic>
<bold>6</bold>
, 33633; doi: 10.1038/srep33633 (2016).</p>
</sec>
<sec sec-type="supplementary-material" id="S1">
<title>Supplementary Material</title>
<supplementary-material id="d33e24" content-type="local-data">
<caption>
<title>Supplementary Information</title>
</caption>
<media xlink:href="srep33633-s1.pdf"></media>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<p>The authors are grateful to Dr. Humberto D’Muniz Pereira for crystallographic data collection of galectin-4N, Dr. Patricia R. Feliciano and Ricardo P. de Pádua for galectin-4 sample preparation for SAXS and Drs. Richard Gillilan and Alvin Acerbo for assistance with the SEC-SAXS setup. We thank Diamond Light Source and Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory for time on beamlines I04-1 and BL12-2, respectively. SLAC National Accelerator Laboratory, is supported by the US Department of Energy, Office of Science, Office of Basic Energy Sciences under contract number DE-AC02-76SF00515. Molecular dynamics simulations were performed on the Data Analysis and Visualization Cyberinfrastructure (NSF OCI-0959097) and IBM Bluegene/Q (Rice University and University of São Paulo cooperation). CHESS is supported by the NSF & NIH/NIGMS via NSF award DMR-1332208, and the MacCHESS resource is supported by NIGMS award GM-103485. This work was supported by a grant to J.K.R. (2010/16153-2) and M.C.N. (2011/21811-1) from Fundação de Amparo à Pesquisa do Estado de São Paulo, to M.D.B from Núcleo de Apoio à Pesquisa em Doenças Inflamatórias (NAPDIN, 11.1.21625.01.0) and a National Institutes of Health grant (GM100008) to N.A.</p>
</ack>
<ref-list>
<ref id="b1">
<mixed-citation publication-type="journal">
<name>
<surname>Barondes</surname>
<given-names>S. H.</given-names>
</name>
,
<name>
<surname>Cooper</surname>
<given-names>D. N.</given-names>
</name>
,
<name>
<surname>Gitt</surname>
<given-names>M. A.</given-names>
</name>
&
<name>
<surname>Leffler</surname>
<given-names>H.</given-names>
</name>
<article-title>Galectins. Structure and function of a large family of animal lectins</article-title>
.
<source>J Biol Chem</source>
<volume>269</volume>
,
<fpage>20807</fpage>
<lpage>20810</lpage>
(
<year>1994</year>
).
<pub-id pub-id-type="pmid">8063692</pub-id>
</mixed-citation>
</ref>
<ref id="b2">
<mixed-citation publication-type="journal">
<name>
<surname>Hughes</surname>
<given-names>R. C.</given-names>
</name>
<article-title>Secretion of the galectin family of mammalian carbohydrate-binding proteins</article-title>
.
<source>Biochim Biophys Acta</source>
<volume>1473</volume>
,
<fpage>172</fpage>
<lpage>185</lpage>
(
<year>1999</year>
).
<pub-id pub-id-type="pmid">10580137</pub-id>
</mixed-citation>
</ref>
<ref id="b3">
<mixed-citation publication-type="journal">
<name>
<surname>Leffler</surname>
<given-names>H.</given-names>
</name>
,
<name>
<surname>Carlsson</surname>
<given-names>S.</given-names>
</name>
,
<name>
<surname>Hedlund</surname>
<given-names>M.</given-names>
</name>
,
<name>
<surname>Qian</surname>
<given-names>Y.</given-names>
</name>
&
<name>
<surname>Poirier</surname>
<given-names>F.</given-names>
</name>
<article-title>Introduction to galectins</article-title>
.
<source>Glycoconj J</source>
<volume>19</volume>
,
<fpage>433</fpage>
<lpage>440</lpage>
(
<year>2004</year>
).
<pub-id pub-id-type="pmid">14758066</pub-id>
</mixed-citation>
</ref>
<ref id="b4">
<mixed-citation publication-type="journal">
<name>
<surname>Compagno</surname>
<given-names>D.</given-names>
</name>
<etal></etal>
.
<article-title>Galectins: major signaling modulators inside and outside the cell</article-title>
.
<source>Curr Mol Med</source>
<volume>14</volume>
,
<fpage>630</fpage>
<lpage>651</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">24894174</pub-id>
</mixed-citation>
</ref>
<ref id="b5">
<mixed-citation publication-type="journal">
<name>
<surname>Ebrahim</surname>
<given-names>A. H.</given-names>
</name>
<etal></etal>
.
<article-title>Galectins in cancer: carcinogenesis, diagnosis and therapy</article-title>
.
<source>Ann Transl Med</source>
<volume>2</volume>
,
<fpage>88</fpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">25405163</pub-id>
</mixed-citation>
</ref>
<ref id="b6">
<mixed-citation publication-type="journal">
<name>
<surname>Hirabayashi</surname>
<given-names>J.</given-names>
</name>
&
<name>
<surname>Kasai</surname>
<given-names>K.</given-names>
</name>
<article-title>The family of metazoan metal-independent beta-galactoside-binding lectins: structure, function and molecular evolution</article-title>
.
<source>Glycobiology</source>
<volume>3</volume>
,
<fpage>297</fpage>
<lpage>304</lpage>
(
<year>1993</year>
).
<pub-id pub-id-type="pmid">8400545</pub-id>
</mixed-citation>
</ref>
<ref id="b7">
<mixed-citation publication-type="journal">
<name>
<surname>López-Lucendo</surname>
<given-names>M. F.</given-names>
</name>
<etal></etal>
.
<article-title>Growth-regulatory human galectin-1: crystallographic characterisation of the structural changes induced by single-site mutations and their impact on the thermodynamics of ligand binding</article-title>
.
<source>J Mol Biol</source>
<volume>343</volume>
,
<fpage>957</fpage>
<lpage>970</lpage>
(
<year>2004</year>
).
<pub-id pub-id-type="pmid">15476813</pub-id>
</mixed-citation>
</ref>
<ref id="b8">
<mixed-citation publication-type="journal">
<name>
<surname>Kashio</surname>
<given-names>Y.</given-names>
</name>
<etal></etal>
.
<article-title>Galectin-9 induces apoptosis through the calcium-calpain-caspase-1 pathway</article-title>
.
<source>J Immunol</source>
<volume>170</volume>
,
<fpage>3631</fpage>
<lpage>3636</lpage>
(
<year>2003</year>
).
<pub-id pub-id-type="pmid">12646627</pub-id>
</mixed-citation>
</ref>
<ref id="b9">
<mixed-citation publication-type="journal">
<name>
<surname>Bi</surname>
<given-names>S.</given-names>
</name>
,
<name>
<surname>Earl</surname>
<given-names>L. A.</given-names>
</name>
,
<name>
<surname>Jacobs</surname>
<given-names>L.</given-names>
</name>
&
<name>
<surname>Baum</surname>
<given-names>L. G.</given-names>
</name>
<article-title>Structural features of galectin-9 and galectin-1 that determine distinct T cell death pathways</article-title>
.
<source>J Biol Chem</source>
<volume>283</volume>
,
<fpage>12248</fpage>
<lpage>12258</lpage>
(
<year>2008</year>
).
<pub-id pub-id-type="pmid">18258591</pub-id>
</mixed-citation>
</ref>
<ref id="b10">
<mixed-citation publication-type="journal">
<name>
<surname>Levy</surname>
<given-names>Y.</given-names>
</name>
<etal></etal>
.
<article-title>It depends on the hinge: a structure-functional analysis of galectin-8, a tandem-repeat type lectin</article-title>
.
<source>Glycobiology</source>
<volume>16</volume>
,
<fpage>463</fpage>
<lpage>476</lpage>
(
<year>2006</year>
).
<pub-id pub-id-type="pmid">16501058</pub-id>
</mixed-citation>
</ref>
<ref id="b11">
<mixed-citation publication-type="journal">
<name>
<surname>André</surname>
<given-names>S.</given-names>
</name>
,
<name>
<surname>Wang</surname>
<given-names>G. N.</given-names>
</name>
,
<name>
<surname>Gabius</surname>
<given-names>H. J.</given-names>
</name>
&
<name>
<surname>Murphy</surname>
<given-names>P. V.</given-names>
</name>
<article-title>Combining glycocluster synthesis with protein engineering: an approach to probe into the significance of linker length in a tandem-repeat-type lectin (galectin-4)</article-title>
.
<source>Carbohydr Res</source>
<volume>389</volume>
,
<fpage>25</fpage>
<lpage>38</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">24698724</pub-id>
</mixed-citation>
</ref>
<ref id="b12">
<mixed-citation publication-type="journal">
<name>
<surname>Earl</surname>
<given-names>L. A.</given-names>
</name>
,
<name>
<surname>Bi</surname>
<given-names>S.</given-names>
</name>
&
<name>
<surname>Baum</surname>
<given-names>L. G.</given-names>
</name>
<article-title>Galectin multimerization and lattice formation are regulated by linker region structure</article-title>
.
<source>Glycobiology</source>
<volume>21</volume>
,
<fpage>6</fpage>
<lpage>12</lpage>
(
<year>2011</year>
).
<pub-id pub-id-type="pmid">20864568</pub-id>
</mixed-citation>
</ref>
<ref id="b13">
<mixed-citation publication-type="journal">
<name>
<surname>Troncoso</surname>
<given-names>M. F.</given-names>
</name>
,
<name>
<surname>Elola</surname>
<given-names>M. T.</given-names>
</name>
,
<name>
<surname>Croci</surname>
<given-names>D. O.</given-names>
</name>
&
<name>
<surname>Rabinovich</surname>
<given-names>G. A.</given-names>
</name>
<article-title>Integrating structure and function of ‘tandem-repeat’ galectins</article-title>
.
<source>Front Biosci (Schol Ed)</source>
<volume>4</volume>
,
<fpage>864</fpage>
<lpage>887</lpage>
(
<year>2012</year>
).
<pub-id pub-id-type="pmid">22202096</pub-id>
</mixed-citation>
</ref>
<ref id="b14">
<mixed-citation publication-type="journal">
<name>
<surname>Kim</surname>
<given-names>S. W.</given-names>
</name>
<etal></etal>
.
<article-title>Abrogation of galectin-4 expression promotes tumorigenesis in colorectal cancer</article-title>
.
<source>Cell Oncol (Dordr)</source>
<volume>36</volume>
,
<fpage>169</fpage>
<lpage>178</lpage>
(
<year>2013</year>
).
<pub-id pub-id-type="pmid">23378274</pub-id>
</mixed-citation>
</ref>
<ref id="b15">
<mixed-citation publication-type="journal">
<name>
<surname>Belo</surname>
<given-names>A. I.</given-names>
</name>
,
<name>
<surname>van der Sar</surname>
<given-names>A. M.</given-names>
</name>
,
<name>
<surname>Tefsen</surname>
<given-names>B.</given-names>
</name>
&
<name>
<surname>van Die</surname>
<given-names>I.</given-names>
</name>
<article-title>Galectin-4 reduces migration and metastasis formation of pancreatic cancer cells</article-title>
.
<source>PLoS One</source>
<volume>8</volume>
,
<fpage>e65957</fpage>
(
<year>2013</year>
).
<pub-id pub-id-type="pmid">23824659</pub-id>
</mixed-citation>
</ref>
<ref id="b16">
<mixed-citation publication-type="journal">
<name>
<surname>Satelli</surname>
<given-names>A.</given-names>
</name>
,
<name>
<surname>Rao</surname>
<given-names>P. S.</given-names>
</name>
,
<name>
<surname>Thirumala</surname>
<given-names>S.</given-names>
</name>
&
<name>
<surname>Rao</surname>
<given-names>U. S.</given-names>
</name>
<article-title>Galectin-4 functions as a tumor suppressor of human colorectal cancer</article-title>
.
<source>Int J Cancer</source>
<volume>129</volume>
,
<fpage>799</fpage>
<lpage>809</lpage>
(
<year>2011</year>
).
<pub-id pub-id-type="pmid">21064109</pub-id>
</mixed-citation>
</ref>
<ref id="b17">
<mixed-citation publication-type="journal">
<name>
<surname>Hayashi</surname>
<given-names>T.</given-names>
</name>
<etal></etal>
.
<article-title>Galectin-4, a novel predictor for lymph node metastasis in lung adenocarcinoma</article-title>
.
<source>PLoS One</source>
<volume>8</volume>
,
<fpage>e81883</fpage>
(
<year>2013</year>
).
<pub-id pub-id-type="pmid">24339976</pub-id>
</mixed-citation>
</ref>
<ref id="b18">
<mixed-citation publication-type="journal">
<name>
<surname>Kondoh</surname>
<given-names>N.</given-names>
</name>
<etal></etal>
.
<article-title>Identification and characterization of genes associated with human hepatocellular carcinogenesis</article-title>
.
<source>Cancer Res</source>
<volume>59</volume>
,
<fpage>4990</fpage>
<lpage>4996</lpage>
(
<year>1999</year>
).
<pub-id pub-id-type="pmid">10519413</pub-id>
</mixed-citation>
</ref>
<ref id="b19">
<mixed-citation publication-type="journal">
<name>
<surname>Huflejt</surname>
<given-names>M. E.</given-names>
</name>
,
<name>
<surname>Jordan</surname>
<given-names>E. T.</given-names>
</name>
,
<name>
<surname>Gitt</surname>
<given-names>M. A.</given-names>
</name>
,
<name>
<surname>Barondes</surname>
<given-names>S. H.</given-names>
</name>
&
<name>
<surname>Leffler</surname>
<given-names>H.</given-names>
</name>
<article-title>Strikingly different localization of galectin-3 and galectin-4 in human colon adenocarcinoma T84 cells. Galectin-4 is localized at sites of cell adhesion</article-title>
.
<source>J Biol Chem</source>
<volume>272</volume>
,
<fpage>14294</fpage>
<lpage>14303</lpage>
(
<year>1997</year>
).
<pub-id pub-id-type="pmid">9162064</pub-id>
</mixed-citation>
</ref>
<ref id="b20">
<mixed-citation publication-type="journal">
<name>
<surname>Matulis</surname>
<given-names>D.</given-names>
</name>
,
<name>
<surname>Kranz</surname>
<given-names>J. K.</given-names>
</name>
,
<name>
<surname>Salemme</surname>
<given-names>F. R.</given-names>
</name>
&
<name>
<surname>Todd</surname>
<given-names>M. J.</given-names>
</name>
<article-title>Thermodynamic stability of carbonic anhydrase: measurements of binding affinity and stoichiometry using ThermoFluor</article-title>
.
<source>Biochemistry</source>
<volume>44</volume>
,
<fpage>5258</fpage>
<lpage>5266</lpage>
(
<year>2005</year>
).
<pub-id pub-id-type="pmid">15794662</pub-id>
</mixed-citation>
</ref>
<ref id="b21">
<mixed-citation publication-type="journal">
<name>
<surname>Bum-Erdene</surname>
<given-names>K.</given-names>
</name>
,
<name>
<surname>Leffler</surname>
<given-names>H.</given-names>
</name>
,
<name>
<surname>Nilsson</surname>
<given-names>U. J.</given-names>
</name>
&
<name>
<surname>Blanchard</surname>
<given-names>H.</given-names>
</name>
<article-title>Structural characterization of human galectin-4C-terminal domain: elucidating the molecular basis for recognition of glycosphingolipids, sulfated saccharides and blood group antigens</article-title>
.
<source>FEBS J</source>
<volume>282</volume>
,
<fpage>3348</fpage>
<lpage>3367</lpage>
(
<year>2015</year>
).
<pub-id pub-id-type="pmid">26077389</pub-id>
</mixed-citation>
</ref>
<ref id="b22">
<mixed-citation publication-type="journal">
<name>
<surname>Bum-Erdene</surname>
<given-names>K.</given-names>
</name>
,
<name>
<surname>Leffler</surname>
<given-names>H.</given-names>
</name>
,
<name>
<surname>Nilsson</surname>
<given-names>U. J.</given-names>
</name>
&
<name>
<surname>Blanchard</surname>
<given-names>H.</given-names>
</name>
<article-title>Structural characterisation of human galectin-4N-terminal carbohydrate recognition domain in complex with glycerol, lactose, 3′-sulfo-lactose, and 2′-fucosyllactose</article-title>
.
<source>Sci Rep</source>
<volume>6</volume>
,
<fpage>20289</fpage>
(
<year>2016</year>
).
<pub-id pub-id-type="pmid">26828567</pub-id>
</mixed-citation>
</ref>
<ref id="b23">
<mixed-citation publication-type="journal">
<name>
<surname>Zimbardi</surname>
<given-names>A. L.</given-names>
</name>
,
<name>
<surname>Pinheiro</surname>
<given-names>M. P.</given-names>
</name>
,
<name>
<surname>Dias-Baruffi</surname>
<given-names>M.</given-names>
</name>
&
<name>
<surname>Nonato</surname>
<given-names>M. C.</given-names>
</name>
<article-title>Cloning, expression, purification, crystallization and preliminary X-ray diffraction analysis of the N-terminal carbohydrate-recognition domain of human galectin-4</article-title>
.
<source>Acta Crystallogr Sect F Struct Biol Cryst Commun</source>
<volume>66</volume>
,
<fpage>542</fpage>
<lpage>545</lpage>
(
<year>2010</year>
).</mixed-citation>
</ref>
<ref id="b24">
<mixed-citation publication-type="journal">
<name>
<surname>Rustiguel</surname>
<given-names>J. K.</given-names>
</name>
,
<name>
<surname>Kumagai</surname>
<given-names>P. S.</given-names>
</name>
,
<name>
<surname>Dias-Baruffi</surname>
<given-names>M.</given-names>
</name>
,
<name>
<surname>Costa-Filho</surname>
<given-names>A. J.</given-names>
</name>
&
<name>
<surname>Nonato</surname>
<given-names>M. C.</given-names>
</name>
<article-title>Recombinant expression, purification and preliminary biophysical and structural studies of C-terminal carbohydrate recognition domain from human galectin-4</article-title>
.
<source>Protein Expr Purif</source>
<volume>118</volume>
,
<fpage>39</fpage>
<lpage>48</lpage>
(
<year>2016</year>
).
<pub-id pub-id-type="pmid">26432949</pub-id>
</mixed-citation>
</ref>
<ref id="b25">
<mixed-citation publication-type="journal">
<name>
<surname>Ideo</surname>
<given-names>H.</given-names>
</name>
,
<name>
<surname>Seko</surname>
<given-names>A.</given-names>
</name>
&
<name>
<surname>Yamashita</surname>
<given-names>K.</given-names>
</name>
<article-title>Recognition mechanism of galectin-4 for cholesterol 3-sulfate</article-title>
.
<source>J Biol Chem</source>
<volume>282</volume>
,
<fpage>21081</fpage>
<lpage>21089</lpage>
(
<year>2007</year>
).
<pub-id pub-id-type="pmid">17545668</pub-id>
</mixed-citation>
</ref>
<ref id="b26">
<mixed-citation publication-type="journal">
<name>
<surname>Fagherazzi</surname>
<given-names>G.</given-names>
</name>
<article-title>Small angle X-ray scattering</article-title>
edited by
<name>
<surname>Glatter</surname>
<given-names>O.</given-names>
</name>
&
<name>
<surname>Kratky</surname>
<given-names>O.</given-names>
</name>
.
<source>Acta Crystallographica Section A</source>
<volume>39</volume>
,
<fpage>500</fpage>
(
<year>1983</year>
).</mixed-citation>
</ref>
<ref id="b27">
<mixed-citation publication-type="journal">
<name>
<surname>Di Lella</surname>
<given-names>S.</given-names>
</name>
<etal></etal>
.
<article-title>When galectins recognize glycans: from biochemistry to physiology and back again</article-title>
.
<source>Biochemistry</source>
<volume>50</volume>
,
<fpage>7842</fpage>
<lpage>7857</lpage>
(
<year>2011</year>
).
<pub-id pub-id-type="pmid">21848324</pub-id>
</mixed-citation>
</ref>
<ref id="b28">
<mixed-citation publication-type="journal">
<name>
<surname>Rabinovich</surname>
<given-names>G. A.</given-names>
</name>
,
<name>
<surname>Toscano</surname>
<given-names>M. A.</given-names>
</name>
,
<name>
<surname>Jackson</surname>
<given-names>S. S.</given-names>
</name>
&
<name>
<surname>Vasta</surname>
<given-names>G. R.</given-names>
</name>
<article-title>Functions of cell surface galectin-glycoprotein lattices</article-title>
.
<source>Curr Opin Struct Biol</source>
<volume>17</volume>
,
<fpage>513</fpage>
<lpage>520</lpage>
(
<year>2007</year>
).
<pub-id pub-id-type="pmid">17950594</pub-id>
</mixed-citation>
</ref>
<ref id="b29">
<mixed-citation publication-type="journal">
<name>
<surname>Yoshida</surname>
<given-names>H.</given-names>
</name>
<etal></etal>
.
<article-title>X-ray structure of a protease-resistant mutant form of human galectin-8 with two carbohydrate recognition domains</article-title>
.
<source>FEBS J</source>
<volume>279</volume>
,
<fpage>3937</fpage>
<lpage>3951</lpage>
(
<year>2012</year>
).
<pub-id pub-id-type="pmid">22913484</pub-id>
</mixed-citation>
</ref>
<ref id="b30">
<mixed-citation publication-type="journal">
<name>
<surname>Kato</surname>
<given-names>Y.</given-names>
</name>
<etal></etal>
.
<article-title>Acidic extracellular microenvironment and cancer</article-title>
.
<source>Cancer Cell Int</source>
<volume>13</volume>
,
<fpage>89</fpage>
(
<year>2013</year>
).
<pub-id pub-id-type="pmid">24004445</pub-id>
</mixed-citation>
</ref>
<ref id="b31">
<mixed-citation publication-type="journal">
<name>
<surname>van Weelden</surname>
<given-names>S.</given-names>
</name>
,
<name>
<surname>van Hellemond</surname>
<given-names>J.</given-names>
</name>
,
<name>
<surname>Opperdoes</surname>
<given-names>F.</given-names>
</name>
&
<name>
<surname>Tielens</surname>
<given-names>A.</given-names>
</name>
<article-title>New functions for parts of the Krebs cycle in procyclic
<italic>Trypanosoma brucei</italic>
, a cycle not operating as a cycle</article-title>
.
<source>Journal of Biological Chemistry</source>
<volume>280</volume>
,
<fpage>12451</fpage>
<lpage>12460</lpage>
(
<year>2005</year>
).
<pub-id pub-id-type="pmid">15647263</pub-id>
</mixed-citation>
</ref>
<ref id="b32">
<mixed-citation publication-type="journal">
<name>
<surname>Niesen</surname>
<given-names>F. H.</given-names>
</name>
,
<name>
<surname>Berglund</surname>
<given-names>H.</given-names>
</name>
&
<name>
<surname>Vedadi</surname>
<given-names>M.</given-names>
</name>
<article-title>The use of differential scanning fluorimetry to detect ligand interactions that promote protein stability</article-title>
.
<source>Nat Protoc</source>
<volume>2</volume>
,
<fpage>2212</fpage>
<lpage>2221</lpage>
(
<year>2007</year>
).
<pub-id pub-id-type="pmid">17853878</pub-id>
</mixed-citation>
</ref>
<ref id="b33">
<mixed-citation publication-type="journal">
<name>
<surname>Battye</surname>
<given-names>T. G.</given-names>
</name>
,
<name>
<surname>Kontogiannis</surname>
<given-names>L.</given-names>
</name>
,
<name>
<surname>Johnson</surname>
<given-names>O.</given-names>
</name>
,
<name>
<surname>Powell</surname>
<given-names>H. R.</given-names>
</name>
&
<name>
<surname>Leslie</surname>
<given-names>A. G.</given-names>
</name>
<article-title>iMOSFLM: a new graphical interface for diffraction-image processing with MOSFLM</article-title>
.
<source>Acta Crystallogr D Biol Crystallogr</source>
<volume>67</volume>
,
<fpage>271</fpage>
<lpage>281</lpage>
(
<year>2011</year>
).
<pub-id pub-id-type="pmid">21460445</pub-id>
</mixed-citation>
</ref>
<ref id="b34">
<mixed-citation publication-type="journal">
<name>
<surname>Evans</surname>
<given-names>P.</given-names>
</name>
<article-title>Scaling and assessment of data quality</article-title>
.
<source>Acta Crystallogr D Biol Crystallogr</source>
<volume>62</volume>
,
<fpage>72</fpage>
<lpage>82</lpage>
(
<year>2006</year>
).
<pub-id pub-id-type="pmid">16369096</pub-id>
</mixed-citation>
</ref>
<ref id="b35">
<mixed-citation publication-type="journal">
<name>
<surname>Evans</surname>
<given-names>P. R.</given-names>
</name>
&
<name>
<surname>Murshudov</surname>
<given-names>G. N.</given-names>
</name>
<article-title>How good are my data and what is the resolution?</article-title>
<source>Acta Crystallogr D Biol Crystallogr</source>
<volume>69</volume>
,
<fpage>1204</fpage>
<lpage>1214</lpage>
(
<year>2013</year>
).
<pub-id pub-id-type="pmid">23793146</pub-id>
</mixed-citation>
</ref>
<ref id="b36">
<mixed-citation publication-type="journal">
<name>
<surname>Winn</surname>
<given-names>M. D.</given-names>
</name>
<etal></etal>
.
<article-title>Overview of the CCP4 suite and current developments</article-title>
.
<source>Acta Crystallogr D Biol Crystallogr</source>
<volume>67</volume>
,
<fpage>235</fpage>
<lpage>242</lpage>
(
<year>2011</year>
).
<pub-id pub-id-type="pmid">21460441</pub-id>
</mixed-citation>
</ref>
<ref id="b37">
<mixed-citation publication-type="journal">
<name>
<surname>McCoy</surname>
<given-names>A. J.</given-names>
</name>
<etal></etal>
.
<article-title>Phaser crystallographic software</article-title>
.
<source>J Appl Crystallogr</source>
<volume>40</volume>
,
<fpage>658</fpage>
<lpage>674</lpage>
(
<year>2007</year>
).
<pub-id pub-id-type="pmid">19461840</pub-id>
</mixed-citation>
</ref>
<ref id="b38">
<mixed-citation publication-type="journal">
<name>
<surname>Adams</surname>
<given-names>P. D.</given-names>
</name>
<etal></etal>
.
<article-title>PHENIX: a comprehensive Python-based system for macromolecular structure solution</article-title>
.
<source>Acta Crystallogr D Biol Crystallogr</source>
<volume>66</volume>
,
<fpage>213</fpage>
<lpage>221</lpage>
(
<year>2010</year>
).
<pub-id pub-id-type="pmid">20124702</pub-id>
</mixed-citation>
</ref>
<ref id="b39">
<mixed-citation publication-type="journal">
<name>
<surname>Emsley</surname>
<given-names>P.</given-names>
</name>
,
<name>
<surname>Lohkamp</surname>
<given-names>B.</given-names>
</name>
,
<name>
<surname>Scott</surname>
<given-names>W. G.</given-names>
</name>
&
<name>
<surname>Cowtan</surname>
<given-names>K.</given-names>
</name>
<article-title>Features and development of Coot</article-title>
.
<source>Acta Crystallogr D Biol Crystallogr</source>
<volume>66</volume>
,
<fpage>486</fpage>
<lpage>501</lpage>
(
<year>2010</year>
).
<pub-id pub-id-type="pmid">20383002</pub-id>
</mixed-citation>
</ref>
<ref id="b40">
<mixed-citation publication-type="journal">
<name>
<surname>Chen</surname>
<given-names>V. B.</given-names>
</name>
<etal></etal>
.
<article-title>MolProbity: all-atom structure validation for macromolecular crystallography</article-title>
.
<source>Acta Crystallogr D Biol Crystallogr</source>
<volume>66</volume>
,
<fpage>12</fpage>
<lpage>21</lpage>
(
<year>2010</year>
).
<pub-id pub-id-type="pmid">20057044</pub-id>
</mixed-citation>
</ref>
<ref id="b41">
<mixed-citation publication-type="journal">
<name>
<surname>DeLano</surname>
<given-names>W. L.</given-names>
</name>
<article-title>Use of PYMOL as a communications tool for molecular science</article-title>
.
<source>Abstracts of Papers of the American Chemical Society</source>
<volume>228</volume>
,
<fpage>U313</fpage>
<lpage>U314</lpage>
(
<year>2004</year>
).</mixed-citation>
</ref>
<ref id="b42">
<mixed-citation publication-type="journal">
<name>
<surname>Laskowski</surname>
<given-names>R. A.</given-names>
</name>
<etal></etal>
.
<article-title>PDBsum: a Web-based database of summaries and analyses of all PDB structures</article-title>
.
<source>Trends Biochem Sci</source>
<volume>22</volume>
,
<fpage>488</fpage>
<lpage>490</lpage>
(
<year>1997</year>
).
<pub-id pub-id-type="pmid">9433130</pub-id>
</mixed-citation>
</ref>
<ref id="b43">
<mixed-citation publication-type="journal">
<name>
<surname>Kim</surname>
<given-names>D. E.</given-names>
</name>
,
<name>
<surname>Chivian</surname>
<given-names>D.</given-names>
</name>
&
<name>
<surname>Baker</surname>
<given-names>D.</given-names>
</name>
<article-title>Protein structure prediction and analysis using the Robetta server</article-title>
.
<source>Nucleic Acids Res</source>
<volume>32</volume>
,
<fpage>W526</fpage>
<lpage>531</lpage>
(
<year>2004</year>
).
<pub-id pub-id-type="pmid">15215442</pub-id>
</mixed-citation>
</ref>
<ref id="b44">
<mixed-citation publication-type="journal">
<name>
<surname>Sali</surname>
<given-names>A.</given-names>
</name>
&
<name>
<surname>Blundell</surname>
<given-names>T. L.</given-names>
</name>
<article-title>Comparative protein modelling by satisfaction of spatial restraints</article-title>
.
<source>J Mol Biol</source>
<volume>234</volume>
,
<fpage>779</fpage>
<lpage>815</lpage>
(
<year>1993</year>
).
<pub-id pub-id-type="pmid">8254673</pub-id>
</mixed-citation>
</ref>
<ref id="b45">
<mixed-citation publication-type="journal">
<name>
<surname>Pronk</surname>
<given-names>S.</given-names>
</name>
<etal></etal>
.
<article-title>GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit</article-title>
.
<source>Bioinformatics</source>
<volume>29</volume>
,
<fpage>845</fpage>
<lpage>854</lpage>
(
<year>2013</year>
).
<pub-id pub-id-type="pmid">23407358</pub-id>
</mixed-citation>
</ref>
<ref id="b46">
<mixed-citation publication-type="journal">
<name>
<surname>Lindorff-Larsen</surname>
<given-names>K.</given-names>
</name>
<etal></etal>
.
<article-title>Improved side-chain torsion potentials for the Amber ff99SB protein force field</article-title>
.
<source>Proteins</source>
<volume>78</volume>
,
<fpage>1950</fpage>
<lpage>1958</lpage>
(
<year>2010</year>
).
<pub-id pub-id-type="pmid">20408171</pub-id>
</mixed-citation>
</ref>
<ref id="b47">
<mixed-citation publication-type="journal">
<name>
<surname>Hoover</surname>
<given-names>W. G.</given-names>
</name>
<article-title>Canonical dynamics: Equilibrium phase-space distributions</article-title>
.
<source>Phys Rev A Gen Phys</source>
<volume>31</volume>
,
<fpage>1695</fpage>
<lpage>1697</lpage>
(
<year>1985</year>
).
<pub-id pub-id-type="pmid">9895674</pub-id>
</mixed-citation>
</ref>
<ref id="b48">
<mixed-citation publication-type="journal">
<name>
<surname>Parrinello</surname>
<given-names>M.</given-names>
</name>
&
<name>
<surname>Rahman</surname>
<given-names>A.</given-names>
</name>
<article-title>Polymorphic transitions in single crystals: A new molecular dynamics method</article-title>
.
<source>Journal of Applied Physics</source>
<volume>52</volume>
(
<year>1981</year>
).</mixed-citation>
</ref>
<ref id="b49">
<mixed-citation publication-type="journal">
<name>
<surname>Miyamoto</surname>
<given-names>S.</given-names>
</name>
&
<name>
<surname>Kollman</surname>
<given-names>P. A.</given-names>
</name>
<article-title>Settle: An analytical version of the SHAKE and RATTLE algorithm for rigid water models</article-title>
.
<source>Journal of Computational Chemistry</source>
<volume>13</volume>
,
<fpage>952</fpage>
<lpage>962</lpage>
(
<year>1992</year>
).</mixed-citation>
</ref>
<ref id="b50">
<mixed-citation publication-type="journal">
<name>
<surname>Hess</surname>
<given-names>B.</given-names>
</name>
<article-title>P-LINCS:  A parallel linear constraint solver for molecular simulation</article-title>
.
<source>J Chem Theory Comput</source>
<volume>4</volume>
,
<fpage>116</fpage>
<lpage>122</lpage>
(
<year>2008</year>
).
<pub-id pub-id-type="pmid">26619985</pub-id>
</mixed-citation>
</ref>
<ref id="b51">
<mixed-citation publication-type="journal">
<name>
<surname>Jorgensen</surname>
<given-names>W. L.</given-names>
</name>
,
<name>
<surname>Chandrasekhar</surname>
<given-names>J.</given-names>
</name>
,
<name>
<surname>Madura</surname>
<given-names>J. D.</given-names>
</name>
,
<name>
<surname>Impey</surname>
<given-names>R. W.</given-names>
</name>
&
<name>
<surname>Klein</surname>
<given-names>M. L.</given-names>
</name>
<article-title>Comparison of simple potential functions for simulating liquid water</article-title>
.
<source>Journal of Chemical Physics</source>
<volume>79</volume>
,
<fpage>926</fpage>
<lpage>935</lpage>
(
<year>1983</year>
).</mixed-citation>
</ref>
<ref id="b52">
<mixed-citation publication-type="journal">
<name>
<surname>Thurlkill</surname>
<given-names>R. L.</given-names>
</name>
,
<name>
<surname>Grimsley</surname>
<given-names>G. R.</given-names>
</name>
,
<name>
<surname>Scholtz</surname>
<given-names>J. M.</given-names>
</name>
&
<name>
<surname>Pace</surname>
<given-names>C. N.</given-names>
</name>
<article-title>pK values of the ionizable groups of proteins</article-title>
.
<source>Protein Sci</source>
<volume>15</volume>
,
<fpage>1214</fpage>
<lpage>1218</lpage>
(
<year>2006</year>
).
<pub-id pub-id-type="pmid">16597822</pub-id>
</mixed-citation>
</ref>
<ref id="b53">
<mixed-citation publication-type="journal">
<name>
<surname>Kirschner</surname>
<given-names>K. N.</given-names>
</name>
,
<name>
<surname>Lins</surname>
<given-names>R. D.</given-names>
</name>
,
<name>
<surname>Maass</surname>
<given-names>A.</given-names>
</name>
&
<name>
<surname>Soares</surname>
<given-names>T. A.</given-names>
</name>
<article-title>A glycam-based force field for simulations of lipopolysaccharide membranes: parametrization and validation</article-title>
.
<source>J Chem Theory Comput</source>
<volume>8</volume>
,
<fpage>4719</fpage>
<lpage>4731</lpage>
(
<year>2012</year>
).
<pub-id pub-id-type="pmid">26605626</pub-id>
</mixed-citation>
</ref>
<ref id="b54">
<mixed-citation publication-type="other">Group., W.
<italic>GLYCAM Web</italic>
,
<ext-link ext-link-type="uri" xlink:href="http://glycam.org/">http://glycam.org/</ext-link>
(2005–2015).</mixed-citation>
</ref>
<ref id="b55">
<mixed-citation publication-type="journal">
<name>
<surname>Sousa da Silva</surname>
<given-names>A. W.</given-names>
</name>
&
<name>
<surname>Vranken</surname>
<given-names>W. F.</given-names>
</name>
<article-title>ACPYPE - AnteChamber PYthon Parser interfacE</article-title>
.
<source>BMC Res Notes</source>
<volume>5</volume>
,
<fpage>367</fpage>
(
<year>2012</year>
).
<pub-id pub-id-type="pmid">22824207</pub-id>
</mixed-citation>
</ref>
<ref id="b56">
<mixed-citation publication-type="journal">
<name>
<surname>Grant</surname>
<given-names>B. J.</given-names>
</name>
,
<name>
<surname>Rodrigues</surname>
<given-names>A. P.</given-names>
</name>
,
<name>
<surname>ElSawy</surname>
<given-names>K. M.</given-names>
</name>
,
<name>
<surname>McCammon</surname>
<given-names>J. A.</given-names>
</name>
&
<name>
<surname>Caves</surname>
<given-names>L. S.</given-names>
</name>
<article-title>Bio3d: an R package for the comparative analysis of protein structures</article-title>
.
<source>Bioinformatics</source>
<volume>22</volume>
,
<fpage>2695</fpage>
<lpage>2696</lpage>
(
<year>2006</year>
).
<pub-id pub-id-type="pmid">16940322</pub-id>
</mixed-citation>
</ref>
<ref id="b57">
<mixed-citation publication-type="journal">
<name>
<surname>Humphrey</surname>
<given-names>W.</given-names>
</name>
,
<name>
<surname>Dalke</surname>
<given-names>A.</given-names>
</name>
&
<name>
<surname>Schulten</surname>
<given-names>K.</given-names>
</name>
<article-title>VMD: visual molecular dynamics</article-title>
.
<source>J Mol Graph</source>
<volume>14</volume>
,
<fpage>33</fpage>
<lpage>38</lpage>
, 27–38 (
<year>1996</year>
).
<pub-id pub-id-type="pmid">8744570</pub-id>
</mixed-citation>
</ref>
<ref id="b58">
<mixed-citation publication-type="journal">
<name>
<surname>Hutchinson</surname>
<given-names>E. G.</given-names>
</name>
&
<name>
<surname>Thornton</surname>
<given-names>J. M.</given-names>
</name>
<article-title>PROMOTIF–a program to identify and analyze structural motifs in proteins</article-title>
.
<source>Protein Sci</source>
<volume>5</volume>
,
<fpage>212</fpage>
<lpage>220</lpage>
(
<year>1996</year>
).
<pub-id pub-id-type="pmid">8745398</pub-id>
</mixed-citation>
</ref>
<ref id="b59">
<mixed-citation publication-type="journal">
<name>
<surname>de Beer</surname>
<given-names>T. A.</given-names>
</name>
,
<name>
<surname>Berka</surname>
<given-names>K.</given-names>
</name>
,
<name>
<surname>Thornton</surname>
<given-names>J. M.</given-names>
</name>
&
<name>
<surname>Laskowski</surname>
<given-names>R. A.</given-names>
</name>
<article-title>PDBsum additions</article-title>
.
<source>Nucleic Acids Res</source>
<volume>42</volume>
,
<fpage>D292</fpage>
<lpage>296</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">24153109</pub-id>
</mixed-citation>
</ref>
<ref id="b60">
<mixed-citation publication-type="journal">
<name>
<surname>Nielsen</surname>
<given-names>S. S.</given-names>
</name>
,
<name>
<surname>Møller</surname>
<given-names>M.</given-names>
</name>
&
<name>
<surname>Gillilan</surname>
<given-names>R. E.</given-names>
</name>
<article-title>High-throughput biological small-angle X-ray scattering with a robotically loaded capillary cell</article-title>
.
<source>J Appl Crystallogr</source>
<volume>45</volume>
,
<fpage>213</fpage>
<lpage>223</lpage>
(
<year>2012</year>
).
<pub-id pub-id-type="pmid">22509071</pub-id>
</mixed-citation>
</ref>
<ref id="b61">
<mixed-citation publication-type="journal">
<name>
<surname>Skou</surname>
<given-names>S.</given-names>
</name>
,
<name>
<surname>Gillilan</surname>
<given-names>R. E.</given-names>
</name>
&
<name>
<surname>Ando</surname>
<given-names>N.</given-names>
</name>
<article-title>Synchrotron-based small-angle X-ray scattering of proteins in solution</article-title>
.
<source>Nat Protoc</source>
<volume>9</volume>
,
<fpage>1727</fpage>
<lpage>1739</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">24967622</pub-id>
</mixed-citation>
</ref>
<ref id="b62">
<mixed-citation publication-type="journal">
<name>
<surname>Petoukhov</surname>
<given-names>M. V.</given-names>
</name>
<etal></etal>
.
<article-title>New developments in the ATSAS program package for small-angle scattering data analysis</article-title>
.
<source>J Appl Crystallogr</source>
<volume>45</volume>
,
<fpage>342</fpage>
<lpage>350</lpage>
(
<year>2012</year>
).
<pub-id pub-id-type="pmid">25484842</pub-id>
</mixed-citation>
</ref>
<ref id="b63">
<mixed-citation publication-type="journal">
<name>
<surname>Svergun</surname>
<given-names>D.</given-names>
</name>
,
<name>
<surname>Barberato</surname>
<given-names>C.</given-names>
</name>
&
<name>
<surname>Koch</surname>
<given-names>M.</given-names>
</name>
<article-title>CRYSOL - A program to evaluate x-ray solution scattering of biological macromolecules from atomic coordinates</article-title>
.
<source>Journal of Applied Crystallography</source>
<volume>28</volume>
,
<fpage>768</fpage>
<lpage>773</lpage>
(
<year>1995</year>
).</mixed-citation>
</ref>
<ref id="b64">
<mixed-citation publication-type="journal">
<name>
<surname>Svergun</surname>
<given-names>D.</given-names>
</name>
<article-title>Determination of the regularization parameter in indirect-transform methods using perceptual criteria</article-title>
.
<source>Journal of Applied Crystallography</source>
<volume>25</volume>
,
<fpage>495</fpage>
<lpage>503</lpage>
(
<year>1992</year>
).</mixed-citation>
</ref>
<ref id="b65">
<mixed-citation publication-type="journal">
<name>
<surname>Svergun</surname>
<given-names>D. I.</given-names>
</name>
,
<name>
<surname>Petoukhov</surname>
<given-names>M. V.</given-names>
</name>
&
<name>
<surname>Koch</surname>
<given-names>M. H.</given-names>
</name>
<article-title>Determination of domain structure of proteins from X-ray solution scattering</article-title>
.
<source>Biophys J</source>
<volume>80</volume>
,
<fpage>2946</fpage>
<lpage>2953</lpage>
(
<year>2001</year>
).
<pub-id pub-id-type="pmid">11371467</pub-id>
</mixed-citation>
</ref>
<ref id="b66">
<mixed-citation publication-type="journal">
<name>
<surname>Volkov</surname>
<given-names>V. V.</given-names>
</name>
&
<name>
<surname>Svergun</surname>
<given-names>D. I.</given-names>
</name>
<article-title>Uniqueness of
<italic>ab initio</italic>
shape determination in small-angle scattering</article-title>
.
<source>Journal of Applied Crystallography</source>
<volume>36</volume>
,
<fpage>860</fpage>
<lpage>864</lpage>
(
<year>2003</year>
).</mixed-citation>
</ref>
</ref-list>
<fn-group>
<fn>
<p>
<bold>Author Contributions</bold>
M.C.N. and M.D.B conceived the project. J.K.R. carried out the protein production, crystallography, molecular modelling, thermal stability assays and molecular dynamic simulation analysis under supervision of M.C.N. and R.O.S.S. carried out the molecular dynamics simulations and analysis under supervision of M.C.N.; K.L.M., S.P.M. and K.M.D. carried out the SAXS experiments and data analysis under supervision of N.A.; J.K.R. and M.C.N. wrote the paper with input from all authors. All authors discussed the results and implications and commented on the manuscript at all stages.</p>
</fn>
</fn-group>
</back>
<floats-group>
<fig id="f1">
<label>Figure 1</label>
<caption>
<title>Thermofluor assays.</title>
<p>(
<bold>a)</bold>
Normalized thermal denaturation curves for galectin-4, galectin-4N and galectin-4C. Measured apparent unfolding temperatures were 55.92 ± 0.05 °C for galectin-4, 56.8 ± 0.1 °C for galectin-4N and 68.12 ± 0.05 °C for galectin-4C. (
<bold>b</bold>
) Evaluation of thermal shift profile for galectin-4, galectin-4N and galectin-4C at different categories of additives. Bars show all additives that contribute to interpretable transitions with positive and/or negative thermal shift for the three proteins. Compounds and the respective thermal shift values are listed in
<xref ref-type="supplementary-material" rid="S1">Supplementary Table S1</xref>
. (
<bold>c</bold>
) Thermal shift profile as function of lactose concentration.</p>
</caption>
<graphic xlink:href="srep33633-f1"></graphic>
</fig>
<fig id="f2">
<label>Figure 2</label>
<caption>
<title>Crystal structures of galectin-4N and galectin-4C.</title>
<p>(
<bold>a</bold>
) Overall β-sandwich fold of galectin-4N (blue) and galectin-4C (pink) structures. The antiparallel β-sheets are shown in blue (F0-F5) and cyan (S1-S6a/b) for galectin-4N, and pink (F0′-F5′) and light pink (S1′-S6a′) for galectin-4C. (
<bold>b</bold>
) Electrostatic potential surface for both the galectin-4N and galectin-4C structures. Front view (β-sheet S1-S6/S1′-S6′) and back view (β-sheet F0-F5/F0′-F5′). The circle marks the canonical binding site. (
<bold>c</bold>
) Canonical (pink) and extended (yellow) binding sites of galectin-4 domains. The main residues involved in binding interactions are represented as sticks. (
<bold>d)</bold>
Sequence alignment of galectin-4N and galectin-4C showing secondary structures elements. Marked in bold are the conserved residues. Highlighted in pink are the residues of canonical carbohydrate-binding site; the star is the only conservative substitution in the binding site residues between both domains. In yellow are the extended binding site residues.</p>
</caption>
<graphic xlink:href="srep33633-f2"></graphic>
</fig>
<fig id="f3">
<label>Figure 3</label>
<caption>
<title>Model of full-length galectin-4.</title>
<p>(
<bold>a</bold>
) Cartoon representation of the initial model for full-length protein (
<bold>b</bold>
) Overall fold of galectin-4 model after equilibrium dynamics and geometry optimization. (
<bold>c</bold>
) Representation of inter-domain interactions mediated by hydrogen bonds.</p>
</caption>
<graphic xlink:href="srep33633-f3"></graphic>
</fig>
<fig id="f4">
<label>Figure 4</label>
<caption>
<title>Solution conformation of full-length galectin-4 examined by X-ray scattering.</title>
<p>(
<bold>a</bold>
) The experimental scattering of galectin-4 in the absence of ligand (gray) is well fit by the theoretical scattering of the full-length model in
<xref ref-type="fig" rid="f3">Fig. 3b</xref>
(solid line), confirming that the two CRDs associate in solution. In contrast, a comparison of the experimental scattering to the theoretical scattering of the model found in
<xref ref-type="fig" rid="f3">Fig. 3a</xref>
in which the CRDs are non-associating (dotted), shows a poor fit. (
<bold>b</bold>
) An
<italic>ab initio</italic>
shape reconstruction generated from ligand-free galectin-4 scattering data also shows good agreement with the full-length model. (
<bold>c</bold>
) Addition of lactose leads to a subtle expansion in the width of the pair-distance distribution function,
<italic>P</italic>
(
<italic>r</italic>
), and a slight increase in radius of gyration.</p>
</caption>
<graphic xlink:href="srep33633-f4"></graphic>
</fig>
<fig id="f5">
<label>Figure 5</label>
<caption>
<title>RMSD plots for molecular dynamics simulation with (+lactose) and without (−lactose) lactose, 150 ns trajectories.</title>
<p>RMSD by domains structure (
<bold>a</bold>
) (−lactose) and (
<bold>b</bold>
) (+lactose). (
<bold>c)</bold>
RMSF box chart for MD simulation without and with lactose and cartoon putty representation of mobility through trajectory (inset); the blue-white-magenta scale calculated B-factor from 0 to 250 Å
<sup>2</sup>
. Porcupine plot of the first eigenvector generated through principal component analysis of the representative structure with lactose in (
<bold>d</bold>
) front view and (
<bold>e</bold>
) bottom view. The vectors, represented as blue arrows, show the tendency of movement. Plot of atomic correlations of MD without lactose (
<bold>f</bold>
) with lactose (
<bold>g</bold>
). The correlated movements are shown in pink and anticorrelated movements in blue scale bar. The bars indicate the portion of the graph relating to each domain, white for galectin-4N, light gray for linker and dark gray for galectin-4C.</p>
</caption>
<graphic xlink:href="srep33633-f5"></graphic>
</fig>
<table-wrap position="float" id="t1">
<label>Table 1</label>
<caption>
<title>Data collection and refinement statistics.</title>
</caption>
<table frame="hsides" rules="groups" border="1">
<colgroup>
<col align="left"></col>
<col align="center"></col>
</colgroup>
<thead valign="bottom">
<tr>
<th align="left" valign="top" charoff="50"> </th>
<th align="center" valign="top" charoff="50">galectin-4N</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td colspan="2" align="left" valign="top" charoff="50">
<bold>Data collection</bold>
</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50"> Space group</td>
<td align="center" valign="top" charoff="50">P6
<sub>1</sub>
22</td>
</tr>
<tr>
<td colspan="2" align="left" valign="top" charoff="50">Cell dimensions</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic>a, b, c</italic>
(Å)</td>
<td align="center" valign="top" charoff="50">72.55, 72.55, 110.30</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
α, β, γ (°)</td>
<td align="center" valign="top" charoff="50">90, 90, 120</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
Resolution (Å)</td>
<td align="center" valign="top" char="(" charoff="50">31.73–1.48(1.56–1.48)</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic> R</italic>
<sub>
<italic>sym</italic>
</sub>
</td>
<td align="center" valign="top" char="(" charoff="50">0.056(0.543)</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
</td>
<td align="center" valign="top" char="(" charoff="50">23.5(4.8)</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
Completeness (%)</td>
<td align="center" valign="top" char="(" charoff="50">100.0(100.0)</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
Redundancy</td>
<td align="center" valign="top" char="(" charoff="50">11.9(12.3)</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
No. total reflections</td>
<td align="center" valign="top" char="(" charoff="50">350,098(51,170)</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
No. unique reflections</td>
<td align="center" valign="top" char="(" charoff="50">29,321(4,172)</td>
</tr>
<tr>
<td colspan="2" align="left" valign="top" charoff="50">Refinement</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
Resolution (Å)</td>
<td align="center" valign="top" charoff="50">1.48</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic> R</italic>
<sub>
<italic>work</italic>
</sub>
/
<italic>R</italic>
<sub>
<italic>free</italic>
</sub>
</td>
<td align="center" valign="top" charoff="50">15.0/18.4</td>
</tr>
<tr>
<td colspan="2" align="left" valign="top" charoff="50">No. atoms</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
Protein</td>
<td align="center" valign="top" charoff="50">1231</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
Ligand/ion</td>
<td align="center" valign="top" charoff="50">1</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
Water</td>
<td align="center" valign="top" charoff="50">174</td>
</tr>
<tr>
<td colspan="2" align="left" valign="top" charoff="50">
<italic>B-</italic>
factors</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
Protein</td>
<td align="center" valign="top" charoff="50">22.90</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
Ligand/ion</td>
<td align="center" valign="top" charoff="50">12.50</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
water</td>
<td align="center" valign="top" charoff="50">35.0</td>
</tr>
<tr>
<td colspan="2" align="left" valign="top" charoff="50">r.m.s. deviations</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
Bond lengths (Å)</td>
<td align="center" valign="top" charoff="50">0.006</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">
<italic></italic>
Bond angles (°)</td>
<td align="center" valign="top" charoff="50">1.11</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t1-fn1">
<p>Values in parentheses are for highest-resolution shell. Each dataset was collected from a single crystal.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</floats-group>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000147 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000147 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:5027518
   |texte=   Full-length model of the human galectin-4 and insights into dynamics of inter-domain communication
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:27642006" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024