Glycobiology Advance Access originally published online on October 31, 2008
Glycobiology 2009 19(2):112-117; doi:10.1093/glycob/cwn121
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Communication |
Structural analysis of the recognition mechanism of poly-N-acetyllactosamine by the human galectin-9 N-terminal carbohydrate recognition domain
3 Structural Biology Research Center, Photon Factory, Institute of Materials Structure Science, High Energy Accelerator Research Organization (KEK), Tsukuba, Ibaraki 305-0801
4 Department of Endocrinology, Faculty of Medicine, Kagawa University, Kagawa 761-0793
5 Department of Applied Biological Chemistry, Faculty of Agriculture, Shizuoka University, Shizuoka 422-8529, Japan
1 To whom correspondence should be addressed: Tel: +81-29-879-6177; Fax: +81-29-879-6179; e-mail: ryuichi.kato{at}kek.jp
Received on April 24, 2008; revised on October 10, 2008; accepted on October 27, 2008
| Abstract |
|---|
|
|
|---|
Galectins are a family of β-galactoside-specific lectins bearing a conserved carbohydrate recognition domain. Interactions between galectins and poly-N-acetyllactosamine sequences are critical in a variety of biological processes. Galectin-9, a member of the galectin family, has two carbohydrate recognition domains at both the N- and C-terminal regions. Here we report the crystal structure of the human galectin-9 N-terminal carbohydrate recognition domain in complex with N-acetyllactosamine dimers and trimers. These complex structures revealed that the galectin-9 N-terminal carbohydrate recognition domain can recognize internal N-acetyllactosamine units within poly-N-acetyllactosamine chains. Based on these complex structures, we propose two putative recognition modes for poly-N-acetyllactosamine binding by galectins.
Key words: carbohydrate recognition / galectin / poly-N-acetyllactosamine / X-ray structure
| Introduction |
|---|
|
|
|---|
Galectins, widely distributed animal lectins, share a conserved carbohydrate recognition domain (CRD) of approximately 130 amino acids; this sequence is tightly folded into a sandwich structure of 5- to 6-stranded β-sheets (Barondes et al. 1994
The tandem-repeat-type galectin human galectin-9 was originally isolated from tumor cells in Hodgkin's disease (Tureci et al. 1997
). This protein was determined to be identical to the potent eosinophil chemoattractant ecalectin, originally cloned from a human T-cell line (Matsumoto et al. 1998
; Matsushita et al. 2000
). The chemoattractant activity of galectin-9 depends on its carbohydrate-binding activity and requires both CRDs (Matsushita et al. 2000
). Although the exact structure of the target carbohydrates recognized by human galectin-9 is unclear, biochemical analyses have revealed that both the N-terminal CRD (NCRD) and C-terminal CRD (CCRD) of human galectin-9 have a high affinity for repeating β1-3-linked N-acetyllactosamine (Galβ1-4GlcNAc; LacNAc) residues (poly-N-acetyllactosamine) (Hirabayashi et al. 2002
; Sato et al. 2002
). The dissociation constants of human galectin-9 NCRD for LacNAc dimers (LN2), trimers (LN3), and pentamers (LN5) are 3.0, 0.81, and 0.12 µM, respectively, and those of CCRD are 17, 2.7, and 0.41 µM, respectively (Hirabayashi et al. 2002
). This enhanced affinity for larger repeating LacNAc sequences was shared by both CRDs, but was more prominent for the NCRD. The particular preference for poly-N-acetyllactosamines should help elucidate the molecular mechanisms by which human galectin-9 exerts its varied biological functions.
Poly-N-acetyllactosamine chains are widely distributed in mammalian cells in multiple species (Fukuda 1985
). These motifs are preferentially bound by other galectins with high avidity, although the affinities for single LacNAc residues are relatively low (Barondes et al. 1994
; Di Virgilio et al. 1999
; Hirabayashi et al. 2002
). The molecular mechanism governing the interaction between galectins and glycoconjugates containing repeating LacNAc motifs is still unresolved. In this report, we determined the crystal structures of the human galectin-9 NCRD in complex with LN2 and LN3. We discuss the possible interactions between the galectin-9 NCRD and poly-N-acetyllactosamine residues.
| Results |
|---|
|
|
|---|
Crystal structure of galectin-9 N-terminal CRD in complex with LN2
To investigate the mechanism by which the human galectin-9 NCRD and poly-N-acetyllactosamine interact, we determined the crystal structures of the human galectin-9 NCRD in the presence of LN2 or LN3. The crystal structure of human galectin-9 NCRD in complex with LN2 (called LN2-bound crystal 1 throughout this manuscript) was refined to a 1.8 Å resolution. The crystal structure demonstrated that no obvious structural difference was found between the human galectin-9 NCRD-lactose complex structure reported recently by our group (Nagae et al. 2008
In LN2-bound crystal 1, the protein structures of the two complexes in the asymmetric unit (referred to as complexes A and B) are essentially identical with a root mean square deviation (rmsd) of 0.37 Å for all C
atoms (Figure 1A). The interaction modes with the carbohydrates, however, are quite different between the complexes. The protein molecule in complex A interacts closely with the reducing end of the LacNAc moiety (Figure 1B, left), while the protein molecule in complex B interacts tightly with the nonreducing end (Figure 1B, right). Thus, the carbohydrate molecules bound in these complexes are shifted by one complete LacNAc unit. The amino acid residues that interact with the LN2 molecules in both complexes are identical, with the exception of Asn48 and Asn137. These two asparagine residues form additional interaction sites for the ligand in complex A.
|
We obtained another crystal which also belongs to an orthorhombic space group with different cell constants (referred to as LN2-bound crystal 2). The asymmetric unit of LN2-bound crystal 2 contains four galectin-9 NCRD-LN2-bound complexes. The interaction modes of these complexes were also divided into two classes. Protein–ligand interactions for three of the molecules, conformed to that seen in complex A of the LN2-bound crystal 1 where the interaction is with the reducing end of the LN2, whilst one of the protein molecules interacts with the nonreducing end. The backbones of these four protein molecules superimpose well on each other with rmsd values within 0.30 Å. The c-axis of crystal 1 is half that of the b-axis of crystal 2 and the former has lost the screw axis (Table I). We reprocessed the crystal 2 data to interchange the b- and c-axes and confirmed that the crystal packings of the protein molecules in LN2-bound crystal 2 are almost identical with those of LN2-bound crystal 1. The main difference in the packing is derived from positional differences of the carbohydrates.
|
Crystal structure of galectin-9 N-terminal CRD in complex with LN3
Crystals of the human galectin-9 NCRD-LN3-bound complex were found to adopt two crystal forms, one of which exhibited P212121 space group symmetry (LN3-bound crystal 1) while the other belonged to the P41212 group (LN3-bound crystal 2). The cell constants of LN3-bound crystal 1 are almost identical to those of LN2-bound crystal 2 (Table I). The asymmetric unit of the LN3-bound crystal 1 also contains four complexes, designated complexes A–D. In these four complexes, the positions of the carbohydrate molecules could also be divided into two classes. The protein molecules in both complexes A and B primarily recognize the third LacNAc unit (fifth and sixth carbohydrate residues) from the nonreducing end, while complexes C and D interact with the second LacNAc unit (third and fourth carbohydrate residues) from the nonreducing end (Table II). The carbohydrate recognition mode of complex B is similar with that of complex A, with the exception of the interactions of the main chain oxygen atom of Ser54. The recognition modes of complexes C and D are identical to each other. The four protein molecules in the asymmetric unit are superimposable with rmsd values within 0.30 Å (Figure 2A).
|
|
In LN3-bound crystal 2, only one complex existed in the asymmetric unit. The protein recognizes the second LacNAc unit from the nonreducing end (Figure 2B). The carbohydrate interaction in LN3-bound crystal 2 is identical to that seen in complexes C and D of LN3-bound crystal 1.
| Discussion |
|---|
|
|
|---|
Our determination of the crystal structures of the human galectin-9 NCRD in complex with varying numbers of LacNAc repeats provides the first structural insights into the mechanisms of interaction between the galectin CRD and poly-N-acetyllactosamine. The structures of LN2- and LN3-bound complexes clearly indicate that the human galectin-9 NCRD could recognize three continuous carbohydrate residues (GlcNAcβ1-3Galβ1-4GlcNAc) irrespective of the positions of the LacNAc unit (Table II). The amino acid residues involved in β-galactoside recognition are identical among human galectin-1, -3, and galectin-9 NCRD (Leonidas et al. 1998
Although previous frontal affinity chromatography analyses revealed that the longer carbohydrates show increasing affinity for human galectin-9 NCRD (LN2, 3.0 µM of Kd value; LN3, 0.81 µM of Kd value; LN5, 0.12 µM of Kd value) (Hirabayashi et al. 2002
), no additional interaction was observed between LN2- and LN-3 complexes. In frontal affinity chromatography analyses, retardation of elution of carbohydrate ligand molecules from lectin immobilized column is measured, and an association constant is calculated from the retardation (Hirabayashi et al. 2002
). Therefore, apparent affinity may increase by the multivalent binding effect of human galectin-9 NCRD to longer carbohydrates. Or else, two CRDs may simultaneously bind to one carbohydrate molecule.
Structural comparison among other galectins gives the potential effect of the local structural differences on the various interaction modes between galectins and poly-N-acetyllactosamine. Superposition of human galectin-1 CRD (PDB code: 1GZW, Lopez-Lucendo et al. 2004
) on the LN3-bound complexes suggests that there are structural conflicts between LN3 and N-terminus of human galectin-1. Galectin-1 is thought to bind to nonreducing terminal LacNAc units rather than mid-chain LacNAc residues within a poly-N-acetyllactosamine chain (Solomon et al. 1991
; Di Virgilio et al. 1999
; Stowell et al. 2004
). Thus, these conflicts would affect the interaction with poly-N-acetyllactosamine. Both the galectin-3 and -7 CRDs (Leonidas et al. 1998
; Seetharaman et al. 1998
) display close structural similarities to the human galectin-9 NCRD. In the case of human galectin-3 (PDB code: 1A3K
[PDB]
), replacement of Asn137 in the human galectin-9 NCRD with Ser237 may affect the interaction with poly-N-acetyllacotsamine chains. When human galaectin-3 CRD is structurally superimposed to the human galectin-9 NCRD-LN2 or -LN3 complex, there are no structural conflicts between human galectin-3 and each oligosaccharide. In contrast, superposition of human galectin-7 (PDB code: 1BKZ) on human galectin-9 NCRD-LN3 complex suggests that the complexes A and B in LN3-bound crystal 1 revealed heavy steric conflicts between the LN3 molecule and the side chain of Gln42, which is located in close proximity of the carbohydrate binding site. These structural comparison explains well the previous observations that the affinities of the human galectin-3 CRD for repeating LacNAc chains increased with the increasing number of carbohydrate repeats, while human galectin-7 exhibited comparable affinities irrespective of ligand length (Hirabayashi et al. 2002
).
Galectin-1 and -2 form noncovalent homodimers in solution (Morris et al. 2004
); galectin-3 forms oligomers via interactions of their N-terminal domains in the presence of multivalent carbohydrate (Ahmad et al. 2004
). The tandem-repeat-type galectins are bivalent, possessing two CRDs. The interaction mode between human galectin-9 CCRD and poly-N-acetyllactosamine still remains unclear. To presume the relative positions of two CRDs, we generated a hypothetical model structure for the human galectin-9 NCRD in complex with LN6 based on the LN3-bound complexes (data not shown). The model structure implies that two human galectin-9 CRD molecules can bind simultaneously to a single LN6 molecule, and the distance between two CRDs is about 45 Å. The long linker peptide (61 amino acids) of human galectin-9 might connect both CRDs, which bind poly-N-acetyllactosamine chains by intra- or intermolecular manner. These differences in the quaternary structure may account for the formation of a crosslinking network of multivalent carbohydrates (Brewer 2002
). Molecular modeling of the low-energy conformers of poly-N-acetyllactosamine revealed generally extended structures (Atrih et al. 2005
), which may make galectins able to access each LacNAc unit externally. Our results suggest the possibility of a variety of interaction modes between galectins and poly-N-acetyllactosamine. These differences may be responsible for the physiological variety among the galectins and related biological systems.
| Material and methods |
|---|
|
|
|---|
Protein purification and crystallization
The recombinant human galectin-9 NCRD was expressed and purified as described previously (Nagae et al. 2008
Data collection, structure determination, and refinement
Synchrotron data were collected at beamlines BL-5A and AR-NW12A at the Photon Factory (Tsukuba, Japan). All datasets were processed and scaled using the HKL2000 program software (Otwinowski and Minor 1997
). The phase of the LN2-bound crystal 1 was determined by molecular replacement using Morlep software from the CCP4 program suite (CCP4 1994
). We used the crystal structure of the human galectin-9 NCRD–lactose complex (PDB code: 2EAK; Nagae et al. 2008
) without lactose and solvent molecules as a search model. The phases of the other carbohydrate complex structures were determined utilizing the LN2-bound crystal 1 structure as a search model. Model reconstructions were performed manually using Xfit software from XtalView (McRee 1999
). Crystallographic refinement was performed using REFMAC5 software from the CCP4 suite (CCP4 1994
). Quality of the protein models was assessed with the PROCHECK software (Laskowski et al. 1993
). Data collection and refinement statistics are summarized in Table I. Figures were created with the Molscript (Kraulis 1991
), Raster3D (Merrit and Murphy 1994
), and PyMOL (DeLano Scienctific LLC) programs.
Coordinates for the galectin-9 NCRDs have been deposited in the Protein Data Bank under the following accession numbers: 2ZHK (LN2-bound crystal 1), 2ZHL (LN2-bound crystal 2), 2ZHM (LN3-bound crystal 1), and 2ZHN (LN3-bound crystal 2).
| Funding |
|---|
|
|
|---|
The Ministry of Education, Culture, Sports, Science and Technology of Japan (17046024 for scientific research and the Protein 3000 project).
| Conflict of interest statement |
|---|
|
|
|---|
None declared.
| Acknowledgements |
|---|
We thank the staff at beamlines BL-5A and AR-NW12A at High Energy Accelerator Research Organization - Photon Factory, Tsukuba, Japan, for providing data collection facilities and support.
| Footnotes |
|---|
2 Present address: Institute for Protein Research, Osaka University, Suita, Osaka 565-0871, Japan.
| Abbreviations |
|---|
CCRD, C-terminal CRD; CRD, carbohydrate recognition domain; GlcNAc, N-acetylglucosamine; LacNAc, N-acetyllact osamine; LN2, N-acetyllactosamine dimer; LN3, N-acetyllactosamine trimer; LN6, N-acetyllactosamine hexamer; NCRD, N-terminal CRD; PEG, polyethylene glycol; rmsd, root mean square deviation
| References |
|---|
|
|
|---|
Ahmad N, Gabius HJ, Andre S, Kaltner H, Sabesan S, Roy R, Liu B, Macaluso F, Brewer CF. Galectin-3 precipitates as a pentamer with synthetic multivalent carbohydrates and forms heterogeneous cross-linked complexes. J Biol Chem. (2004) 279:10841–10847.
Atrih A, Richardson JM, Prescott AR, Ferguson MAJ. Trypanosoma brucei glycoproteins contain novel giant poly-N-acetyllactosamine carbohydrate chains. J Biol Chem. (2005) 280:865–871.
Barondes SH, Cooper DNW, Gitt MA, Leffler H. Galectins. Structure and function of a large family of animal lectins. J Biol Chem. (1994) 269:20807–20810.
Brewer CF. Binding and cross-linking properties of galectins. Biochim Biophys Acta (2002) 1572:255–262.[Medline]
CCP4 (Collaborative Computational Project 4). The CCP4 suite: Programs for protein crystallography. Acta Crystallogr. (1994) D50:760–763.[Web of Science]
Cooper DNW. Galectinomics: finding themes in complexity. Biochim Biophys Acta. (2002) 1572:198–202.[Medline]
Di Virgilio S, Glushka J, Moremen K, Pierce M. Enzymatic synthesis of natural and 13C enriched linear poly-N-acetyllactosamine as ligands for galectin-1. Glycobiology. (1999) 9:353–364.
Fukuda M. Cell surface glycoconjugates as onco-differentiation markers in hematopoietic cells. Biochim Biophys Acta. (1985) 780:119–150.[Medline]
Hirabayashi J, Hashidate T, Arata Y, Nishi N, Nakamura T, Hirashima M, Urashima T, Oka T., Futai M, Muller WEG, et al. Oligosaccharide specificity of galectins: A search by frontal affinity chromatography. Biochim Biophys Acta. (2002) 1572:232–254.[Medline]
Hirabayashi J, Kasai K. The family of metazoan metal-independent beta-galactoside-binding lectins: Structure, function and molecular evolution. Glycobiology (1993) 3:297–304.
Hiraki M, Kato R, Nagai M, Satoh T, Hirano S, Ihara K, Kudo N, Nagae M, Kobayashi M, Inoue M, et al. Development of an automated large-scale protein-crystallization and monitoring system for high-throughput protein-structure analyses. Acta Crystallogr. (2006) D62:1058–1065.
Kasai K, Hirabayashi J. Galectins: A family of animal lectins that decipler glycocodes. J Biochem. (1996) 119:1–8.
Kraulis PJ. MOLSCRIPT: A program to produce both detailed and schematic plots of protein structures. J Appl Crystallogr. (1991) 24:946–950.[CrossRef][Web of Science]
Laskowski RA, MacArthur MW, Moss DS, Thornton JM. PROCHECK: a program to check the stereochemical quality of protein structures. J Appl Cryst (1993) 26:283–291.[CrossRef][Web of Science]
Leonidas DD, Vatzaki EH, Vorum H, Celis JE, Madsen P, Acharya KR. Structural basis for the recognition of carbohydrates by human galectin-7. Biochemistry (1998) 37:13930–13940.
Lopez-Lucendo MF, Solis D, Andre S, Hirabayashi J, Kasai K, Kaltner H, Gabius H-J, Romero A. Growth-regulatory human galectin-1: Crystallographic characterization of the structural changes induced by single-site mutations and their impact on the thermodynamics of ligand binding. J Mol Biol. (2004) 343:957–970.[CrossRef][Web of Science][Medline]
Matsumoto R, Matsumoto H, Seki M, Hata M, Asano Y, Kanegasaki S, Stevens RL, Hirashima M. Human ecalectin, a variant of human galectin-9, is a novel eosinophil chemoattractant produced by T lymphocytes. J Biol Chem. (1998) 273:16976–16984.
Matsushita N, Nishi N, Seki M, Matsumoto R, Kuwabara I, Liu F-T, Hata Y, Nakamura T, Hirashima M. Requirement of divalent galactoside-binding activity of ecalectin/galectin-9 for eosinophil chemoattraction. J Biol Chem (2000) 275:8355–8360.
McRee DE. XtalView/Xfit – A versatile program for manipulating atomic coordinates and electron density. J Struct Biol (1999) 125:156–165.[CrossRef][Web of Science][Medline]
Merrit EA, Murphy MEP. Raster3D Version 2.0. A program for photorealistic molecular graphics. Acta Crystallogr (1994) D50:869–873.
Morris S, Ahmad N, Andre S, Kaltner H, Gabius HJ, Brenowitz M, Brewer F. Quaternary solution structures of galectins-1, -3, and -7. Glycobiology (2004) 14:293–300.
Murata T, Amarume S, Hattori T, Tokuyama S, Tokuyama K, Kawagishi H, Usui T. Purification and characterization of a chitinase from Amycolatopsis orientalis with N-acetyllactosamine-repeating unit releasing activity. Biochem Biophys Res Commun (2005) 336:514–520.[Medline]
Nagae M, Nakamura-Tsuruta S, Nishi N, Hirabayashi J, Wakatsuki S, Kato R. Structural analysis of the human galectin-9 N-terminal carbohydrate recognition domain reveals unexpected properties that differ from the mouse orthologue. J Mol Biol (2008) 375:119–135.[CrossRef][Web of Science][Medline]
Nagae M, Nishi N, Murata T, Usui T, Nakamura T, Wakatsuki S, Kato R. Crystal structure of the galectin-9 N-terminal carbohydrate recognition domain from Mus musculus reveals the basic mechanism of carbohydrate recognition. J Biol Chem (2006) 281:35884–35893.
Otwinowski Z, Minor W. Processing of x-ray diffraction data collected in oscillation mode. Methods Enzymol (1997) 276:307–326.[CrossRef][Web of Science]
Sato M, Nishi N, Shoji H, Seki M, Hashidate T, Hirabayashi J, Kasai K, Hata Y, Suzuki S, Hirashima M, et al. Functional analysis of the carbohydrate recognition domains and a linker peptide of galectin-9 as to eosinophil chemoattractant activity. Glycobiology (2002) 12:191–197.
Seetharaman J, Kanigsberg A, Slaaby R, Leffler H, Barondes SH, Rini JM. X-ray crystal structure of the human galectin-3 carbohydrate recognition domain at 2.1-Å resolution. J Biol Chem (1998) 273:13047–13052.
Solomon JC, Stoll MS, Penfold P, Abbott WM, Childs RA, Hanfland P, Feizi T. Studies of the binding specificity of the soluble 14,000-dalton bovine heart muscle lectin using immobilised glycolipids and neoglycolipids. Carbohydr Res (1991) 213:293–307.[CrossRef][Web of Science][Medline]
Stowell SR, Dias-Baruffi M, Penttila L, Renkonen O, Nyame AK, Cummings RD. Human galectin-1 recognition of poly-N-acetyllactosamine and chimeric polysaccharides. Glycobiology (2004) 14:157–167.
Tureci O, Schmitt H, Fadle N, Pfreundschuh M, Sahin U. Molecular definition of a novel human galectin which is immunogenic in patients with Hodgkin's disease. J Biol Chem. (1997) 272:6416–6422.
![]()
CiteULike
Connotea
Del.icio.us What's this?
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

