Skip Navigation


Glycobiology Advance Access originally published online on October 23, 2006
Glycobiology 2007 17(2):165-184; doi:10.1093/glycob/cwl062
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrowOA All Versions of this Article:
17/2/165    most recent
cwl062v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (7)
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Wu, A. M.
Right arrow Articles by Gabius, H.-J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wu, A. M.
Right arrow Articles by Gabius, H.-J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2006 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Activity–structure correlations in divergent lectin evolution: fine specificity of chicken galectin CG-14 and computational analysis of flexible ligand docking for CG-14 and the closely related CG-16

Albert M. Wu1,2, Tanuja Singh2, Jia-Hau Liu2, Mickael Krzeminski3, Roland Russwurm4, Hans-Christian Siebert4, Alexandre M.J.J. Bonvin3, Sabine André4 and Hans-Joachim Gabius4

2 Glyco-Immunochemistry Research Laboratory, Institute of Molecular and Cellular Biology, College of Medicine, Chang-Gung University, Kwei-san, Tao-yuan 333, Taiwan
3 Department of NMR Spectroscopy, Bijvoet Center for Biomolecular Research, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands
4 Institute of Physiological Chemistry, Faculty of Veterinary Medicine, Ludwig Maximilians University, Veterinärstr. 13, 80539 Munich, Germany


1 To whom correspondence should be addressed; e-mail: amwu{at}mail.cgu.edu.tw

Received on June 20, 2006; revised on September 18, 2006; accepted on October 15, 2006


    Abstract
 Top
 Abstract
 Introduction
 Results
 Discussion
 Materials and methods
 Supplementary data
 Acknowledgments
 References
 
Gene duplication and sequence divergence are driving forces toward establishing protein families. To examine how sequence changes affect carbohydrate specificity, the two closely related proto-type chicken galectins CG-14 and CG-16 were selected as models. Binding properties were analyzed using a highly sensitive solid-phase assay. We tested 56 free saccharides and 34 well-defined glycoproteins. The two galectins share preference for the II (Galß1-4GlcNAc) versus I (Galß1-3GlcNAc) version of ß-galactosides. A pronounced difference is found owing to the reactivity of CG-14 with histo-blood group ABH active oligosaccharides and A/B active glycoproteins. These experimental results prompted to determine activity–structure correlations by modeling. Computational analysis included consideration of the flexibility of binding partners and the presence of water molecules. It provided a comparative description of complete carbohydrate recognition domains, which had so far not been characterized in animal galectins. The structural models assigned II, I selectivity to a region downstream of the central Trp moiety. Docking revealed that the tetrasaccharides can be accommodated in their free-state low-energy conformations. CG-14's preference for A versus B epitopes could be attributed to a contact between His124 and the N-acetyl group of GalNAc. Regarding intergalectin comparison, the Ala53/Cys51 exchange affects the interaction potential of His54/His52. Close inspection of simulated dynamic interplay revealed reorientation of His124 at the site of the His124/Glu123 substitution, with potential impact on ligand dissociation. In summary, this study identifies activity differences and provides information on their relation to structural divergence, epitomizing the value of this combined approach beyond galectins.

Key words: blood group / docking / evolution / glycoprotein / lectin / mutation / protein–carbohydrate interaction


    Introduction
 Top
 Abstract
 Introduction
 Results
 Discussion
 Materials and methods
 Supplementary data
 Acknowledgments
 References
 
Gene duplication and ensuing sequence diversification by mutations are key factors in the process to generate proteins with closely related structural properties (Thomas 1993Go). The introduction of amino acid substitutions can entail emergence of new and distinctive properties in members of a protein family. Such products are attractive study objects in the quest to delineate activity–structure relations. Of special interest for thorough study are those cases in which homologous proteins are differentially expressed, because they are then apparently assigned to nonidentical biological functions. Under these circumstances, detailed comparative analysis is expected to teach salient lessons on the versatility of a common domain acquired by amino acid substitutions. Toward this end, a corresponding research program starts with the definition of sequence divergence in the selected protein pair, followed by monitoring the presence of proteins in tissues. The next step then is biochemical determination of binding properties to delineate reactivity profiles with physiological ligands. These efforts are especially well invested when dealing with important regulators of cellular activities.

Owing to the emerging significance of protein (lectin)–carbohydrate recognition for a wide range of biological activities such as adhesion, growth control, migration, or glycan routing (Gabius 1997Go, 2001Go, 2006Go; Villalobo et al. 2006Go), we herein present a model study along this given line. We focus on a family of animal lectins exerting these listed functions, i.e. galectins. The intrafamily diversification of these cellular effectors has been carefully mapped at the level of sequences and phylogenetic dendrograms (Cooper 2002Go; Houzelstein et al. 2004Go). This information guides to the prioritization among members of this lectin family. As consequence, the case of two proto-type chicken galectins (CGs), termed CG-14 and CG-16 owing to their molecular weights as determined by gel electrophoresis, has attracted attention for a pilot study. A close inspection of this protein pair beyond structural relationship reveals further favorable features which give high priority to the study of their binding characteristics.

Structurally, the sequence alignment yielded 71 substitutions per 134 amino acid residues, and ensuing calculations on mutational rates suggested that the origin of the genes for these two lectins dates back to a duplication event at around the divergence of birds from mammals approximately 3 x 108 years ago (Sakakura et al. 1990Go). Evidently, neither the common ß-sandwich fold nor the position of the Trp moiety in the central part of the carbohydrate recognition domain (CRD) is impaired by these sequence changes (Siebert et al. 1997Go; Varela et al. 1999Go). Thus, the basic mode of interaction of these two galectins with the ß-galactoside core of a glycan is maintained. But it is an open question whether the fine specificities for complex glycans will also be constant. The importance of answering the question on the CGs' binding properties to glycoproteins (gps) is underscored by the conspicuous differences in their developmental regulation. These were observed in dermal morphogenesis and organ-specific gene expression, which give reason to assume nonredundant functionalities (Sakakura et al. 1990Go; Akimoto et al. 1993Go, 1995Go). Experimentally, the two galectins also differ in hemagglutination and axon growth/guidance activities, CG-16 being a strong cross-linking agent and capable of blocking proliferation of neuroblastoma (SK-N-MC) cells (Schneller et al. 1995Go; Kopitz et al. 2004Go; André, Kaltner et al. 2005Go).

In order to distinguish sugar-encoded signals on the cell surface, galectins present extended binding sites to complex glycans of natural glycoconjugates (Leffler and Barondes, 1986Go; Kopitz et al. 2001Go; Hirabayashi et al. 2002Go; Siebert et al. 2003Go; Sturm et al. 2004Go; Fischer et al. 2005Go). It is therefore timely to resolve the issue whether and to what extent the fine specificities of these two homologous lectins will have been subject to divergence. In addition, spatial factors of cellular glycans imposed by clustering or substitutions are known to modulate lectin affinity significantly (André, Cejas et al. 1999Go; Andre et al. 2001Go; Andre et al. 2003Go; Andre, Unverzagt et al. 2004Go; Unverzagt et al. 2002Go; Dam et al. 2005Go). Consequently, we extended our panel of glycosubstances beyond commonly used free saccharides. In order to deliberately simulate the natural situation where both sequence and topological features contribute to lectin reactivity, 33 well-characterized gps and a bacterial polysaccharide were also tested. The molecular details of these probes' glycosylation, especially at the branch ends relevant as docking sites for galectins, and the binding profile of CG-16 had been described in our previous report (Wu et al. 2001Go). Here, we first document the results for CG-14. They were obtained under identical conditions to complete the comparative monitoring and to ensure its validity. Following the presentation of these experimental results, activity–structure relations derived from extensive in silico work on the two CRDs are reported. The computational docking analysis with state-of-the-art software provided detailed insights into the nature of the binding sites beyond the core ß-galactoside. Ligand positioning was systematically graded according to the level of interaction energy, and contributions of individual amino acids to the contact to histo-blood group ABH tri- and tetrasaccharides were inferred. Hereby, effects of mutational substitutions in the CRDs could be determined for the first time in this lectin family. Equally important, the probing into the lectin–oligosaccharide interaction included simulation of flexibility for both binding partners. In contrast to crystallographic analysis, the potential for mutual adaptation could thus be inferred. In summary, the given results will have relevance beyond this model pair, because no information on structural aspects of binding of these determinants by mammalian lectins and the inherent dynamics is so far available. In more general terms, they established proof-of-principle for the combined strategy also suitable for the design of custom-made lectin variants and inhibitors.


    Results
 Top
 Abstract
 Introduction
 Results
 Discussion
 Materials and methods
 Supplementary data
 Acknowledgments
 References
 
CG-14–glycan interaction
The first aim was to study the interaction profile of CG-14 by a highly sensitive microtiter plate enzyme-linked lectinosorbent assay (ELLSA) with a panel of natural gps. These test substances present a spectrum of distinct epitopes at their spatially accessible branch ends, simulating the physiological situation. Of particular note, their application in combination with free saccharides allows us to determine the effect of spatial parameters of glycan presentation on a protein backbone. Biotinylated CG-14 was used as probe. Labeling in the presence of ligand resulted in a loss of binding activity of less than 25%, as determined by hemagglutination and solid-phase assays (data not shown). The results obtained with different gps and a bacterial polysaccharide are summarized in Table I. Evidently, CG-14 selects distinct epitopes (Table I). It reacted strongly with three histo-blood group Ah active gps, namely Cyst MSM 10% ppt, Cyst Mcdon and Cyst MSS 10% 2 x ppt (1–3), and one histo-blood group precursor equivalent gp (11). Binding to these gps reached absorbance values (A405) of >3.0 within 4 h and required less than 14 ng gp for 1.5 units of absorbance. CG-14 also bound well to other histo-blood group Ah and Bh active gps (such as cyst Beach phenol insoluble, cyst 9 and 14 phenol insoluble 4–6) and I/II-containing precursor gps (cyst Beach P-1, cyst OG 10% 2 x, and cyst MSS 1st Smith degraded 12–14). Not all ß-galactosides were suitable binding partners. CG-14 reacted weakly or not at all with blood group H or Leb active substances (cyst N-1 Lea 20% 2 x, cyst Tighe, and JS phenol insoluble 8–10), multi-II antennary gps (sialylated and asialo rat sublingual gp (RSL), human {alpha}1-acid gp with/without sialylation and fetuin 15, 16, 20, 26–28, and also the Pneumococcus type 14 polysaccharide 17), T/Tn-containing salivary gps (sialylated and ovine, bovine and porcine salivary asialo gps 21, 29–32, active and inactive antifreeze gps 25 and 33, human glycophorin and its asialo derivative 22 and 23), and sheep hydatid cyst gp (34). Evidently, the topological way an epitope is presented matters significantly for ligand properties. As parameter for comparison, the amount of gp used for coating is listed in Table I. Because the actual amount of glycans adsorbed on the surface of the microtiter plate wells had not been determined in this assay setting, their relative binding properties were further examined in an inhibition assay. To mediate carbohydrate-dependent CG-14 binding to the plate surface, a gp presenting I, II determinants was selected for adsorption, as done previously in the study of CG-16 properties (Wu et al. 2001Go).


View this table:
[in this window]
[in a new window]

 
Table I. Binding of CG-14 to human blood group A, B, H, P1 Lea, and Leb active, sialo, and asialo gps by ELLSA*

 
Inhibition of CG-14–glycan interaction by various polyvalent gps
The ability of various gps to inhibit the binding of CG-14 to a Galß1–3/4GlcNAc containing gp (Beach P-1) purified from human ovarian cyst fluid was systematically analyzed by ELLSA. Five free saccharides were included for internal calibration of the inhibitory potency of the gps. The results are compiled in Table II, and the inhibition curves are shown in Figure 1. Among the natural glycocompounds tested for the inhibition of this interaction, a gp presenting Ah determinants (1, cyst MSM 10% ppt) was the best. The CG-14–glycan interaction was also strongly inhibited by other gps with blood group Ah or Bh activity and their precursors (2–10). Their reactivities were 1.9 x 103, 3.5 x 103, and 9.0 x 104 times higher than that of monomeric Galß1-4GlcNAc (II), Galß1-4Glc (L), and Gal (16, 17, and 21), respectively. The inhibitory potencies of the gps, in general, are in accord with the maximum absorbances recorded in the binding assay (Tables I and II). However, a problem arises when carefully examining the quantitative data and applying the stoichiometric criterion that at least equal molarities of sugar unit and lectin should be reached to result in substantial inhibition. Evidently, the most potent gps reach a higher level of inhibitory capacity than expected on the basis of this criterion. Having herewith determined the reactivity of CG-14 to a series of gps in two assay settings, we next proceeded to define the binding specificity with free sugar compounds. These assays included epitopes known from glycolipids and galactose with aglyconic substituents.


View this table:
[in this window]
[in a new window]

 
Table II. Amount of various gps and free saccharides giving 50% inhibition of binding of CG-14 (125 ng/50 µL) to Beach P-1 (50 ng/50 µL)a

 

Figure 0621
View larger version (18K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. Inhibition of CG-14 binding to a Galß1-3/4GlcNAc-containing gp (Cyst Beach P-1) coated on the surface of ELLSA plate wells by a panel of gps presenting distinct branch-end epitopes. The quantity of gp in the coating solution was 50 ng per well. The quantity of lectin used for the inhibition assay was 125 ng per well. Total volume was 50 µL. A405 was recorded after a 4-h incubation period with substrate. The amount (ng) of gp required to reach 50% inhibition was determined.

 
Inhibition of CG-14–glycan interaction by mono-, di-, and oligosaccharides
These measurements were performed under conditions identical to the assays with the panel of the gps. The complete results are presented in Table III and Figure 2 for substances reaching a level of 50% inhibition and in Table IV for substances which are less active. To enable direct comparison between the inhibitory potencies of gps and free saccharides, we had added five mono- and disaccharides at positions 16, 17, 18, 20, and 21 to Table II. Among the saccharides tested, Galß1-4GlcNAcß1-3Galß1-4Glc (IIß1–3L), [Fuc{alpha}1-2]Galß1-3GlcNAcß1-3Galß1-4Glc (H active Iß1–3L), and the tri-antennary N-linked glycopeptides, with Gal(ß1-4)GlcNAc at the nonreducing end (Tri-II), were the best, being 5.0, 5.0, 2.9 and 5.0 x 102, 5.0 x 102, 2.9 x 102 times more active than Galß1-4GlcNAc (II) and Gal, respectively (1–3 compared with 9 and 7, Table III, Figure 2A). Apparently, the 2,4,2-branched tri-antennary N-glycan from asialofetuin is slightly better than a bi-antennary hexasaccharide, and the presence of three N-glycans in asialofetuin conveys ligand properties to the gp, which are inferior to those of O-glycan-bearing gps from human ovarian cyst fluid. H active Iß1–3L pentasaccharide (2) was 3.8-fold more potent than Iß1–3L (5), whereas H active L (6) was 2.6 times more potent than L (13). These results indicate that introduction of LFuc{alpha}1-2 to parent Iß1–3L or L significantly enhances the binding affinity. In line with the data on gp, binding the completion of A- and B-type headgroups yielded 3.7- or 3.8-fold better inhibitors compared with the fucose-free disaccharides (10 and 27 versus 21 and 34). In direct comparison, the A-type trisaccharide (10) clearly exceeded the B-type glycan (27). Bi-antennary hexasaccharides IIß1–3(IIß1–6)L and Iß1–3(IIß1–6)L were 2.2 and 6.5 times better than Galß1-3/4GlcNAc (I/II) (4 versus 9 and 7 versus 24, Table III), indicating that the affinity of CG-14 for binding partners is significantly enhanced by the length of carbohydrate sequence and the number of antennae in free Galß1-containing ligands. Introduction of an I epitope into bivalent sugar lowered the affinity, and a ß1-6 extension of II did not enhance it (7 and 8, Table III). These two compounds were about 2.9 times less active than tri-antennary Galß1-4GlcNAc (3, Table III), but still 6.5 times more active than Galß1-3GlcNAc (I). Gal(ß1-6)GlcNAc did not reach 50% inhibition, revealing the importance of the ß1–4 linkage at the core galactose unit. These results show that CG-14 has a pronounced preference at this site in the order: Galß1-4 > Galß1-3 > Galß1-6.


View this table:
[in this window]
[in a new window]

 
Table III. Amount of various saccharides giving 50% inhibition of binding of CG-14 (125 ng/50 µL) to Beach P-1 (50 ng/50 µL)a

 

Figure 0622
View larger version (23K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 2. Inhibition of CG-14 binding to a Galß1-3/4GlcNAc-containing gp (Cyst Beach P-1) coated on the surface of ELLSA plate wells by various free saccharides. The quantity of gp in the coating solution was 50 ng per well. The lectin (250 ng per well) was preincubated with an equal volume of serially diluted inhibitor. The final CG-14 content was 125 ng per well. Total volume was 50 µL. A405 was recorded after a 4-h incubation period with substrate. The amount (nmol) of free saccharides required to reach 50% inhibition was determined.

 

View this table:
[in this window]
[in a new window]

 
Table IV. Reduction of binding of CG-14 (125 ng/50 µL) to Beach P-1 (50 ng/50 µL) by saccharides acting as weak inhibitors

 
Testing further disaccharides revealed a clear preference pattern: GalNAcß1-3Gal (P), GalNAc{alpha}1-3Gal (A), GalNAc{alpha}1-3GalNAc (F), and Galß1-3GalNAc (T) were from 28.5, 18.2 to 16.7 times more active than Gal (14, 21–23 versus 37), but 30 times less active than IIß1–3L (Type 2; no. 1) (Table III). These results indicate that the N-acetyl amino group of the GalNAc residue at the penultimate position (T) might interfere with optimal ligand accommodation, that a histo-blood-group-like {alpha}1-3 extension and its ß-anomer are tolerated, and that the ß1-4 linkage is better than the ß1-3 linkage. GalNAc itself, irrespective of its anomer, is a very weak inhibitor. Glycotopes P{alpha} (GalNAcß1-3Gal{alpha}1-Me) and B{alpha} (Gal{alpha}1-3Gal{alpha}1-Me) were 1.4–15.4 times better than P and B without fixed anomeric position (12 versus 14 and 20 versus 34), respectively, illustrating that anomeric arrest in P{alpha} and B{alpha} epitopes can play a role in binding (Table III). Further testing of building blocks of natural glycans helped to dissect the contributions of distinct structures to the binding of CG-14 to a gp carrying several determinants. In line with the lack of activity of mucin with Tn and T{alpha} (Table III), these two epitopes as free saccharides are weakly inhibitory (Table IV). Activity of the histo-blood precursor gps in Table II is most likely attributable to the I, II structures. Having herewith completed the analysis of a series of 90 glycosubstances as inhibitors of CG-14 binding, it is possible to present a detailed comparison between the properties of CG-14/CG-16, taking the previously published data on CG-16 into account (Wu et al. 2001Go).

Comparison of binding properties
The fact that we deliberately kept all experimental parameters constant in the measurements on the two CGs ensured that the intended comparison of the data sets is valid. The results presented in Tables IIV and Figures 1 and 2 on CG-14 as well as in our previous report on CG-16 (Wu et al. 2001Go) can thus be directly compared. To help the reader identify differences, the two data sets were scrutinized stepwise to spot any disparities. Table V documents the individual aspects of the carbohydrate specificities, arranged into eight categories. Obviously, the two homologous lectins share pronounced preference to II. The relative levels of discrimination between II and I were 20-fold for CG-16 and 6.5-fold for CG-14. Polyvalency is discriminatively sensed by CG-14. When further comparing the reactivity profiles of the tested glycotopes, the assays revealed differences in gp binding regarding substitutions which led to histo-blood group AB epitopes (Table V). They are reflected on the level of assays with free oligosaccharides. Whereas the A- and B-type trisaccharides are potent inhibitors in the case of CG-14, the interaction of CG-16 with first Smith degraded cyst MSS gp could hardly be reduced by their presence, only H active L (Fuc{alpha}1-2Galß1-4Glc) serving as inhibitor with about 10-fold diminished activity relative to lactose (data not shown). Evidently, the presence of the A- and B-type extensions in gps makes marked difference. These observations prompted a close inspection of the CRDs for activity–structure relations. The current lack of information on how animal galectins accommodate these types of oligosaccharides adds a further incentive to address this issue. As illustrated in Figure 3, the two CGs are structurally closely related. Their CRDs are placed at equivalent positions in the jelly-roll-like folding pattern. Whereas the position of lactose hosted in each CRD can readily be depicted, as shown in the middle panel of Figure 4, it is yet an open question how the two homologous CGs interact with ABH type epitopes. To answer it, we performed a computational modeling and docking study. Of note, the binding partners maintained full flexibility in the interaction study, a new feature for docking studies with lectins.


View this table:
[in this window]
[in a new window]

 
Table V. Comparison of the binding properties of the two proto-type CGs CG-14 and CG-16

 

Figure 0623
View larger version (90K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 3. Comparison of structural aspects between CG-14 and CG-16. Their sequences were set into relation using the program CLUSTAL W (1.60) for multiple sequence alignment (http://bioportal.bic.nus.edu.sg/clustalw/clustalw.html). Identical residues in both sequences are indicated in white letters on black background, similarities in black letters on gray background (top part). The middle and bottom parts present schematic representations of the crystal structure of homodimeric CG-16 (PDB 1QMJ) [(A) and (C)] and the structure of CG-14 protomer [(B) and (D)] derived from homology-based calculations (André, Kaltner et al. 2005Go) in two versions depicting either the protein backbone with the set of antiparallel ß-sheets [(A) and (B)] or the contour surface of the two proteins [(C) and (D)]. The latter version is also used in the real-time documentation of the dynamics in the interaction process (http://www.nmr.chem.uu.nl/haddock/movies/cg.html). The contact site for Galß1-4GlcNAc (II) in the CRDs is highlighted by appropriately positioning the disaccharide. Details on the individual amino acids in the vicinity of the ligands of increasing degree of substitution for the two CRDs are given in Figure 4.

 

Figure 0624
View larger version (48K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 4. Graphical illustration of the CRDs of CG-16 (left panel) and CG-14 (right panel) pinpointing the contact sites for Galß1-4GlcNAc (upper panel) and histo-blood group A tetrasaccharide. Positioning of key contact sites was deliberately kept constant for direct comparison. Both sugars are drawn in low-energy conformations, and the comparison identifies the regions of each CRD responsible for contact to the {alpha}1-2/3 substitutions on the core galactose moiety.

 
Computational modeling and docking
The generation of the model for CG-14—together with crystallographic information on CG-16—was the first step in our modeling study. The ensuing comparison of the set of key amino acids in the regions of contact for the disaccharide of CG-14 and CG-16 revealed identity (Figure 4, upper panel). Sequence divergence concerned the immediate vicinity of these moieties. Notably, the orientation of the Trp residues will probably not be kept constant, at least in part due to the Thr67/Glu69 substitution following the common Gly to their side. As evidence presented in the last paragraph intimates, the sequence stretch downstream of the Trp moiety might be relevant for II, I selectivity. The models then served as platform to visualize contact sites for ABH epitopes. The sequence additions to turn I, II structures into ABH epitopes will most likely occupy the space, where two amino acid changes are apparent (Figure 4, bottom panel). In detail, Cys51/Ala53 and Glu123/His124 substitutions, among others, separate CG-16 from CG-14. In order to determine whether these differences may have a bearing on ligand binding, we proceeded from rigid-body docking to three stages of increasing level of model-building refinement. First, flexibility was allowed for the amino acid side chains and then for both protein backbone and the side chains, the ligand being treated as flexible throughout the entire procedure. Finally, calculations on complexes selected due to most favorable binding energy were run with explicit consideration of water molecules.

The first lesson taught by the in silico work was that the ligand's free-state low-energy conformation was not distorted by contact with the CRDs. In other words, structures corresponding to populated areas of {Phi}, {Psi}, E-maps for the ABH epitopes (Figure 5) will fit into the respective section of the CRD of both CGs. They will not be subject to a major alteration from their free-state parameters. Such a conformation could thus be drawn from Figure 4 (bottom panel). The positions of three energetically preferred constellations of the ligand associated with the CGs in the simulations are marked in the {Phi}, {Psi}, E-maps in Figure 5. The energy grading in these maps also epitomizes that only a portion of the conformational space is accessible for these oligosaccharides in the free state. This conformational preference turns into a valuable asset for affinity generation when conformers from this region are selected for binding. The actual numbers of average dihedral angles of the glycosidic bonds are listed in Table VI. In essence, they signify that the free-state low-energy topology is suited for complex formation. To answer the question whether it is common for these histo-blood group determinants to retain the free-state conformation, we scoured records on structures of their complexes with proteins. As a result, crystallographic information on three cases involving the {alpha}1-2/3 substitutions on the galactose moiety is given in Table VI for comparison. These data sets support the concept that the free-state low-energy conformation of these oligosaccharides behaves like a "key-like" structure selected for interaction. Nonetheless, to preclude missing a physiologically relevant bound-state conformation, full ligand flexibility was enforced without resulting in a viable alternative. Both CGs therefore select conformers from energetically preferred regions of the {Phi}, {Psi}, E-maps for binding. Their low positional root mean square deviation (RMSD) values from the average structure (between 0.57 and 0.84 Å) signify that these ensembles are structurally well defined. Inspecting the contact sites closely, we find that there is no clash between an amino acid side chain and the ligand, which might preclude its access to CG-16's CRD. Having herewith identified new topological aspects of ligand accommodation for animal galectins shown in Figure 4 (bottom panel) and most probably excluded steric hindrance as explanation for the difference, we proceeded to characterize the energetic contributions of each amino acid/monosaccharide to the overall interaction energy, distinguishing between Coulomb and van der Waals energies.


Figure 0625
View larger version (50K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 5. Illustration of {Phi}, {Psi}, E-plots of the glycosidic linkages in GalNAc{alpha}1-3Gal [(A) and (B)] and in Fuc{alpha}1-2Gal [(C) and (D)] of the histo-blood group A-tetrasaccharide, with the internal energy calibration from 0 to 5 kcal/mol. The positions of three low-energy conformations of the sugar ligand in complex with CG-16 [(A) and (C)] or CG-14 [(C) and (D)] are defined by the numbers 1, 2, and 3 in the maps.

 

View this table:
[in this window]
[in a new window]

 
Table VI. Average dihedral {Phi} and {Psi} angles of the glycosidic linkages in histo-blood group H-trisaccharide and the A-tetrasaccharide in the free state, in complex with CG-14/CG-16 on the basis of calculations or in complex with other receptor types in crystals

 
The overall distribution of energy contributions for the interaction between the tri- and tetrasaccharides and CRDs of CG-14 and CG-16 reflected a high degree of structural homology (Tables VII and VIII). These data characterize and grade in energetic terms the region responsible for hosting {alpha}1-2/3 substitutions in the two animal galectins. As a consequence, it is possible to offer a reasoning to rationalize the measured preference of CG-14 for histo-blood group A versus B epitopes, as seen in Tables I and II. Close inspection of patterns of interaction in silico revealed contact between the oxygen atom of the N-acetyl group of the {alpha}1-3-linked GalNAc moiety and the imidazole ring of His124 in CG-14, also shown in Figure 4 (bottom panel). Its loss in the histo-blood group B-tetrasaccharide might account for the reduced binding/inhibition capacity of gps with histo-blood group B in contrast to A epitopes. Regarding the differential reactivity of AB epitopes for CG-14 versus CG-16, as given in category 6 of Table V, the presented energetic considerations provide no immediate clue. What could prove helpful besides the estimation of energy terms, though, is thorough monitoring of the dynamic way the ligand–receptor pair is formed. The respective simulations indicated that rigid-body docking will inevitably miss rearrangements of the local environment as a response to the establishing contact. Indeed, the analysis of the routes to mutual structural adaptation offered a hint to pinpoint a potentially relevant difference between the CGs. To allow the reader to follow the reasoning, files visualizing details of the simulated dynamics of the molecular interplay are instructive and are made accessible as Supplementary data.


View this table:
[in this window]
[in a new window]

 
Table VII. Compilation of values for energy of interactiona between CG-14 and the histo-blood group H-trisaccharide and A-tetrasaccharide determinants (for spatial positioning of the listed amino acid residues, please see Figure 4)

 

View this table:
[in this window]
[in a new window]

 
Table VIII. Compilation of values for energy of interactiona between CG-16 and the histo-blood group H-trisaccharide and A-tetrasaccharide determinants (for spatial positioning of the listed amino acid residues, please see Figure 4)

 
The two substitutions in the first paragraph of this section were apparently not neutral in topological aspects. The Ala53/Cys51 exchange affected the orientation of the equivalent His54/His52 residues (Figure 4) and thereby the Coulomb energy term. This effect is clearly seen in Tables VII and VIII. These two tables list major energy contributions by individual amino acids. By itself, the noted substitution has no such direct influence on the energy balance sheet. Intriguingly, the second substitution, i.e. His124/Glu123, yet appears to be responsible for a notable difference in CRD–ligand interplay with potentially direct relevance to the experimental observation. The way the recognition process develops can best be followed by accessing the graphic files available at http://www.nmr.chem.uu.nl/haddock/movies/cg.html and looking at the movies. They document the different stages of the simulations of the interaction between the CGs and the histo-blood group H-trisaccharide/A-tetrasaccharide. Starting with the view on ligand flexibility, the four files visualize increasing degree of flexibility also on the level of the protein. Thereby, the impression of a fine-tuned mutual adaptation process reaching a snug fit is conveyed, especially in the case of CG-14 and the tetrasaccharide. Crucial amino acids are marked in the visual presentations for assignment of surface profile changes. By doing so, the re-orientation of the imidazole ring of His124 of CG-14 as consequence of tetrasaccharide movement into the CRD of CG-14 can be followed. This particular movement might impede ligand dissociation, singling the respective substitution out for further analysis.

With respect to the different levels of selectivity for I and II structures, the graphic files and Figure 4 draw attention to the Glu73/Glu71 moieties (and Arg75/Arg73) which come close to the GlcNAc ring and their environment. The relatively strong interaction of the GlcNAc residue to the Glu73 moiety in CG-14 is a major factor to restrict internal flexibility to a single {Phi}{Psi} combination (Table VI). Next, sequence divergence starting from the conserved Trp-Gly motif (Thr72-Glu73-Gln74 in CG-14, Glu70-Glu71-Asp72 in CG-16) is found in this region. This difference is conducive for disparate ligand specificity. Conversely, a stretch of sequence identity from the central Trp moiety downstream up to Arg75-Glu76 is spotted between CG-14 and human galectin-1. Fittingly, this lectin reacts 2.5-fold more potently with II versus I disaccharides in an inhibition assay using asialofetuin as ligand (Sparrow et al. 1987Go). Equally fitting in terms of an activity–structure correlation, the N-terminal domain of rat galectin-4 with its preference to I and also to T deviates from this shared sequence markedly with Lys-Glu-Glu-Lys-Lys, and rat galectin-5 with similar I and II reactivity also shows divergence with Pro-Glu-Glu-Arg-Ser.


    Discussion
 Top
 Abstract
 Introduction
 Results
 Discussion
 Materials and methods
 Supplementary data
 Acknowledgments
 References
 
The first aim of this study was to define the binding profile of CG-14. The specificity to glycotopes was determined in a highly sensitive solid-phase assay using free saccharides as inhibitors. It is yet becoming clear that not only the sequence but also the topology of ligand presentation contributes significantly to the affinity regulation of lectin–carbohydrate interactions. These factors evidently cooperate to endow cell surface glycans with their characteristic binding preferences toward lectins. In order to adequately take this emerging factor into account besides the glycotope specificity, we performed binding assays with a panel of well-characterized gps. In comparison with previous reports on CG-14, using frontal affinity chromatography or binding to neoglycoproteins, our data are in accord with its weak reactivity against sialylated N-glycans, an increase in affinity upon chain elongation of LacNAc and the binding to the complex-type tri-antennary glycans irrespective of the branching mode (Hirabayashi et al. 2002Go; André et al. 2006Go). These results thus served as positive quality controls. Moving on, especially the series with gps prepared from human ovarian cyst fluid revealed a pronounced responsiveness of CG-14 to clustered multivalency. This finding underscores the importance of combining measurements on glycotope specificity with topology of glycan presentation.

Addressing the second issue of our study, i.e. systematic comparison of the presented data sets with the published evidence on CG-16 (Wu et al. 2001Go), differences were detected. CG-16 exhibits a rather strong preference for II versus I relative to CG-14. In line with previous solid-phase assay data (Wu et al. 2001Go), the free enthalpy change measured by titration calorimetry reached 0.6 kJ/mol between II/I as ligands (Bharadwaj et al. 1999Go). The way CG-16 separates II/I structures is favorable to localize such glycotopes in cells and tissues, as well as to help fractionate complex glycan mixtures with respect to these types of galactosylation. Thus, this avian lectin can be added as tool to the array of plant lectins commonly used in histochemistry and lectin affinity chromatography (Osawa and Tsuji, 1987Go; Peumans and van Damme, 1998Go; Rüdiger and Gabius, 2001Go; Gabius et al. 2004Go; Manning et al. 2004Go). Of note, the previously analyzed rat galectins-4 (N-domain) and -5 either predominantly bind I or show no significant capacity for discrimination (Wu et al. 2002Go, 2004Go, 2006Go). Interestingly, the considerable reactivity of galectin-4 to T or Tn glycotopes not shared by CGs attests to the occurrence of further marked affinity changes within the galectin family (Wu et al. 2002Go, 2004Go). They can only thoroughly be mapped by systematic application of this approach. A further distinct difference in ligand selection between CG-14 and CG-16 concerns gps with the histo-blood group AB epitopes.

These two detected activity changes prompted us to initiate computer modeling. It was performed with the aim to discover activity–structure relations which might explain emergence of substitutions at distinct sites in functional terms. In view of the clinical relevance of mammalian galectins in tumor progression, the development of reliable test protocols for computational simulations could be helpful in drug designs. Its aim will be to devise potent blocking reagents, e.g. to block galectin-1-dependent glioma invasion (Camby et al. 2002Go). The recent cases of the C-type lectins DC-SIGN/DC-SIGNR and their Val351/Ser363 substitution (it eliminates van der Waals interactions of DC-SIGNR to the 2-hydroxyl group of L-fucose in Lex epitopes and restricts its binding to high-mannose-type N-glycans) (Feinberg et al. 2001Go; Guo et al. 2004Go; van Liempt et al. 2004Go) and the extended network of interactions in E- versus P-selectins facilitated by a rotation of the Asn83 residue and a movement of the sialic acid moiety in the sLex glycotope (Somers et al. 2000Go) exemplify ways how sequence divergence can entail diversity in ligand selection. Furthermore, the four haplotypes of the human mannan-binding lectin (wild-type, Gly34Asp, Gly37Gln, Arg32Cys) illustrate the relevance of allelic variance for a lectin's functionality regulated by the quaternary structure (Dean et al. 2006Go). These advances have been mainly expedited by crystallography. Computer modeling faces at least two serious challenges in the case of galectins. First, long-range effects of substitutions may be difficult to predict. As determined in crystals of Cys2Ser/Arg111His mutants of human galectin-1, inversion by 180° of the side chain of Asp123 and shifts in orientations of His52/Trp68 were caused by sequence changes distant to the CRD (López-Lucendo et al. 2004Go). Knowledge-based modeling is thus prone to not fully cover such effects. Next, ligand accommodation can involve a dynamic series of mutual steps of structural adaptation. Arguing against a rigid lock-and-key recognition in solution, small-angle neutron scattering was instrumental in determining a significant decrease in the gyration radius of human galectin-1 by ligand binding (He et al. 2003Go). This challenge can be met by introducing flexibility to the simulations for both binding partners. Therefore, rigid-body docking was deliberately avoided in our analysis, which strived for examining aspects of the dynamics of the molecular interplay. This objective engenders the illustration of the recognition process by motion pictures. They are accessible to the reader in addition to common figures.

With focus on the ligand, the modeling indicates that the binding of the histo-blood group tri- and tetrasaccharides is not accompanied by a major distortion of the free-state low-energy conformations. They were previously described by molecular mechanics calculations and NMR spectroscopy (Yan and Bush, 1990Go; Imberty et al. 1995Go; Koca et al. 1995Go; Casset et al. 1996Go; Otter et al. 1999Go; Azurmendi and Bush, 2002Go). These rather rigid ligands (please see Figure 5 for accessible conformational space) appear to home into the CRDs without a major entropic penalty in the thermodynamic balance sheet owing to the loss of conformational freedom. The implied selection of a low-energy conformer present in solution has already been described for CG-16 and the disaccharide Galß1-2Gal (Siebert et al. 1996Go). The computed topology of the interactions between the tri- and tetrasaccharides and the CRDs provide a first view on the extended binding site of these animal lectins for {alpha}1-2/{alpha}1-3-substituted ß-galactosides. So far, respective information was only available from a protein of the basidiomycete Coprinopsis cinerea (Walser et al. 2004Go). The presented interaction analysis details the individual contributions at the level of amino acids and sugar units. A substantial portion originates from the GalNAc unit, exceeding Fuc as source of binding energy significantly. That this region of the CRD is most likely relevant for physiological ligands is epitomized by the 2.75 and 8.44 kJ/mol gains in free enthalpy when testing Galß1-4Glc in comparison with the histo-blood group H-trisaccharide and A-tetrasaccharide (at 279.6–280.4 K) and human galectin-3 (Bachhawat-Sikder et al. 2001Go). Moreover, placement of bulky substituents at the 3'-position of galactose such as a 4-methoxy-2,3,5,6-tetrafluorobenzamido moiety which stacks against Arg144 is documented for human galectin-3 (Sörme et al. 2005Go).

With focus on the protein, the modeling data presented make it possible to relate distinct sequence substitutions to alterations in direct contacts and/or orientations of crucial side chains in the model of the two CGs. It makes sense to avoid a mutational loss of an indispensable contact site but to modulate its functionality by altering its immediate vicinity, as evidenced for the region downstream of the central Trp residue and also the His124/Glu123 substitution. In addition to I/II preference, the vicinity of the Trp residue comes into contact with the carboxyl group of sialyllactose in human galectin-1 (Ala in position 67) (Siebert et al. 2003Go). It thus regulates the activity of sialylated glycans, too. By making precise calculation-based predictions on the spatial vicinity between distinct constituents of ligand and lectin, this work gives research a clear direction to map the spatial aspects of the interaction experimentally. For this purpose, monitoring saturation transfer in NMR-spectroscopic experiments can prove valuable (Siebert et al. 2003Go). Besides addressing questions on the impact of sequence changes, the computational analysis will also find application in the design of inhibitors, as noted at the end of the Introduction. Equally important, rational design of new lectin variants will become feasible. Explicitly, this kind of in silico analysis may help gauge the effect of mutagenesis when aiming at producing galectin variants with custom-made properties as laboratory tools. Finally, the full consideration of flexibility for both binding partners visualizes that an intricate structural interplay can allow for initial contact formation. The insights into the routes toward an optimal fit which the simulations provided can help narrow the existing gap in our understanding between the static view of ligand binding obtained by crystallography and not-yet-rigorously defined dynamics in solution.


    Materials and methods
 Top
 Abstract
 Introduction
 Results
 Discussion
 Materials and methods
 Supplementary data
 Acknowledgments
 References
 
Free saccharides and glycopeptides
Mono-, di-, and oligosaccharides were purchased from or prepared by Dextra (Reading, Berkshire, UK) and Sigma (Munich, Germany). Tri-antennary glycopeptides with Gal(ß1-4)GlcNAc termini and a small percentage of Gal(ß1-3)GlcNAc termini in the {alpha}1-3 branch were obtained from asialofetuin after pronase digestion and repeated fractionation by BioGel P-4 400 mesh column chromatography (Wu et al. 1998Go). The Tn-presenting glycopeptides (Tn clusters) used for this study were mixtures obtained in the filterable fraction (molecular mass cut-off <3000 Da) (Wu et al. 1997Go).

Gps and polysaccharides
Sources and further processing of these compounds have been given in detail in the previous report on CG-16 (Wu et al. 2001Go). The predominant galectin-reactive sugar determinant of each test substance is listed in Tables I and II, for example, defining gps from human ovarian cyst fluid (i.e. Cyst Beach, MSS, Tighe, Mcdon, and JS phenol insoluble) to harbor I/II and histo-blood group ABH determinants.

Cloning, expression, and purification of CG-14
Total RNA from embryonic chicken kidney (14th day of incubation) was isolated using an RNAeasy kit (Qiagen, Hilden, Germany) according to the manufacturer's protocols; 1 µg of the resulting total RNA was used as template for first-strand synthesis of cDNA. Reverse transcription was performed for 50 min at 42 °C in a 20 µL reaction volume containing first-strand buffer (50 mM Tris–HCl, pH 8.3, 75 mM KCl, 15 mM MgCl2), dNTPs (1 mM each), oligo(dT)12–18 primer (0.5 µg), 200 units SuperscriptTM II RNase H reverse transcriptase (Invitrogen/Life Technologies, Karlsruhe, Germany), and diethylpyrocarbonate-treated water. The reaction was terminated by incubation at 70 °C for 15 min, and 2 µL of this mixture was transferred into a new tube for PCR amplification, with the sense primer 5'-CGTACGGGATCCATCTCTTGTCAGGGACC-3' and the antisense primer 5'-CGCTAGGGATCCTTACCCCAGCTGACAG-3' harboring BamHI restriction sites (underlined). The reaction mixture was prepared as recommended by the manufacturer of the Taq polymerase (Qiagen). After an initial denaturation step at 94 °C for 4 min, 35 cycles (denaturation at 94 °C for 45 s, annealing at 63 °C for 45 s, extension at 72 °C for 1 min) were run. The final extension step was performed at 72 °C for 5 min. The produced amplicon was ligated into the EcoRV-linearized pET-Blue-1 AccepTorTM vector (Novagen®, Merck Biosciences, Darmstadt, Germany) having single 3'dU overhangs and propagated in the commercially available Escherichia coli strain NovaBlue. DNA sequencing ascertained complete agreement to the cDNA sequence as published in GenBank (accession no. M11674 [GenBank] ). To facilitate cloning into an efficient expression system, further sequence modification and subcloning were necessary. Therefore, using the pET-Blue-1/CG-14 plasmid as template in a megaprimer PCR, the HindIII restriction site was impaired by a single nucleotide exchange from C to T at position 93, which caused no amino acid substitution. The amplicon obtained in this mutagenesis procedure was used for a further standard PCR, with the sense primer 5'-CGCTAGCCATGGCTTGTCAGGGACC-3' containing an internal NocI restriction site (underlined) and the antisense primer 5'-CGTACGAAGCTTTTACTCCCAGCTGACAG-3' with an internal HindIII restriction site (underlined). The steps of the PCR protocol were the same as outlined above in this paragraph. Following digestion of the product exploiting the restriction sites, the amplicon was cloned into the NcoI/HindIII sites of the prokaryotic expression vector pQE60 (Qiagen) used for transformation. Bacteria were grown in terrific broth medium (Roth, Karlsruhe, Germany) at 37 °C, and recombinant production was performed under conditions defined to be optimal by systematic testing, applying isopropyl- ß-D-thiogalactoside at a final concentration of 0.5 mM to yield approximately 6 mg CG-14/L. Purification by affinity chromatography on lactosylated Sepharose 4B, obtained after resin activation by divinyl sulfone, as crucial step, quality and purity controls by one- and two-dimensional SDS–PAGE/silver staining and mass spectrometry, biotinylation under activity-preserving conditions and controls for activity by hemagglutination, and solid-phase assays with immunological detection of extent of bound lectin with/without label were performed as described in Gabius et al. (1984)Go, Gabius (1990)Go, André, Kaltner et al. (2004)Go, and André, Kojima et al. (2005Go).

Lectin-enzyme binding assay
The assay was performed according to the procedures and conditions described previously (Duk et al. 1994). The volume of each glycosubstance-containing solution applied to the plate for coating was 50 µL/well, and all incubations, except for the coating step performed overnight at 4 °C, were carried out at room temperature (20–25 °C). Tris-buffered saline (TBS; 0.05 M Tris–HCl, 0.15 M NaCl, pH 7.35) containing 0.05% Tween 20 (TBS-T) was used to prepare the solutions and for washing plate wells between incubation steps.

The 96-well microtiter plates (F96 Maxisorp, Nunc-Immuno plate, Kamstrupvej, Denmark) were coated with gps in 0.05 M carbonate buffer (0.05 M NaHCO3/0.05 M Na2CO3), pH 9.6, overnight at 4 °C. After washing the plate three or four times with 200 µL TBS-T, solution with biotinylated CG-14 (250 ng/well) was added and the plate was incubated for 30 min. The plates were then carefully washed three or four times to remove unabsorbed lectin, and the ExtrAvidin/Alkaline Phosphatase Conjugate (diluted 1 : 10 000, Sigma) was then added to detect the specifically bound probes. After a 1 h incubation step, the plates were again washed at least four times with TBS-T and then incubated with p-nitrophenyl phosphate (Sigma phosphatase substrate 5 mg tablets) in 0.05 M carbonate buffer, pH 9.6, containing 1 mM MgCl2 (one tablet/5 mL). The resulting absorbance was read at 405 nm in a microtiter plate reader after 2–4-h incubation at room temperature in the dark with the substrate-containing solution.

For inhibition studies, serially diluted inhibitor samples were mixed with an equal volume of lectin solution containing a fixed amount of CG-14 (250 ng/well). The control lectin sample was diluted 2-fold with TBS-T. After incubation at room temperature for 1 h, the binding of these samples (lectin–inhibitor mixture) to cyst Beach P-1-coated plate was determined as described in the paragraph above. The inhibitory activity of each compound was delineated from the inhibition curves. It is presented in the tables as the amount of inhibitor (ng or nmol per well) resulting in 50% inhibition of the control level of lectin binding in the absence of any inhibitor.

All experiments were performed in duplicate or triplicate, and data are presented as the mean values of the combined results. The standard deviation did not exceed 10% and in most experiments was even less than 5% of the mean value. Blank wells, where either the coating step or that of incubation with biotinylated lectin was omitted from the standard protocol, gave low absorbance values (below 0.1) after a 2–4-h period of incubation with the chromogenic substrate at room temperature. This showed that blocking the wells before lectin addition was not necessary to achieve low background values, when Tween 20 was present in the buffer.

Computational analysis
The crystal structure of CG-16 (PDB code 1QMJ) and a homology-based structural model of CG-14 (André, Kaltner et al. 2005Go) were used for further in silico processing in the modeling analysis, together with low-energy conformations of the histo-blood group ABH tri- and tetrasaccharides derived from crystal structures (PDB codes 1ULD and 1ULF) and molecular mechanics calculations yielding detailed {Phi}, {Psi}, E-plots to define the conformational space of the ligands (Siebert et al. 2000Go). Systematic investigations on the way these ligands can be accommodated in the CRDs of both galectins were performed by the information-driven docking program HADDOCK (High Ambiguity Driven DOCKing) (version 1.3) (Dominguez et al. 2003Go). The parallhdg.5.3 data set (Linge et al. 2003Go), which includes the optimized potentials for liquid simulations (OPLS) parameters for nonbonded interactions (Jorgenson and Tirado-Rives, 1988Go), was applied to the protein part, and respective sets for the carbohydrates were adapted from crystallography and NMR system (Brunger et al. 1998Go). Coulomb/van der Waals energy terms characterizing the interaction were calculated using a 8.5 Å cut-off value. The dielectric constant {varepsilon} was set to 10 in the first stage of the calculations, when no solvent molecules were present. When water molecules were explicitly added for the second part of the simulations, {varepsilon} was set to 1. In order to figure out contributions of individual amino acid residues and monosaccharides to the energy balance of the complete interaction process, as carried out in detail previously for a carbohydrate-binding 15-mer peptide (Siebert et al. 2006Go), the two energetic terms were separately computed for each contact between an amino acid of the lectin and a sugar moiety below the cut-off limit.

Modeling initial contact formation between ligand and lectin was guided by the evidence that the CGs maintain the characteristic way a ligand's central galactose unit resides in the vicinity of the Trp moiety of the CRD (Siebert et al. 1997Go; André, Kaltner et al. 2005Go). In detail, the distance restraints for defining the relative positions of the binding partners suitable to drive the simulations were derived from inspection of the mode of interaction between human galectin-1 and galactose as part of lactose in the crystal state (PDB code 1W6M), using the LigPlot program (Wallace et al. 1995Go). Five distance restraints (with distance values between 2 and 4 Å) involving the same type of amino acid at equivalent positions were accordingly obtained in the cases of both CGs. They involve atom pairs of the sugar and the respective contacting amino acid residue, namely the oxygen atom O1 of galactose and the hydrogen atom NH{varepsilon}2 of His44 (CG-16) or His46 (CG-14), the oxygen atoms O4 and the O5 atoms of the sugar and the hydrogen atom NH2{eta} of Arg48 (CG-16) or Arg50 (CG-14), the oxygen atom O6 of the sugar and the hydrogen atom NH{delta}2 of Asn61 (CG-16) or Asn63 (CG-14), as well as the HO6 hydrogen atom of the ligand and the oxygen atom O{varepsilon} of Glu71(CG-16) or Glu73 (CG-14). Having defined this set of knowledge-based interatom distances, the ensuing calculations were divided into separate stages of increasing level of refinement in order to describe the conformational space of both binding partners in the complex comprehensively. Protein–ligand docking allowing full flexibility of the oligosaccharide was performed during the simulated annealing stage of HADDOCK. In other words, rigid-body docking was avoided. The hardware for these computations was a Linux Beowulf cluster consisting of 16 dual processor nodes (Opteron 246, 2 GHz). To gauge the incurred CPU time costs, one complete docking run took approximately 8 h on 10 processors. The first stage encompassed a conformational search at high temperature (2000 K), followed by three annealing stages (please see Dominguez et al. 2003Go for program details). The total number of integration steps was 16 000. The ligand maintained full flexibility throughout the protocol to let it adopt its bound-state conformation should complex formation be accompanied by a distortion of the free-state low-energy structure. Regarding the lectins, the degree of flexibility was increased stepwise, first only for side chains of the amino acids and later for the CGs' backbones as well. As a result, a total of 300 models of the complex between each galectin and ligand were generated, which were then systematically ranked according to their interaction energy. The top 250 structures of each complex were subjected to a final refinement in the presence of explicit water molecules. The resulting models were grouped into clusters on the basis of similarity of their binding modes. Clustering was performed using a 1 Å cut-off value on the basis of the pairwise RMSD matrix for the carbon atoms of each ligand after fitting on the backbone atoms of the protein. The cluster harboring the most favorable intermolecular energy terms was selected as representative model of the complex, and this structure was closely examined regarding energetic aspects of the intermolecular recognition on the basis of amino acids/monosaccharides. The motion pictures visualizing the docking process were generated with Pymol (http://www.pymol.org).


    Supplementary data
 Top
 Abstract
 Introduction
 Results
 Discussion
 Materials and methods
 Supplementary data
 Acknowledgments
 References
 
Supplementary data are available at Gycobiology online (http://glycob.oxfordjournals.org/).


    Acknowledgments
 Top
 Abstract
 Introduction
 Results
 Discussion
 Materials and methods