Skip Navigation


Glycobiology Advance Access originally published online on January 19, 2007
Glycobiology 2007 17(5):23R-34R; doi:10.1093/glycob/cwm005
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrowOA All Versions of this Article:
17/5/23R    most recent
cwm005v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Bishop, J. R.
Right arrow Articles by Gagneux, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bishop, J. R.
Right arrow Articles by Gagneux, P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


REVIEW

Evolution of carbohydrate antigens—microbial forces shaping host glycomes?

Joseph R. Bishop2 and Pascal Gagneux1,2

2 Glycobiology Research and Training Center, Cellular and Molecular Medicine-East, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0687


1 To whom correspondence should be addressed; e-mail: pgagneux{at}ucsd.edu

Received on December 5, 2006; revised on January 10, 2007; accepted on January 10, 2007


    Abstract
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
Many glycans show remarkably discontinuous distribution across evolutionary lineages. These differences play major roles when organisms belonging to different lineages interact as host–pathogen or host–symbiont. Certain lineage-specific glycans have become important signals for multicellular host organisms, which use them as molecular signatures of their pathogens and symbionts through recognition by a toolkit of innate defense molecules. In turn, pathogens have evolved to exploit host lineage-specific glycans and are constantly shaping the glycomes of their hosts. These interactions take place in the face of numerous critical endogenous functions played by glycans within host organisms. Whether due to simple evolutionary divergence or adaptive changes under natural selection resulting from endogenous functional requirements, once different lineages elaborate on differential glycomes these mutual differences provide opportunities for host exploitation and/or pathogen defense between lineages. Such phylogenetic molecular recognition mechanisms will augment and likely contribute to the maintenance of lineage-specific differences in glycan repertoires.

Key words: glycan / co-evolution / host-pathogen / animal lectin


    Introduction
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
Carbohydrates makeup a substantial portion of the biomass on earth, mostly in the form of the two structural polysaccharides—cellulose from plants and chitin from arthropods and fungi. All known living organisms also display an array of free or covalently attached carbohydrates collectively known as glycans (Varki et al. 1999Go). Some of these complex molecules decorate the surface of cells and are secreted into the surrounding environment where they function in a wide variety of processes required for life including structural support, protection, recognition, localization, and information/nutrient transfer. The precise compositions and combinations of different carbohydrates making up the glycan repertoire of each species can differ dramatically. The rapid development of glycomics methods (Raman et al. 2005Go) is bound to greatly increase our knowledge about natural glycan diversity, and evolutionary considerations will be crucial for interpreting glycan function within and between organisms (Varki 2006Go).

Despite recent advances, we are yet to have a complete inventory of naturally occurring monosaccharides used to produce the glycan portion of these molecules, as many members of the Bacteria and Archaea domains synthesize a number of specialized carbohydrates (Schaffer et al. 2001Go). In contrast, metazoan animals build most of their glycans from a very limited number of monosaccharide building blocks, allowing us to consider how these molecules might have evolved over time. Most metazoan glycoconjugates are built from six classes of monosaccharides including sialic acids, hexoses, hexosamines, deoxyhexoses, pentoses, and uronic acids (Varki et al. 1999Go) see Box 1. These monosaccharides can of course be modified to create greater complexity at the single monosaccharide level. Furthermore, the individual carbohydrate units can be attached via a variety of glycosidic linkages, into highly complex linear or branched structures. Thus in theory, there is virtually no limit to the number of different glycans that can be generated. In practice though metazoan animals seem to generate only a limited range of these possibilities.

It would be impossible to do justice to the overwhelming diversity of natural glycans and their functions in one review. Fortunately, a number of excellent recent reviews and texts address the biology of individual classes of glycans as well as their endogenous ligands, the glycan-binding animal lectins (Staudacher et al. 1999Go; Angata and Varki 2002Go; Esko and Selleck 2002Go; Spiro 2002Go; Lowe and Marth 2003Go; Varki and Angata 2006Go). The aim of this review is to address the taxonomic distribution of glycans and to reflect on the processes that are shaping this distribution. Our main focus will be on how interactions between multicellular animal hosts and their microbial or viral pathogens as well as symbionts may have contributed to the observed lineage-specific constellations of certain glycans, especially extracellular glycans.


    Distribution of glycans within the tree of life
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
Glycans occur in a discontinuous and puzzling distribution across evolutionary lineages. Examples of discontinuously distributed glycans are presented in Table I. The hypothetical evolutionary relationships of living organisms can be depicted in the form of phylogenetic trees. Figure 1 shows three phylogenies depicting the evolutionary relationships: between the three domains of life (Figure 1A), among Eukarya (Figure 1B) and among the anthropoid primates (Figure 1C), respectively, along with the distribution patterns of selected glycan across different evolutionary lineages. As seen, the distribution patterns of glycans fall into four general patterns.

  1. Glycans conserved across many taxa. In contrast to ribosomal RNA that is present in all living organisms, thus allowing the reconstruction of these phylogenies, no single glycan structure has been conserved to the same extent. An example for a relatively conserved class of glycan would be N-glycans found in organisms of all the three primary lineages of life, albeit absent from many bacteria (Figure 1A).
  2. Glycans specific to a particular lineage, such as capsule murein peptidoglycans in bacteria (Figure 1A) or gangliosides in vertebrates (Figure 1B).
  3. Glycans similar across distant taxa, examples include glycosaminoglycans found in metazoans and bacteria (Figure 1A); cellulose in plants, bacteria and tunicates; sialic acids (long thought to be unique to metazoan animals) of the deuterostome lineage and also found in many bacteria and in cephalopod mollusks (squid and octopus); or Gal(Fuc alpha 1–4) N-acetylglucosamine (GlcNAc) (Lewis A) only found in primates, some other vertebrates, plants, and few pathogenic bacteria (Figure 1B), and
  4. Glycans conspicuously absent from very restricted taxa only (species, families, or higher units) within lineages that otherwise possess such glycans. Examples include Gal alpha 1–4Gal beta1–4GlcNAc present in most vertebrates but absent in mammals and some birds (Figure 1B); Gal alpha 1–3 Gal beta 1–4GlcNAc (alpha-Gal) present in most mammals, but absent in Old World monkeys, apes and humans (Catarrhines), and N-glycolylneuraminic acid (Neu5Gc) present in most vertebrates but absent in humans (Figure 1C).


View this table:
[in this window]
[in a new window]

 
Table I. Some glycans with strikingly discontinuous taxonomic distribution

 

Figure 1
View larger version (50K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. Repeated phylogenies and glycan distributions for (A) the three domains of life, (B) eukaryote lineages, and (C) anthropoid primate lineages. Phylogenetic trees are redrawn and modified from Purvis (1995)Go, Angata and Varki (2002)Go, and Stearns and Hoekstra (2005)Go.

 

    Why do glycans evolve?
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
Divergence
Like that of any biological molecule, glycan evolution is likely to occur simply due to the divergence of evolutionary lineages. Phylogenies (literally: "history of lineages") come about mostly by the successive bifurcation of lineages, as populations derived from a common ancestor cease to exchange genetic information (i.e., become reproductively isolated). See Box 2 for a list of some key evolution terminology. The genetic tool kits responsible for glycan synthesis and modification of different lineages are subsequently shaped by independent mutational histories, causing the glycan repertoires (glycomes) of different lineages to diverge as well. An example would be the use of cellulose in plants but not in metazoans, with the exception of tunicates (Figure 1B). Divergence involves much historical contingency, where random changes in different lineages, such as the recruitment of certain glycan types over others for specific functions, limit the future evolution of their glycomes.

Natural selection
Selective pressures resulting from recognition processes disproportionately affect the glycans covering cell surfaces. Natural selection acts on glycans, either by favoring the maintenance of a particular glycan (stabilizing or purifying selection) or by diminishing survival and/or reproductive success of organisms carrying a certain glycan (negative selection). Maintenance of the N-glycan synthesis pathway in all eukaryotes is an example of stabilizing selection, since disruptions often lead to lethal consequences (Chui et al. 2001Go; Schachter 2002Go). Negative selection on glycans could occur whenever an important pathogen exploits a particular glycan as a receptor for infection. Positive selection would entail selection for rapid change in glycans e.g., to accommodate novel endogenous functions.

Convergence
Still another mechanism for generating diversity occurs when organisms belonging to distantly related lineages recruit or "reinvent" similar subsets of glycan repertoires. Such parallel events may be due to particular demands of the environment or be due to random recruitment of ancestral synthetic pathways. The existence of the Lewis A antigen [Gal beta 1–4 (Fuc alpha 1–4)GlcNAc] in Catarrhines and in plants could be such an example, as the enzymes involved in its synthesis have very different genomic sequences (Palma et al. 2001Go; Javaud et al. 2003Go) (Figure 1B). Alternatively, what appears as convergent evolution could result from the differential retention of ancestral enzymatic tool kits confined to a few distantly related lineages.

Coevolution
When organisms belonging to different lineages repeatedly interact, as is the case in most natural ecological communities, then their glycomes can become involved in coevolutionary processes. Thus, the interactions of two distinct glycomes of the interacting lineages directly influence their mutual evolution. There is ample evidence for coevolution in glycan diversity in the interactions of microbes and their animal hosts. These cases of coevolution involve two distinct phenomena: (i) independent evolution of enzymatic tool kits for the production of identical molecules in microbes. Examples include glycans found almost exclusively in multicellular hosts and in their microbial pathogens such as glycosaminoglycans and sialic acids (Figure 1B), and (ii) synthesis of "mimic", molecules not identical but very similar to hosts glycans such as polylegionaminic acids by Legionella or pseudaminic acid by Pseudomonas (Knirel et al. 1987Go; Kooistra et al. 2001Go). The Lewis A antigen is also found in certain strains of Helicobacter pylori, which infect humans, likely reflecting coevolution (Monteiro et al. 1998Go). Coevolution could also be occurring via horizontal gene transfer between metazoans and their bacterial pathogens, as has been discussed for genes involved in sialic acid synthesis (Angata and Varki 2002Go).


    Disclaimer about limitations of evolutionary research
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
While we would certainly agree with the statement that "nothing in glycobiology makes sense, except in the light of evolution" (Varki 2006Go), we must also realize that evolution only occurred once and that evolution does not follow well-defined rules (Lewontin 2002Go). This situation is somewhat alleviated by the fact that after lineages diverge, more often than not they remain separated for good and, thus provide researchers with large numbers of iterations ("pseudo samples") for which evolutionary processes have occurred independently. The study of these divergent lineages provides a good opportunity to elucidate evolutionary mechanisms.

A further limitation arises with regard to glycan changes in rapidly evolving organisms such as microbes or viruses, as it is impossible to gain information from long-extinct pathogens, which leave no fossils. The speed of evolution in pathogens means that the identity of past pathogens will never be known and that many current pathogens may be descendents of earlier innocuous microbes or even former symbionts. Rapid evolutionary rates are also associated with homoplasy, i.e., if the observed similarity between glycans is not necessarily due to recent shared ancestry but could have evolved independently in different lineages (convergence). In the era of genomics, the ability to investigate the genomic sequences of the genes coding for enzymes that assemble and modify glycans in different lineages provides a powerful means of reconstructing the evolutionary history of glycosylation by determining key events in the establishment of glycan synthesis machinery.


    Glycans in metazoan animals
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
In metzoan animals, cell surfaces are covered with an electron dense coating of glycoconjugates known as the glycocalyx. Further, glycans are directly secreted as polymers or attached to proteins into the extracellular matrix and body fluids. This glycan landscape is often (for functional and historical reasons) characteristic of both species and particular cell types. (Paulson and Colley 1989Go; Roth 1996Go). Four basic types of glycoconjugates are present in metazoans including N-linked, O-linked, glycolipids, and proteoglycans (Varki et al. 1999Go). These molecules play a large array of functions required for life including support, signaling, protein folding, and protection (Table II).


View this table:
[in this window]
[in a new window]

 
Table II. Endogenous functions of glycans in metazoans that go beyond structural function

 
Why vertebrates use only such a small fraction of monosaccharide types for the assembly of their glycans remains a mystery (Box 1). For example, what is the reason why vertebrates, unlike plants do not carry terminal xylose on their N-glycans or incorporate any trehalose in their glycan repertoire? Absences of such structures are likely to represent cases of lineage-specific evolutionary happenstance (contingency), whereby the independent mutational history of different lineages has led to differential evolution of glycan biosynthesis enzymes along separate lineages. Paradoxically, however, even with their relatively reduced panel of monosaccharides (compared to bacteria for example), vertebrates generate a staggering amount of structural variation by combining just nine principal monosaccharides into chains of varying lengths and degrees of branching on differentially decorated proteins and lipids (Manzi et al. 2000Go). It also appears that despite the relative small number of different building blocks (monosaccharides),vertebrates produce much more complex branched N-glycans than many other lineages (Varki et al. 1999Go).


Box 1. Principal building blocks of vertebrate glycans

Sialic acids: e.rg. N-acetylneuraminic acid (Neu5Ac. N-glycolylneuraminic acid (Neu5Gc)

Hexoses: Glucose, mannose, galactose (Gal)

Hexosamines: N-acetylglucosamine (GlcNAc. N-acetylgalactoseamine GalNAc

Deoxyhexoses: Fucose (Fuc)

Pentoses: Xylose

Uronic acids: Iduronic acid, glucuronic acid

 


Box 2. Glossary of evolution terminology

Antagonistic coevolution: "evolutionary arms race", where changes in one lineage of a pair of host-parasite lineages are prompted by or prompt changes in the other lineage.

Convergence: similarity between taxa despite independent evolutionary histories.

Catarrhine: primates belonging to Old World monkeys, apes and humans.

Demographic bottleneck: strong reduction in population size.

Divergence: differences between taxa due to independent evolutionary histories.

Domain: one of the three radiations of life including the Archaea, Bacteria, and Eukarya.

Founder event: establishment of new populations by small numbers of founder individuals.

Frequency dependent selection: when the fitness of a genotype depends on its frequency.

Genetic drift: random variation in gene frequency from one generation to another.

Historical contingency: the effect of random event on the probability of subsequent events in a lineage.

Homoplasy: similarities in character states for reasons other than inheritance from a common ancestor. These include convergence, parallelism, and reversal.

Lineage: group of organisms sharing a common ancestor (monophyletic).

Phylogeny: hypothetical history of related lineages based on DNA sequences or any other heritable derived traits.

Purifying (stabilizing) selection: a type of selection that removes individuals from both ends of a phenotypic distribution thus maintaining the same distribution mean.

Trade-off: the balancing of different selection pressures especially when these have opposing directions.

 


    Variation in animals glycan antigens in time and space
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
Transient glycan variation in animals has been documented during key processes such as pregnancy, lactation, infection, or acute phase response, whereas ontogenetic glycan variation plays key roles in the regulation of metazoan development (Haltiwanger and Lowe 2004Go). Glycans can also vary in space, as different compartments and adjacent tissues in many animal species carry different glycan repertoires. In a given metazoan, one could detect different glycan distribution from the outside as the secreted mucins and bound mucins of the mucous membranes, the epithelia, the basal layers, the stroma, the endothelia of blood vessels, the different types of immune cells, the cells of the peripheral and central nervous system, and the reproductive systems all usually vary with respect to their surface glycans (Ohtsubo and Marth 2006Go). Such variation and its distribution might reflect trade-offs between the needs for endogenous function and adaptation to external selective pressures from pathogens or accommodation of important symbionts.


    Genes coding for glycan biosynthetic enzymes have undergone substantial expansion contributing to glycan diversity in metazoans
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
A substantial fraction (1–2%) of animal genes function in glycan biosynthesis and modification. Unlike genes coding for a single protein product, these enzymes work in an "assembly-line" like system of glycan synthesis pathways (Lowe and Marth 2003Go). These pathways allow organisms to generate rapid phenotypic changes based on posttranslational modification of their glycoconjugates. Glycosyltransferases, which catalyze the addition of sugars to growing glycan chains and proteins, have been subject to multiple lineage-specific expansions via gene duplication (Lespinet et al. 2002Go). In mammals, for example, there are 9 fucosyl transferases, compared to 4 in Drosophila and up to 18 in Caenorhabdiitis elegans (Javaud et al. 2003Go). In the case of fucosyl transfereases, genomic analyses have determined that the more ancient genes in mammals had multiple exons and typically encode enzymes that transfer fucose near the base of the N- or O-peptidic sequences, whereas more recent genes are monoexonic and encode transferases acting at the periphery (termini) of glycans (Oriol et al. 1999Go). A variety of genetic mechanisms caused this expansion, including duplication, exon shuffling, point mutations, and transposition (Javaud et al. 2003Go). Similar examples are seen in a host of enzymes including sialyltransferases (19 in mammals) and heparan sulfate proteoglycan modification enzymes (15 in mammals) (Esko and Lindahl 2001Go; Harduin-Lepers et al. 2005Go). As the majority of these enzymes reside in the endoplasmic reticulum (ER)–Golgi secretory system, these organelles have rightly been called the "evolvability" module of animal cells providing organisms with machinery for generating variation through combinatorial modification of expressed proteins (Kirschner and Gerhart 1998Go).

Genetic studies in model organisms with null mutations in biosynthesis genes have proved that many glycans are required for proper metazoan development, as these mutations produce phenotypes ranging from embryonal lethality to growth defects to impaired morphogenesis and cognitive function—but some can also have no obvious effects under laboratory conditions (Natsuka and Lowe 1994Go; Kotani et al. 2001Go; Lowe and Marth 2003Go; Kudo et al. 2006Go). It is conspicuous that the consequences of experimental abolition of many glycans are often not evident in animal cell cultures, even when these prove to be lethal as early as the embryonic stage in the whole organism from which the cells are cultured (Grobe et al. 2002Go). These findings point to key functions of glycans for multicellular development, but they also leave open the possibility that a certain fraction of animal glycans can be selectively neutral, i.e., these can be altered without incurring major fitness costs to the organism. Laboratory studies looking at consequences of experimental glycan alteration for individuals are unlikely to shed light on population-level effects of glycan polymorphism, such as the proposed protective effects in preventing the rapid spread of pathogens due to herd immunity-related mechanisms (Gagneux and Varki 1999Go). This idea remains untested in part because such effects would be based on populations rather than individuals.


    Animal lectin intractions and glycan evolution
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
In many cases, the endogenous function of glycans requires interaction with proteins, and recent decades have seen the discovery of a growing list of animal lectins with specific carbohydrate recognition domains (CRD) (Drickamer and Taylor 1993Go; Gabius 1997Go; Probstmeier and Pesheva 1999Go; Rini and Lobsanov 1999Go). Binding is usually highly specific for glycan type, as defined by its monosaccharide composition and the nature of the glycosidic linkages by which these are connected (Drickamer and Taylor 1993Go; Kaltner and Stierstorfer 1998Go; Kilpatrick 2002Go). Lectin–glycan interactions can mediate a variety of cell–cell recognition events including interspecies (host–pathogen), intraspecies (fertilization and gestation), and intercellular (development and immune regulation) interactions (Gabius et al. 2002Go). Much of the diversity of metazoan glycans is found at the periphery of the glycans involving terminal portions capped by sialic acids, fucose, galactose or GalNac or, in the case of proteoglycans, by directed modification of linear glycan chains (Varki 1993Go; Esko and Lindahl 2001Go). This increase in diversity towards the exterior termini of glycans on the surfaces of mammalian cells (Dennis et al. 1999Go; Varki 2006Go), combined with the fact that animal lectins often recognize specific glycan structures found on the termini, strongly suggests that recognition processes rather than simple divergence are driving this diversity. It is no surprise therefore, that microbial and viral pathogens of metazoan hosts have evolved their own sets of lectins to exploit these molecules for host recognition, attachment and tissue tropism (Karlsson 1995Go; Sharon 1996Go; Rostand and Esko 1997Go).


    The evolutionary glycan arms race
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
The ubiquitous presence of species-specific glycans on host cells and secretions predispose these as convenient receptors to be exploited by microbes for host recognition, attachment, and invasion by way of a wide array of microbial and viral lectins including adhesins, pili, fimbriae, and hemagglutinins (Gilboa-Garber and Garber 1989Go; Wadström and Ljungh 1999Go). For specific examples of glycan-mediated host–pathogen interactions, we refer the reader to several excellent reviews (Sharon 1996Go; Rostand and Esko 1997Go; Hooper and Gordon 2001Go; Olofsson and Bergstrom 2005Go). Host invasion often occurs via the large epithelial layers lining the external cavities of vertebrates, which participate in gas exchange, olfaction, nutrient uptake, secretion, and reproduction. Outer epithelia are characterized by mucous covered membranes derived from glycoconjugates (mucins, proteoglycans, etc) that form an important interface between these animals and their environments replete with ubiquitous microbes. Rather than providing convenient points of invasion, mucin secretions may act as efficient decoys or smokescreens absorbing intruding pathogens before they can invade (Perrier et al. 2006Go). Apart from host recognition and invasion, microbes can further exploit multicellular host glycans in a variety of ways: they can (i) scavenge host glycans and use these as carbon source (Sonnenburg et al. 2005Go). (ii) engage in host mimicry by synthesizing glycans identical or nearly identical to those of the host (Martin et al. 1997Go; Bersudsky et al. 2000Go; Harvey et al. 2001Go; Vimr et al. 2004Go), and (iii) modulate host glycans by expressing glycosidases to destroy host decoy glycans or to expose more appropriate underlying saccharides for lectin interaction (Dwarakanath et al. 1995Go; Vimr et al. 2004Go). The intracellular parasite Trypanosoma cruzii even expresses an enzyme that transfers host sialic acids to its own cell surface as a type of camouflage (Colli 1993Go).

It may seem that rapid glycan structural change in response to pathogenic microbes would be the best route to evade infection, however, change of glycans has the potential of negatively affecting critical endogenous functions or jeopardizing successful interaction with symbionts (see Hostsymbiont coevolution). Given this, we speculate that microbe driven alteration in host glycan structure is more likely if the change minimally affects endogenous function(s) or if the selection is strong enough to outweigh the impaired endogenous function. Similarly, pathogens can evolve to counter-adapt to changes in host glycan structure by altering ligand/receptor specificity. Such antagonistic coevolution (also called "evolutionary arms race") is known to lead to rapid evolutionary change (Buckling and Rainey 2002Go). It appears that the ongoing arms race between microbes and their animal hosts is constantly shaping the makeup of glycans of both sides, and such glycan changes must be considered against the background of "normal" glycan variation.

Owing to the observed glycome differences in distant lineages, it is tempting to speculate that glycome differences often represent insurmountable barriers for pathogens of one distant lineage for infecting hosts of another lineage. For example, plants and animals share few terminal glycans and, with one possible exception (Gibbs and Weiller 1999Go), there seem to be no plant pathogens that also infect animals or vice versa.


    Glycans as innate markers of nonself
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
Many microbes seem to be affected by lineage-dependent constraints such as the glycan composition of their cell walls. The highly conserved capsule glycans in pathogenic microbes can be exploited by multicellular hosts as "pathogen associated molecular patterns" (PAMPs) or "microbial motifs" and used as target molecules for (Kawabata and Tsuda 2002Go) pathogen recognition receptors of the innate immune system (Weis et al. 1998Go). Typical structures associated with bacterial lineages and used as PAMPs by multicellular host lectins are lipopolysaccharides of the outer membrane of gram-negative bacteria, lipoteichoic acid of gram-positive bacteria, high mannose glycan and betaglucans of fungi. Multicellular hosts have been able to exploit such PAMPs to the extent where their recognition is encoded in the germ line of the host in the form of innate immune receptors such as toll-like receptors, DC-SIGN, or mannose-binding lectins of dendritic cells (Cherayil et al. 1990Go; Akira et al. 2001Go; Appelmelk et al. 2003Go). These glycan recognition molecules are essential for survival and some are even shared by metazoa and plants (Toll-like receptors). Innate immune systems of invertebrates seem to compensate for the absence of an adaptive immune system by having special lectins with divergent ligand specificities for recognizing different polysaccharides of pathogen membranes as well (Zhu et al. 2006Go). Given the multitude of lectin-based innate immune recognition mechanisms, it appears that glycans have formed a substantial part of the basis for lineage–specific recognition of prevailing pathogens via innate immune systems (Janeway and Medzhitov 2002Go).

Further, by expanding terminal glycan structures which are absent from pathogen lineages, metazoan hosts can recruit these same structures as innate determinants of self. Mammals have evolved 19 sialic acid glycosyltranseferases and utilize the absence of this terminal glycan for the detection of nonself. Lack of sialic acid on any cell surface perturbs factor H binding and allows complement molecules to be deposited on the surface leading to an immune attack (Pangburn et al. 2000Go). Simultaneously, a family of endogenous mammalian lectins called Siglecs mediate immune cell functions based on the presence of sialic acid (Crocker and Varki 2001Go). Thus, host innate immune systems directly target microbe glycans and readily detect the absence of self-glycans as well (Janeway and Medzhitov 2002Go).


    Nonself glycan and adaptive immunity
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
Jawed vertebrates have the capacity to generate virtually unlimited variation of receptors with their adaptive immune systems. This important evolutionary innovation provides these animals with a flexible system, capable of learning (affinity maturation) and experienced-based memory. This innovation is also double edged, as antibodies targeting foreign peptides may cross-react with host epitopes including glycans (Hedrick 2004Go), such as infection with Campylobacter, which can result in the autoimmune disease Guillain–Barré syndrome (Ang et al. 2004Go). Ironically, the adaptive immune system itself became possible by recruitment of recombination-associated genes (RAG), which are themselves of viral origin (Du Pasquier 2004Go). Some glycans have the capacity to elicit immune responses when introduced into animals, which do not possess the same structure as part of their glycoprofile/glycan portfolio (Schauer 1988Go). More importantly, perhaps, glycans often provide crucial parts of antigenic epitopes found on glycolipids or glycoproteins. Recent studies have shown, for example, that lineage-specific glycans on plant glycoproteins are major antigens and are responsible for human allergies to plants (Bardor et al. 2003Go). Humans also produce anti-Neu5Gc antibodies against this otherwise very common mammalian sialic acid (Nguyen et al. 2005Go). Also, vaccines such as the Haemophilus influnezae b (Hib) vaccine take advantage of the fact that when conjugated to bacterial proteins (such as toxins) glycan antigens generate a strong T-cell dependent immune response (Kelly et al. 2004Go). Finally, some of the most potent adjuvants in mammals are glycans from very distantly related taxa such as the mollusk keyhole limpet or horseshoe crab (Jennemann et al. 1994Go). Thus, it seems likely that vertebrates can generate specific antibodies to pathogen glycoproteins, however, these must be limited to those that fail to recognize self-glycans.


    Host symbiont coevolution
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
Metazoans must tolerate huge numbers of microbial (nonself) symbionts. Thus, host immune systems must accommodate "a vast consortium of symbiotic bacteria" and all their surface glycans, while distinguishing them from pathogens (Cash et al. 2006Go) (Ironically, one of the reasons such microbes are essential is that vertebrates can extract valuable nutrients from the abundant, but biochemically inaccessible, plant structural polysaccharides only with microbial enzymatic help.). Host glycans appear to play crucial roles for providing symbiotic microbes with attractive niches ("welcome mats") while discriminating against pathogens. For example, mammalian hosts have microbe-binding lectins lacking complement recruitment domains for gut symbionts (Cash et al. 2006Go). Gut micro flora can specifically modulate the gut glycosylation pattern (Freitas et al. 2002Go), and in mammals some of these effects are important for the establishment of proper host glycosylation after weaning (Bry et al. 1996Go). In addition, hosts need to have mechanisms for monitoring symbiotic microbe communities that are capable of turning into "pathologic communities", given the wrong circumstances (Ley et al. 2006Go). Successful sequestration of important microbes is a prerequisite for successful symbiosis and avoidance of invasion/infection. As such, symbiont management by hosts could be considered a stepping-stone for control of pathogenic microbes e.g., via secretion of antimicrobial peptides. In setting the tolerance appropriately from zero against pathogens and to substantial for well-sequestered symbiotic microbes, one role of glycans has been termed "legislators of host–microbial interactions" and may have played a role in the distribution of glycans among divergent lineages (Hooper and Gordon 2001Go).


    Adaptation by glycan loss
 Top
 Abstract
 Introduction
 Distribution of glycans within...
 Why do glycans evolve?
 Disclaimer about limitations of...
 Glycans in metazoan animals
 Variation in animals glycan...
 Genes coding for glycan...
 Animal lectin intractions and...
 The evolutionary glycan arms...
 Glycans as innate markers...
 Nonself glycan and adaptive...
 Host symbiont coevolution
 Adaptation by glycan loss
 Adaptation by glycan gain
 Is there neutral variation...
 Glycans as markers of...
 Appreciating complex microbial...
 Fundamental asymmetries in...
 Future directions and...
 Conflict of interest statement
 Acknowledgments
 References
 
A drastic mechanism for hosts to alter the glycan composition of their cell surfaces is to abolish the expression of a terminal glycan structure in order to curtail pathogen interaction. The complete loss of a particular glycan usually involves inactivating mutations of one or more genes involved in assembly followed by the fixation of the inactive allele across the population. Fixation of such mutations can come about due to selection for absence of the glycan or by genetic drift due to small population size (founder events or demographic bottlenecks). The complete loss a glycan modification, which is otherwise very common in many closely related lineages (e.g., alpha-Gal in Catarrhines or Neu5Gc in humans) has at least two advantages: (i) the loss quickly prevents recognition by pathogens using structure as a receptor and (ii) it opens the possibility of adding the abolished glycan to the panel of nonself-glycans recognized by adaptive immunity. For example, in the human and other primate blood groups, the absence of a glycan type is also accompanied by the presence of antibody against the missing glycan (Clausen and Hakomori 1989Go).

Of course, there is a potential cost to such an adaptive glycan loss. If the nonfunctional allele responsible for the loss becomes fixed in the population, the lost glycan will likely be lost forever, as random mutations are much more likely to further incapacitate a gene rather than to revive its function. A further cost will result when a glycan with important endogenous functions is lost (e.g., due to very strong negative selection by a pathogen), as this will require subsequent compensatory changes in the endogenous lectins. It follows that the set of endogenous lectins of each lineage can be expected to closely mirror that lineage's glycan repertoire as far as endogenous function is concerned. The human specific changes in several siglec genes might be an example for such compensatory changes, as humans have lost the ability to make Neu5Gc and some of their sialic-acid-binding siglecs have shifted from binding both Neu5Gc and Neu5Ac to a strong preference for binding Neu5Ac (Brinkman-Van der Linden et al. 2000Go). The potential costs associated with such radical glycan remodeling are illustrated by the many different forms of congenital disorders of glycosylation involving deficiencies in N-glycan synthesis (even if each particular form is rare) (Aebi and Hennet 2001Go). It has been suggested that selection for altered levels of N-glycan synthesis could be linked to an inhibitory effect on viral replication (Freeze and Westphal 2001Go). Most animal populations are likel