Skip Navigation


Glycobiology Advance Access originally published online on November 29, 2005
Glycobiology 2006 16(4):47R-62R; doi:10.1093/glycob/cwj066
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
16/4/47R    most recent
cwj066v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Kelleher, D. J.
Right arrow Articles by Gilmore, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kelleher, D. J.
Right arrow Articles by Gilmore, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

REVIEW

An evolving view of the eukaryotic oligosaccharyltransferase

Daniel J. Kelleher and Reid Gilmore1

Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605-2324


1 To whom correspondence should be addressed; e-mail: reid.gilmore{at}umassmed.edu

accepted on November 23, 2005


    Abstract
 Top
 Abstract
 Introduction
 Ancient origin of N-linked...
 Oligosaccharide donors for the...
 Defects in donor assembly...
 Acceptor substrates and peptide...
 Context factors that influence...
 Subunit composition of the...
 The STT3 subunit contains...
 Organization of subunits into...
 Donor-substrate selection by the...
 Conclusions
 References
 
Asparagine-linked glycosylation (ALG) is one of the most common protein modification reactions in eukaryotic cells, as many proteins that are translocated across or integrated into the rough endoplasmic reticulum (RER) carry N-linked oligosaccharides. Although the primary focus of this review will be the structure and function of the eukaryotic oligosaccharyltransferase (OST), key findings provided by the analysis of the archaebacterial and eubacterial OST homologues will be reviewed, particularly those that provide insight into the recognition of donor and acceptor substrates. Selection of the fully assembled donor substrate will be considered in the context of the family of human diseases known as congenital disorders of glycosylation (CDG). The yeast and vertebrate OST are surprisingly complex hetero-oligomeric proteins consisting of seven or eight subunits (Ost1p, Ost2p, Ost3p/Ost6p, Ost4p, Ost5p, Stt3p, Wbp1p, and Swp1p in yeast; ribophorin I, DAD1, N33/IAP, OST4, STT3A/STT3B, Ost48, and ribophorin II in mammals). Recent findings from several laboratories have provided overwhelming evidence that the STT3 subunit is critical for catalytic activity. Here, we will consider the evolution and assembly of the eukaryotic OST in light of recent genomic evidence concerning the subunit composition of the enzyme in diverse eukaryotes.

Key words: dolichol / endoplasmic reticulum / N-linked oligosaccharide / oligosaccharyltransferase


    Introduction
 Top
 Abstract
 Introduction
 Ancient origin of N-linked...
 Oligosaccharide donors for the...
 Defects in donor assembly...
 Acceptor substrates and peptide...
 Context factors that influence...
 Subunit composition of the...
 The STT3 subunit contains...
 Organization of subunits into...
 Donor-substrate selection by the...
 Conclusions
 References
 
In eukaryotic cells, a critical step in asparagine-linked glycosylation (ALG) of proteins is catalyzed by the oligosaccharyltransferase (OST), an integral membrane protein that mediates the en bloc transfer of a preassembled high-mannose oligosaccharide onto asparagine residues of nascent polypeptides entering the lumen of the rough endoplasmic reticulum (RER). The dolichol pyrophosphate-linked oligosaccharide (OS-PP-Dol) donor (Glc3Man9GlcNAc2-PP-Dol in higher eukaryotes) for glycosylation is sequentially assembled by glycosyltransferases located on the cytoplasmic and lumenal faces of the RER membrane. The assembly pathway for dolichol oligosaccharides has been reviewed in detail (Burda and Aebi, 1999Go), so here we will focus on recent discoveries that impact donor-substrate recognition, selection and catalysis by the OST. Although the OST has been the focus of a several reviews during the past decade (Silberstein and Gilmore, 1996Go; Knauer and Lehle, 1999Go; Dempski and Imperiali, 2002Go; Yan and Lennarz, 2005Go), recent developments in this field make this review timely.

N-Linked glycosylation of nascent glycoproteins is temporally coupled to the protein translocation reaction that occurs as, or immediately after, the polypeptide is synthesized. The coordination of N-linked glycosylation and protein translocation necessitates a spatial localization of the OST adjacent to the protein translocation channel. Even though the OST has unrestricted access to nascent polypeptides entering the ER lumen, a surprising percentage of consensus glycosylation sites are not modified in vivo. Here, we review the effect of acceptor peptide sequence as well as sequence context factors that influence N-linked glycosylation of N-X-T/S sites in vivo and in vitro.

The elucidation of the roles of the noncatalytic OST subunits remains an important issue. In this article, we review the biochemical and genetic characterization of the yeast and mammalian OST subunits, in the context of emerging information about the evolution of the OST that is being deduced from genomics data. Potential roles for several subunits can now be proposed based upon the available data and upon the differences in the enzymatic properties of the OST in diverse organisms.


    Ancient origin of N-linked glycosylation
 Top
 Abstract
 Introduction
 Ancient origin of N-linked...
 Oligosaccharide donors for the...
 Defects in donor assembly...
 Acceptor substrates and peptide...
 Context factors that influence...
 Subunit composition of the...
 The STT3 subunit contains...
 Organization of subunits into...
 Donor-substrate selection by the...
 Conclusions
 References
 
N-Linked glycosylation of proteins was initially thought to be restricted to eukaryotic organisms. The first evidence that this was not the case came from the structural analysis of alkali-insensitive glycopeptides derived from Halobacterium cell surface glycoprotein (CSG) and flagellins (for a review, see Lechner and Wieland, 1989Go). These Halobacterium salinarum proteins are modified with tetrasaccharides composed of glucose and sulfated glucuronic and iduronic acids that are linked to asparagine residues through glucose (Wieland et al., 1983Go, 1985Go). The glycosylated asparagines are in an N-X-T/S motif that matches the sequon for N-linked glycosylation in eukaryotic organisms (Wieland et al., 1985Go). Furthermore, a dolichol-phosphate-linked tetrasaccharide serves as the oligosaccharide donor for N-linked glycosylation in H. salinarum (Lechner et al., 1985Go). A second glycosaminoglycan-like oligosaccharide is attached to the H. salinarum CSG through an asparaginyl-N-acetylgalactosamine linkage at an N-A-S site (Paul et al., 1986Go) using a dolichol-pyrophosphate-linked oligosaccharide as a donor (Lechner and Wieland, 1989Go). In contrast to what we know about the acceptor-substrate specificity for the eukaryotic OST, addition of the glycosaminoglycan-like oligosaccharide to CSG still occurred when the serine in the N-A-S sequon was replaced with a valine or leucine residue (Zeitler et al., 1998Go). Thus, in H. salinarum, two structurally different N-linked oligosaccharides are attached to N-X-S/T sites by what are likely distinct enzymes based upon differences in donor substrate structure and acceptor substrate specificity. The flagellins of Methanococcus voltae are N-glycosylated at multiple N-X-T/S sites with a trisaccharide through an asparaginyl-ß-GlcNAc linkage (Voisin et al., 2005Go). The most abundant oligosaccharide donor in Haloferax volcanii is mannosyl-(ß1–4)galactosyl phosphodolichol (Kuntz et al., 1997Go). Taken together, it appears that donor substrates for N-glycosylation in archaebacterial organisms are structurally diverse, yet are assembled upon dolichol phosphate or dolichol pyrophosphate carriers.

Recent evidence indicates that several eubacteria, including Campylobacter jejuni, have acquired a 14-gene pgl (protein glycosylation) operon that is involved in N-linked glycosylation of cell surface proteins. Protein sequence analysis of the pgl operon (Szymanski et al., 1999Go) revealed that PglB is homologous to the STT3 subunit of the eukaryotic OST. Mutagenesis of the pglB locus and chemical deglycosylation of C. jejuni membrane proteins led to the conclusion that the pgl operon is involved in protein glycosylation (Szymanski et al., 1999Go). We refer the reader to a recent review of eubacterial glycosylation for more information concerning the roles of the pgl gene products in the assembly pathway of the undecaprenyl-pyrophosphate-linked oligosaccharide donor for N-linked glycosylation in eubacteria (Szymanski and Wren, 2005Go).


    Oligosaccharide donors for the eukaryotic OST
 Top
 Abstract
 Introduction
 Ancient origin of N-linked...
 Oligosaccharide donors for the...
 Defects in donor assembly...
 Acceptor substrates and peptide...
 Context factors that influence...
 Subunit composition of the...
 The STT3 subunit contains...
 Organization of subunits into...
 Donor-substrate selection by the...
 Conclusions
 References
 
In eukaryotic organisms, the oligosaccharide donor for N-linked glycosylation of proteins is assembled on the carrier lipid dolichol pyrophosphate by the sequential addition of the monosaccharides GlcNAc, Man, and Glc (Figure 1). Chemically synthesized donor substrate analogues that replace dolichol-PP with phytanyl-PP, dihydrofarnesyl-PP, or citronellyl-PP are not substrates for the OST (Fang et al., 1995Go), but polyprenol-PP-linked oligosaccharides are utilized as donors in the Chinese hamster ovary cell line B211 (Kaiden et al., 1998Go).


Figure 1
View larger version (16K):
[in this window]
[in a new window]
 
Fig. 1. The structure and assembly pathway for eukaryotic dolichol-linked oligosaccharides. GlcNAc residues are designated by circles, mannose residues by squares, and glucose residues by triangles. Numbers within saccharide units designate the order of saccharide addition. Red residues are transferred to the dolichol-pyrophosphate carrier on the cytoplasmic face of the RER, blue residues are added on the lumenal side. The yeast gene(s) that encodes each glycosyltransferase is indicated if known. Currently described subtypes of CDG-1 are identified by the step in OS-PP-Dol assembly that is affected. The role of Alg11p in transfer of residue 7 is uncertain at this time.

 

The assembly pathway for Glc3Man9GlcNAc2-PP-Dol has been deduced from the analysis of intermediates that accumulate in vitro in membranes from wild-type cells (Chapman et al., 1979Go; Liu et al., 1979Go) and by the analysis of the yeast alg mutants that lack glycosyltransferase activities (Huffaker and Robbins, 1983Go; Burda and Aebi, 1999Go). Most of the glycosyltransferases (ALG proteins) that mediate Glc3Man9GlcNAc2-PP-Dol assembly have now been identified (Figure 1). OS-PP-Dol assembly initiates on the cytoplasmic face of the RER, where sugar nucleotides (UDP-GlcNAc and GDP-Man) serve as the saccharide donors for the addition of the first seven residues (Figure 1, red symbols) to yield Man5GlcNAc2-PP-Dol. The initial step in OS-PP-Dol assembly is catalyzed by N-acetylglucosaminyl phosphate transferase (ALG7) that forms the phosphodiester linkage between GlcNAc-1-phosphate (derived from UDP-GlcNAc) and dolichol-monophosphate.

The most recently identified glycosyltransferase in the OS-PP-Dol assembly pathway is the ALG13–14 heterodimer that mediates the addition of the second GlcNAc residue to form Dol-PP-GlcNAc2 (Bickel et al., 2005Go; Chantret et al., 2005Go; Gao et al., 2005Go). Dol-PP-GlcNAc2 is the in vivo oligosaccharide donor for N-linked glycosylation in Giardia lamblia (Samuelson et al., 2005Go). The yeast or vertebrate OST will utilize Dol-PP-GlcNAc2 as a donor substrate in an in vitro assay (Sharma et al., 1981Go; Imperiali and Shannon, 1991Go). The importance of the two N-acetyl moieties in Dol-PP-GlcNAc2 has been investigated using several synthetic dolichol-linked oligosaccharides (Tai and Imperiali, 2001Go). Dol-PP-GlcNAc-Glc, like the minimal in vivo donor Dol-PP-GlcNAc2, is a functional donor for the OST, indicating that the 2'-N-acetyl modification of the second sugar is not critical for OST recognition and catalysis. Although glycopeptide products bearing a single GlcNAc were detected by mass spectroscopy when Dol-PP-GlcNAc was tested as the donor, Dol-PP-GlcNAc is clearly a very poor donor substrate for the OST (Bause et al., 1995Go; Tai and Imperiali, 2001Go). Dol-PP-2-deoxyfluoroglucose-GlcNAc and Dol-PP-N-trifluoroacetylglucosamine-GlcNAc are not substrates but are instead OST inhibitors. These results demonstrate that the N-acetyl modification on the proximal sugar is critical for OST catalysis, whereas the presence and structure of a second sugar residue (GlcNAc > Glc > no sugar) increases enzyme activity (Tai and Imperiali, 2001Go).

Until recently, it was not clear whether Alg2 transfers residue 4 (an {alpha}-1,3 linked mannose) or residue 5 (an {alpha}-1,6 linked mannose). Analysis of human ALG2 mutant fibroblasts (congenital disorders of glycosylation-Ii [CDG-Ii]) demonstrated that Alg2 is an {alpha}-1,3 mannosyltransferase (Thiel et al., 2003Go), which provides the substrate for addition of residue 5 by a currently unidentified mannosyltransferase (Figure 1). Man3GlcNAc2-PP-Dol accumulates as a major lipid-linked oligosaccharide in yeast alg11{Delta} cells providing evidence that Alg11p is an {alpha}-1,2 mannosyltransferase responsible for the addition of residue 6 (Helenius et al., 2002Go). Although structural analysis of the protein-linked oligosaccharides synthesized by alg11{Delta} cells had suggested that Alg11p adds residue 7 (Cipollo et al., 2001Go), the latter analysis is complicated by elongation of the alg11{Delta} assembly intermediate by lumenal mannosyltransferases. Given that both residues 6 and 7 are {alpha}-1,2 linked mannose, we speculate that a single enzyme (Alg11p) acts sequentially to mediate both additions. Support for this hypothesis is provided by the absence of a candidate GDP-Man-dependent mannosyltransferase of unknown function encoded by the yeast genome. Man5GlcNAc2-PP-Dol is then flipped from the cytoplasmic face of the ER into the lumen in a reaction that is dependent upon Rft1 in vivo (Helenius et al., 2002Go).

The dolichol monosaccharides Dol-P-Man and Dol-P-Glc serve as the sugar donors for the three lumenal mannosyltransferases (ALG3, ALG12, and ALG9) and three glucosyltransferases (ALG6, ALG8, and ALG10) that sequentially transfer residues 8–14 onto the OS-PP-Dol (Figure 1). The ordered addition of the sugar residues is maintained by the remarkable substrate specificity of the glycosyltransferases. For example, the predominant OS-PP-Dol compound synthesized by the alg3{Delta} yeast mutant is Man5GlcNAc2-PP-Dol not Glc3Man5GlcNAc2-PP-Dol or Glc3Man7GlcNAc2-PP-Dol (Verostek et al., 1993Go; Aebi et al., 1996Go; Kelleher et al., 2001Go) indicating that Man5GlcNAc2-PP-Dol is a poor substrate for the Alg6 and Alg12 enzymes. Additional insight into this stepwise assembly pathway has been provided by the discovery that Alg9 transfers two {alpha}-1,2-linked mannose residues (Figure 1, residues 9 and 11). The first addition yields the Man7GlcNAc2-PP-Dol intermediate (Burda et al., 1996Go; Cipollo and Trimble, 2000Go) that is the substrate for Alg12. Alg9 then adds the ninth and final mannose residue to oligosaccharide donor (Frank and Aebi, 2005Go).


    Defects in donor assembly are responsible for CDG-I
 Top
 Abstract
 Introduction
 Ancient origin of N-linked...
 Oligosaccharide donors for the...
 Defects in donor assembly...
 Acceptor substrates and peptide...
 Context factors that influence...
 Subunit composition of the...
 The STT3 subunit contains...
 Organization of subunits into...
 Donor-substrate selection by the...
 Conclusions
 References
 
OS-PP-Dols lacking the terminal glucose residue are less effective donors for the vertebrate and yeast OST both in vitro and in vivo (Turco et al., 1977Go; Trimble et al., 1980Go; Burda and Aebi, 1998Go; Karaoglu et al., 2001Go). Enzymatic defects that interfere with the biosynthesis of Glc3Man9GlcNAc2-PP-Dol are responsible for a family of diseases referred to as CDG-I (for a recent review of CDG, see Freeze and Aebi, 2005Go).

Reductions in phosphomannose isomerase activity (PMI; CDG-Ib), phosphomannomutase activity (PMM, CDG-Ia), and Dol-P-Man synthase activity (CDG-Ie) reduce the biosynthesis of the mannose donors (GDP-mannose and Dol-P-Man), thereby causing an accumulation of diverse assembly intermediates (as reviewed in Aebi and Hennet, 2001Go). The role of the vertebrate LEC35 gene product in OS-PP-Dol assembly was unclear as Lec35 mutants accumulate Man5GlcNAc2-PP-Dol (Lehrman and Zeng, 1989Go) but are not defective in Alg3 activity and have normal levels of Dol-P-Man. Lec35 cells were recently shown to be defective in all aspects of Dol-P-Man and Dol-P-Glc utilization (Anand et al., 2001Go), hence the LEC35 gene product is now referred to as mannose-P-dolichol utilization (MPDU). Deficiencies in human MPDU are responsible for CDG-If (Kranz et al., 2001Go; Schenk et al., 2001Go).

Several recently identified CDG-I subtypes (Figure 1, subtypes Ig, Ih, and Il) cause the accumulation of an OS-PP-Dol assembly intermediate that is a poor substrate for the OST (Chantret et al., 2002Go; Frank et al., 2004Go; Schollen et al., 2004Go). CDG-Ij, Ik, and Ii are caused by mutations that reduce the activity of early acting glycosyltransferases (ALG7, ALG1, and ALG2), thereby reducing the steady-state pool of Glc3Man9GlcNAc2-PP-Dol that can serve as the oligosaccharide donor (Thiel et al., 2003Go; Wu et al., 2003Go; Grubenmann et al., 2004Go).


    Acceptor substrates and peptide inhibitors of the OST
 Top
 Abstract
 Introduction
 Ancient origin of N-linked...
 Oligosaccharide donors for the...
 Defects in donor assembly...
 Acceptor substrates and peptide...
 Context factors that influence...
 Subunit composition of the...
 The STT3 subunit contains...
 Organization of subunits into...
 Donor-substrate selection by the...
 Conclusions
 References
 
Sequencing of N-glycopeptides derived from eukaryotic glycoproteins established that N-glycosylated asparagine residues are within an N-X-T/S consensus sequon, where X can be any residue except proline (Marshall, 1972Go). Provided that the N- and C-termini are blocked by reagents that mimic peptide bonds, N-X-T tripeptides are glycosylated by the OST in vitro (Welply et al., 1983Go). N-Glycosylated N-X-C sites are present in human von Willebrand factor and serum protein C (Titani et al., 1986Go; Miletich and Broze, 1990Go). In vitro assays using synthetic peptide substrates with internal N-X-T/S/C sites showed that the kinetic parameters of the OST (peptide Km and Vmax) are strongly influenced by the side chain in the +2 position relative to asparagine, with N-X-T serving as a better substrate than N-X-S, which is in turn much better than N-X-C (Bause, 1984Go; Breuer et al., 2001Go). Peptides containing asparagine analogues have been tested as potential OST substrates and inhibitors. As expected from the sequence of natural glycopeptides, the sequon analogues Q-X-T and D-X-T are neither substrates nor high-affinity inhibitors (Welply et al., 1983Go; Bause et al., 1995Go). Of more than a dozen tested peptides wherein asparagine is replaced by an amino acid analogue (Welply et al., 1983Go; Bause et al., 1995Go; Xu T., Werner M., et al., 1998), ß-hydroxyasparagine was found to yield a low affinity substrate, whereas diaminobutanoic acid (Amb) yielded a promising lead compound for OST inhibitors (Bause et al., 1995Go), as discussed below.

The yeast and vertebrate OST have a surprisingly low affinity (Kp ~ 10–30 µM) for a synthetic tripeptide substrate (e.g., N-Ac-N-[125I]Y-T-NH2; Karaoglu et al., 2001Go; Kelleher et al., 2003Go). Hydrophobic N-terminal blocking groups (N-benzoyl and N-octanoyl) enhance peptide-binding affinity relative to an N-acetyl blocking group, whereas bulky N-terminal blocking groups (N-t-butoxycarbonyl or N-4-benzoyl-benzoyl) reduce peptide-binding affinity (Welply et al., 1983Go; Xu T., Khanna H., et al., 1998). Taken together with the effect of X-residue substitutions within an N-X-S sequon (Shakin-Eshleman et al., 1996Go), these observations indicate that residues other than asparagine and the hydroxy amino acid (S or T) can facilitate acceptor-substrate binding to the OST active site.

Two strategies have been used to obtain high-affinity substrates and more potent OST inhibitors. Conformationally constrained peptides that adopt ß-turn or asparagine-turn conformations have been tested as OST substrates, and it was found that peptides that adopt the latter, but not former, conformation are OST substrates (Imperiali et al., 1992Go). Cyclization of synthetic OST substrates has been achieved by covalently linking an N-terminal blocking group on asparagine to the cysteine side chain in a N-C-T sequon to obtain the cyclo(hex-Asn-Cys)-Thr series of substrates or to an Amb-C-T tripeptide to obtain the cyclo(hex-Amb-Cys)-Thr series of OST inhibitors (Hendrickson et al., 1996Go; Kellenberger et al., 1997Go). Peptide cyclization appears to cause a relatively modest increase in the binding affinity relative to noncyclic-control peptides that have a hydrophobic, N-terminal blocking group (Kellenberger et al., 1997Go; Peluso et al., 2002Go). In contrast, a dramatic increase in the binding affinity (Ki < 50 nM) of OST inhibitors has been achieved by extending the C-terminus of the peptides by several residues to obtain cyclo(hex-Amb-Cys)-Thr-Val-Thr-Nph-NH2, where Nph corresponds to para-nitrophenylalanine (Hendrickson et al., 1996Go). Peptide extensions that included one or more charged amino acids (e.g., K-K-Nph-NH2, E-K-Nph-NH2, or V-E-Nph-NH2) yielded markedly less-effective inhibitors or substrates (Kellenberger et al., 1997Go). The design of the Val-Thr-Nph-NH2 extension was based upon a careful reexamination (Imperiali and colleagues) of the statistical frequency of amino acids at the +3 and +4 residues adjacent to utilized and nonutilized N-X-T sites (Gavel and Von Heijne, 1990).


    Context factors that influence N-linked glycosylation of sequons
 Top
 Abstract
 Introduction
 Ancient origin of N-linked...
 Oligosaccharide donors for the...
 Defects in donor assembly...
 Acceptor substrates and peptide...
 Context factors that influence...
 Subunit composition of the...
 The STT3 subunit contains...
 Organization of subunits into...
 Donor-substrate selection by the...
 Conclusions
 References
 
It has been estimated that ~35% of the potential glycosylation sites in proteins that enter the ER lumen are not modified (Petrescu et al., 2004Go), and others are modified with low efficiency. Sequence analysis of a large collection of utilized and nonutilized N-X-T/S sites in proteins that enter the secretory pathway showed that a proline residue in the +3 site relative to asparagine reduces the statistical frequency of N-glycosylation (Gavel and Von Heijne, 1990). Hexapeptides with internal N-P-T sites or N-X-T-P sites are neither substrates nor low-affinity inhibitors (Bause, 1983Go). The statistical analysis of utilized N-glycosylation sites also supports the preference for N-X-T sites relative to N-X-S sites (Gavel and Von Heijne, 1990). The influence of amino acids in the X position of N-X-T and N-X-S sites has been tested in the context of the rabies virus glycoprotein. Although the identity of the X-residue had little effect upon the glycosylation efficiency of N-X-T sites, several residues (Glu, Asp, Trp, and Leu) significantly reduced the glycosylation efficiency of N-X-S sites (Shakin-Eshleman et al., 1996Go). Analysis of several proteins with overlapping sequons (N-N-T/S-T/S) has shown that a single glycan is attached to such a site (Lockridge et al., 1987Go; Reddy et al., 1988Go, 1999Go). Recent results support the conclusion that an N-linked glycan at an asparagine residue reduces access of a closely spaced sequon to the OST active site (Karamyshev et al., 2005).

Potential glycosylation sites that are located <12–14 residues from a transmembrane (TM) span are not utilized by the vertebrate OST, indicating that the OST active site is ~30–40 Å above the lumenal membrane surface of the RER (Nilsson and von Heijne, 1993). Not only does this distance restraint on glycosylation of membrane proximal N-X-T/S sites explain a lack of glycosylation of some integral membrane proteins, it is relevant to the glycosylation scanning mutagenesis procedure that has been used to elucidate the topology of membrane proteins (Hresko et al., 1994Go). Sequons inserted into short lumenal loops are not glycosylated (Popov et al., 1997Go). Glycosylation sites near the C-terminus of proteins are also less likely to be utilized (Gavel and Von Heijne, 1990), presumably because chain termination allows the protein to move more rapidly past the OST active site. Glycosylation of N-X-T/S sites can also be reduced by a disulfide bond involving a nearby cysteine residue (Allen et al., 1995Go) or delayed or inhibited if the N-X-T/S site is near the signal peptidase cleavage site (Chen et al., 2001Go). Mutations that prevent folding of a glycoprotein may be accompanied by N-glycosylation at sequons that are not normally modified (McGinnes and Morrison, 1994Go), indicating that rapid protein folding may be responsible for inefficient modification of some N-X-T/S sites. A detailed analysis of the extent of glycosylation at each of the 14 sites in yeast invertase suggests that additional sequence context factors may influence the efficiency of modification of individual glycosylation sites (Reddy et al., 1988Go).


    Subunit composition of the OST in eukaryotic organisms
 Top
 Abstract
 Introduction
 Ancient origin of N-linked...
 Oligosaccharide donors for the...
 Defects in donor assembly...
 Acceptor substrates and peptide...
 Context factors that influence...
 Subunit composition of the...
 The STT3 subunit contains...
 Organization of subunits into...
 Donor-substrate selection by the...
 Conclusions
 References
 
The OST has been most extensively investigated in the yeast (Saccharomyces cerevisiae) and vertebrate (canine pancreas) experimental systems. A combination of protein biochemistry and yeast molecular genetics experiments has led to the identification of eight polypeptides (Figure 2A) that are assembled into a heterooctameric yeast OST complex composed of one copy each of Ost1p, Ost2p, Ost3p or Ost6p, Ost4p, Ost5p, Wbp1p, Swp1p, and Stt3p (Karaoglu et al., 1997Go; Spirig et al., 1997Go). Although several papers describe yeast OST complexes that apparently lack 2–4 of the above subunits (Kelleher and Gilmore, 1994Go; Knauer and Lehle, 1994Go; Pathak and Imperiali, 1997Go), this discrepancy is explained by the difficulty in detecting the diffusely migrating Stt3 protein as well as the three low molecular weight subunits (Ost2p, Ost4p, and Ost5p) on sodium dodecyl sulfate (SDS)-polyacrylamide gels (Karaoglu et al., 1997Go). Protein sequence database searches and protein purification results indicate that the yeast and vertebrate OST complexes have very similar subunit compositions (Kelleher and Gilmore, 1997Go; Kelleher et al., 2003Go). Canine OST complexes composed of ribophorin I (Ost1p), ribophorin II (Swp1p), OST48 (Wbp1p), DAD1 (Ost2p), and an STT3 homologue have been purified from canine pancreas (Kelleher et al., 2003Go). Below, the subunits of the OST are discussed in their approximate order of discovery.


Figure 2
View larger version (25K):
[in this window]
[in a new window]
 
Fig. 2. The OST complex in diverse eukaryotes. The lumenal side of the ER membrane is at the top of each panel. The experimentally verified or predicted topologies for each subunit are shown. A black curved arrow designates a cleaved N-terminal signal sequence; colored arrowheads designate the N-termini of proteins that lack cleaved signal sequences. The OST subunits are color coded to designate the three putative subcomplexes (blue for Ost1-Ost5, red for Swp1-Wbp1-Ost2, green for Stt3-Ost3-Ost4) and are drawn to scale. Yellow lines in panels AD designate N-terminal (ribophorin II) or C-terminal (ribophorin I) extensions that are present/predicted as indicated by the labeled arrows. (A) The subunit composition of the fungal (Saccharomyces cerevisiae) and vertebrate (Canis familiaris) OST complexes are based upon biochemical and genetic evidence summarized in the text. Predicted subunit compositions for organisms other than Saccharomyces cerevisiae and Canis familiaris (AE) are based upon protein and DNA sequence database searches using Saccharomyces cerevisiae and human OST sequences as the query sequence. (A) Predicted OST complexes for other fungi (Schizosaccharomyces pombe, Cryptococcus neoformans, and Aspergillus nidulans) and vertebrates (Homo sapiens and Mus musculus) have seven or eight subunits (see text). (B) Predicted OST complexes for insects (Drosophila melanogaster and Anopheles gambiae), plants (Arabidopsis thaliana and Oryza sativa), nematodes (Caenorhabdilis elegans), Dictyostelium discoideum, and Tetrahymena thermophila have seven subunits. OST complexes from protists are predicted to have six subunits (C, Cryptosporidium parvum), four subunits (D, Entamoeba histolytica, Trichomonas vaginalis, and Plasmodium falciparum) or a single subunit (E, Giardia lamblia, Trypanosoma cruzi, and Trypanosoma brucei).

 

Wbp1 (OST48)
The Wbp1 protein (OST48 in vertebrates) was the first identified subunit of the yeast OST (te Heesen et al., 1992) and one of the first three identified subunits (ribophorin I, ribophorin II, and OST48) of the purified canine OST (Kelleher et al., 1992Go; Silberstein et al., 1992Go). Wbp1 (OST48) proteins have an N-terminal signal sequence that is cleaved in vivo (te Heesen et al., 1991; Silberstein et al., 1992Go), N-linked glycans (yeast Wbp1 but not canine OST48), a hydrophobic segment near the C-terminus that serves as a TM span, and a short cytoplasmic tail that has a C-terminal dilysine ER retention/retrieval motif (Gaynor et al., 1994Go; Fu and Kreibich, 2000Go). These sequence features demonstrated that Wbp1 is a type I (Nlum-Ccyt) integral membrane protein with a large lumenal domain (Figure 2A). Wbp1 proteins are encoded by genomes from diverse eukaryotes including vertebrates, fungi, nematodes, arthropods, plants, Dictyostelium discoideum, and many, but not all protists (Figure 2A–D), and show little variation in polypeptide length between organisms. Protein sequence comparisons suggest that Wbp1 proteins are members of the GldG/intraflagellar transport (GIFT) domain protein family that includes diverse prokaryotic and eukaryotic proteins (Beatson and Ponting, 2004Go). As the X-ray structure of a GIFT domain has not been solved, the biological function and structure of this domain in the Wbp1 proteins is not known.

Point mutations in Wbp1 reduce in vivo and in vitro OST activity (te Heesen et al., 1992) and destabilize the OST complex (Karaoglu et al., 1997Go). Deletion of the Wbp1 TM span or replacement of this segment with other hydrophobic sequences is not tolerated because of lack of incorporation of these mutant proteins into the OST complex (Li et al., 2003Go). Two papers that predated the identification of STT3 as the active-site subunit suggested that Wbp1/OST48 provides the oligosaccharide donor and acceptor binding sites that are critical for enzyme activity. Covalent inactivation of the yeast OST with a sulfhydryl reagent can be blocked by the presence of Dol-PP-GlcNAc2, indicating that a cysteine residue, which was mapped to Wbp1, is located in the dolichol-oligosaccharide binding site of the OST (Pathak et al., 1995Go). The significance of this result cannot be discounted given that enzyme kinetic experiments now suggest that the OST has two independent OS-PP-Dol binding sites (Karaoglu et al., 2001Go). OST48 and a 68-kDa protein (thought to be ribophorin I at the time, but probably STT3A instead) were labeled with a suicide substrate analogue, wherein the hydroxy amino acid (T or S) was replaced by epoxyethylglycine (Bause et al., 1997Go).

Ost1 (ribophorin I)
The Ost1/ribophorin I subunit is present in the purified yeast and vertebrate OST complexes. The yeast OST1 gene was isolated based upon peptide sequence data (Silberstein S., Collins P.G., Kelleher D.J., Rapiejko P.J., et al., 1995) and identified in a genetic screen for mutations that reduce OST activity (Reiss et al., 1997Go). The Ost1 proteins have an N-terminal signal sequence that is cleaved in vivo, N-X-T/S sites that carry N-linked glycans in yeast and vertebrates, a single TM span and a C-terminal cytosolic domain, hence Ost1 proteins have a type I (Nlum-Ccyt) topology (Crimaudo et al., 1987Go; Silberstein S., Collins P.G., Kelleher D.J., Rapiejko P.J., et al., 1995). Protein sequence database searches reveal that Ost1/ribophorin I orthologues are encoded by all fully sequenced eukaryotic genomes except Encephalitozoon cuniculi, G. lamblia, and the kinetoplastids (Figure 2A–D). The cytoplasmic domains of Ost1/ribophorin I proteins are either <10 residues in length (fungi, insects, and many protists) or 140–160 residues in length (vertebrates, plants, nematodes, and Plasmodium). Consensus protein secondary structure prediction algorithms (http://npsa-pbil.ibcp.fr/.) indicate that the longer cytoplasmic tails have a high probability of being {alpha}-helical.

Point mutations in Ost1 reduce N-linked glycosylation in vivo and reduce in vitro OST activity (Silberstein S., Collins P.G., Kelleher D.J., Rapiejko P.J., et al., 1995). Labeling of Ost1 with photoreactive acceptor substrates was detected using an immunoprecipitation strategy (Yan et al., 1999Go). However, subsequent studies did not support the initial conclusion that Ost1 was the active site subunit.

Ribophorins I and II are abundant proteins of the mammalian RER (Kreibich et al., 1978Go) that are present in roughly equal amounts relative to membrane-bound ribosomes (Marcantonio et al., 1984Go). Antibodies raised against the cytoplasmic domain of ribophorin I interfere with protein translocation across the RER by preventing ribosome targeting to the Sec61 complex (Yu et al., 1990Go), thereby indicating that the OST is adjacent to the protein translocation channel. The location of OST adjacent to the translocon allows the cotranslational modification of the nascent polypeptide as it enters the lumen of the RER (Chen et al., 1995Go).

Swp1 (ribophorin II)
The yeast SWP1 gene was isolated as a high-copy suppressor of a wbp1 mutant (te Heesen et al., 1993), and Swp1 (ribophorin II) is present in the purified yeast and canine OST complexes. The Swp1 proteins have an N-terminal signal sequence that is cleaved in vivo, an N-glycosylation sequon that is modified in ribophorin II, and three hydrophobic segments located near the C-terminus (Rosenfeld et al., 1984Go; Crimaudo et al., 1987Go; te Heesen et al., 1993; Kelleher and Gilmore, 1994Go). Protease digestion experiments have shown that C-terminus of ribophorin II is insensitive to protease digestion in intact microsomes (Crimaudo et al., 1987Go) consistent with a three TM (Nlum-Ccyt) topology depicted in Figure 2A. Swp1/ribophorin II proteins are only encoded by a subset of eukaryotic genomes (Figure 2A and B) suggesting that this subunit was a relatively late addition to the OST complex. Nematode and fungi genomes encode truncated forms of Swp1/ribophorin II that lack an N-terminal domain that is predicted to be {alpha}-helical. RNAi of Caenorhabditis elegans ribophorin II causes an embryonic lethal phenotype (Fraser et al., 2000Go) consistent with the previous demonstration that Swp1 is essential in yeast (te Heesen et al., 1993).

Ost2 (DAD1)
Ost2 was initially detected as a 16-kD subunit in the purified yeast OST complex (Kelleher and Gilmore, 1994Go) and was subsequently shown to be present in the mammalian OST complex (Kelleher and Gilmore, 1997Go) using antibodies specific for the vertebrate homologue DAD1 (Nakashima et al., 1993Go). Ost2 (DAD1) subunits are low molecular weight (112–130 residues) proteins that consist of a polar N-terminal segment of variable length, followed by three conserved hydrophobic segments. Protease accessibility experiments have shown that the N-terminus and the hydrophilic segment between TM2 and TM3 are exposed on the cytoplasmic face of the membrane (Silberstein S., Collins P.G., Kelleher D.J., and Gilmore R., 1995; Kelleher and Gilmore, 1997Go), consistent with the 3-TM (Ncyt-Clum) topology model shown in Figure 2A. An alternative topology model for Ost2p that depicts a 3-TM span protein with an inverted topology (Yan et al., 2003Go) is incompatible with the protease accessibility data cited above and is not in agreement with an Ost2p-reporter gene fusion experiment that suggests that the C-terminus is located within the RER lumen (Yan et al., 2005Go). An alternative 2-TM (Ncyt-Ccyt) span model for DAD1 (Makishima et al., 1997Go) is based upon the assumption that the third hydrophobic segment is too polar to be integrated into the membrane. We would argue that TM3 of Ost2/DAD1 contacts the TM spans of other OST subunits, rather than phospholipid when Ost2/DAD is assembled into the OST complex. Ost2/DAD1 proteins are encoded by all fully sequenced eukaryotic genomes except E. cuniculi, G. lamblia, and the kinetoplastids (Figure 2A–D).

Point mutations in the TM spans of Ost2 cause in vivo and in vitro reductions in OST activity at the restrictive temperature (Silberstein S., Collins P.G., Kelleher D.J., and Gilmore R., 1995). A temperature sensitive mutation in the hamster DAD1 (defender against death) protein causes an apoptotic phenotype in the tsBN7 cell line because of instability of DAD1 (Nakashima et al., 1993Go). Degradation of DAD1 at the restrictive temperature results in a severe defect in OST activity (Makishima et al., 1997Go; Sanjay et al., 1998Go). The apoptotic phenotype of tsBN7 cells is most likely because of the irreversible induction of the unfolded protein response (UPR) pathway caused by the accumulation of unfolded glycoproteins, as tunicamycin can also induce apoptosis in plant and vertebrate cell lines (Pérez-Salva and Mollinedo, 1995Go; Walker et al., 1998Go; Iwata and Koizumi, 2005Go).

STT3
Mutations in the yeast STT3 locus were isolated in a screen for yeast that showed enhanced sensitivity to staurosporine (Yoshida et al., 1992Go), an inhibitor of protein kinase C. Additional stt3 alleles were obtained in genetic screens for mutations that confer a synthetic lethality phenotype when combined with the wbp1-2 allele (Zufferey et al., 1995Go) or the alg5{Delta} allele (Reiss et al., 1997Go). Because the yeast STT3 gene encodes a 78-kDa protein that was not detected in the purified yeast OST complex (Kelleher and Gilmore, 1994Go; Knauer and Lehle, 1994Go; Pathak and Imperiali, 1997Go), it was initially unclear whether Stt3 was an OST subunit (Yoshida et al., 1995Go; Zufferey et al., 1995Go). Subsequent immunopurification experiments using yeast strains that express affinity-tagged OST subunits (STT3-HA, STT3-protein A, or OST3-HA) demonstrated that Stt3 is in an equimolar complex with the other OST subunits (Karaoglu et al., 1997Go; Spirig et al., 1997Go).

The topology of the Stt3 proteins is more difficult to predict and experimentally verify than any of the other OST subunits. Eukaryotic Stt3 proteins consist of an N-terminal hydrophobic domain composed of 12–13 predicted TM spans followed by a C-terminal hydrophilic domain that is located within the RER lumen (Zufferey et al., 1995Go). The N-termini of canine STT3A and yeast Stt3 is exposed to the cytosol, hence eukaryotic STT3 proteins must have an uneven number of TM spans (e.g., 11 or 13) and a Ncyt-Clum topology (Kelleher et al., 2003Go; Kim et al., 2005Go). STT3 topology models with 13 TM spans (as depicted in Figure 2) are based on the assumption that each of the 13 hydrophobic segments are bona fide TM spans, whereas 11 TM span models necessitate that two of the hydrophobic segments are instead exposed on the cytosolic or lumenal faces of the membrane. A comprehensive investigation of the topology of the yeast and mouse STT3 proteins provided support for two conflicting 11 TM span models (Kim et al., 2005Go), suggesting that structural biology data will be needed to define the topology of the Stt3 proteins.

As initially suggested by the alignment of STT3 protein sequences from S. cerevisiae, C. elegans, and Mus musculus (Zufferey et al., 1995Go), the Stt3 subunit is the most highly conserved polypeptide in the OST complex. With the exception of E. cuniculi, which lacks N-linked glycosylation, all fully sequenced eukaryotic genomes encode at least one member of the Stt3 protein family (Figures 2 and 3). Vertebrate, plant, and insect genomes encode two STT3 proteins that we have designated as STT3A and STT3B (Kelleher et al., 2003Go). Vertebrate STT3A was initially named TM conserved (TMC) (Lissy et al., 1996Go), whereas STT3B is also known as source of immunodominant major histocompatibility complex-associated peptides (SIMP) (McBride et al., 2002Go). Phylogenetic tree analysis shows that STT3B proteins are more closely related to the fungal and nematode STT3 proteins, whereas the STT3A protein cluster includes STT3 from D. discoideum (Figure 3). Several protist genomes encode multiple STT3 proteins that reflect gene duplication events that are unrelated to the derivation of the STT3A and STT3B subfamilies. For example, four consecutive genes on one chromosome encode the predicted Leishmania major STT3 proteins. To our knowledge, it is not known whether multiple STT3 proteins are simultaneously expressed in protist organisms.


Figure 3
View larger version (17K):
[in this window]
[in a new window]
 
Fig. 3. Phylogenetic tree analysis of STT3 proteins. STT3 protein sequences were obtained by protein and DNA sequence database searches (Kelleher et al., 2003Go; Samuelson et al., 2005Go) using the Saccharomyces cerevisiae Stt3 and human STT3A as query sequences. Representative fungal and metazoan STT3 sequences as well as all currently available protist sequences were selected for alignment. Several archaebacterial and eubacterial STT3 homologues were used to root the phylogenetic tree.

 

Ost3/Ost6 (N33 and IAP)
The Ost3 protein was initially detected as a 34-kD subunit in the yeast OST complex (Kelleher and Gilmore, 1994Go). Ost6p is a homologue of Ost3p that is assembled into a second, less abundant, form of the yeast OST complex that lacks Ost3p as a subunit (Karaoglu et al., 1997Go; Knauer and Lehle, 1999Go; Spirig et al., 2005Go). The Ost3 family of proteins have an N-terminal signal sequence that is cleaved in vivo in yeast, a 160–180 residue N-terminal lumenal domain that includes one thioredoxin repeat (Fetrow et al., 2001Go) followed by four predicted TM spans (Karaoglu et al., 1995Go). Consequently, the Ost3 proteins are integrated into the RER with a 4-TM (Nlum-Clum) topology (Figure 2A). Recently, the 4-TM (Nlum-Clum) topology for the Ost3 family has been disputed based upon results obtained with a reporter–gene fusion strategy (Yan et al., 2003Go, 2005Go). An alternative topology model for Ost3 predicts that a very short, weakly hydrophobic segment within the thioredoxin motif of Ost3p functions as a fifth TM span, thereby inverting the topology of Ost3 to yield a 5-TM (Nlum-Ccyt) topology (Yan et al., 2003Go, 2005Go). Both termini of Ost6 are proposed to face the cytosol in a 4-TM (Ncyt-Ccyt) topology (Yan et al., 2005Go), which would expose the N-terminal signal sequence and the thioredoxin domain of Ost6 on to the cytoplasmic side of the membrane (Chavan et al., 2005Go). How can we explain the discrepancy between these recent proposals and the previously accepted topology model shown in Figure 2A? Although reporter–domain fusions have been used extensively to define the topology of bacterial and vertebrate membrane proteins, this strategy can be less satisfactory in yeast (as discussed by Kim H., Melen K., et al., 2003), because integration of membrane proteins can be perturbed by an N-terminal fusion joint that inactivates a signal sequence or by a C-terminal fusion joint that interferes with integration of the final TM span (Green and Walter, 1992Go).

Protein sequence database searches show that members of the Ost3 family of proteins are encoded by all fully sequenced vertebrate, nematode, fungi (except E. cuniculi), arthropod, and plant genomes. In addition, Ost3 homologues are encoded by D. discoideum, Tetrahymena thermophila, and Cryptosporidium parvum. The gene duplication event that is responsible for the OST6 gene is restricted to the Saccharomycetaceae family of budding yeasts. Phylogenetic tree analysis using currently available fungal Ost3/Ost6 sequences indicates that other fungal genomes encode a protein that is more closely related to Ost3 than to Ost6p (not shown). Vertebrate genomes encode two closely related Ost3 homologues that are annotated as N33 and IAP (MacGrogan et al., 1996Go; Kelleher et al., 2003Go). Unlike Ost3 and Ost6 that are present in the purified yeast OST complex (Kelleher and Gilmore, 1994Go; Karaoglu et al., 1997Go) and comigrate with the OST on Blue-Native gels (Knauer and Lehle, 1999Go), N33 and IAP appear to be loosely associated with the canine OST complex and dissociate at various stages during enzyme purification (Kelleher et al., 2003Go) or during Blue-Native polyacrylamide gel electrophoresis (Shibatani et al., 2005Go).

Disruption of the yeast OST3 or OST6 genes does not cause a growth defect at any temperature in an otherwise wild-type strain (Karaoglu et al., 1995Go; Knauer and Lehle, 1999Go). Loss of Ost3p expression in yeast causes a modest, and substrate dependent, reduction in the in vivo glycosylation of proteins (Karaoglu et al., 1995Go). The mild phenotypes of the ost3{Delta} strain are explained in part by redundancy, as an ost3{Delta}ost6{Delta} strain has a more severe defect in glycosylation than either single mutant (Knauer and Lehle, 1999Go; Spirig et al., 2005Go). The C. elegans OST3 gene (ZK686.3) is essential; RNAi of ZK686.3 causes an embryonic lethal phenotype (Kamath et al., 2003Go) because of a defect in cytokinesis (Gonczy et al., 2000Go).

Ost4p
The yeast OST4 gene was identified in a screen for yeast mutants that are resistant to sodium vanadate and sensitive to hygromycin B (Chi et al., 1996Go), two properties that are shared by many yeast mutants that impact different steps in cell-wall biosynthesis (Dean, 1995Go). Point mutations, or the complete deletion, of the nonessential yeast OST4 gene causes severe in vivo hypoglycosylation of glycoproteins and a temperature sensitive growth defect (Chi et al., 1996Go). Remarkably, the OST4 gene encodes a 36-residue polypeptide (Chi et al., 1996Go), which is one of the smallest verified proteins encoded by the yeast genome. The single TM span in Ost4p is preceded by acidic residues and followed by basic residues thus favoring integration in the type I (Nlum-Ccyt) topology as initially proposed (Karaoglu et al., 1997Go; Spirig et al., 1997Go). Although a type I topology for Ost4p has been disputed (Kim et al., 2000Go), a more recent study confirmed the original prediction (Kim H., Yan Q., et al., 2003).

Homologues of yeast Ost4 are encoded by fungi, vertebrate, nematode, insect, plant, and several protist genomes (Figure 2A–C). Relatively few of these OST4 sequences are currently annotated in protein sequence databases. For example, the human and mouse expressed sequence tags (EST) databases contain highly expressed mRNAs that encode the 37-residue Ost4 homologue shown in Figure 4. Ost4 proteins consist of conserved lumenal and TM segments followed by a poorly conserved cytoplasmic domain of variable length (Figure 4). Mass spectroscopy of the purified canine OST complex has shown that a protein with a mass (4193 Da) consistent with the vertebrate Ost4 sequence is present in the complex (Kelleher and Gilmore, unpublished data). Hence we are confident that Ost4 is an OST subunit in diverse eukaryotes (Figure 2A–C).


Figure 4
View larger version (31K):
[in this window]
[in a new window]
 
Fig. 4. Alignment of eukaryotic OST4 sequences. OST4 sequences were obtained by protein and DNA sequence database searches using Saccharomyces cerevisiae Ost4 as the query sequence. Representative fungal and metazoan OST4 sequences as well as all currently available protist sequences were selected for alignment using ClustalW. Regions of homology are boxed, with amino acid identities designated by dark shading. Replacement of hydrophobic residues (M18 to I24) in the TM span of Saccharomyces cerevisiae Ost4 with lysine or aspartic acid caused temperature sensitive growth defects and destabilized the interaction between Stt3 and Ost4 (asterisks) or destabilized the interaction between Ost4 and Ost3 (asterisks and diamonds) (Kim et al., 2000Go).

 

Ost5
Ost5 was initially detected as a 9-kDa subunit in the yeast OST complex (Kelleher and Gilmore, 1994Go). The OST5 gene was identified in a screen for yeast genes that encode OST subunits (Reiss et al., 1997Go). Deletion of the yeast OST5 gene causes a minor defect in the in vivo glycosylation of yeast proteins and a reduction in OST activity that is not caused by an altered affinity for donor or acceptor substrates (Reiss et al., 1997Go). Ost5p is an 86-residue protein with two-predicted TM spans. The 2-TM Ncyt-Ccyt topology that was predicted for Ost5p (Reiss et al., 1997Go) has been confirmed (Yan et al., 2005Go). Although putative Ost5 homologues can be identified in metazoan organisms using protein sequence database searches (Kelleher and Gilmore, unpublished results), biochemical evidence that these polypeptides are OST subunits has not been obtained (Kelleher et al., 2003Go; Shibatani et al., 2005Go).


    The STT3 subunit contains the catalytic site of the OST
 Top
 Abstract
 Introduction
 Ancient origin of N-linked...
 Oligosaccharide donors for the...
 Defects in donor assembly...
 Acceptor substrates and peptide...
 Context factors that influence...
 Subunit composition of the...
 The STT3 subunit contains...
 Organization of subunits into...
 Donor-substrate selection by the...
 Conclusions
 References
 
The five OST subunits (Stt3, Ost1, Ost2, Swp1, and Wbp1) that are encoded by essential yeast genes were the logical candidates for the active-site subunit of the OST. Of these five proteins, only the Stt3 protein has archaebacterial homologues (Spirig et al., 1997Go). Alignment of eukaryotic and archaebacterial STT3 proteins resulted in the identification of a short invariant sequence (WWDXG motif) that is present in all members of the STT3 family (Spirig et al., 1997Go; Burda and Aebi, 1999Go). During the past several years, overwhelming evidence has been obtained indicating that Stt3 family proteins catalyze N-linked glycosylation in eukaryotes, archaebacteria and in selected eubacteria.

The most direct evidence that Stt3 proteins are the catalytic subunits of the eukaryotic OST was provided by an elegant in vivo analysis of the C. jejuni pgl operon. N-linked glycosylation of AcrA, a C. jejuni membrane protein was dependent upon a wild-type PglB locus (Wacker et al., 2002Go). Moreover, the expression of AcrA and the complete pgl operon in Escherichia coli reconstituted N-linked glycosylation activity in the E. coli host unless point mutations were introduced into the WWDYG motif in PglB (Wacker et al., 2002Go). Genetic analysis of PglB has been facilitated by the nonessential nature of N-linked glycosylation in C. jejuni and in the E. coli host system. Structural analysis of the resulting glycopeptide showed that the modified asparagine (N*) was present within the peptide DFN*VSK (Wacker et al., 2002Go), where the underlined sequence matches the N-X-T/S sequon for eukaryotic N-linked glycopeptides. The N-X-T/S sequence is necessary, but not sufficient for N-linked glycosylation in C. jejuni, as several N-X-T/S sites in the AcrA and HisJ glycoproteins are not modified (Nita-Lazar et al., 2005Go). However, because 35% of N-X-T/S sequons are not modified by the eukaryotic OST, further work will be required to define a minimum consensus site for PglB. The heptasaccharide (GalNAc-{alpha}1,4-GalNAc-{alpha}1,4-(Glcß1,3)-GalNAc-{alpha}1,4-GalNAc-{alpha}1, 4-GalNAc-{alpha}1,3-Bac) transferred to proteins by the C. jejuni PglB protein bears little resemblance to the high-mannose oligosaccharide donor assembled by eukaryotic cells (Young et al., 2002Go), except that the s