Glycobiology Advance Access originally published online on September 22, 2004
Glycobiology 2005 15(2):153-164; doi:10.1093/glycob/cwh151
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Glycobiology vol. 15 no. 2 © Oxford University Press 2005; all rights reserved.
Prediction, conservation analysis, and structural characterization of mammalian mucin-type O-glycosylation sites
Center for Biological Sequence Analysis, BioCentrum, Building 208, Technical University of Denmark, DK-2800 Lyngby, Denmark
1 To whom correspondence should be addressed; e-mail: karin.julenius{at}sbc.su.se
Received on January 9, 2004; revised on September 15, 2004; accepted on September 15, 2004
O-GalNAc-glycosylation is one of the main types of glycosylation in mammalian cells. No consensus recognition sequence for the O-glycosyltransferases is known, making prediction methods necessary to bridge the gap between the large number of known protein sequences and the small number of proteins experimentally investigated with regard to glycosylation status. From O-GLYCBASE a total of 86 mammalian proteins experimentally investigated for in vivo O-GalNAc sites were extracted. Mammalian protein homolog comparisons showed that a glycosylated serine or threonine is less likely to be precisely conserved than a nonglycosylated one. The Protein Data Bank was analyzed for structural information, and 12 glycosylated structures were obtained. All positive sites were found in coil or turn regions. A method for predicting the location for mucin-type glycosylation sites was trained using a neural network approach. The best overall network used as input amino acid composition, averaged surface accessibility predictions together with substitution matrix profile encoding of the sequence. To improve prediction on isolated (single) sites, networks were trained on isolated sites only. The final method combines predictions from the best overall network and the best isolated site network; this prediction method correctly predicted 76% of the glycosylated residues and 93% of the nonglycosylated residues. NetOGlyc 3.1 can predict sites for completely new proteins without losing its performance. The fact that the sites could be predicted from averaged properties together with the fact that glycosylation sites are not precisely conserved indicates that mucin-type glycosylation in most cases is a bulk property and not a very site-specific one. NetOGlyc 3.1 is made available at www.cbs.dtu.dk/services/netoglyc.
Key words: machine learning / mucin-type / neural networks / O-glycosylation / prediction
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
B. Zappone, P. J. Thurner, J. Adams, G. E. Fantner, and P. K. Hansma Effect of Ca2+ Ions on the Adhesion and Mechanical Properties of Adsorbed Layers of Human Osteopontin Biophys. J., September 15, 2008; 95(6): 2939 - 2950. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Raman, T. A. Fritz, T. A. Gerken, O. Jamison, D. Live, M. Liu, and L. A. Tabak The Catalytic and Lectin Domains of UDP-GalNAc:Polypeptide {alpha}-N-Acetylgalactosaminyltransferase Function in Concert to Direct Glycosylation Site Selection J. Biol. Chem., August 22, 2008; 283(34): 22942 - 22951. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Thorsen, K. D. Sorensen, A. S. Brems-Eskildsen, C. Modin, M. Gaustadnes, A.-M. K. Hein, M. Kruhoffer, S. Laurberg, M. Borre, K. Wang, et al. Alternative Splicing in Colon, Bladder, and Prostate Cancer Identified by Exon Array Analysis Mol. Cell. Proteomics, July 1, 2008; 7(7): 1214 - 1224. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. E. Gerszten, F. Accurso, G. R. Bernard, R. M. Caprioli, E. W. Klee, G. G. Klee, I. Kullo, T. A. Laguna, F. P. Roth, M. Sabatine, et al. Challenges in translating plasma proteomics from bench to bedside: update from the NHLBI Clinical Proteomics Programs Am J Physiol Lung Cell Mol Physiol, July 1, 2008; 295(1): L16 - L22. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Overton, C. A. J. van Niekerk, L. G. Carter, A. Dawson, D. M. A. Martin, S. Cameron, S. A. McMahon, M. F. White, W. N. Hunter, J. H. Naismith, et al. TarO: a target optimisation system for structural biology Nucleic Acids Res., July 1, 2008; 36(suppl_2): W190 - W196. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Herr, G. Korniychuk, Y. Yamamoto, K. Grubisic, and M. Oelgeschlager Regulation of TGF-{beta} signalling by N-acetylgalactosaminyltransferase-like 1 Development, May 15, 2008; 135(10): 1813 - 1822. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Sarkar, M. J. Troese, S. A. Kearns, T. Yang, D. V. Reneer, and J. A. Carlyon Anaplasma phagocytophilum MSP2(P44)-18 Predominates and Is Modified into Multiple Isoforms in Human Myeloid Cells Infect. Immun., May 1, 2008; 76(5): 2090 - 2098. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Nixon, R. C. Jones, and M. K. Holland Molecular and Functional Characterization of the Rabbit Epididymal Secretory Protein 52, REP52 Biol Reprod, May 1, 2008; 78(5): 910 - 920. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Luo, X. Zhang, A. Wakeel, V. L. Popov, and J. W. McBride A Variable-Length PCR Target Protein of Ehrlichia chaffeensis Contains Major Species-Specific Antibody Epitopes in Acidic Serine-Rich Tandem Repeats Infect. Immun., April 1, 2008; 76(4): 1572 - 1580. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. R. Sprague, H. Reinhard, E. J. Cheung, A. H. Farley, R. D. Trujillo, H. Hengel, and P. J. Bjorkman The Human Cytomegalovirus Fc Receptor gp68 Binds the Fc CH2-CH3 Interface of Immunoglobulin G J. Virol., April 1, 2008; 82(7): 3490 - 3499. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. K. Tipsmark Identification of FXYD protein genes in a teleost: tissue-specific expression and response to salinity change Am J Physiol Regulatory Integrative Comp Physiol, April 1, 2008; 294(4): R1367 - R1378. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Saito, K. Yano, S. Sharma, H. E. McMahon, and S. Shimasaki Characterization of the post-translational modification of recombinant human BMP-15 mature protein Protein Sci., February 1, 2008; 17(2): 362 - 370. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Krucken, R. J. Hosse, A. N. Mouafo, R. Entzeroth, S. Bierbaum, P. Marinovski, K. Hain, G. Greif, and F. Wunderlich Excystation of Eimeria tenella Sporozoites Impaired by Antibody Recognizing Gametocyte/Oocyst Antigens GAM22 and GAM56 Eukaryot. Cell, February 1, 2008; 7(2): 202 - 211. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Itoh, M. Kamata-Sakurai, K. Denda-Nagai, S. Nagai, M. Tsuiji, K. Ishii-Schrade, K. Okada, A. Goto, M. Fukayama, and T. Irimura Identification and Expression of Human Epiglycanin/MUC21: a Novel Transmembrane Mucin Glycobiology, January 1, 2008; 18(1): 74 - 83. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Herr, C.-Y. Hung, and G. T. Cole Evaluation of Two Homologous Proline-Rich Proteins of Coccidioides posadasii as Candidate Vaccines against Coccidioidomycosis Infect. Immun., December 1, 2007; 75(12): 5777 - 5787. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Bettegowda, J. Yao, A. Sen, Q. Li, K.-B. Lee, Y. Kobayashi, O. V. Patel, P. M. Coussens, J. J. Ireland, and G. W. Smith JY-1, an oocyte-specific gene, regulates granulosa cell function and early embryonic development in cattle PNAS, November 6, 2007; 104(45): 17602 - 17607. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. de Turenne-Tessier and T. Ooka Post-translational modifications of Epstein Barr virus BARF1 oncogene-encoded polypeptide J. Gen. Virol., October 1, 2007; 88(10): 2656 - 2661. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Julenius NetCGlyc 1.0: prediction of mammalian C-mannosylation sites Glycobiology, August 1, 2007; 17(8): 868 - 876. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. T. Pallesen, L. R. L. Pedersen, T. E. Petersen, and J. T. Rasmussen Characterization of Carbohydrate Structures of Bovine MUC15 and Distribution of the Mucin in Bovine Milk J Dairy Sci, July 1, 2007; 90(7): 3143 - 3152. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. R. Pigott and D. J. Ellar Role of Receptors in Bacillus thuringiensis Crystal Toxin Activity Microbiol. Mol. Biol. Rev., June 1, 2007; 71(2): 255 - 281. [Abstract] [Full Text] [PDF] |
||||
![]() |
E Memili, D Peddinti, L A Shack, B Nanduri, F McCarthy, H Sagirkaya, and S C Burgess Bovine germinal vesicle oocyte and cumulus cell proteomics Reproduction, June 1, 2007; 133(6): 1107 - 1120. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. W. Avellar, L. Honda, K. G. Hamil, Y. Radhakrishnan, S. Yenugu, G. Grossman, P. Petrusz, F. S. French, and S. H. Hall Novel Aspects of the Sperm-Associated Antigen 11 (SPAG11) Gene Organization and Expression in Cattle (Bos taurus) Biol Reprod, June 1, 2007; 76(6): 1103 - 1116. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. S. Andrali, Q. Qian, and S. Ozcan Glucose Mediates the Translocation of NeuroD1 by O-Linked Glycosylation J. Biol. Chem., May 25, 2007; 282(21): 15589 - 15596. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Wang, K. Julenius, J. Hryhorenko, and F. K. Hagen Systematic Analysis of Proteoglycan Modification Sites in Caenorhabditis elegans by Scanning Mutagenesis J. Biol. Chem., May 11, 2007; 282(19): 14586 - 14597. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. B. Olszewski, A. J. Groot, J. Dastych, and E. F. Knol TNF Trafficking to Human Mast Cell Granules: Mature Chain-Dependent Endocytosis J. Immunol., May 1, 2007; 178(9): 5701 - 5709. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Carmon, M. Wilkin, J. Hassan, M. Baron, and R. MacIntyre Concerted Evolution Within the Drosophila dumpy Gene Genetics, May 1, 2007; 176(1): 309 - 325. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Mark, O. B. Spiller, M. Okroj, S. Chanas, J. A. Aitken, S. W. Wong, B. Damania, A. M. Blom, and D. J. Blackbourn Molecular Characterization of the Rhesus Rhadinovirus (RRV) ORF4 Gene and the RRV Complement Control Protein It Encodes J. Virol., April 15, 2007; 81(8): 4166 - 4176. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. S.A. Hofinger, M. Spickenreither, J. Oschmann, G. Bernhardt, R. Rudolph, and A. Buschauer Recombinant human hyaluronidase Hyal-1: insect cells versus Escherichia coli as expression system and identification of low molecular weight inhibitors Glycobiology, April 1, 2007; 17(4): 444 - 453. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. K. Inder, N. Ueda, A. A. Mercer, S. B. Fleming, and L. M. Wise Bovine papular stomatitis virus encodes a functionally distinct VEGF that binds both VEGFR-1 and VEGFR-2 J. Gen. Virol., March 1, 2007; 88(3): 781 - 791. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. P Y Lee, D. D Mruk, W. Xia, and C Y. Cheng Cellular localization of sphingomyelin synthase 2 in the seminiferous epithelium of adult rat testes J. Endocrinol., January 1, 2007; 192(1): 17 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. McBride, C. K. Doyle, X. Zhang, A. M. Cardenas, V. L. Popov, K. A. Nethery, and M. E. Woods Identification of a Glycosylated Ehrlichia canis 19-Kilodalton Major Immunoreactive Protein with a Species-Specific Serine-Rich Glycopeptide Epitope Infect. Immun., January 1, 2007; 75(1): 74 - 82. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lefebvre, J. Fan, S. Chevalier, R. Sullivan, E. Carmona, and P. Manjunath Genomic structure and tissue-specific expression of human and mouse genes encoding homologues of the major bovine seminal plasma proteins Mol. Hum. Reprod., January 1, 2007; 13(1): 45 - 53. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Julenius and A. G. Pedersen Protein Evolution Is Faster Outside the Cell Mol. Biol. Evol., November 1, 2006; 23(11): 2039 - 2048. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Benjannet, D. Rhainds, J. Hamelin, N. Nassoury, and N. G. Seidah The Proprotein Convertase (PC) PCSK9 Is Inactivated by Furin and/or PC5/6A: FUNCTIONAL CONSEQUENCES OF NATURAL MUTATIONS AND POST-TRANSLATIONAL MODIFICATIONS J. Biol. Chem., October 13, 2006; 281(41): 30561 - 30572. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Tarcha, V. Basrur, C.-Y. Hung, M. J. Gardner, and G. T. Cole Multivalent Recombinant Protein Vaccine against Coccidioidomycosis. Infect. Immun., October 1, 2006; 74(10): 5802 - 5813. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Schneider, A. A. Khalil, J. Poulton, C. Castillejo-Lopez, D. Egger-Adam, A. Wodarz, W.-M. Deng, and S. Baumgartner Perlecan and Dystroglycan act at the basal side of the Drosophila follicular epithelium to maintain epithelial organization Development, October 1, 2006; 133(19): 3805 - 3815. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. B. Johansen, L. Kiemer, and S. Brunak Analysis and prediction of mammalian protein glycation Glycobiology, September 1, 2006; 16(9): 844 - 853. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. E. Van den Steen, I. Van Aelst, V. Hvidberg, H. Piccard, P. Fiten, C. Jacobsen, S. K. Moestrup, S. Fry, L. Royle, M. R. Wormald, et al. The Hemopexin and O-Glycosylated Domains Tune Gelatinase B/MMP-9 Bioavailability via Inhibition and Binding to Cargo Receptors J. Biol. Chem., July 7, 2006; 281(27): 18626 - 18637. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Wernersson, K. Rapacki, H.-H. Staerfeldt, P. W. Sackett, and A. Molgaard FeatureMap3D--a tool to map protein features and sequence conservation onto homologous structures in the PDB. Nucleic Acids Res., July 1, 2006; 34(suppl_2): W84 - W88. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Mavromatis, C. K. Doyle, A. Lykidis, N. Ivanova, M. P. Francino, P. Chain, M. Shin, S. Malfatti, F. Larimer, A. Copeland, et al. The Genome of the Obligately Intracellular Bacterium Ehrlichia canis Reveals Themes of Complex Membrane Structure and Immune Evasion Strategies J. Bacteriol., June 1, 2006; 188(11): 4015 - 4023. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Hashimoto, S. Goto, S. Kawano, K. F. Aoki-Kinoshita, N. Ueda, M. Hamajima, T. Kawasaki, and M. Kanehisa KEGG as a glycome informatics resource Glycobiology, May 1, 2006; 16(5): 63R - 70R. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. A. Reeves, J. M. Thornton, and the BioSapiens Network of Excellence Integrating biological data through the genome Hum. Mol. Genet., April 15, 2006; 15(suppl_1): R81 - R87. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Wopereis, D. J. Lefeber, E. Morava, and R. A. Wevers Mechanisms in Protein O-Glycan Biosynthesis and Clinical and Molecular Aspects of Protein O-Glycan Biosynthesis Defects: A Review Clin. Chem., April 1, 2006; 52(4): 574 - 600. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Fritz, J. Raman, and L. A. Tabak Dynamic Association between the Catalytic and Lectin Domains of Human UDP-GalNAc:Polypeptide {alpha}-N-Acetylgalactosaminyltransferase-2 J. Biol. Chem., March 31, 2006; 281(13): 8613 - 8619. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. H. Y. Jiang, B. M. Tyler, S. C. Whisson, A. R. Hardham, and F. Govers Ancient Origin of Elicitin Gene Clusters in Phytophthora Genomes Mol. Biol. Evol., February 1, 2006; 23(2): 338 - 351. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Ahmad, D. C. Hoessli, E. Walker-Nasir, S. M. Rafik, A. R. Shakoori, and Nasir-ud-Din Oct-2 DNA binding transcription factor: functional consequences of phosphorylation and glycosylation Nucleic Acids Res., January 8, 2006; 34(1): 175 - 184. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. K. Doyle, K. A. Nethery, V. L. Popov, and J. W. McBride Differentially Expressed and Secreted Major Immunoreactive Protein Orthologs of Ehrlichia canis and E. chaffeensis Elicit Early Antibody Responses to Epitopes on Glycosylated Tandem Repeats Infect. Immun., January 1, 2006; 74(1): 711 - 720. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Hubmacher, K. Tiedemann, R. Bartels, J. Brinckmann, T. Vollbrandt, B. Batge, H. Notbohm, and D. P. Reinhardt Modification of the Structure and Function of Fibrillin-1 by Homocysteine Suggests a Potential Pathogenetic Mechanism in Homocystinuria J. Biol. Chem., October 14, 2005; 280(41): 34946 - 34955. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Musicki, M. F. Kramer, R. E. Becker, and A. L. Burnett Inactivation of phosphorylated endothelial nitric oxide synthase (Ser-1177) by O-GlcNAc in diabetes-associated erectile dysfunction PNAS, August 16, 2005; 102(33): 11870 - 11875. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Semenov, K. Tamai, and X. He SOST Is a Ligand for LRP5/LRP6 and a Wnt Signaling Inhibitor J. Biol. Chem., July 22, 2005; 280(29): 26770 - 26775. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. T. Zlateva, P. Lemey, E. Moes, A.-M. Vandamme, and M. Van Ranst Genetic Variability and Molecular Evolution of the Human Respiratory Syncytial Virus Subgroup B Attachment G Protein J. Virol., July 15, 2005; 79(14): 9157 - 9167. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. O'Connor, B. Eisenhaber, J. Dalley, T. Wang, C. Missen, N. Bulleid, P. N. Bishop, and D. Trump Species specific membrane anchoring of nyctalopin, a small leucine-rich repeat protein Hum. Mol. Genet., July 1, 2005; 14(13): 1877 - 1887. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. I. Olason Integrating protein annotation resources through the Distributed Annotation System Nucleic Acids Res., July 1, 2005; 33(suppl_2): W468 - W470. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Patnaik and P. Stanley Mouse Large Can Modify Complex N- and Mucin O-Glycans on {alpha}-Dystroglycan to Induce Laminin Binding J. Biol. Chem., May 27, 2005; 280(21): 20851 - 20859. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Nakav, A. Jablonka-Shariff, S. Kaner, P. Chadna-Mohanty, H. E. Grotjan, and D. Ben-Menahem The LH{beta} Gene of Several Mammals Embeds a Carboxyl-terminal Peptide-like Sequence Revealing a Critical Role for Mucin Oligosaccharides in the Evolution of Lutropin to Chorionic Gonadotropin in the Animal Phyla J. Biol. Chem., April 29, 2005; 280(17): 16676 - 16684. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. H. Olesen, L. L. Christensen, F. B. Sorensen, T. Cabezon, S. Laurberg, T. F. Orntoft, and K. Birkenkamp-Demtroder Human FK506 Binding Protein 65 Is Associated with Colorectal Cancer Mol. Cell. Proteomics, April 1, 2005; 4(4): 534 - 544. [Abstract] [Full Text] [PDF] |
||||

























