Glycobiology Advance Access originally published online on October 20, 2005
Glycobiology 2006 16(5):71R-81R; doi:10.1093/glycob/cwj049
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
REVIEW |
GLYCOSCIENCES.de: an Internet portal to support glycomics and glycobiology research
2 Present address: Division of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139
3 Present address: Mannheim Medical Research Center, University of Heidelberg, Theodor-Kutzer-Ufer 1-3, D-68135 Mannheim, Germany
4 Present address: Bijvoet Center for Biomolecular Research, Utrecht University, Padualaan 8, NL-3584 CH Utrecht, The Netherlands
Spectroscopic Department (B090), German Cancer Research Center, Molecular Modelling, Im Neuenheimer Feld 280, D-69120 Heidelberg, Germany
1 To whom correspondence should be addressed; w.vonderlieth{at}dkfz.de
Received on August 25, 2005; revised on October 7, 2005; accepted on October 11, 2005
| Abstract |
|---|
The development of glycan-related databases and bioinformatics applications is considerably lagging behind compared with the wealth of available data and software tools in genomics and proteomics. Because the encoding of glycan structures is more complex, most of the bioinformatics approaches cannot be applied to glycan structures. No standard procedures exist where glycan structures found in various species, organs, tissues or cells can be routinely deposited. In this article the concepts of the GLYCOSCIENCES.de portal are described. It is demonstrated how an efficient structure-based cross-linking of various glycan-related data originating from different resources can be accomplished using a single user interface. The structure oriented retrieval optionsexact structure, substructure, motif, composition and sugar componentsare discussed. The types of available datareferences, composition, spatial structures, nuclear magnetic resonance (NMR) shifts (experimental and estimated), theoretically calculated fragments and Protein Database (PDB) entriesare exemplified for Man3. The free availability and unrestricted use of glycan-related data is an absolute prerequisite to efficiently share distributed resources. Additionally, there is an urgent need to agree to a generally accepted exchange format as well as to a common software interface. An open access repository for glyco-related experimental data will secure that the loss of primary data will be considerably reduced.
Key words: databases/glycobioinformatics/glycomics/structure retrieval
| Introduction |
|---|
With the awareness that the human genome encodes for a significant smaller number of genes than estimated from genomes of lower organisms like yeast (Sequencing, 2004
Such data are spread over many publications, using different formats and representations. Because a variety of graphical cartoons are often used to report the detected glycan structures, the retrospective extraction of these data has to be done by well-trained people, which is time intensive and costly. Consequently, no comprehensive routinely updated freely available database for glycans is currently available.
The Complex Carbohydrate Structure Database (CCSD) (Doubet et al., 1989
; Doubet and Albersheim, 1992
)often named as CarbBank according to the retrieval software to access the datawas developed and maintained by the Complex Carbohydrate Research Centre of the University of Georgia (USA). It was the largest effort during the 1990s to collect glycan structures mainly through retrospective manual extraction from literature. The main issue of this approach was to easily find all publications in which specific carbohydrate structures were reported. However, when the funding stopped during the second half of the 1990s, CarbBank was not further developed and CCSD no longer updated. Nevertheless, with about 45,000 entries of about 20,000 different structures, the CCSD is still the largest repository of glycan-related data. The text-oriented structural description of glycans as developed by CarbBank has been adopted by most of the newer approaches. Also the three larger database projects which emerged in recent yearsKyoto Encyclopedia of Genes and Genomes (KEGG)-glycan (Aoki et al., 2004
; Hashimoto et al., 2005
), the bioinformatics core of the US Consortium for Functional Glycomics (CFG) (http://www.functionalglycomics.org) as well as the GLYCOSCIENCES.de portal (http://www.glycosciences.de) (Goetz et al., 2004
) [the former SWEET-DB (Loss et al., 2002
)]provide access to CCSD entries and enable amongst others a CarbBank-like (Doubet et al., 1989
; Doubet and Albersheim, 1992
) representation of glycan structures. However, depending on the scientific background each of the three groups originally came from, the CCSD glycan data have been incorporated in diverse ways to support research in different areas of glycobiology. All three implementations in common are that the retrieval of glycan structures and related data is offered through a (sub)structure search. A speciality of the KEGG-glycan approach (Aoki et al., 2003
, 2005
) is to provide access to sugar structures through manually drawn pathway maps representing current knowledge on glycan biosynthesis and metabolism of various species. As all known enzymes involved in making and degrading the glycans are also reported, a direct link to genomics and proteomics data is established. The CFG databases are primarily designed to include experimental data produced by the various cores of the consortium. An especially useful approach is the glycan array screening facility. It is the first freely available database presenting reliable and consistent data on protein carbohydrate interactions. In such a way access to carbohydrates which are recognized by a specific lectin are provided. The GLYCOSCIENCES.de portal is an attempt to link glycan-related data originating from various resources through a unique structural description (see supplementary material 1). Special emphasis has been made to provide an easy access to the available experimentally determined spatial structures of glycans and to analyze their interactions with proteins.
In this article, we will present the underlying concepts of the GLYCOSCIENCES.de portal and describe its current status with emphasis on recently added new features. Additionally, we will demonstrate how an efficient structure-based cross-linking of various glycan-related data originating from different resources can be accomplished using a single-user interface. Finally, we will discuss the required next steps to link smaller projects into a powerful network of online connected glycan databases.
| Basic concepts |
|---|
Comparable to data collections of gene and protein sequences, glycan structures are a logical element to arrange all associated data including experiments, references, spectra and biological occurrence. However, since oligosaccharides can exhibit varying and multiple linkages, they cannot be described as simple sequences. To use a glycan structure as identifier in database applications, a special encoding scheme is required, which transforms branched oligosaccharide chains into a unique linear description. The GLYCOSCIENCES.de portal uses the LInear Notation for Unique description of Carbohydrate Sequences (LINUCS) notation (Bohne-Lang et al., 2001
Another practical complicacy to develop informatics tools for glycosciences originates from the fact that different scientific communities use different ways to represent complex carbohydrate structures. Medically and biologically oriented scientists prefer to report compositions or cartoon-like representation. In contrast chemists favor a complete structural description either in form of an implicit identification of the stereochemistry through symbolic names (e.g., a-D-Galp) or through the use of pseudo three-dimensional (3D) plots, where the stereochemistry of each atom is explicitly drawn utilizing wedges to indicate the spatial orientations of bonds. On the other hand X-ray crystallographers, NMR spectroscopists and molecular modelers often prefer 3D representations of glycans. Although all mentioned representations of glycan structures should be offered and users should have a choice to select their preferred depiction, only one internal descriptionthis is the LINUCS notation in GLYCOSCIENCES.deis in principle required from which all the other representations can be automatically generated.
There is currently an ongoing debate about the most convenient way how users can input carbohydrate structures. Java-based graphical approaches (Kikuchi et al., 2005
) are discussed, as well as tools where monosaccharide residues and their linkages can be selected from pull-down menus and input into spreadsheets (Cooper et al., 2003
). The GLYCOSCIENCES.de portal currently supports only the input using the so called text-oriented extended IUPAC description, which was introduced by CarbBank. This approach has the advantage, that a carbohydrate structure can be easily transferred between various applications using the normal copypaste mechanism provided by any browser. Because graphically oriented input facilities are now freely available like KEGG-Draw (Aoki et al., 2004
), it is aimed to integrate such tools. For novice users and those who only seldom use the portal, an intuitive graphical interface will reduce the barrier to access the data.
The glycan structure may not be the only desired way to retrieve data. Standard database technology allows storing data in separate tables, which can be cross-referenced by unique identifies. In principle, any stored data can be used for user queries, provided the underlying data model is appropriately structured according to the needs of specific scientific questions. Besides various options of structural retrieval the GLYCOSCIENCES.de portal has implemented options to access glycan-related data through the input of experimental data like NMR, MS, crystallographic data or biological occurrence.
| Structural retrieval |
|---|
According to the varying needs of specific research questions, the GLYCOSCIENCES.de portal provides several structure-oriented options to recall glycan-related data. The retrieval of exactly matching glycans is the most traditional way to access a database. Here, it is accomplished through the lookup of the LINUCS (Bohne-Lang et al., 2001
To retrieve glycans based on composition of residues is an especially useful option for larger structures, which also exhibit residues whose masses differ from that of frequently occurring hexoses (see supplementary material 5/6). It can be searched for the exact number of contained residues as well as ranges (e.g., between 2 and 6 Hex or more than 3 NeuAc residues).
| Substructure search |
|---|
Substructure search is the most frequently used way to look for glycan structures. The GLYCOSCIENCES.de portal offers a so-called "beginner" spreadsheet, where a limited number of residues and linkages can be selected from a pull-down menu. The advanced substructure search requires to manually type the name of to be retrieved monosaccharide units and their linkages. Wildcards either for a single character (?)e.g., if it is unknown if the anomeric carbon is
or ßor for any zero or more characters (*) are supported. Figure 1 depicts the search query for a trisaccharide substructure with various wildcards. Some typical retrieved structures are also shown where the matched substructures are indicated in red. The advanced mode allows to search for two substructures simultaneously, which can be logically connected by an "AND" or an "OR" operation. Figure 2 shows some examples of structures, which contain the N-glycan core as well as the LewisX motif. For both substructure retrieval options, the user can indicate whether he/she wants to access only entries, for which specific resources like NMR spectra or PDB structures are available. For all retrieved structures, direct links to the other stored data are enabled activating the corresponding buttons.
|
|
Two substructures, each containing up to five residues, can be searched in a single query. Because typical carbohydrate structures exhibit between three and about twelve residues, the implemented search is capable to deal with most types of substructures.
| Motif search |
|---|
The motif search enables to retrieve all entries, which exhibit substructures having names like LewisX, blood group H antigen or GM3 (see supplementary material 7). Currently, about 50 frequently used motifs are retrievable. This list can be easily expanded, since simply the connections of residues for a new substructure, its representation using a IUPAC text description and the associated common name have to be added to an editable ASCII-file. The motif search also allows searching for O-glycan core structures (Figures 3 and 4).
|
|
Because N-glycans show a structural high complexity, a separate search interface was implemented (see supplementary material 8) to customize the query by looking for types of N-glycans (high mannose, complex or hybrid), the number of antennas, the type and the number of terminating residues as well as bisecting and core-fucosylation. Additionally, the retrieval of N-glycans can be combined with the motif search (see Figure 4 for some typical results).
| Linked databases |
|---|
The various structure-based retrieval options provide access to several data resources:
- References taken from CCSD up to 1997 and some newly entered references.
- 3D coordinates automatically generated with the SWEET-II service (Bohne et al., 1999
).
- 1H and 13C NMR-spectra: about 1500 spectra taken from Sugabase (van Kuik and Vliegenthart, 1992
; van Kuik et al., 1992
) plus about 2000 NMR spectra manually extracted from the literature, both represented as lists of peaks, which are assigned to a certain atom.
- Estimated 1H and 13C NMR-shift lists for those structures, where no experimental NMR spectra are available.
- Masses of theoretically calculated fragments (Lohmann and von der Lieth, 2003
, 2004
).
- Ligands, N- and O-glycans contained in PDB. The automatic assignment of carbohydrate chains is based on 3D coordinates and their connectivity (Lütteke et al., 2004
).
- 3D conformational maps for many glycosidic linkages.
Figure 5 depicts the type of experimental, thereof derived and other generated data, which can be retrieved through the GLYCOSCIENCES.de portal for a specific carbohydrate. Here the N-glycan core region (Man3) is taken as an example as provided for the exact structure search (click on Trimannsoyl core N-glycan to activate Man3). At first, the user gets an overview where the availability of each type of data is indicated. Activating the corresponding fields a full display of the associated data is presented. By default only the carbohydrate structure will be displayed using the CarbBank representation. In Figure 5a the contained motif, the general structure data as well as the composition are depicted.
|
| Theoretical 3D structures |
|---|
Clicking on the "theor(etical) 3D co-ord(inate)" button below the structure, a spatial structure of Man3 is displayed, which has been automatically generated using the SWEET-II service (Bohne et al., 1999). Subsequently the initially constructed geometry was optimized with the MM3 force field as implemented in die TINKER-software package (http://dasher.wustl.edu/tinker/). The JMOL (http://jmol.sourceforge.net/) JAVA applet is used for the 3D molecular display. Because JMOL performs well for all commonly used browsers, no additional software has to be installed locally by the user. One can easily rotate the molecule, choose between various display options and input RASMOL (Sayle and Milner-White, 1995
| NMR |
|---|
If measured 1H and 13C-NMR shifts are available, the stored data will be presented when activating the corresponding button (Figure 5c). A list of all shifts and their assignments to atoms is displayed. The linkage paththis is the list of glycosidic attachment positions starting from the non-reducing endis used to provide a unique identification of each residue within a given glycan chain. If noor not for all atoms of a given structureexperimental shifts are stored, a procedure to estimate 1H and 13C-NMR is implemented (Figure 5d). The procedure to estimate NMR shifts can be invoked by following the NMR link on the databases tab at the top of the page. The NMR shift estimation is based on an appropriate encoding of the structural environment for each atom in the database, which is stored in a table together with the assigned shift value. For the estimation of 1H and 13C-NMR shifts, the same encoding is applied to the input structure, all stored shifts values for a given code are recalled and a statistical analysis is performed. Normally, the mean of all stored shifts is taken for the prediction. The mean, minimum, maximum and standard deviation for the distribution of shifts is displayed for each estimated shift. The user has the possibility to examine the underlying distribution of shifts in detail and retrieve the original spectra from which specific shifts originate.
| MS |
|---|
To assist the interpretation of MS spectra, the total mass and all possible fragmentations of the glycosidic linkage are displayed when activating the corresponding button (Figure 5e). By default all B, C and X, Y fragments are presented. The other theoretically possible fragments can be easily looked up when posting the structure to the GlycoFragment (Lohmann and von der Lieth, 2003
| Experimental 3D structures |
|---|
The PDB (Berman et al., 2000
| Several derived data sets are generated from the detected structures and their protein environment |
|---|
- GlyVicinity: (Lütteke et al., 2005
) All amino acids in the spatial vicinity of each carbohydrate moiety and the interacting atoms are stored in a separate file and can be analyzed in various ways with the help of the GlyVicinity interface.
- Glyseq: (Lütteke et al., 2005
) A statistical analysis of the frequency of amino acids found in the neighborhood of the detected N- and O-glycosylation sites can be performed.
- GlyTorsion: (Lütteke et al., 2005
) The torsion angles of the glycosidic linkages between two sugar rings, which dominantly determine the 3D shape of carbohydrate structures, are automatically detected for attached glycans as well as ligands and are stored in a separate file. They can be recalled to build 3D structures of attached glycan, as it is done in the GlyProt (Bohne-Lang and von der Lieth, 2005
) (Table I) service, to be compared with theoretically calculated conformational maps (GlycoMapsDB) or to judge if a carbohydrate 3D structure is reasonable (Carp).
|
Last but not least, the GLYCOSCENCES.de portal provides access to all stored references for a given structure (Figure 5h). An automatic link to abstracts as provided by the PubMed service is enabled. However, since the systematic update of CarbBank stopped in 1998 and only references containing NMR peak-lists have been added, not the complete literature is covered.
| Summary and discussion |
|---|
Table I presents a list of the purposes of all applications accessible through the GLYCOSCIENCES.de portal. The structure-oriented integration of glycan-related scientific data originating from various resources can be regarded as a proof of principle, that the users dream recently expressed at the joint meeting of the US and Japanese consortia for glycomics might become reality within a foreseeable future: "It was agreed that the construction of databases is an area of interest to everyone and is ripe for synergistic effort. A distributed database with a single search engine, accessible to everyone, was thought to be a worthy goal." (CFG Consortium Quarterly Newsletter, Vol. 3, No. 2, December 2004, http://www.functionalglycomics.org/static/consortium/news.shtml#newsletter).
Basic prerequisite for this idea is that the organizations maintaining digital collections of glycan-related scientific data make it universally and freely accessible through the Internet (open access philosophy: free availability and unrestricted use) in an easily readable and agreed format without any barrier. Unfortunately this situation has not yet been reached and it will probably take some time to convince database providers to join the open access community and to invest time to convert their internal descriptions to standard formats.
It is obvious that the upcoming high-throughput glyco-proteomics and glycomics projects will produce a large amount of experimental data. It is predictable that especially the availability of glyco-arrays providing data which oligosaccharide binds to which lectin as well as the MS-based profiling of the glycans, which are found in normal and diseased tissues, will attract considerable attention. Because, for example, the CFG has clearly announced that they will provide open access to their data and since most of the information is already available in a digital format on the Internet, it will be a logic expansion of the GLYCOSCIENCES.de portal to cross-link this information with the already available data. A prerequisite for an efficient exchange of data will be the agreement to a generally accepted exchange format as well as to a common software interface. The need to establish a readable exchange format has been recognized by various research groups: CabosML (Kikuchi et al., 2005
), GLYcan Data Exchange (GLYDE) (http://lsdis.cs.uga.edu/projects/glycomics/index.php?page=4) and Bacterial Carbohydrate Structure Data Base (http://www.glyco.ac.ru/bcsdb/start.shtml).
Consequently, several proposals for an eXtensible Markup Language (XML)-based description of glycan structures already exist and it is foreseeable that some form of consensus will be reached within the near future. This agreement may not cover all peculiarities for all types of carbohydrates found in nature. However, it will be sufficiently comprehensive for all glycan structures found in mammalians. It seems that the simple object access protocol (SOAP) (http://www.w3.org/TR/soap/) is now the broadly accepted procedure for communication between applications. Being designed to communicate through Internet, it is well suited to be also used for the exchange of glycan-related data between distributed computers. Taking together, the time seems to be mature to establish an online connection of distributed databases at least between the larger already established projects.
Besides the larger projects, many other initiatives exist where glycoscientists have made available their scientific data on the Internet, using a representation that is often closely related to a particular experimental data format or to a certain biological system (von der Lieth, 2004
). This diversity of data models hampers an efficient cross-linking with other resources and makes comparative glycomics analysis difficult. Currently, the creation of disconnected and incompatible islands of glycomics data continues.
One obvious reason for this undesirable situation is that no well-established procedures and repositories exist, where glycoscientists can deposit their glycan structures and related experimental data (von der Lieth, 2004
). In genomics and proteomics research, it is a standard procedure that scientists submit their sequences before publication to one of the large repositories like GenBank (Benson et al., 2004
) for DNA data or PDB (Berman et al., 2004) for 3D macromolecular structures. No such procedures exist for glycomics data. Although the currently produced amount of scientific data as compared with genomics and proteomics is still modest in size, the foreseeable rapid evolution of glycomics research will result in an increasing number of glycan structures found in various tissues and species. Therefore, it is an obvious demand that a corresponding repository will have to be established for glycomics research as well. However, since glycans exhibit branched structures and the monosaccharide units are connected in several ways, oligosaccharides cannot be encoded using a simple linear code. Therefore, additional software tools have to be provided, which enable all scientists to input glycan structures in an easily manageable way.
Based on the idea that the Internet offers the unique chance to constitute a global and interactive communication for scientific data, the EUROCarbDB (http://www.eurocarbdb.org) project, a design study funded by the sixth framework program of the European Union, develops a network of locally installed distributed databases for glycosciences. The availability of such tools will encourage people to input their recorded experimental data into a local database that may be kept private until it is published. Additionally, the released data will be further annotated, stored and archived in a central database which will be maintained at the European Bioinformatics Institute (EMBL-EBI) (http://www.ebi.ac.uk).
The existence of a broadly accepted open access repository for glyco-related experimental data will hopefully secure that the loss of primary data in glycobiology research will be considerably reduced. The agreement to quality standards and notations will not only raise the scientific usefulness of the stored data, it will open the opportunity to apply various data-mining approaches including multivariate statistics and artificial neural network algorithms to extract new information and to derive new knowledge which until now is hidden in unstructured data.
| Supplementary data |
|---|
Supplementary data are available at Glycobiology online (http://glycob.oxfordjournals.org/).
| Acknowledgments |
|---|
The development of the GLYCOSCIENCES.de portal is funded by a grant from the German Research Council (Deutsche Forschungsgemeinschaft: DFG) within the digital library program.
| Abbreviations |
|---|
3D, three-dimensional; CCSD, Complex Carbohydrate Structure Database; CFG, Consortium for Functional Glycomics; KEGG, Kyoto Encyclopedia of Genes and Genomes; LINUCS, LInear Notation for Unique description of Carbohydrate Sequences; MS, mass spectrometry; NMR, nuclear magnetic resonance; PDB, Protein Database
| References |
|---|
Aoki, K., Yamaguchi, A., Okuno, Y., Akutsu, T., Ueda, N., Kanehisa, M., and Mamitsuka, H. (2003) Efficient tree-matching methods for accurate carbohydrate database queries. Genome Inform., 14, 134143.
Aoki, K., Yamaguchi, A., Ueda, N., Akutsu, T., Mamitsuka, H., Goto, S., and Kanehisa, M. (2004) KCaM (KEGG Carbohydrate Matcher): a software tool for analyzing the structures of carbohydrate sugar chains. Nucleic Acids Res., 32, W267W272.
Aoki, K., Mamitsuka, H., Akutsu, T., and Kanehisa, M. (2005) A score matrix to reveal the hidden links in glycans. Bioinformatics, 21, 14571463.
Benson, D., Karsch-Mizrachi, I., Lipman, D., Ostell, J., and Wheeler, D. (2004) GenBank: update. Nucleic Acids Res., 32, D23D26.
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., and Bourne, P.E. (2000) The protein data bank. Nucleic Acids Res., 28, 235242.
Blixt, O., Head, S., Mondala, T., Scanlan, C., Huflejt, M., Alvarez, R., Bryan, M., Fazio, F., Calarese, D., Stevens, J., and others. (2004) Printed covalent glycan array for ligand profiling of diverse glycan binding proteins. Proc. Natl. Acad. Sci. U. S. A., 101, 1703317038.
Bohne, A., Lang, E., and von der Lieth, C.W. (1999) SWEET WWW-based rapid 3D construction of oligo- and polysaccharides. Bioinformatics, 15, 767768.
Bohne-Lang, A. and von der Lieth, C.W. (2005) GlyProt: is silico glycosylation of proteins. Nucleic Acids Res., 33, W214W219.
Bohne-Lang, A., Lang, E., Forster, T., and von der Lieth, C.W. (2001) LINUCS: linear notation for unique description of carbohydrate sequences. Carbohydr. Res., 336, 111.[CrossRef][ISI][Medline]
Cooper, C., Joshi, H., Harrison, M., Wilkins, M., and Packer, N. (2003) GlycoSuiteDB: a curated relational database of glycoprotein glycan structures and their biological sources. Nucleic Acids Res., 31, 511513.
Dell, A. and Morris, H. (2001) Glycoprotein structure determination by mass spectrometry. Science, 291, 23512356.
Doubet, S. and Albersheim P. (1992) CarbBank. Glycobiology, 2, 505.
Doubet, S., Bock, K., Smith, D., Darvill, A., and Albersheim, P. (1989) The complex carbohydrate structure database. Trends Biochem. Sci., 14, 475477.[CrossRef][ISI][Medline]
Feizi, T. and Chai, W. (2004) Oligosaccharide microarrays to decipher the glyco-code. Nat. Rev. Mol. Cell Biol., 5, 582588.[CrossRef][ISI][Medline]
Frank, M., Gutbrod, P., Hassayoun, C., and von der Lieth, C. (2003) Dynamic molecules: molecular dynamics for everyone. An internet-based access to molecular dynamic simulations: basic concepts. J. Mol. Model, 9, 308315.
Goetz, T. and von der Lieth, C. (2005) PubFinder: a tool for improving retrieval rate of relevant PubMed. Nucleic Acids Res., 33, W774W778.
Goetz, T., Bohne-Lang, A., Frank, M., Lohmann, K., Loss, A., Lütteke, T., and von der Lieth, C.W. (2004) Glycosciences.de: an Internet portal for glyco-related data from open access resources. In Giegerich R., Stoye, J. (eds.), German Conference on Bioinformatics. Köllen Druck+ Verlag GmbH, Bonn, Germany, pp. 115119.
Hashimoto, K., Goto, S., Kawano, S., Aoki-Kinoshita, K., Ueda, N., Hamajima, M., Kawasaki, T., and Kanehisa, M. (2005) KEGG as a glycome informatics resource. Glycobiology. Epub ahead of print.
Kikuchi, N., Kameyama, A., Nakaya, S., Ito, H., Sato, T., Shikanai, T., Takahashi, Y., and Narimatsu, H. (2005) The carbohydrate sequence markup language (CabosML): an XML description of carbohydrate structures. Bioinformatics, 21, 17171718.
Lohmann, K. and von der Lieth, C. (2003) GLYCO-FRAGMENT. A web tool to support the interpretation of mass spectra of complex carbohydrates. Proteomics, 3, 20282035.[CrossRef][ISI][Medline]
Lohmann, K. and von der Lieth, C. (2004) GlycoFragment and GlycoSearchMS: web tools to support the interpretation of mass spectra of complex carbohydrates. Nucleic Acids Res., 32, W261W266.
Loss, A., Bunsmann, P., Bohne, A., Loss, A., Schwarzer, E., Lang, E., and von der Lieth, C.W. (2002) SWEET-DB: an attempt to create annotated data collections for carbohydrates. Nucleic Acids Res., 30, 405408.
Lütteke, T. and von der Lieth, C.W. (2004) pdb-care (PDB carbohydrate residue check): a program to support annotation of complex carbohydrate structures in PDB files. BMC Bioinformatics, 5, 69.
Lütteke, T., Frank, M., and von der Lieth, C.W. (2004) Data mining the protein data bank: automatic detection and assignment of carbohydrate structures. Carbohydr. Res., 339, 10151020.[CrossRef][ISI][Medline]
Lütteke, T., Frank, M., and von der Lieth, C.W. (2005) Carbohydrate structure suite (CSS): analysis of carbohydrate, 3D structures derived from the PDB. Nucleic Acids Res., 33, D242D246.
Pratt, M.R. and Bertozzi, C.R. (2005) Synthetic glycopeptides and glycoproteins as tools for biology. Chem. Soc. Rev., 34, 5868.
Sayle, R. and Milner-White, E. (1995) RASMOL: biomolecular graphics for all. Trends Biochem. Sci., 20, 374.
Sequencing, I.H.G. (2004) Consortium. Finishing the euchromatic sequence of the human genome. Nature, 431, 931945.[CrossRef][Medline]
van Kuik, J. and Vliegenthart, J.(1992) Databases of complex carbohydrates. Trends Biotechnol., 10, 182185.[CrossRef][ISI][Medline]
van Kuik, J., Hard, K., and Vliegenthart, J.A. (1992) 1H NMR database computer program for the analysis of the primary structure of complex carbohydrates. Carbohydr. Res., 235, 5368.[CrossRef][ISI][Medline]
von der Lieth, C.W. (2004) An endorsement to create open databases for analytical data of complex carbohydrates. J. Carbohydr. Chem., 23, 277297.[CrossRef]
von der Lieth, C.W., Bohne-Lang, A., Lohmann, K., and Frank, M. (2004) Bioinformatics for glycomics: status, methods, requirements and perspectives. Brief Bioinform., 5, 164178.
Wong, C. (2005) Protein glycosylation: new challenges and opportunities. J. Org. Chem., 70, 42194225.[CrossRef][ISI][Medline]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cit




