Skip Navigation


Bioinformatics Advance Access originally published online on March 14, 2008
Bioinformatics 2008 24(9):1214-1216; doi:10.1093/bioinformatics/btn090
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrowOA All Versions of this Article:
24/9/1214    most recent
btn090v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Google Scholar
Right arrow Articles by Campbell, M. P.
Right arrow Articles by Rudd, P. M.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Campbell, M. P.
Right arrow Articles by Rudd, P. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2008 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

GlycoBase and autoGU: tools for HPLC-based glycan analysis

Matthew P. Campbell *,{dagger}, Louise Royle {ddagger}, Catherine M. Radcliffe {dagger}, Raymond A. Dwek and Pauline M. Rudd {dagger}

Oxford Glycobiology Institute, Department of Biochemistry, University of Oxford, South Parks Road, Oxford OX1 3QU, UK

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DESIGN AND IMPLEMENTATION
 3 APPLICATIONS AND PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 

Summary: The development of robust high-performance liquid chromatography (HPLC) technologies continues to improve the detailed analysis and sequencing of glycan structures released from glycoproteins. Here, we present a database (GlycoBase) and analytical tool (autoGU) to assist the interpretation and assignment of HPLC-glycan profiles. GlycoBase is a relational database which contains the HPLC elution positions for over 350 2-AB labelled N-glycan structures together with predicted products of exoglycosidase digestions. AutoGU assigns provisional structures to each integrated HPLC peak and, when used in combination with exoglycosidase digestions, progressively assigns each structure automatically based on the footprint data. These tools are potentially very promising and facilitate basic research as well as the quantitative high-throughput analysis of low concentrations of glycans released from glycoproteins.

Availability: http://glycobase.ucd.ie

Contact: matthew.campbell{at}nibrt.ie


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DESIGN AND IMPLEMENTATION
 3 APPLICATIONS AND PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 
Glycobiology is a distinct field that seeks to determine the roles of sugars in biology; it includes the release, labelling, separation and sequencing of glycan structures that are covalently attached to proteins or lipids. The structure determination of glycans is a difficult task partly due to the large diversity of structures found in nature, the complex biosynthetic structural variations and the physical and chemical similarities of the monosaccharide. The next major challenge in the post-genomic era is to fully determine and understand the implications of post-translational modifications, such as glycosylation.

Bioinformatics solutions for glycobiology and glycomics are still in their infancy compared to those tools available in the proteomic and genomic fields. Several large scale initiatives have been established to develop technologies and resources to advance data handling of large and diverse datasets and to assist data interpretation these include the Consortium for Functional Glycomics (CFG) (Raman et al., 2006), Kyoto Encyclopedia of Genes and Genomes (KEGG) (Hashimoto et al., 2006), Glycosciences.de (Lutteke et al., 2006), CAZy (http://afmb.cnrs-mrs.fr/CAZY/), BCSDB (Toukach et al., 2007), GlycoSuiteDB (Cooper et al., 2003) and CarbBank (Doubet and Albersheim, 1992). The most recent of these projects is the EUROCarbDB initiative (www.eurocarbdb.org) which aims to establish a framework for glycan data deposition obtained by high-performance liquid chromatography (HPLC), MS and NMR techniques and the development of tools to assist data interpretation.

One of the main limiting factors restricting the development and application of glycobioloy is the lack of a well established, rapid and automated high-throughput platform. We have recently developed and validated a HPLC technology based on a 96-well plate format for the detailed analysis of glycans at concentrations required for biomedical applications (low femtomoles of N-linked sugars released from micrograms of glycoproteins). However the interpretation, annotation and assignment of HPLC-glycan data is currently a manual and very time-consuming aspect of glycan analysis performed by experts (Anumula, 2000; Guile et al., 1996; Royle et al., 2006). Consequently, we have built a relational database (GlycoBase) and an analytical tool (autoGU) in conjunction with EUROCarbDB to assist data interpretation to bring glycan analysis within the reach of any well-established laboratory.


    2 DESIGN AND IMPLEMENTATION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DESIGN AND IMPLEMENTATION
 3 APPLICATIONS AND PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 
GlycoBase and autoGU are based on open-source technology, developed in Perl, Common Gateway Interface (CGI) and tested on SUSE 10.0 running Apache 2.0. The relational database was set-up in MySQL 5.0 and the schema is based on the EUROCarbDB-HPLC data model which can be easily modified and extended. The html-based web interface has been tested using IE6.0-7.0 and Firefox 1.0-2.0.


    3 APPLICATIONS AND PERSPECTIVES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DESIGN AND IMPLEMENTATION
 3 APPLICATIONS AND PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 
GlycoBase and autoGU are novel developments for the interpretation and annotation of glycan sequencing data.

3.1 HPLC strategy for glycan analysis
Normal–Phase (NP)-HPLC using amide-based columns is a robust and reproducible method for high-resolution separation of N-linked glycans released from glycoproteins (Anumula, 2000, 2006; Guile et al., 1996; Tomiya et al., 1988). Released glycans labelled with a fluorophore e.g. 2-aminobenzamide enables detection at the femtomole level. The advantages of NP-HPLC analysis of 2-aminobenzamide (2AB)-labelled glycans are: (i) a total glycan pool including charged and neutral glycans can be analysed at once compared to capillary electrophoresis (CE) where samples are usually desialylated before analysis or matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) MS in which sialic acid linkages are unstable; (ii) the 1:1 stoichiometric labelling of the released glycans allows for accurate and quantitative measurement of the relative amounts of individual glycans; (iii) the addition or subtraction of monosaccharides shifts the retention time by predictable amounts (unlike RPHPLC, CE or HPAEC-PAD) enabling detailed structural assignments to be made when HPLC analysis is combined with exoglycosidase digestions and database matching and (iv) NP-HPLC can separate structures with the same composition on the basis of sequence and linkage type (mass-spectrometry analysis can distinguish differences when used in combination with chromatographic separation and/or fragmentation analysis).

Glycan profiles from NP-HPLC are calibrated against a 2AB-labelled dextran ladder (2AB-glucose homopolymer, Ludger Ltd.) and assigned glucose unit (GU) values by fitting a fifth order polynomial distribution curve to allocate GU values from retention times (using Empower GPC software from Waters Ltd). Glycan structures/composition are assigned using the database of GU values and confirmed by a series of exoglyosidase digestions (Royle et al., 2006). GU values can be used as reference standard values because the calibration minimizes day-to-day and/or system variations.

We have developed a high-throughput method for the efficient release and 2AB labelling of 96 glycoprotein samples in 2–3 days (Fig. 1). Preliminary HPLC analysis takes an additional 2 days (using 30 min run) followed by exoglycosidase sequencing steps and structural assignments of the glycan pool by database matching (GlycoBase and autoGU); a similar approach was recently used in a pilot scale to identify glycosylation changes in total serum of patients with advanced ovarian cancer (Saldova et al., 2007). The high-throughput methodology with supporting data has been recently published (Royle et al., 2007).


Figure 1
View larger version (30K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. An overview of the experimental strategy used for releasing, labelling and sequencing glycans using our HPLC platform. GlycoBase and autoGU assist HPLC profile data interpretation by analyzing each integrated profile submitted in a sequentially manner based on the exoglycosidase digestion methods. GlycoBase can be used independently to manually assign profiles by using the well curated GU values.

 
3.2 GlycoBase
GlycoBase contains more than 350 2AB-labelled N-linked glycan structures including 117 identified in the human serum glycome (Fig. 2). All structures were determined at the Oxford Glycobiology Institute by a combination of NP-HPLC with exoglycosidase sequencing and mass spectrometry (MALDI-MS, ESI-MS, ESI-MS/MS, LC-MS, LC-ESI-MS/MS). GlycoBase contains the HPLC elution positions (expressed as glucose unit values) for each individual glycan alongside the products of exoglycosidase digestions (Royle et al., 2006). The GU value for each glycan is related to the number and linkage types of its constituent monosaccharides. The SD of the GU value is ≤0.1 for 91% (≤0.2 for 97%) of neutral structures and ≤0.2 for 95% of charged structures, with more than one reference value. This is calculated for over 800 referenced GU values from the accumulated experimental data in GlycoBase. Each entry is comprehensively annotated and includes the following: a pictorial representation of the structure depicting monosaccharide sequence and linkages; NP-HPLC retention time expressed as an average GU value, with SD (calculated from all listed published data for that structure); the monosaccharide composition; related reference information; links to the identified exoglycosidase digest products and a list of the subgroups in which the glycan can be found. The subgroups eliminate structures which are not found/possible therefore assisting glycan searches; humans do not encode the glycosyltransferase responsible for adding core {alpha}1–3 fucose units unlike plants.


Figure 2
View larger version (28K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 2. GlycoBase a novel database containing over 350 2AB-labelled glycan structures. Each glycan entry is comprehensively annotated with its NP-HPLC retention time (standardized to GU values); exoglycosidase digestion products; monosaccharide composition and publication references. The displayed entries can be filtered by selecting a selection of structural features including branching complexity.

 
Many of the well-established glycan databases offer a vast amount of information including: glycan biosynthetic pathways; glyco- and microarray data and structural and chemical information for natural and chemically synthesized glycans. We recognize that our dataset is comparatively small but none of these large databases provide standardized experimental information which can be used to interpret HPLC-glycan data with a high degree of accuracy. Consequently, GlycoBase is an important development in the field of glycobiology. The availability of high quality data generated by a highly sensitive, robust and quantitative method has recently been used to assist full glycan characterization (Domann et al., 2007; Saldova et al., 2007).

3.3 autoGU
The interpretation of a complete set of exoglycosidase digestions can be very time consuming and a difficult exercise for an inexperienced glycobiologists. We have developed database matching software (autoGU) which automatically assigns possible glycan structures to each HPLC peak. When used in combination with data from a series of exoglycosidases autoGU will create a refined list of structures based on the digest footprint i.e. shifts in GU values due to cleavage of terminal monosaccharides dependent on enzyme specificity.

Integrated HPLC data in the format of GU values and percentage areas is initially submitted to a search of entries in GlycoBase. Preliminary glycan structures are assigned to each peak where a GU match exists using the mean and standard deviation values for each structure or ±0.3 GU where only one reference source is reported.

Initially, several different glycan structures can be assigned to each peak (undigested sample) however each glycan structure has a unique exoglycosidase digest footprint (all digestion pathways are stored in GlycoBase). The sequential digestion of a glycan with exoglycosidases can be used to accurately and quantitatively sequence a glycan pool (Royle et al., 2006). The removal of monosaccharides from the non-reducing end changes the retention time by a predictable amount, therefore structural assignments can be made when HPLC analysis is combined with exoglycosidase digestions. When the results from a set of glycan exoglycosidase profiles are submitted, autoGU progressively analyses the data, to produce a refined list of final structural assignments that match the digestion data using the preliminary (undigested sample) assignments as the initial dataset (Fig. 3).


Figure 3
View larger version (38K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 3. autoGU queries GlycoBase to assign preliminary structures to each HPLC peak. The exoglycosidase digestion data is cross-referenced with the digest footprints for each preliminary assignment to produce a final refined assignment which lists those structures that correlate with the entire digest footprint.

 
These glycan tools have been extensively evaluated and shown to assist and improve the accuracy of HPLC-glycan data interpretation (Domann et al., 2007; Saldova et al., 2007). GlycoBase will be updated on a regular basis and will include N- and O-linked glycans from natural sources e.g. immunoglobulins, human serum and modulation of glycosylation in cell culture. We anticipate that the continued growth will further enhance the application of GlycoBase to accurately annotate HPLC-glycan profiles. These tools will be incorporated into the EUROCarbDB framework for the community including the development of databases for additional labelling procedures and HPLC methods.


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DESIGN AND IMPLEMENTATION
 3 APPLICATIONS AND PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 
We are actively working in collaboration with the EUROCarbDB (http://www.eurocarbdb.org) RIDS contract number: 011952. This work was supported by the Oxford Glycobiology Institute endowment. We thank the Wellcome Trust and the Biotechnology and Biological Sciences Research Council for grants to purchase the MALDI-TOF and Q-Tof mass spectrometers that were used in this work.

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Limsoon Wong

{dagger}Present address: Dublin-Oxford Glycobiology Laboratory, National Institute for Bioprocessing Research and Training, Conway Institute, University College Dublin, Dublin, Ireland. Back

{ddagger}Present address: Ludger Ltd, Culham Science Centre, Abingdon, Oxfordshire OX14 3EB., UK. Back

Received on February 8, 2008; revised on February 8, 2008; accepted on March 4, 2008

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 DESIGN AND IMPLEMENTATION
 3 APPLICATIONS AND PERSPECTIVES
 ACKNOWLEDGEMENTS
 REFERENCES
 

    Anumula KR. High-sensitivity and high-resolution methods for glycoprotein analysis. Anal. Biochem, ( (2000) ) 283, : 17–26.[CrossRef][ISI][Medline].

    Anumula KR. Advances in fluorescence derivatization methods for high-performance liquid chromatographic analysis of glycoprotein carbohydrates. Anal. Biochem, ( (2006) ) 350, : 1–23.[CrossRef][ISI][Medline].

    Cooper CA, et al. GlycoSuiteDB: a curated relational database of glycoprotein glycan structures and their biological sources. 2003 update. Nucleic Acids Res, ( (2003) ) 31, : 511–513.[Abstract/Free Full Text].

    Domann PJ, et al. Separation-based Glycoprofiling approaches using fluorescent labels. Proteomics, ( (2007) ) 7, : 70–76.[CrossRef][Medline].

    Doubet S, Albersheim P. CarbBank. Glycobiology, ( (1992) ) 2, : 505.[Free Full Text].

    Guile GR, et al. A rapid high-resolution high-performance liquid chromatographic method for separating glycan mixtures and analyzing oligosaccharide profiles. Anal. Biochem, ( (1996) ) 240, : 210–226.[CrossRef][ISI][Medline].

    Hashimoto K, et al. KEGG as a glycome informatics resource. Glycobiology, ( (2006) ) 16, : 63R–70R.[Abstract/Free Full Text].

    Lutteke T, et al. GLYCOSCIENCES.de: an Internet portal to support glycomics and glycobiology research. Glycobiology, ( (2006) ) 16, : 71R–81R.[Abstract/Free Full Text].

    Raman R, et al. Advancing glycomics: implementation strategies at the consortium for functional glycomics. Glycobiology, ( (2006) ) 16, : 82R–90R.[Abstract/Free Full Text].

    Royle L, et al. HPLC-based analysis of serum N-glycans on a 96-well plate platform with dedicated database software. In: Anal. Biochem., ( (2007) ) December 23, 2007 [Epub ahead of print]..

    Royle L, et al. Detailed structural analysis of N-glycans released from glycoproteins in SDS-PAGE gel bands using HPLC combined with exoglycosidase array digestions. Methods Mol. Biol, ( (2006) ) 347, : 125–143.[Medline].

    Saldova R, et al. Ovarian cancer is associated with changes in glycosylation in both acute-phase proteins and IgG. Glycobiology, ( (2007) ) 17, : 1344–1356.[Abstract/Free Full Text].

    Tomiya N, et al. Analyses of N-linked oligosaccharides using a two-dimensional mapping technique. Anal. Biochem, ( (1988) ) 171, : 73–90.[CrossRef][ISI][Medline].

    Toukach P, et al. Sharing of worldwide distributed carbohydrate-related digital resources: online connection of the bacterial carbohydrate structure DataBase and GLYCOSCIENCES.de. Nucleic Acids Res, ( (2007) ) 35, : D280–D286.[Abstract/Free Full Text].


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrowOA All Versions of this Article:
24/9/1214    most recent
btn090v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Google Scholar
Right arrow Articles by Campbell, M. P.
Right arrow Articles by Rudd, P. M.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Campbell, M. P.
Right arrow Articles by Rudd, P. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?