Skip Navigation


Bioinformatics Advance Access originally published online on December 14, 2004
Bioinformatics 2005 21(8):1301-1303; doi:10.1093/bioinformatics/bti206
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary data
Right arrow All Versions of this Article:
21/8/1301    most recent
bti206v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (18)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Bateman, A.
Right arrow Articles by Yeats, C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bateman, A.
Right arrow Articles by Yeats, C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2004. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions{at}oupjournals.org

The G5 domain: a potential N-acetylglucosamine recognition domain involved in biofilm formation

Alex Bateman *, Matthew T. G. Holden and Corin Yeats

The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SA, UK

*To whom correspondence should be addressed.


    Abstract
 TOP
 Abstract
 INTRODUCTION
 METHODS AND RESULTS
 CONCLUSIONS
 SUPPLEMENTARY DATA
 REFERENCES
 

Summary: Biofilms are complex microbial communities found at surfaces that are often associated with extracellular polysaccharides. Biofilm formation is a complex process that is being understood at the molecular level only recently. We have identified a novel domain that we call the G5 domain (named after its conserved glycine residues), which is found in a variety of enzymes such as Streptococcal IgA peptidases and various glycosyl hydrolases in bacteria. The G5 domain is found in the Accumulation Associated Protein (AAP), which is an important component in biofilm formation in Staphylococcus aureus. A common feature of the proteins containing G5 domains is N-acetylglucosamine binding, and we attribute this function to the G5 domain.

Contact: agb{at}sanger.ac.uk


    INTRODUCTION
 TOP
 Abstract
 INTRODUCTION
 METHODS AND RESULTS
 CONCLUSIONS
 SUPPLEMENTARY DATA
 REFERENCES
 
In nature, a large fraction of bacteria are found to be associated with surfaces. Bacteria attached to a surface form a biofilm (O'toole et al., 2000; Sauer, 2003) that protects the bacteria from harsh physical conditions and forces. Biofilms also provide protection from attack by antibiotics and the immune system. The formation of biofilms occurs in several steps. Initially adherence to a surface (adhesion) is followed by microcolony formation (growth and recruitment) leading to maturation (structures). Biofilms are also medically important since they lead to chronic infection of implants and recalcitrance to antimicrobial therapy. Here, we discover a novel domain found in a wide variety of proteins including some involved in biofilm formation that may help to dissect the function of these proteins.


    METHODS AND RESULTS
 TOP
 Abstract
 INTRODUCTION
 METHODS AND RESULTS
 CONCLUSIONS
 SUPPLEMENTARY DATA
 REFERENCES
 
The G5 domain was first noticed as a family in the Pfam-B database. Pfam-B is an automatically generated clustering of proteins that is supplemental to the Pfam resource of protein family alignments and profile hidden Markov models (HMMs) (Bateman et al., 2004). Pfam-B is directly derived from PRODOM and entry Pfam-B_921 corresponds to PRODOM entry PD005454 in 2002.1 release (Corpet et al., 2000). This Pfam-B family contained 35 members found in a wide variety of different domain architectures in proteins associated with bacterial pathogenesis and antibiotic resistance. To detect further members of the family HMMs were built using the HMMer software (http://hmmer.wustl.edu/) from the Pfam-B alignment in both global (ls) and local (fs) modes and iteratively searched against Swiss-Prot and TrEMBL until no new sequences were found. Inclusion E-value thresholds of 0.1 and 0.01 were set for global and local HMMs, respectively. This resulted in the detection of 90 copies of the domain, which was added to the Pfam database with accession number PF07501. PSI-blast searches were carried out from a number of starting points to validate the alignment and the membership of the original Pfam-B alignment.

This region ranges in size from 76 residues in the beta galactosidase protein from Streptococcus pneumoniae (Q8L3E9) to 84 residues in the Methicillin-resistant surface protein from Staphylococcus aureus (P80544 [GenBank] ). The alignment of the G5 domain contains a few highly conserved residues, see multiple sequence alignment shown in Figure 1. None of these conserved residues is the polar types of amino acids found in active sites, hence it seems unlikely that this region has an enzymatic function. However, in nearly all cases the G5 domain is associated with a known enzymatic domain. Therefore, the G5 domain may confer localization or substrate specificity on the proteins in which it is found. Other alternative functions could be allosteric regulation of the enzymatic domain or co-factor binding.



View larger version (46K):
[in this window]
[in a new window]
 
Fig. 1 An alignment of representative G5 domains. The alignment was generated using MAFFT (Katoh et al., 2002) with manual adjustment. The alignment has been coloured using Chroma with the default colouring scheme (Goodstadt and Ponting, 2001). The Swiss-Prot accession numbers for the proteins in the alignment are as follows: PLS_STAAU (P80544 [GenBank] ), BGAA-STRPN (Q8DQP4), YABE_BACSU (P37546 [GenBank] ), TTE2691-THETE (Q8R6U3), BL0658-BIFLO (Q8G6I2), STRH_STRPN (Q8DRL6), SPR1403-STRPN (Q8CYI8), ENDD-STRPN (Q93HW0), IGA-STRPN (Q97QP7), SCO3150-STRCO (Q8CJX5) and VANW-CLOPE (Q8XKT0).

 
The Pls protein from S.aureus (Swiss: PLS_STAAU, P80544 [GenBank] ) contains five copies of the G5 domain. (Fig. 2). The Pls protein is found to be cleaved between residues 387 and 388 by plasmin (hence the Pls name after Plasmin sensitive) (Savolainen et al., 2001). The pls gene is carried on a mobile genetic element (SCCmec type I) containing the ß-lactam resistance gene, mecA, that are associated with some methicillin resistance S.aureus (MRSA) strains (Ito et al., 2001). Staphylococcus epidermis also contains a number of proteins related to Pls (Hussain et al., 1997) that contain between four and seven G5 domains. These proteins have been called accumulation-associated proteins and they appear to have a role in biofilm formation. It has been suggested that these proteins bind to PIA, the ß-1,6-linked glucosaminoglycan that is the key matrix component of biofilms in S.epidermis. The biofilm-binding capacity of the Pls may therefore be crucial for the ability of S.epidermidis cells to be recruited and integrated into a maturing biofilm. As a corollary to this, the presence of the pls gene on the SCCmec type I element may enhance the antibiotic resistance by promoting biofilm formation in some MRSA strains.



View larger version (53K):
[in this window]
[in a new window]
 
Fig. 2 A schematic view of the domain architectures of G5 domain containing proteins. Domain names are shown in the key with Pfam (Bateman et al., 2002) accessions. See Figure 1 for protein accession numbers. This figure can be viewed in colour on Bioinformatics online.

 
Some strains of S.aureus also contain a homologue of Pls, SasG, that contain seven G5 domains. In addition to the G5 domains, both Pls and SasG contain an N-terminal domain that has been demonstrated to be involved with host ligand-binding (Roche et al., 2003). We have found that this region is likely to be a lectin domain. Adherence assays have demonstrated that both SasG and Pls promote binding to human desquamated nasal epithelium cells, and that it is the N-terminal domain that binds an as yet unidentified ligand on the epithelium cells (Roche et al., 2003). Recombinant protein studies have demonstrated that G5 domain containing region of the SasG is not involved in adherence to the host molecule. In addition to its ability to promote host–cell interactions, the SasG protein also causes bacteria cell–cell interaction. Expression of sasG in recombinant Lactococcus lactis caused aggregation, whereas aggregation was absent in the wild-type cells lacking sasG (Roche et al., 2003).

IgA1 peptidases are important proteins for pathogens that colonize mucosal surfaces. To evade the host immune defences, these bacteria produce a peptidase that cleaves IgA1—the predominant immunoglobulin of mucosal surfaces. This activity has evolved independently multiple times, since IgA1 peptidases from different classes of bacteria are unrelated. The IgA1 peptidase from streptococci are metallopeptidases that belong to family M26 (Rawlings et al., 2002).

The G5 domain is found in a number of S.pneumoniae glycosyl hydrolase enzymes. These enzymes are thought to help the organism break down oligosaccharides in its environment to provide a carbon source. ß-N-Acetylglucosaminidase known as StrH (Swiss:STRH_STRPN, Q8DRL6) contains two G5 domains as well as two domains of the glycosyl hydrolase 20 family, which catalyse the hydrolysis of terminal non-reducing N-acetyl-D-hexosamine residues in N-acetyl-ß-D-hexosaminides (Clarke et al., 1995).

EndoD (endo-ß-N-acetylglucosaminidase) from S.pneumoniae cleaves the di-N-acetylchitobiose structure (a 1–4 linked disaccharide of N-acetylglucosamine) in N-linked oligosaccharides and generally acts on complex oligosaccharides after the removal of external sugars by ß-galactosidase and ß-N-acetylglucosaminidase. The catalytic activity of this protein residues in the N-terminal region that belongs to glycosyl hydrolase family 85. Muramatsu et al. (2001) noted that the C-terminus of EndoD was related to S.pneumoinae ß-galactosidase in the region, which contains the G5 domain. The G5 domain seems likely to bind to the N-acetylglucosamine residues of di-N-chitobiose providing substrate specificity for the enzyme. In other proteins, this binding function could lead to a more general localization to the cell surface and provide an adhesive function.

The G5 domain is found at the C-terminus of some members of the VanW protein family. VanW is found as part of the Vancomycin resistance gene cluster, but the function for these proteins has not yet been identified. Vancomycin is often seen as a drug of last resort because some bacterial strains have become resistant to so many other antibiotics. Vancomycin action occurs by binding to the peptide component of unlinked peptidoglycan and blocking the action of transpeptidase enzymes. If VanW does have a direct or indirect role in Vancomycin resistance then the potential function of the G5 domain in binding NAG could provide localization to the NAG component of peptidoglycan. VanW may bind to other Vancomycin resistance proteins and help to localize them to unlinked peptidoglycan chains. The G5 domain containing VanW proteins are specific to clostridial species and Deinococcus and are not found in the clinically important enterococci strains.


    CONCLUSIONS
 TOP
 Abstract
 INTRODUCTION
 METHODS AND RESULTS
 CONCLUSIONS
 SUPPLEMENTARY DATA
 REFERENCES
 
The G5 domain is currently found widely distributed in both high- and low-GC gram positive eubacterial genomes. It is particularly expanded in the streptococcal genomes, with many of the domain architectures only seen in this group of bacteria. Based on the ability of AAP to bind PIA as well as the appearance of G5 domains in StrH and EndoD that act on N-acetylglucosamine-related substrates, we have suggested that this domain may bind NAG. Thus, we suggest that G5 domain is an important molecular component in a variety of biological situations including cell wall degradation as well as biofilm formation.


    SUPPLEMENTARY DATA
 TOP
 Abstract
 INTRODUCTION
 METHODS AND RESULTS
 CONCLUSIONS
 SUPPLEMENTARY DATA
 REFERENCES
 
Supplementary data for this paper are available on Bioinformatics online.


    Acknowledgments
 
We would also like to thank Neil Rawlings and David Studholme for useful comments on the manuscript. A.B. is supported by the Wellcome Trust.

Received on October 22, 2004; revised on December 2, 2004; accepted on December 2, 2004

    REFERENCES
 TOP
 Abstract
 INTRODUCTION
 METHODS AND RESULTS
 CONCLUSIONS
 SUPPLEMENTARY DATA
 REFERENCES
 

    Bateman, A., Birney, E., Cerruti, L., Durbin, R., Etwiller, L., Eddy, S.R., Griffiths-Jones, S., Howe, K.L., Marshall, M., Sonnhammer, E.L.L. (2002) The Pfam protein families database. Nucleic Acids Res., 30, 276–280[Abstract/Free Full Text].

    Bateman, A., Coin, L., Durbin, R., Finn, R.D., Hollich, V., Griffiths-Jones, S., Khanna, A., Marshall, M., Moxon, S., Sonnhammer, E.L., et al. (2004) The Pfam protein families database. Nucleic Acids Res., 32, D138–D141[Abstract/Free Full Text].

    Clarke, V.A., Platt, N., Butters, T.D. (1995) Cloning and expression of the beta-N-acetylglucosaminidase gene from Streptococcus pneumoniae. Generation of truncated enzymes with modified aglycon specificity. J. Biol. Chem., 270, 8805–8814[Abstract/Free Full Text].

    Corpet, F., Servant, F., Gouzy, J., Kahn, D. (2000) ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons. Nucleic Acids Res., 28, 267–269[Abstract/Free Full Text].

    Goodstadt, L. and Ponting, C.P. (2001) CHROMA: consensus-based colouring of multiple alignments for publication. Bioinformatics, 17, 845–846[Abstract/Free Full Text].

    Hussain, M., Herrmann, M., von Eiff, C., Perdreau-Remington, F., Peters, G. (1997) A 140-kilodalton extracellular protein is essential for the accumulation of Staphylococcus epidermidis strains on surfaces. Infect. Immun., 65, 519–524[Abstract/Free Full Text].

    Ito, T., Katayama, Y., Asada, K., Mori, N., Tsutsumimoto, K., Tiensasitorn, C., Hiramatsu, K. (2001) Structural comparison of three types of staphylococcal cassette chromosome mec integrated in the chromosome in methicillin-resistant. Staphylococcus aureus. Antimicrob. Agents Chemother., 45, 1323–1336.

    Katoh, K., Misawa, K., Kuma, K., Miyata, T. (2002) MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res., 30, 3059–3066[Abstract/Free Full Text].

    Muramatsu, H., Tachikui, H., Ushida, H., Song, X., Qiu, Y., Yamamoto, S., Muramatsu, T. (2001) Molecular cloning and expression of endo-beta-N-acetylglucosaminidase D, which acts on the core structure of complex type asparagine-linked oligosaccharides. J. Biochem. (Tokyo), 129, 923–928[Abstract/Free Full Text].

    O'Toole, G., Kaplan, H.B., Kolter, R. (2000) Biofilm formation as microbial development. Annu. Rev. Microbiol., 54, 49–79[CrossRef][Web of Science][Medline].

    Rawlings, N.D., O'Brien, E., Barrett, A.J. (2002) MEROPS: the protease Database. Nucleic Acids Res., 30, 343–346[Abstract/Free Full Text].

    Roche, F.M., Meehan, M., Foster, T.J., Massey, R., Peacock, S.J., Day, N.P., Visai, L., Speziale, P., Lam, A., Pallen, M. (2003) The Staphylococcus aureus surface protein SasG and its homologues promote bacterial adherence to human desquamated nasal epithelial cells. Microbiology, 149, 2759–2767[Abstract/Free Full Text].

    Sauer, K. (2003) The genomics and proteomics of biofilm formation. Genome Biol., 4, 219[CrossRef][Medline].

    Savolainen, K., Paulin, L., Westerlund-Wikstrom, B., Foster, T.J., Korhonen, T.K., Kuusela, P. (2001) Expression of pls, a gene closely associated with the mecA gene of methicillin-resistant Staphylococcus aureus, prevents bacterial adhesion in vitro. Infect. Immun., 69, 3013–3020[Abstract/Free Full Text].


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Infect. Immun.Home page
H. J. Petersen, C. Keane, H. F. Jenkinson, M. M. Vickerman, A. Jesionowski, J. C. Waterhouse, D. Cox, and S. W. Kerrigan
Human Platelets Recognize a Novel Surface Protein, PadA, on Streptococcus gordonii through a Unique Interaction Involving Fibrinogen Receptor GPIIbIIIa
Infect. Immun., January 1, 2010; 78(1): 413 - 422.
[Abstract] [Full Text] [PDF]


Home page
Antimicrob. Agents Chemother.Home page
L. Kalan, S. Ebert, T. Kelly, and G. D. Wright
Noncanonical Vancomycin Resistance Cluster from Desulfitobacterium hafniense Y51
Antimicrob. Agents Chemother., July 1, 2009; 53(7): 2841 - 2845.
[Abstract] [Full Text] [PDF]


Home page
J Med MicrobiolHome page
N. T. Stevens, C. M. Greene, J. P. O'Gara, and H. Humphreys
Biofilm characteristics of Staphylococcus epidermidis isolates associated with device-related meningitis
J. Med. Microbiol., July 1, 2009; 58(7): 855 - 862.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
N. J. Croucher, D. Walker, P. Romero, N. Lennard, G. K. Paterson, N. C. Bason, A. M. Mitchell, M. A. Quail, P. W. Andrew, J. Parkhill, et al.
Role of Conjugative Elements in the Evolution of the Multidrug-Resistant Pandemic Clone Streptococcus pneumoniaeSpain23F ST81
J. Bacteriol., March 1, 2009; 191(5): 1480 - 1489.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
D. G. Conrady, C. C. Brescia, K. Horii, A. A. Weiss, D. J. Hassett, and A. B. Herr
A zinc-dependent adhesion module is responsible for intercellular adhesion in staphylococcal biofilms
PNAS, December 9, 2008; 105(49): 19456 - 19461.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
N. Takenouchi-Ohkubo, L. M. Mortensen, K. R. Drasbek, M. Kilian, and K. Poulsen
Horizontal transfer of the immunoglobulin A1 protease gene (iga) from Streptococcus to Gemella haemolysans
Microbiology, July 1, 2006; 152(7): 2171 - 2180.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Bateman
Bioinformatics--The new home for protein sequence motifs
Bioinformatics, January 1, 2006; 22(1): 2 - 2.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary data
Right arrow All Versions of this Article:
21/8/1301    most recent
bti206v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (18)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Bateman, A.
Right arrow Articles by Yeats, C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bateman, A.
Right arrow Articles by Yeats, C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?