Bioinformatics Vol. 17 no. 90001 2001
Pages S83-S89
© 2001 Oxford University Press
An insight into domain combinations
1 MRC Laboratory of Molecular Biology, Hills
Road, Cambridge, CB2 2QH, UK
2 Department of Biochemistry & Molecular
Biology, University College London, Darwin Bldg., Gower Street,
London, WC1E 6BT, UK
Received on February 6, 2001
; revised on March 29, 2001
; accepted on March 29, 2001
Domains are the building blocks of all globular proteins, and are units of compact three-dimensional structure as well as evolutionary units. There is a limited repertoire of domain families, so that these domain families are duplicated and combined in different ways to form the set of proteins in a genome. Proteins are gene products. The processes that produce new genes are duplication and recombination as well as gene fusion and fission. We attempt to gain an overview of these processes by studying the structural domains in the proteins of seven genomes from the three kingdoms of life: Eubacteria, Archaea and Eukaryota. We use here the domain and superfamily definitions in Structural Classification of Proteins Database (SCOP) in order to map pairs of adjacent domains in genome sequences in terms of their superfamily combinations. We find 624 out of the 764 superfamilies in SCOP in these genomes, and the 624 families occur in 585 pairwise combinations. Most families are observed in combination with one or two other families, while a few families are very versatile in their combinatorial behaviour. This type of pattern can be described by a scale-free network. Finally, we study domain repeats and we compare the set of the domain combinations in the genomes to those in PDB, and discuss the implications for structural genomics.
Contact: apic{at}mrc-lmb.cam.ac.uk
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
B. Lee and D. Lee DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture Nucleic Acids Res., July 1, 2008; 36(suppl_2): W60 - W64. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. H. B. Maegawa, M. Tropak, J. Buttner, T. Stockley, F. Kok, J. T. R. Clarke, and D. J. Mahuran Pyrimethamine as a Potential Pharmacological Chaperone for Late-onset Forms of GM2 Gangliosidosis J. Biol. Chem., March 23, 2007; 282(12): 9150 - 9161. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Wang and G. Caetano-Anolles Global Phylogeny Determined by the Combination of Protein Domains in Proteomes Mol. Biol. Evol., December 1, 2006; 23(12): 2444 - 2454. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Lin, L. Zhu, and D.-Y. Zhang An initial strategy for comparing proteins at the domain architecture level Bioinformatics, September 1, 2006; 22(17): 2081 - 2086. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Weiner 3rd and E. Bornberg-Bauer Evolution of Circular Permutations in Multidomain Proteins Mol. Biol. Evol., April 1, 2006; 23(4): 734 - 743. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Park, S. Lee, D. Bolser, M. Schroeder, M. Lappe, D. Oh, and J. Bhak Comparative interactomics analysis of protein family interaction networks using PSIMAP (protein structural interactome map) Bioinformatics, August 1, 2005; 21(15): 3234 - 3240. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Liu, S. Wu, H. Walch, and A. Grigoriev Exon-domain correlation and its corollaries Bioinformatics, August 1, 2005; 21(15): 3213 - 3216. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Fernandez, R. Scott, and R. S. Berry The nonconserved wrapping of conserved protein folds reveals a trend toward increasing connectivity in proteomic networks PNAS, March 2, 2004; 101(9): 2823 - 2827. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Fleming, A. Muller, R. M. MacCallum, and M. J. E. Sternberg 3D-GENOMICS: a database to compare structural and functional annotations of proteins between sequenced genomes Nucleic Acids Res., January 1, 2004; 32(90001): D245 - 250. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Caetano-Anolles and D. Caetano-Anolles An Evolutionarily Structured Universe of Protein Architecture Genome Res., July 1, 2003; 13(7): 1563 - 1571. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Pawson and P. Nash Assembly of Cell Regulatory Systems Through Protein Interaction Domains Science, April 18, 2003; 300(5618): 445 - 452. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Coin, A. Bateman, and R. Durbin Enhanced protein domain discovery by using language modeling techniques from speech recognition PNAS, April 15, 2003; 100(8): 4516 - 4520. [Abstract] [Full Text] [PDF] |
||||






