Bioinformatics Vol. 16 no. 7 2000
Pages 613-618
© 2000 Oxford University Press
Original Paper |
Domain size distributions can predict domain boundaries
1 National Center for Biotechnology
Information, National Library of Medicine, National Institutes of
Health, Bethesda, Maryland 20894, USA
2 Department of Molecular Biology and
Genetics, The Johns Hopkins University School of Medicine,
Baltimore, Maryland 21205, USA
Received on July 17, 1999
; revised on January 9, 2000
; accepted on February 10, 2000
Motivation: The sizes of protein domains observed in the 3D-structure database follow a surprisingly narrow distribution. Structural domains are furthermore formed from a single-chain continuous segment in over 80% of instances. These observations imply that some choices of domain boundaries on an otherwise uncharacterized sequence are more likely than others, based solely on the size and segment number of predicted domains. This property might be used to guess the locations of protein domain boundaries.
Results: To test this possibility we enumerate putative domain boundaries and calculate their relative likelihood under a probability model that considers only the size and segment number of predicted domains. We ask, in a cross-validated test using sequences with known 3D structure, whether the most likely guesses agree with the observed domain structure. We find that domain boundary predictions are surprisingly successful for sequences up to 400 residues long and that guessing domain boundaries in this way can improve the sensitivity of threading analysis.
Availability: The DGS algorithm, for Domain Guess by Size, is available as a web service at http://www.ncbi.nlm.nih.gov/dgs. This site also provides the DGS source code.
Contact: bryant{at}ncbi.nlm.nih.gov
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
B. W. Neuman, J. S. Joseph, K. S. Saikatendu, P. Serrano, A. Chatterjee, M. A. Johnson, L. Liao, J. P. Klaus, J. R. Yates III, K. Wuthrich, et al. Proteomics Analysis Unravels the Functional Repertoire of Coronavirus Nonstructural Protein 3 J. Virol., June 1, 2008; 82(11): 5279 - 5294. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. N.I. Pang, K. Lin, M. A. Wouters, J. Heringa, and R. A. George Identifying foldable regions in protein sequence from the hydrophobic signal Nucleic Acids Res., February 2, 2008; 36(2): 578 - 588. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Cheng DOMAC: an accurate, hybrid protein domain prediction server Nucleic Acids Res., July 13, 2007; 35(suppl_2): W354 - W356. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Phuong, C. B. Do, R. C. Edgar, and S. Batzoglou Multiple alignment of protein sequences with repeats and rearrangements Nucleic Acids Res., November 6, 2006; 34(20): 5932 - 5942. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Lozada-Chavez, S. C. Janga, and J. Collado-Vides Bacterial regulatory networks are extremely flexible in evolution Nucleic Acids Res., July 13, 2006; 34(12): 3434 - 3445. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Gewehr and R. Zimmer SSEP-Domain: protein domain prediction by alignment of secondary structure elements and profiles Bioinformatics, January 15, 2006; 22(2): 181 - 187. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. George, K. Lin, and J. Heringa Scooby-domain: prediction of globular domains in protein sequence Nucleic Acids Res., July 1, 2005; 33(suppl_2): W160 - W163. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Liu and B. Rost Sequence-based prediction of protein domains Nucleic Acids Res., July 7, 2004; 32(12): 3522 - 3530. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. E. Kim, D. Chivian, and D. Baker Protein structure prediction and analysis using the Robetta server Nucleic Acids Res., July 1, 2004; 32(suppl_2): W526 - W531. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. V. Galzitskaya and B. S. Melnik Prediction of protein domain boundaries from sequence alone Protein Sci., April 1, 2003; 12(4): 696 - 701. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Pintar, O. Carugo, and S. Pongor Atom Depth as a Descriptor of the Protein Interior Biophys. J., April 1, 2003; 84(4): 2553 - 2561. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-t. Guo, D. Xu, D. Kim, and Y. Xu Improving the performance of DomainParser for structural domain partition using neural network Nucleic Acids Res., February 1, 2003; 31(3): 944 - 952. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Rigden Use of covariance analysis for the prediction of structural domain boundaries from multiple protein sequence alignments Protein Eng. Des. Sel., February 1, 2002; 15(2): 65 - 77. [Abstract] [Full Text] [PDF] |
||||





