Bioinformatics Vol. 19 no. 14 2003
Pages 1760-1764
© 2003 Oxford University Press
DDBASE2.0: updated domain database with improved identification of structural domains


1 National Centre for Biological Sciences, Tata Institute of Fundamental Research, UAS-GKVK campus, Bellary Road, Bangalore, Karnataka 560 065, India and 2 Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, CB21GA, UK
Received on October 27, 2002
; revised on March 3, 2003
; accepted on March 28, 2003
Motivation: Although many methods are available for the identification of structural domains from protein three-dimensional structures, accurate definition of protein domains and the curation of such data for a large number of proteins are often possible only after manual intervention. The availability of domain definitions for protein structural entries is useful for the sequence analysis of aligned domains, structure comparison, fold recognition procedures and understanding protein folding, domain stability and flexibility.
Results: We have improved our method of domain identification starting from the concept of clustering secondary structural elements, but with an intention of reducing the number of discontinuous segments in identified domains. The results of our modified and automatic approach have been compared with the domain definitions from other databases. On a test data set of 55 proteins, this method acquires high agreement (88%) in the number of domains with the crystallographers' definition and resources such as SCOP, CATH, DALI, 3Dee and PDP databases. This method also obtains 98% overlap score with the other resources in the definition of domain boundaries of the 55 proteins. We have examined the domain arrangements of 4592 non-redundant protein chains using the improved method to include 5409 domains leading to an update of the structural domain database.
Availability: The latest version of the domain database and online domain identification methods are available from http://www.ncbs.res.in/~faculty/mini/ddbase/ddbase.html
Supplementary information: http://www.ncbs.res.in/~faculty/mini/ddbase/supplementary/supplementary.html
Contact: mini{at}ncbs.res.in
* To whom correspondence should be addressed.
Present address: Department of Molecular Biophysics, German Cancer Research Centre (DKFZ), Im Neuenheimer Feld 280, 69120, Heidelberg, Germany.
Present address: Celltech R&D, Inc., 1631 220th Street SE, Bothell, WA 98021, USA.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
G. Pugalenthi, G. Archunan, and R. Sowdhamini DIAL: a web-based server for the automatic identification of structural domains in proteins Nucleic Acids Res., July 1, 2005; 33(suppl_2): W130 - W132. [Abstract] [Full Text] [PDF] |
||||
