Bioinformatics Vol. 19 no. 11 2003
Pages 1381-1390
© 2003 Oxford University Press
Identifying property based sequence motifs in protein families and superfamilies: application to DNase-1 related endonucleases
1 Sealy Center for Structural Biology, Department of Human Biological Chemistry and Genetics, University of Texas Medical Branch, Galveston, TX 77555-1157, USA
Received on September 23, 2002
; revised on January 2, 2003
; accepted on January 28, 2003
Motivation: Identification of short conserved sequence motifs common to a protein family or superfamily can be more useful than overall sequence similarity in suggesting the function of novel gene products. Locating motifs still requires expert knowledge, as automated methods using stringent criteria may not differentiate subtle similarities from statistical noise.
Results: We have developed a novel automatic method, based on patterns of conservation of 237 physicalchemical properties of amino acids in aligned protein sequences, to find related motifs in proteins with little or no overall sequence similarity. As an application, our web-server MASIA identified 12 property-based motifs in the apurinic/apyrimidinic endonuclease (APE) family of DNA-repair enzymes of the DNase-I superfamily. Searching with these motifs located distantly related representatives of the DNase-I superfamily, such as Inositol 5'-polyphosphate phosphatases in the ASTRAL40 database, using a Bayesian scoring function. Other proteins containing APE motifs had no overall sequence or structural similarity. However, all were phosphatases and/or had a metal ion binding active site. Thus our automated method can identify discrete elements in distantly related proteins that define local structure and aspects of function. We anticipate that our method will complement existing ones to functionally annotate novel protein sequences from genomic projects.
Availability: MASIA WEB site: http://www.scsb.utmb.edu/masia/masia.html
Contact: werner{at}newton.utmb.edu
Supplementary information: The dendrogram of 42 APE sequences used to derive motifs is available on http://www.scsb.utmb.edu/comp_biol.html/DNA_repair/publication.html
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. Ma, M. Xu, J. R. Forman, J. Clarke, and A. F. Oberhauser Naturally Occurring Mutations Alter the Stability of Polycystin-1 Polycystic Kidney Disease (PKD) Domains J. Biol. Chem., November 20, 2009; 284(47): 32942 - 32949. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Oezguen, S. Kumar, A. Hindupur, W. Braun, B. K. Muralidhara, and J. R. Halpert Identification and Analysis of Conserved Sequence Motifs in Cytochrome P450 Family 2: FUNCTIONAL AND STRUCTURAL ROLE OF A MOTIF 187RFDYKD192 IN CYP2B ENZYMES J. Biol. Chem., August 1, 2008; 283(31): 21808 - 21816. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Vongpunsawad, N. Oezgun, W. Braun, and R. Cattaneo Selectively Receptor-Blind Measles Viruses: Identification of Residues Necessary for SLAM- or CD46-Induced Fusion and Their Localization on a New Hemagglutinin Structural Model J. Virol., January 1, 2004; 78(1): 302 - 313. [Abstract] [Full Text] [PDF] |
||||

