Bioinformatics Advance Access originally published online on January 29, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics 20(5) © Oxford University Press 2004; all rights reserved.
Predicting protein structure classes from function predictions


1 Department of Computational Biology and Applied Algorithmics, Max-Planck-Institute for Informatics, Stuhlsatzenhausweg 85, Saarbrücken D-66123, Germany and 2 Center for Biological Sequence Analysis, BioCentrum-DTU, The Technical University of Denmark, Building 208, Lyngby DK-2800, Denmark
Received on August 8, 2003
; revised on October 1, 2003
; accepted on October 14, 2003
Advance Access Publication January 29, 2004
Motivation: We introduce a new approach to using the information contained in sequence-to-function prediction data in order to recognize protein template classes, a critical step in predicting protein structure. The data on which our method is based comprise probabilities of functional categories; for given query sequences these probabilities are obtained by a neural net that has previously been trained on a variety of functionally important features. On a training set of sequences we assess the relevance of individual functional categories for identifying a given structural family. Using a combination of the most relevant categories, the likelihood of a query sequence to belong to a specific family can be estimated.
Results: The performance of the method is evaluated using cross-validation. For a fixed structural family and for every sequence, a score is calculated that measures the evidence for family membership. Even for structural families of small size, family members receive significantly higher scores. For some examples, we show that the relevant functional features identified by this method are biologically meaningful. The proposed approach can be used to improve existing sequence-to-structure prediction methods.
Availability: Matlab code is available on request from the authors. The data are available at http://www.mpi-sb.mpg.de/~sommer/Fun2Struc/
Contact: sommer{at}mpi-sb.mpg.de
* To whom correspondence should be addressed.
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.