Skip Navigation

This Article
Right arrow Full Text (Print PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Mamitsuka, H.
Right arrow Articles by Yamanishi, K.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Mamitsuka, H.
Right arrow Articles by Yamanishi, K.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© Oxford University Press

{alpha}-Helix region prediction with stochastic rule learning

Hiroshi Mamitsuka 2 and Kenji Yamanishi 1

C&C Research Laboratories, NEC Corporation, 1-I Miyazaki 4-chome Miyamae-ku, Kawasaki, Kanagawa 216, Japan
1NEC Research Institute, Inc. 4 Independence Way, Princeton, NJ 08540, USA

2 To whom correspondence should be addressed

We propose a new method, based on the theory of stochastic rule learning, for predicting {alpha}-helix regions in a given protein sequence. Our method (hereafter referred to as the SR method) produces stochastic rules, each of which assigns, to any region in an amino acid sequence, the probability that it is an {alpha}-helix region. When learning a stochastic rule from a particular {alpha}-helix region, our method makes use of positive training examples obtained from a number of regions that are homologous to that region. Each stochastic rule is optimized using the minimum description length (MDL) principle, and such optimized stochastic rules are used to predict {alpha}-helix regions of any given protein sequence. In our experiments, using 25 proteins selected from the HSSP database as training examples, we applied the SR method to the problem of predicting {alpha}-helix regions in test examples, which consisted of >5000 residues with 38% {alpha}-helix content. Each of these test examples possesses <25% homology to any proteins in the training and other test examples. Our method achieved 81% average prediction accuracy for the test examples; this compares favorably to Qian and Sejnowski's method, which attains no more than 75% average accuracy, and further which compares to Rost and Sander's method which has proven to be one of the best secondary structure prediction methods.



Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.