Bioinformatics Advance Access originally published online on September 7, 2009
Bioinformatics 2009 25(22):2913-2920; doi:10.1093/bioinformatics/btp532
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Predicting homologous signaling pathways using machine learning
Department of Computing Science, University of Alberta, Edmonton, AB, Canada T6G 2E8
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: In general, each cell signaling pathway involves many proteins, each with one or more specific roles. As they are essential components of cell activity, it is important to understand how these proteins work—and in particular, to determine which of the species' proteins participate in each role. Experimentally determining this mapping of proteins to roles is difficult and time consuming. Fortunately, many pathways are similar across species, so we may be able to use known pathway information of one species to understand the corresponding pathway of another.
Results: We present an automatic approach, Predict Signaling Pathway (PSP), which uses the signaling pathways in well-studied species to predict the roles of proteins in less-studied species. We use a machine learning approach to create a predictor that achieves a generalization F-measure of 78.2% when applied to 11 different pathways across 14 different species. We also show our approach is very effective in predicting the pathways that have not yet been experimentally studied completely.
Availability: The list of predicted proteins for all pathways over all considered species is available at http://www.cs.ualberta.ca/~bioinfo/signaling.
Contact: bioinfo{at}cs.ualberta.ca; duane{at}cs.ualberta.ca
Associate Editor: Alfonso Valencia
Received on May 11, 2009; revised on September 1, 2009; accepted on September 3, 2009