Skip Navigation


Bioinformatics Advance Access originally published online on January 22, 2004
This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow All Versions of this Article:
20/4/547    most recent
btg447v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (89)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Lu, Z.
Right arrow Articles by Eisner, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lu, Z.
Right arrow Articles by Eisner, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics 20(4) © Oxford University Press 2004; all rights reserved.

Predicting subcellular localization of proteins using machine-learned classifiers

Z. Lu , D. Szafron *, R. Greiner , P. Lu , D.S. Wishart , B. Poulin , J. Anvik , C. Macdonell and R. Eisner

Department of Computing Science, University of Alberta, Edmonton, AB, Canada, T6G 2E8

Received on August 26, 2003 ; accepted on September 25, 2003
Advance Access Publication January 22, 2004

Motivation: Identifying the destination or localization of proteins is key to understanding their function and facilitating their purification. A number of existing computational prediction methods are based on sequence analysis. However, these methods are limited in scope, accuracy and most particularly breadth of coverage. Rather than using sequence information alone, we have explored the use of database text annotations from homologs and machine learning to substantially improve the prediction of subcellular location.

Results: We have constructed five machine-learning classifiers for predicting subcellular localization of proteins from animals, plants, fungi, Gram-negative bacteria and Gram-positive bacteria, which are 81% accurate for fungi and 92–94% accurate for the other four categories. These are the most accurate subcellular predictors across the widest set of organisms ever published. Our predictors are part of the Proteome Analyst web-service.

Availability: http://www.cs.ualberta.ca/~bioinfo/PA/Sub, http://www.cs.ualberta.ca/~bioinfo/PA

Supplementary information: http://www.cs.ualberta.ca/~bioinfo/PA/Subcellular

Contact: bioinfo{at}cs.ualberta.ca

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
A. Fyshe, Y. Liu, D. Szafron, R. Greiner, and P. Lu
Improving subcellular localization prediction using text classification and the gene ontology
Bioinformatics, November 1, 2008; 24(21): 2512 - 2517.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
I. Alves-Pereira, J. Canales, A. Cabezas, P. Martin Cordero, M. J. Costas, and J. C. Cameselle
CDP-Alcohol Hydrolase, a Very Efficient Activity of the 5'-Nucleotidase/UDP-Sugar Hydrolase Encoded by the ushA Gene of Yersinia intermedia and Escherichia coli
J. Bacteriol., September 15, 2008; 190(18): 6153 - 6161.
[Abstract] [Full Text] [PDF]


Home page
J Exp BotHome page
G. Y. Busot, B. McClure, C. P. Ibarra-Sanchez, K. Jimenez-Duran, S. Vazquez-Santana, and F. Cruz-Garcia
Pollination in Nicotiana alata stimulates synthesis and transfer to the stigmatic surface of NaStEP, a vacuolar Kunitz proteinase inhibitor homologue
J. Exp. Bot., August 1, 2008; 59(11): 3187 - 3201.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
F. Galka, S. N. Wai, H. Kusch, S. Engelmann, M. Hecker, B. Schmeck, S. Hippenstiel, B. E. Uhlin, and M. Steinert
Proteomic Characterization of the Whole Secretome of Legionella pneumophila and Functional Analysis of Outer Membrane Vesicles
Infect. Immun., May 1, 2008; 76(5): 1825 - 1836.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
S. Kopriva, K. Fritzemeier, G. Wiedemann, and R. Reski
The Putative Moss 3'-Phosphoadenosine-5'-phosphosulfate Reductase Is a Novel Form of Adenosine-5'-phosphosulfate Reductase without an Iron-Sulfur Cluster
J. Biol. Chem., August 3, 2007; 282(31): 22930 - 22938.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Liu, S. Kang, C. Tang, L. B.M. Ellis, and T. Li
Meta-prediction of protein subcellular localization with reduced voting
Nucleic Acids Res., August 1, 2007; (2007) gkm562v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Barsky, J. L. Gardy, R. E. W. Hancock, and T. Munzner
Cerebral: a Cytoscape plugin for layout of and interaction with biological networks using subcellular localization annotation
Bioinformatics, April 15, 2007; 23(8): 1040 - 1042.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. Seringhaus, A. Paccanaro, A. Borneman, M. Snyder, and M. Gerstein
Predicting essential genes in fungal genomes
Genome Res., September 1, 2006; 16(9): 1126 - 1135.
[Abstract] [Full Text] [PDF]


Home page
J Exp BotHome page
C. Ricoult, L. O. Echeverria, J.-B. Cliquet, and A. M. Limami
Characterization of alanine aminotransferase (AlaAT) multigene family and hypoxic response in young seedlings of the model legume Medicago truncatula
J. Exp. Bot., September 1, 2006; 57(12): 3079 - 3089.
[Abstract] [Full Text] [PDF]


Home page
Microbiol. Mol. Biol. Rev.Home page
M.-J. Han and S. Y. Lee
The Escherichia coli Proteome: Past, Present, and Future Prospects
Microbiol. Mol. Biol. Rev., June 1, 2006; 70(2): 362 - 439.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
B.-C. Kim, X. Qian, C. Leang, M. V. Coppi, and D. R. Lovley
Two Putative c-Type Multiheme Cytochromes Required for the Expression of OmcB, an Outer Membrane Protein Essential for Optimal Fe(III) Reduction in Geobacter sulfurreducens.
J. Bacteriol., April 1, 2006; 188(8): 3138 - 3142.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
S. Brunke and B. Hube
MfLIP1, a gene encoding an extracellular lipase of the lipid-dependent fungus Malassezia furfur
Microbiology, February 1, 2006; 152(2): 547 - 554.
[Abstract] [Full Text] [PDF]


Home page
GlycobiologyHome page
X. Fan, Y.-M. She, R. D. Bagshaw, J. W. Callahan, H. Schachter, and D. J. Mahuran
Identification of the hydrophobic glycoproteins of Caenorhabditis elegans
Glycobiology, October 1, 2005; 15(10): 952 - 964.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Xie, A. Li, M. Wang, Z. Fan, and H. Feng
LOCSVMPSI: a web server for subcellular localization of eukaryotic proteins using SVM and profile of PSI-BLAST
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W105 - W110.
[Abstract] [Full Text] [PDF]


Home page
J BiochemHome page
G. V. HarshaRani, S. J. Vayttaden, and U. S. Bhalla
Electronic Data Sources for Kinetic Models of Cell Signaling
J. Biochem., June 1, 2005; 137(6): 653 - 657.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Boden and J. Hawkins
Prediction of subcellular localization using sequence-biased recurrent networks
Bioinformatics, May 15, 2005; 21(10): 2279 - 2286.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
V. Atalay and R. Cetin-Atalay
Implicit motif distribution based hybrid computational kernel for sequence classification
Bioinformatics, April 15, 2005; 21(8): 1429 - 1436.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. L. Gardy, M. R. Laird, F. Chen, S. Rey, C. J. Walsh, M. Ester, and F. S. L. Brinkman
PSORTb v.2.0: Expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis
Bioinformatics, March 1, 2005; 21(5): 617 - 623.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Lu, D. Szafron, R. Greiner, D. S. Wishart, A. Fyshe, B. Pearcy, B. Poulin, R. Eisner, D. Ngo, and N. Lamb
PA-GOSUB: a searchable database of model organism protein sequences with their predicted Gene Ontology molecular function and subcellular localization
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D147 - D153.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Rey, M. Acab, J. L. Gardy, M. R. Laird, K. deFays, C. Lambert, and F. S. L. Brinkman
PSORTdb: a protein subcellular localization database for bacteria
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D164 - D168.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Reumers, J. Schymkowitz, J. Ferkinghoff-Borg, F. Stricher, L. Serrano, and F. Rousseau
SNPeffect: a database mapping molecular phenotypic effects of human non-synonymous coding SNPs
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D527 - D532.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. S. Scott, D. Y. Thomas, and M. T. Hallett
Predicting Subcellular Localization via Protein Motif Co-Occurrence
Genome Res., October 1, 2004; 14(10a): 1957 - 1966.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Szafron, P. Lu, R. Greiner, D. S. Wishart, B. Poulin, R. Eisner, Z. Lu, J. Anvik, C. Macdonell, A. Fyshe, et al.
Proteome Analyst: custom predictions with explanations in a web-based tool for high-throughput proteome annotations
Nucleic Acids Res., July 1, 2004; 32(suppl_2): W365 - W371.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.