Vol. 20 no. 1 2004, pages 21-28
Bioinformatics © Oxford University Press 2004; all rights reserved.
Prediction of protein subcellular locations using fuzzy k-NN method
State Key Laboratory of Intelligent Technology and Systems, Department of Automation, Institute of Bioinformatics, Tsinghua University, Beijing 100084, People's Republic of China
Received on December 10, 2002
; revised on April 23, 2003
; accepted on July 14, 2003
Motivation: Protein localization data are a valuable information resource helpful in elucidating protein functions. It is highly desirable to predict a protein's subcellular locations automatically from its sequence.
Results: In this paper, fuzzy k-nearest neighbors (k-NN) algorithm has been introduced to predict proteins' subcellular locations from their dipeptide composition. The prediction is performed with a new data set derived from version 41.0 SWISS-PROT databank, the overall predictive accuracy about 80% has been achieved in a jackknife test. The result demonstrates the applicability of this relative simple method and possible improvement of prediction accuracy for the protein subcellular locations. We also applied this method to annotate six entirely sequenced proteomes, namely Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster, Oryza sativa, Arabidopsis thaliana and a subset of all human proteins.
Availability: Supplementary information and subcellular location annotations for eukaryotes are available at http://166.111.30.65/hying/fuzzy_loc.htm
Contact: hying99{at}mails.tsinghua.edu.cn
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Guo, L. Yu, Z. Wen, and M. Li Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences Nucleic Acids Res., May 1, 2008; 36(9): 3025 - 3030. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-B. Shen and K.-C. Chou Nuc-PLoc: a new web-server for predicting protein subnuclear localization by fusing PseAA composition and PsePSSM Protein Eng. Des. Sel., November 10, 2007; (2007) gzm057v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Liu, S. Kang, C. Tang, L. B.M. Ellis, and T. Li Meta-prediction of protein subcellular localization with reduced voting Nucleic Acids Res., August 1, 2007; (2007) gkm562v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Lee, D.-W. Kim, D. Na, K. H. Lee, and D. Lee PLPD: reliable protein localization prediction from imbalanced and overlapped datasets Nucleic Acids Res., October 18, 2006; 34(17): 4655 - 4666. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Guo and Y. Lin TSSub: eukaryotic protein subcellular localization by extracting features from profiles Bioinformatics, July 15, 2006; 22(14): 1784 - 1785. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Hoglund, P. Donnes, T. Blum, H.-W. Adolph, and O. Kohlbacher MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition Bioinformatics, May 15, 2006; 22(10): 1158 - 1165. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Larranaga, B. Calvo, R. Santana, C. Bielza, J. Galdiano, I. Inza, J. A. Lozano, R. Armananzas, G. Santafe, A. Perez, et al. Machine learning in bioinformatics Brief Bioinform, March 1, 2006; 7(1): 86 - 112. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Xie, A. Li, M. Wang, Z. Fan, and H. Feng LOCSVMPSI: a web server for subcellular localization of eukaryotic proteins using SVM and profile of PSI-BLAST Nucleic Acids Res., July 1, 2005; 33(suppl_2): W105 - W110. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Sim, S.-Y. Kim, and J. Lee Prediction of protein solvent accessibility using fuzzy k-nearest neighbor method Bioinformatics, June 15, 2005; 21(12): 2844 - 2849. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Cameron, T. Hurd, and B. H. Robinson Computational identification of human mitochondrial proteins based on homology to yeast mitochondrially targeted proteins Bioinformatics, May 1, 2005; 21(9): 1825 - 1830. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Scott, D. Y. Thomas, and M. T. Hallett Predicting Subcellular Localization via Protein Motif Co-Occurrence Genome Res., October 1, 2004; 14(10a): 1957 - 1966. [Abstract] [Full Text] [PDF] |
||||




