Bioinformatics Advance Access originally published online on November 5, 2004
Bioinformatics 2005 21(7):961-968; doi:10.1093/bioinformatics/bti126
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A simple statistical method for discriminating outer membrane proteins with better accuracy
Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST) Aomi Frontier Building 17F, 2-43 Aomi, Koto-ku, Tokyo 135-0064, Japan
*To whom correspondence should be addressed.
Motivation: Discriminating outer membrane proteins from other folding types of globular and membrane proteins is an important task both for identifying outer membrane proteins from genomic sequences and for the successful prediction of their secondary and tertiary structures.
Results: We have systematically analyzed the amino acid composition of globular proteins from different structural classes and outer membrane proteins. We found that the residues, Glu, His, Ile, Cys, Gln, Asn and Ser, show a significant difference between globular and outer membrane proteins. Based on this information, we have devised a statistical method for discriminating outer membrane proteins from other globular and membrane proteins. Our approach correctly picked up the outer membrane proteins with an accuracy of 89% for the training set of 337 proteins. On the other hand, our method has correctly excluded the globular proteins at an accuracy of 79% in a non-redundant dataset of 674 proteins. Furthermore, the present method is able to correctly exclude
-helical membrane proteins up to an accuracy of 80%. These accuracy levels are comparable to other methods in the literature, and this is a simple method, which could be used for dissecting outer membrane proteins from genomic sequences. The influence of protein size, structural class and specific residues for discrimination is discussed.
Availability: A program for the discrimination method is available upon request from the corresponding author. The datasets used in this work are available at http://www.cbrc.jp/~gromiha/omp/dataset.html
Contact: michael-gromiha{at}aist.go.jp
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Randall, J. Cheng, M. Sweredoski, and P. Baldi TMBpro: secondary structure, {beta}-contact and tertiary structure prediction of transmembrane {beta}-barrel proteins Bioinformatics, February 15, 2008; 24(4): 513 - 520. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Gromiha, Y. Yabuki, S. Kundu, S. Suharnan, and M. Suwa TMBETA-GENOME: database for annotated {beta}-barrel membrane proteins in genomic sequences Nucleic Acids Res., January 12, 2007; 35(suppl_1): D314 - D316. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Waldispuhl, B. Berger, P. Clote, and J.-M. Steyaert transFold: a web server for predicting the structure and residue contacts of transmembrane beta-barrels. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W189 - W193. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-J. Park, M. M. Gromiha, P. Horton, and M. Suwa Discrimination of outer membrane proteins using support vector machines Bioinformatics, December 1, 2005; 21(23): 4223 - 4229. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Gromiha, S. Ahmad, and M. Suwa TMBETA-NET: discrimination and prediction of membrane spanning {beta}-strands in outer membrane proteins Nucleic Acids Res., July 1, 2005; 33(suppl_2): W164 - W167. [Abstract] [Full Text] [PDF] |
||||

