Skip Navigation


Bioinformatics Advance Access originally published online on August 12, 2004
Bioinformatics 2005 21(1):10-19; doi:10.1093/bioinformatics/bth466
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/1/10    most recent
bth466v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (81)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Chou, K.-C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Chou, K.-C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics vol. 21 issue 1 © Oxford University Press 2005; all rights reserved.

Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes

Kuo-Chen Chou

Gordon Life Science Institute San Diego, CA 92130, USA

Motivation: With protein sequences entering into databanks at an explosive pace, the early determination of the family or subfamily class for a newly found enzyme molecule becomes important because this is directly related to the detailed information about which specific target it acts on, as well as to its catalytic process and biological function. Unfortunately, it is both time-consuming and costly to do so by experiments alone. In a previous study, the covariant-discriminant algorithm was introduced to identify the 16 subfamily classes of oxidoreductases. Although the results were quite encouraging, the entire prediction process was based on the amino acid composition alone without including any sequence-order information. Therefore, it is worthy of further investigation.

Results: To incorporate the sequence-order effects into the predictor, the ‘amphiphilic pseudo amino acid composition’ is introduced to represent the statistical sample of a protein. The novel representation contains 20 + 2{lambda} discrete numbers: the first 20 numbers are the components of the conventional amino acid composition; the next 2{lambda} numbers are a set of correlation factors that reflect different hydrophobicity and hydrophilicity distribution patterns along a protein chain. Based on such a concept and formulation scheme, a new predictor is developed. It is shown by the self-consistency test, jackknife test and independent dataset tests that the success rates obtained by the new predictor are all significantly higher than those by the previous predictors. The significant enhancement in success rates also implies that the distribution of hydrophobicity and hydrophilicity of the amino acid residues along a protein chain plays a very important role to its structure and function.

Contact: kchou{at}san.rr.com


Received on June 30, 2004; revised on July 20, 2004; accepted on August 2, 2004

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Virol.Home page
P. V. Aguilar, L. W. Leung, E. Wang, S. C. Weaver, and C. F. Basler
A Five-Amino-Acid Deletion of the Eastern Equine Encephalitis Virus Capsid Protein Attenuates Replication in Mammalian Systems but Not in Mosquito Cells
J. Virol., July 15, 2008; 82(14): 6972 - 6983.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
T. Damoulas and M. A. Girolami
Probabilistic multi-class multi-kernel learning: on protein fold recognition and remote homology detection
Bioinformatics, May 15, 2008; 24(10): 1264 - 1270.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
H.-B. Shen and K.-C. Chou
Gpos-PLoc: an ensemble classifier for predicting subcellular localization of Gram-positive bacterial proteins
Protein Eng. Des. Sel., January 23, 2007; (2007) gzl053v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H.-B. Shen and K.-C. Chou
Ensemble classifier for protein fold pattern recognition
Bioinformatics, July 15, 2006; 22(14): 1717 - 1722.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.