Skip Navigation


Bioinformatics Advance Access originally published online on August 25, 2009
Bioinformatics 2009 25(20):2655-2662; doi:10.1093/bioinformatics/btp500
This Article
Right arrow Full Text
Right arrow Full Text (Print PDF)
Right arrow Supplementary Data
Right arrow All Versions of this Article:
25/20/2655    most recent
btp500v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Dong, Q.
Right arrow Articles by Guan, J.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dong, Q.
Right arrow Articles by Guan, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2009. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation

Qiwen Dong 1,2, Shuigeng Zhou 1,2,* and Jihong Guan 3

1 Shanghai Key Lab of Intelligent Information Processing, 2 School of Computer Science, Fudan University and 3 Department of Computer Science and Technology, Tongji University, Shanghai, China

* To whom correspondence should be addressed.


   Abstract

Motivation: Fold recognition is an important step in protein structure and function prediction. Traditional sequence comparison methods fail to identify reliable homologies with low sequence identity, while the taxonomic methods are effective alternatives, but their prediction accuracies are around 70%, which are still relatively low for practical usage.

Results: In this study, a simple and powerful method is presented for taxonomic fold recognition, which combines support vector machine (SVM) with autocross-covariance (ACC) transformation. The evolutionary information represented in the form of position-specific score matrices is converted into a series of fixed-length vectors by ACC transformation and these vectors are then input to a SVM classifier for fold recognition. The sequence-order effect can be effectively captured by this scheme. Experiments are performed on the widely used D-B dataset and the corresponding extended dataset, respectively. The proposed method, called ACCFold, gets an overall accuracy of 70.1% on the D-B dataset, which is higher than major existing taxonomic methods by 2–14%. Furthermore, the method achieves an overall accuracy of 87.6% on the extended dataset, which surpasses major existing taxonomic methods by 9–17%. Additionally, our method obtains an overall accuracy of 80.9% for 86-folds and 77.2% for 199-folds. These results demonstrate that the ACCFold method provides the state-of-the-art performance for taxonomic fold recognition.

Availability: The source code for ACC transformation is freely available at http://www.iipl.fudan.edu.cn/demo/accpkg.html.

Contact: sgzhou{at}fudan.edu.cn

Supplementary information: Supplementary data are available at Bioinformatics online.

Associate Editor: Thomas Lengauer


Received on March 11, 2009; revised on August 11, 2009; accepted on August 13, 2009

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.