Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (23)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Hou, Y.
Right arrow Articles by Bystroff, C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hou, Y.
Right arrow Articles by Bystroff, C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 19 no. 17 2003
pages 2294-2301
© 2003 Oxford University Press

Efficient remote homology detection using local structure

Yuna Hou 1,*, Wynne Hsu 1, Mong Li Lee 1 and Christopher Bystroff 2

1 School of Computing, National University of Singapore, Singapore 117543 and 2 Department of Biology, Rensselaer Polytechnic Institute, Troy, NY 12180, USA

Received on February 27, 2003 ; revised on May 23, 2003 ; accepted on June 3, 2003

Motivation: The function of an unknown biological sequence can often be accurately inferred if we are able to map this unknown sequence to its corresponding homologous family. At present, discriminative methods such as SVM-Fisher and SVM-pairwise, which combine support vector machine (SVM) and sequence similarity, are recognized as the most accurate methods, with SVM-pairwise being the most accurate. However, these methods typically encode sequence information into their feature vectors and ignore the structure information. They are also computationally inefficient. Based on these observations, we present an alternative method for SVM-based protein classification. Our proposed method, SVM-I-sites, utilizes structure similarity for remote homology detection.

Result: We run experiments on the Structural Classification of Proteins 1.53 data set. The results show that SVM-I-sites is more efficient than SVM-pairwise. Further, we find that SVM-I-sites outperforms sequence-based methods such as PSI-BLAST, SAM, and SVM-Fisher while achieving a comparable performance with SVM-pairwise.

Availability: I-sites server is accessible through the web at http://www.bioinfo.rpi.edu. Programs are available upon request for academics. Licensing agreements are available for commercial interests. The framework of encoding local structure into feature vector is available upon request.

Contact: houyuna{at}comp.nus.edu.sg

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
A. R. Shah, C. S. Oehmen, and B.-J. Webb-Robertson
SVM-HUSTLE--an iterative semi-supervised machine learning approach for pairwise protein remote homology detection
Bioinformatics, March 15, 2008; 24(6): 783 - 790.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Krause, A. C. McHardy, T. W. Nattkemper, A. Puhler, J. Stoye, and F. Meyer
GISMO--gene identification using a support vector machine for ORF classification
Nucleic Acids Res., January 28, 2007; 35(2): 540 - 549.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Q.-w. Dong, X.-l. Wang, and L. Lin
Application of latent semantic analysis to protein remote homology detection
Bioinformatics, February 1, 2006; 22(3): 285 - 290.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Rangwala and G. Karypis
Profile-based direct kernels for remote homology detection and fold recognition
Bioinformatics, December 1, 2005; 21(23): 4239 - 4247.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Wang and R. Samudrala
FSSA: a novel method for identifying functional signatures from structural alignments
Bioinformatics, July 1, 2005; 21(13): 2969 - 2977.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Han, B.-c. Lee, S. T. Yu, C.-s. Jeong, S. Lee, and D. Kim
Fold recognition by combining profile-profile alignment and support vector machine
Bioinformatics, June 1, 2005; 21(11): 2667 - 2673.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Yuan and C. Bystroff
Non-sequential structure-based alignments reveal topology-independent core packing arrangements in proteins
Bioinformatics, April 1, 2005; 21(7): 1010 - 1019.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. Hou, S.-R. Jun, C. Zhang, and S.-H. Kim
From The Cover: Global mapping of the protein structure space and application in structure-based inference of protein function
PNAS, March 8, 2005; 102(10): 3651 - 3656.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.