Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (12)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Yang, A.-S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Yang, A.-S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 18 no. 12 2002
Pages 1658-1665
© 2002 Oxford University Press

Structure–dependent sequence alignment for remotely related proteins

An-Suei Yang

Department of Pharmacology and Columbia Genome Center, Columbia University, 630 West 168th street, PH 7 W Room 318, New York, NY 10032, USA

Received on January 2, 2002 ; revised on April 10, 2002 ; accepted on May 24, 2002

Motivation: The quality of a model structure derived from a comparative modeling procedure is dictated by the accuracy of the predicted sequence–template alignment. As the sequence–template pairs are increasingly remote in sequence relationship, the prediction of the sequence–template alignments becomes increasingly problematic with sequence alignment methods. Structural information of the template, used in connection with the sequence relationship of the sequence–template pair, could significantly improve the accuracy of the sequence–template alignment. In this paper, we describe a sequence–template alignment method that integrates sequence and structural information to enhance the accuracy of sequence–template alignments for distantly related protein pairs.

Results: The structure-dependent sequence alignment (SDSA) procedure was optimized for coverage and accuracy on a training set of 412 protein pairs; the structures for each of the training pairs are similar (RMSD<~4Å) but the sequence relationship is undetectable (average pair-wise sequence identity = 8%). The optimized SDSA procedure was then applied to extend PSI-BLAST local alignments by calculating the global alignments under the constraint of the residue pairs in the local alignments. This composite alignment procedure was assessed with a testing set of 1421 protein pairs, of which the pair-wise structures are similar (RMSD<~4Å) but the sequences are marginally related at best in each pair (average pair-wise sequence identity = 13%). The assessment showed that the composite alignment procedure predicted more aligned residues pairs with an average of 27% increase in correctly aligned residues over the standard PSI-BLAST alignments for the protein pairs in the testing set.

Availability: All the computational and assessment procedures have been implemented in the integrated computational system PrISM.1 (Protein Informatics System for Modeling). The system and associated databases for LINUX systems can be downloaded from the website: http://www.columbia.edu/~ay1/

Contact: ay1{at}columbia.edu


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
H.-P. Peng and A.-S. Yang
Modeling protein loops with knowledge-based prediction of sequence-structure alignment
Bioinformatics, November 1, 2007; 23(21): 2836 - 2842.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
H.-C. HUANG, U. NAGASWAMY, and G. E. FOX
The application of cluster analysis in the intercomparison of loop structures in RNA
RNA, April 1, 2005; 11(4): 412 - 423.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C.Z. Cai, L.Y. Han, Z.L. Ji, X. Chen, and Y.Z. Chen
SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence
Nucleic Acids Res., July 1, 2003; 31(13): 3692 - 3697.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.