Bioinformatics Vol. 18 no. 12 2002
Pages 1650-1657
© 2002 Oxford University Press
Local structure-based sequence profile database for local and global protein structure predictions
Department of Pharmacology and Columbia Genome Center, Columbia University, 630 West 168th street, PH 7 W Room 318, New York, NY 10032, USA
Received on January 2, 2002
; revised on April 10, 2002
; accepted on May 24, 2002
Motivation: A large body of evidence suggests that protein structural information is frequently encoded in local sequencessequencestructure relationships derived from local structure/sequence analyses could significantly enhance the capacities of protein structure prediction methods. In this paper, the prediction capacity of a database (LSBSP2) that organizes local sequencestructure relationships encoded in local structures with two consecutive secondary structure elements is tested with two computational procedures for protein structure prediction. The goal is twofold: to test the folding hypothesis that local structures are determined by local sequences, and to enhance our capacity in predicting protein structures from their amino acid sequences.
Results: The LSBSP2 database contains a large set of sequence profiles derived from exhaustive pair-wise structural alignments for local structures with two consecutive secondary structure elements. One computational procedure makes use of the PSI-BLAST alignment program to predict local structures for testing sequence fragments by matching the testing sequence fragments onto the sequence profiles in the LSBSP2 database. The results show that 54% of the test sequence fragments were predicted with local structures that match closely with their native local structures. The other computational procedure is a filter system that is capable of removing false positives as possible from a set of PSI-BLAST hits. An assessment with a large set of non-redundant protein structures shows that the PSI-BLAST + filter system improves the prediction specificity by up to two-fold over the prediction specificity of the PSI-BLAST program for distantly related protein pairs. Tests with the two computational procedures above demonstrate that local sequencestructure relationships can indeed enhance our capacity in protein structure prediction. The results also indicate that local sequences encoded with strong local structure propensities play an important role in determining the native state folding topology.
Availability: All the computational and assessment procedures have been implemented in the integrated computational system PrISM.1 (Protein Informatics System for Modeling). The system and associated databases for LINUX systems can be downloaded from the website:www.columbia.edu/~ay1/
Contact: ay1{at}columbia.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
H.-P. Peng and A.-S. Yang Modeling protein loops with knowledge-based prediction of sequence-structure alignment Bioinformatics, November 1, 2007; 23(21): 2836 - 2842. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. K. Motoike, H. Liu, I. W. Glaaser, A.-S. Yang, M. Tateyama, and R. S. Kass The Na+ Channel Inactivation Gate Is a Molecular Complex: A Novel Role of the COOH-terminal Domain J. Gen. Physiol., January 26, 2004; 123(2): 155 - 165. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Espadaler, N. Fernandez-Fuentes, A. Hermoso, E. Querol, F. X. Aviles, M. J. E. Sternberg, and B. Oliva ArchDB: automated protein loop classification as a tool for structural genomics Nucleic Acids Res., January 1, 2004; 32(90001): D185 - 188. [Abstract] [Full Text] [PDF] |
||||


