Bioinformatics Advance Access published online on June 24, 2004
Bioinformatics, doi:10.1093/bioinformatics/bth370
Bioinformatics © Oxford University Press 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
* To whom correspondence should be addressed. E-mail: yanliu{at}cs.cmu.edu.
Motivation: Protein secondary structure prediction is an important step towards understanding how proteins fold in three dimensions. Recent analysis by information theory indicates that the correlation between neighboring secondary structures are much stronger than that of neighboring amino acids (Crooks & Brenner, 2004). In this paper, we focus on the combination problem for sequences, i.e. combining the scores or assignments from single or multiple prediction systems under the constraint of a whole sequence, as a target for improvement in protein secondary structure prediction. Results: We apply several graphical chain models to solve the combination problem and show that they are consistently more effective than the traditional window-based methods. In particular, conditional random fields (CRFs) improve moderately the predictions for helices and more importantly, for beta sheets, which are the major bottleneck for protein secondary structure prediction.
Accepted June 12, 2004
Article
Comparison of probabilistic combination methods for protein secondary structure prediction
2 Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA; Center for Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15260, USA
3 Center for Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15260, USA
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M.-H. Li, L. Lin, X.-L. Wang, and T. Liu Protein protein interaction site prediction based on conditional random fields Bioinformatics, March 1, 2007; 23(5): 597 - 604. [Abstract] [Full Text] [PDF] |
||||
