Bioinformatics Vol. 18 no. 2 2002
Pages 306-314
© 2002 Oxford University Press
Predicting reliable regions in protein sequence alignments
1 Center for Biomolecular Science and
Engineering
2 Department of Computer Engineering, Jack
Baskin School of Engineering, University of California, Santa Cruz,
CA 95064, USA
Received on April 19, 2001
; revised on August 1, 2001
; accepted on August 21, 2001
Motivation: Protein sequence alignments have a myriad of applications in bioinformatics, including secondary and tertiary structure prediction, homology modeling, and phylogeny. Unfortunately, all alignment methods make mistakes, and mistakes in alignments often yield mistakes in their application. Thus, a method to identify and remove suspect alignment positions could benefit many areas in protein sequence analysis.
Results: We tested four predictors of alignment position reliability, including near-optimal alignment information, column score, and secondary structural information. We validated each predictor against a large library of alignments, removing positions predicted as unreliable. Near-optimal alignment information was the best predictor, removing 70% of the substantially-misaligned positions and 58% of the over-aligned positions, while retaining 86% of those aligned accurately.
Availability: The shift score alignment comparison algorithm is available online at http://www.soe.ucsc.edu/research/compbio/HMM-apps/compare-align.html and from the authors on request.
Contact: cline{at}soe.ucsc.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
E. Benavides, R. Baum, D. McClellan, and J. W. Sites Molecular Phylogenetics of the Lizard Genus Microlophus (Squamata:Tropiduridae): Aligning and Retrieving Indel Signal from Nuclear Introns Syst Biol, October 1, 2007; 56(5): 776 - 797. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Rangwala and G. Karypis Incremental window-based protein sequence alignment algorithms Bioinformatics, January 15, 2007; 23(2): e17 - e23. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei and N. V. Grishin MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural information Nucleic Acids Res., September 11, 2006; 34(16): 4364 - 4374. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. G. Higgins, G. Blackshields, and I. M. Wallace Mind the gaps: Progress in progressive alignment PNAS, July 26, 2005; 102(30): 10411 - 10412. [Full Text] [PDF] |
||||
![]() |
J. Soding Protein homology detection by HMM-HMM comparison Bioinformatics, April 1, 2005; 21(7): 951 - 960. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Ovcharenko, D. Boffelli, and G. G. Loots eShadow: A Tool for Comparing Closely Related Sequences Genome Res., June 1, 2004; 14(6): 1191 - 1198. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Thompson, V. Prigent, and O. Poch LEON: multiple aLignment Evaluation Of Neighbours Nucleic Acids Res., February 24, 2004; 32(4): 1298 - 1307. [Abstract] [Full Text] [PDF] |
||||




