Bioinformatics Vol. 17 no. 8 2001
Pages 676-685
© 2001 Oxford University Press
Representation of amino acids as five-bit or three-bit patterns for filtering protein databases
1 Department of Genetics
2 Department of Chemistry
3 School of Mathematics, University of
Dublin, Trinity College, Dublin 2, Ireland
Received on February 12, 2001
; revised on April 17, 2001
; accepted on April 25, 2001
Motivation: We propose representing amino acids by bit-patterns so they may be used in a filter algorithm for similarity searches over protein databases, to rapidly eliminate non-homologous regions of database sequences. The filter algorithm would be based on dynamic programming optimization. It would have the advantage over previous filter algorithms that its substitution scoring function distinguishes between conservative and non-conservative amino acid substitutions.
Results: Simulated annealing was used to search for the best five-bit or three-bit patterns to represent amino acids, where similar amino acids were given similar bit-patterns. The similarity between amino acids was estimated from the BLOSUM45 matrix. Representing amino acids by these five-bit and three-bit patterns, the Escherichia coli PhoE precursor and the bacteriophage PA2 LC precursor were aligned. The alignments were nearly the same as that obtained when BLOSUM45 was used to score substitutions.
Availability: The C code of the optimization algorithm for searching for the optimal bit-pattern representation of amino acids is available from the authors upon request.
Contact: dmcdonll{at}tcd.ie
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. Massingham and N. Goldman Statistics of the Log-Det Estimator Mol. Biol. Evol., October 1, 2007; 24(10): 2277 - 2285. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. RUKNUDIN and D. H. SCHULZE Proteomics Approach to Na+/Ca2+ Exchangers in Prokaryotes Ann. N.Y. Acad. Sci., November 1, 2002; 976(1): 103 - 108. [Full Text] [PDF] |
||||

