Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (6)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Coghlan, A.
Right arrow Articles by Buttimore, N. H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Coghlan, A.
Right arrow Articles by Buttimore, N. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 17 no. 8 2001
Pages 676-685
© 2001 Oxford University Press

Representation of amino acids as five-bit or three-bit patterns for filtering protein databases

Avril Coghlan 1, Dónall A. Mac Dónaill 2,* and Nigel H. Buttimore 3

1 Department of Genetics
2 Department of Chemistry
3 School of Mathematics, University of Dublin, Trinity College, Dublin 2, Ireland

Received on February 12, 2001 ; revised on April 17, 2001 ; accepted on April 25, 2001

Motivation: We propose representing amino acids by bit-patterns so they may be used in a filter algorithm for similarity searches over protein databases, to rapidly eliminate non-homologous regions of database sequences. The filter algorithm would be based on dynamic programming optimization. It would have the advantage over previous filter algorithms that its substitution scoring function distinguishes between conservative and non-conservative amino acid substitutions.

Results: Simulated annealing was used to search for the best five-bit or three-bit patterns to represent amino acids, where similar amino acids were given similar bit-patterns. The similarity between amino acids was estimated from the BLOSUM45 matrix. Representing amino acids by these five-bit and three-bit patterns, the Escherichia coli PhoE precursor and the bacteriophage PA2 LC precursor were aligned. The alignments were nearly the same as that obtained when BLOSUM45 was used to score substitutions.

Availability: The C code of the optimization algorithm for searching for the optimal bit-pattern representation of amino acids is available from the authors upon request.

Contact: dmcdonll{at}tcd.ie

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
T. Massingham and N. Goldman
Statistics of the Log-Det Estimator
Mol. Biol. Evol., October 1, 2007; 24(10): 2277 - 2285.
[Abstract] [Full Text] [PDF]


Home page
Ann. N. Y. Acad. Sci.Home page
A. RUKNUDIN and D. H. SCHULZE
Proteomics Approach to Na+/Ca2+ Exchangers in Prokaryotes
Ann. N.Y. Acad. Sci., November 1, 2002; 976(1): 103 - 108.
[Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.