Bioinformatics, Vol 14, 407-422, Copyright © 1998 by Oxford University Press
C Notredame, L Holm and DG Higgins
MOTIVATION: In order to increase the accuracy of multiple sequence
alignments, we designed a new strategy for optimizing multiple sequence
alignments by genetic algorithm. We named it COFFEE (Consistency based
Objective Function For alignmEnt Evaluation). The COFFEE score reflects the
level of consistency between a multiple sequence alignment and a library
containing pairwise alignments of the same sequences. RESULTS: We show that
multiple sequence alignments can be optimized for their COFFEE score with
the genetic algorithm package SAGA. The COFFEE function is tested on 11
test cases made of structural alignments extracted from 3D_ali. These
alignments are compared to those produced using five alternative methods.
Results indicate that COFFEE outperforms the other methods when the level
of identity between the sequences is low. Accuracy is evaluated by
comparison with the structural alignments used as references. We also show
that the COFFEE score can be used as a reliability index on multiple
sequence alignments. Finally, we show that given a library of
structure-based pairwise sequence alignments extracted from FSSP, SAGA can
produce high- quality multiple sequence alignments. The main advantage of
COFFEE is its flexibility. With COFFEE, any method suitable for making
pairwise alignments can be extended to making multiple alignments.
AVAILABILITY: The package is available along with the test cases through
the WWW: http://www. ebi.ac.uk/~cedric CONTACT: cedric.notredame@ebi.ac.uk
ARTICLES
COFFEE: an objective function for multiple sequence alignments
EMBL Outstation-The European Bioinformatics Institute, Hinxton Hall, Hinxton, Cambridge CB10 1SD, UK.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
K. Katoh and H. Toh Recent developments in the MAFFT multiple sequence alignment program Brief Bioinform, July 1, 2008; 9(4): 286 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Heger, E. Korpelainen, T. Hupponen, K. Mattila, V. Ollikainen, and L. Holm PairsDB atlas of protein sequence space Nucleic Acids Res., January 11, 2008; 36(suppl_1): D276 - D280. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Heger, S. Mallick, C. Wilton, and L. Holm The global trace graph, a novel paradigm for searching protein sequence databases Bioinformatics, September 15, 2007; 23(18): 2361 - 2367. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. Wheeler and J. D. Kececioglu Multiple alignment by aligning alignments Bioinformatics, July 1, 2007; 23(13): i559 - i568. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. E. Gentle, A. J. Perry, F. H. Alcock, V. A. Likic, P. Dolezal, E. T. Ng, A. W. Purcell, M. McConnville, T. Naderer, A.-L. Chanez, et al. Conserved Motifs Reveal Details of Ancestry and Structure in the Small TIM Chaperones of the Mitochondrial Intermembrane Space Mol. Biol. Evol., May 1, 2007; 24(5): 1149 - 1160. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Wallace, O. O'Sullivan, D. G. Higgins, and C. Notredame M-Coffee: combining multiple sequence alignment methods with T-Coffee Nucleic Acids Res., March 23, 2006; 34(6): 1692 - 1699. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhou and Y. Zhou SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structures Bioinformatics, September 15, 2005; 21(18): 3615 - 3621. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Jarvis and R. Goodacre Genetic algorithm optimization for pre-processing and variable selection of spectroscopic data Bioinformatics, April 1, 2005; 21(7): 860 - 868. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. B. Do, M. S.P. Mahabhashyam, M. Brudno, and S. Batzoglou ProbCons: Probabilistic consistency-based multiple sequence alignment Genome Res., February 1, 2005; 15(2): 330 - 340. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Poirot, K. Suhre, C. Abergel, E. O'Toole, and C. Notredame 3DCoffee@igs: a web server for combining sequences and structures into a multiple sequence alignment Nucleic Acids Res., July 1, 2004; 32(suppl_2): W37 - W40. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Poirot, E. O'Toole, and C. Notredame Tcoffee@igs: a web server for computing, evaluating and combining multiple sequence alignments Nucleic Acids Res., July 1, 2003; 31(13): 3503 - 3506. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Thompson, F. Plewniak, J.-C. Thierry, and O. Poch DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches Nucleic Acids Res., August 1, 2000; 28(15): 2919 - 2926. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Lovis and R. H. Baud Fast Exact String Pattern-matching Algorithms Adapted to the Characteristics of the Medical Language J. Am. Med. Inform. Assoc., July 1, 2000; 7(4): 378 - 391. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Ogata, S. Audic, C. Abergel, P.-E. Fournier, and J.-M. Claverie Protein Coding Palindromes Are a Unique but Recurrent Feature in Rickettsia Genome Res., May 1, 2002; 12(5): 808 - 816. [Abstract] [Full Text] [PDF] |
||||





