Bioinformatics Advance Access originally published online on January 31, 2007
Bioinformatics 2007 23(7):802-808; doi:10.1093/bioinformatics/btm017
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
PROMALS: towards accurate multiple sequence alignments of distantly related proteins
1Howard Hughes Medical Institute and 2Department of Biochemistry, The University of Texas Southwestern Medical Center at Dallas, 6001 Forest Park Road, Dallas, TX 75390-9050, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Accurate multiple sequence alignments are essential in protein structure modeling, functional prediction and efficient planning of experiments. Although the alignment problem has attracted considerable attention, preparation of high-quality alignments for distantly related sequences remains a difficult task.
Results: We developed PROMALS, a multiple alignment method that shows promising results for protein homologs with sequence identity below 10%, aligning close to half of the amino acid residues correctly on average. This is about three times more accurate than traditional pairwise sequence alignment methods. PROMALS algorithm derives its strength from several sources: (i) sequence database searches to retrieve additional homologs; (ii) accurate secondary structure prediction; (iii) a hidden Markov model that uses a novel combined scoring of amino acids and secondary structures; (iv) probabilistic consistency-based scoring applied to progressive alignment of profiles. Compared to the best alignment methods that do not use secondary structure prediction and database searches (e.g. MUMMALS, ProbCons and MAFFT), PROMALS is up to 30% more accurate, with improvement being most prominent for highly divergent homologs. Compared to SPEM and HHalign, which also employ database searches and secondary structure prediction, PROMALS shows an accuracy improvement of several percent.
Availability: The PROMALS web server is available at: http://prodata.swmed.edu/promals/
Contact: jpei{at}chop.swmed.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Alex Bateman
Received on December 4, 2006; revised on January 12, 2007; accepted on January 17, 2007
This article has been cited by other articles:
![]() |
F. Khan, Y. Furuta, M. Kawai, K. H. Kaminska, K. Ishikawa, J. M. Bujnicki, and I. Kobayashi A putative mobile genetic element carrying a novel type IIF restriction-modification system (PluTI) Nucleic Acids Res., January 13, 2010; (2010) gkp1221v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Kemena and C. Notredame Upcoming challenges for multiple sequence alignment methods in the high-throughput era Bioinformatics, October 1, 2009; 25(19): 2455 - 2465. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.-C. Chu, T. Liu, D. T. Lee, G. C. Lee, and A. C.-C. Shih GR-Aligner: an algorithm for aligning pairwise genomic sequences containing rearrangement events Bioinformatics, September 1, 2009; 25(17): 2188 - 2193. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Morgan, E. A. Dwinell, T. K. Bhatia, E. M. Lang, and Y. A. Luyten The MmeI family: type II restriction-modification enzymes that employ single-strand modification for host protection Nucleic Acids Res., August 1, 2009; 37(15): 5208 - 5221. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Loytynoja and N. Goldman Uniting Alignments and Trees Science, June 19, 2009; 324(5934): 1528 - 1529. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. A. Johnston, H.-T. Lee, D. J. Macqueen, K. Paranthaman, C. Kawashima, A. Anwar, J. R. Kinghorn, and T. Dalmay Embryonic temperature affects muscle fibre recruitment in adult zebrafish: genome-wide changes in gene and microRNA expression associated with the transition from hyperplastic to hypertrophic growth phenotypes J. Exp. Biol., June 15, 2009; 212(12): 1781 - 1793. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Morrison Why Would Phylogeneticists Ignore Computerized Sequence Alignment? Syst Biol, March 25, 2009; (2009) syp009v1. [Full Text] [PDF] |
||||
![]() |
Y. Lu and S.-H. Sze Improving accuracy of multiple sequence alignment algorithms based on alignment of neighboring residues Nucleic Acids Res., February 1, 2009; 37(2): 463 - 472. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Bateman, R. D. Finn, P. J. Sims, T. Wiedmer, A. Biegert, and J. Soding Phospholipid scramblases and Tubby-like proteins belong to a new superfamily of membrane tethered transcription factors Bioinformatics, January 15, 2009; 25(2): 159 - 162. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Morgan, T. K. Bhatia, L. Lovasco, and T. B. Davis MmeI: a minimal Type II restriction-modification system that only modifies one DNA strand for host protection Nucleic Acids Res., November 1, 2008; 36(20): 6558 - 6570. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Ahola, T. Aittokallio, M. Vihinen, and E. Uusipaikka Model-based prediction of sequence alignment quality Bioinformatics, October 1, 2008; 24(19): 2165 - 2171. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Katoh and H. Toh Recent developments in the MAFFT multiple sequence alignment program Brief Bioinform, July 1, 2008; 9(4): 286 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei, M. Tang, and N. V. Grishin PROMALS3D web server for accurate multiple protein sequence and structure alignments Nucleic Acids Res., July 1, 2008; 36(suppl_2): W30 - W34. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Roovers, K. H. Kaminska, K. L. Tkaczuk, D. Gigot, L. Droogmans, and J. M. Bujnicki The YqfN protein of Bacillus subtilis is the tRNA: m1A22 methyltransferase (TrmK) Nucleic Acids Res., June 1, 2008; 36(10): 3252 - 3262. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. White, Z. Li, R. Sardana, J. M. Bujnicki, E. M. Marcotte, and A. W. Johnson Bud23 Methylates G1575 of 18S rRNA and Is Required for Efficient Nuclear Export of Pre-40S Subunits Mol. Cell. Biol., May 15, 2008; 28(10): 3151 - 3161. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. F. Bolliger, J. Pei, S. Maxeiner, A. A. Boucard, N. V. Grishin, and T. C. Sudhof Unusually rapid evolution of Neuroligin-4 in mice PNAS, April 29, 2008; 105(17): 6421 - 6426. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Liu, R. Tewari, J. Ning, A. M. Blagborough, S. Garbom, J. Pei, N. V. Grishin, R. E. Steele, R. E. Sinden, W. J. Snell, et al. The conserved plant sterility gene HAP2 functions after attachment of fusogenic membranes in Chlamydomonas and Plasmodium gametes Genes & Dev., April 15, 2008; 22(8): 1051 - 1068. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei, B.-H. Kim, and N. V. Grishin PROMALS3D: a tool for multiple protein sequence and structure alignments Nucleic Acids Res., April 1, 2008; 36(7): 2295 - 2300. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. A. Cymerman, I. Chung, B. M. Beckmann, J. M. Bujnicki, and G. Meiss EXOG, a novel paralog of Endonuclease G in higher eukaryotes Nucleic Acids Res., March 27, 2008; 36(4): 1369 - 1379. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. C. Hoopman, W. Wang, C. A. Brautigam, J. L. Sedillo, T. J. Reilly, and E. J. Hansen Moraxella catarrhalis Synthesizes an Autotransporter That Is an Acid Phosphatase J. Bacteriol., February 15, 2008; 190(4): 1459 - 1472. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Tartari, C. Gissi, V. Lo Sardo, C. Zuccato, E. Picardi, G. Pesole, and E. Cattaneo Phylogenetic Comparison of Huntingtin Homologues Reveals the Appearance of a Primitive polyQ in Sea Urchin Mol. Biol. Evol., February 1, 2008; 25(2): 330 - 338. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. M. Szymanski, D. Binns, R. Bartz, N. V. Grishin, W.-P. Li, A. K. Agarwal, A. Garg, R. G. W. Anderson, and J. M. Goodman The lipodystrophy protein seipin is found at endoplasmic reticulum lipid droplet junctions and is important for droplet morphology PNAS, December 26, 2007; 104(52): 20890 - 20895. [Abstract] [Full Text] [PDF] |
||||










