meta-MEME: Motif-based hidden Markov models of protein families
Department of Computer Science and Engineering, University of california San Diego, La Jolla, CA 92093, USA
1Department of Medicine, University of california San Diego, La Jolla, CA 92093, USA
2San Diego Supercomputer Center PO Box 85608, San Diego, CA 92186, USA
3To whom correspondence should be addressed
MOTIVATION:: Modeling families of related biological sequences using Hidden Markov models (HMMs), although increasingly widespread, faces at least one major problem: because of the complexity of these mathematical models, they require a relatively large training set in order to accurately recognize a given family. For families in which there are few known sequences, a standard linear HMM contains too many parameters to be trained adequately
RESULTS:: This work attempts to solve that problem by generating smaller HMMs which precisely model only the conserved regions of the family. These HMMs are constructed from motif models generated by the EM algorithm using the MEME software. Because motif-based HMMs have relatively few parameters, they can be trained using smaller data sets. Studies of short chain alcohol dehydrogenases and 4Fe-4S ferredoxins support the claim that motif-based HMMs exhibit increased sensitivity and selectivity in database searches, especially when training sets contain few sequences.
AVAILABILITY:: http://www.sdsc.edu/MEME
CONTACT:: bgrundy{at}cs.ucsd.edu
Received on November 5, 1996; accepted on January 14, 1997
This article has been cited by other articles:
![]() |
S. Zhang, W. Su, and J. Yang ARCS-Motif: discovering correlated motifs from unaligned biological sequences Bioinformatics, January 15, 2009; 25(2): 183 - 189. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. E. Newburger and M. L. Bulyk UniPROBE: an online database of protein binding microarray data on protein-DNA interactions Nucleic Acids Res., January 1, 2009; 37(suppl_1): D77 - D82. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Kolchanov, T. I. Merkulova, E. V. Ignatieva, E. A. Ananko, D. Yu. Oshchepkov, V. G. Levitsky, G. V. Vasiliev, N. V. Klimova, V. M. Merkulov, and T. C. Hodgman Combined experimental and computational approaches to study the regulatory elements in eukaryotic genes Brief Bioinform, July 12, 2007; (2007) bbm027v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Kann, S. L. Sheetlin, Y. Park, S. H. Bryant, and J. L. Spouge The identification of complete domains within protein sequences using accurate E-values for semi-global alignment Nucleic Acids Res., July 9, 2007; 35(14): 4678 - 4685. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Mann, J. Li, and Y.-P. P. Chen A pHMM-ANN based discriminative approach to promoter identification in prokaryote genomic contexts Nucleic Acids Res., January 28, 2007; 35(2): e12 - e12. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Chowdhary, S. L. Tan, R. A. Ali, B. Boerlage, L. Wong, and V. B Bajic Dragon Promoter Mapper (DPM): a Bayesian framework for modelling promoter structures Bioinformatics, September 15, 2006; 22(18): 2310 - 2312. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Rivals, C. Bruyere, C. Toffano-Nioche, and A. Lecharny Formation of the Arabidopsis Pentatricopeptide Repeat Family Plant Physiology, July 1, 2006; 141(3): 825 - 839. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Deng, X. Zhu, G. Skogerbo, Y. Zhao, Z. Fu, Y. Wang, H. He, L. Cai, H. Sun, C. Liu, et al. Organization of the Caenorhabditis elegans small non-coding transcriptome: Genomic features, biogenesis, and expression Genome Res., January 1, 2006; 16(1): 20 - 29. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Karplus, R. Karchin, G. Shackelford, and R. Hughey Calibrating E-values for hidden Markov models using reverse-sequence null models Bioinformatics, November 15, 2005; 21(22): 4107 - 4115. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Chakrabarti, A. P. Anand, N. Bhardwaj, G. Pugalenthi, and R. Sowdhamini SCANMOT: searching for similar sequences using a simultaneous scan of multiple sequence motifs Nucleic Acids Res., July 1, 2005; 33(suppl_2): W274 - W276. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Thompson, V. Prigent, and O. Poch LEON: multiple aLignment Evaluation Of Neighbours Nucleic Acids Res., February 24, 2004; 32(4): 1298 - 1307. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Sandelin, W. Alkema, P. Engstrom, W. W. Wasserman, and B. Lenhard JASPAR: an open-access database for eukaryotic transcription factor binding profiles Nucleic Acids Res., January 1, 2004; 32(90001): D91 - 94. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Dopson, C. Baker-Austin, P. R. Koppineedi, and P. L. Bond Growth in sulfidic mineral environments: metal resistance mechanisms in acidophilic micro-organisms Microbiology, August 1, 2003; 149(8): 1959 - 1970. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M. Alto, S. H. Soderling, N. Hoshi, L. K. Langeberg, R. Fayos, P. A. Jennings, and J. D. Scott Bioinformatic design of A-kinase anchoring protein-in silico: A potent and selective peptide antagonist of type II protein kinase A anchoring PNAS, April 15, 2003; 100(8): 4445 - 4450. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Graber, G. D. McAllister, and T. F. Smith Probabilistic prediction of Saccharomyces cerevisiae mRNA 3'-processing sites Nucleic Acids Res., April 15, 2002; 30(8): 1851 - 1858. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Leznik, V. Makarenko, and R. Llinas Electrotonically Mediated Oscillatory Patterns in Neuronal Ensembles: An In Vitro Voltage-Dependent Dye-Imaging Study in the Inferior Olive J. Neurosci., April 1, 2002; 22(7): 2804 - 2815. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Bulyk, P. L. F. Johnson, and G. M. Church Nucleotides of transcription factor binding sites exert interdependent effects on the binding affinities of transcription factors Nucleic Acids Res., March 1, 2002; 30(5): 1255 - 1261. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Down and T. J. P. Hubbard Computational Detection and Location of Transcription Start Sites in Mammalian Genomic DNA Genome Res., March 1, 2002; 12(3): 458 - 461. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. A. Rikke, S. Murakami, and T. E. Johnson Paralogy and Orthology of Tyrosine Kinases that Can Extend the Life Span of Caenorhabditis elegans Mol. Biol. Evol., May 1, 2000; 17(5): 671 - 683. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. v. Helden, Alma. F. Rios, and J. Collado-Vides Discovering regulatory elements in non-coding sequences by analysis of spaced dyads Nucleic Acids Res., April 15, 2000; 28(8): 1808 - 1818. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Rabus, D. L. Jack, D. J. Kelly, and M. H. Saier Jr TRAP transporters: an ancient family of extracytoplasmic solute- receptor-dependent secondary active transporters Microbiology, December 1, 1999; 145(12): 3431 - 3445. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Geetha, V. Di Francesco, J. Garnier, and P. J. Munson Comparing protein sequence-based and predicted secondary structure-based methods for identification of remote homologs Protein Eng. Des. Sel., July 1, 1999; 12(7): 527 - 534. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Retief, K. R. Lynch, and W. R. Pearson Panning for Genes---A Visual Strategy for Identifying Novel Gene Orthologs and Paralogs Genome Res., April 1, 1999; 9(4): 373 - 382. [Abstract] [Full Text] |
||||









