Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (18)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Karchin, R.
Right arrow Articles by Hughey, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Karchin, R.
Right arrow Articles by Hughey, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics, Vol 14, 772-782, Copyright © 1998 by Oxford University Press


ARTICLES

Weighting hidden Markov models for maximum discrimination

R Karchin and R Hughey
Department of Computer Engineering, Jack Baskin School of Engineering, University of California, Santa Cruz, CA 95064, USA. rph@cse.ucsc.edu

MOTIVATION: Hidden Markov models can efficiently and automatically build statistical representations of related sequences. Unfortunately, training sets are frequently biased toward one subgroup of sequences, leading to an insufficiently general model. This work evaluates sequence weighting methods based on the maximum-discrimination idea. RESULTS: One good method scales sequence weights by an exponential that ranges between 0.1 for the best scoring sequence and 1.0 for the worst. Experiments with a curated data set show that while training with one or two sequences performed worse than single-sequence Probabilistic Smith-Waterman, training with five or ten sequences reduced errors by 20% and 51%, respectively. This new version of the SAM HMM suite outperforms HMMer (17% reduction over PSW for 10 training sequences), Meta-MEME (28% reduction), and unweighted SAM (31% reduction). AVAILABILITY: A WWW server, as well as information on obtaining the Sequence Alignment and Modeling (SAM) software suite and additional data from this work, can be found at http://www.cse.ucse. edu/research/compbio/sam.html
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
K. Karplus, R. Karchin, G. Shackelford, and R. Hughey
Calibrating E-values for hidden Markov models using reverse-sequence null models
Bioinformatics, November 15, 2005; 21(22): 4107 - 4115.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. R. Johnston and D. C. Shields
A sequence sub-sampling algorithm increases the power to detect distant homologues
Nucleic Acids Res., July 8, 2005; 33(12): 3772 - 3778.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
H. D. Cho, C. L. Verlinde, and A. M. Weiner
Archaeal CCA-adding Enzymes: CENTRAL ROLE OF A HIGHLY CONSERVED {beta}-TURN MOTIF IN RNA POLYMERIZATION WITHOUT TRANSLOCATION
J. Biol. Chem., March 11, 2005; 280(10): 9555 - 9566.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. S. Rest and D. P. Mindell
Retroids in Archaea: Phylogeny and Lateral Origins
Mol. Biol. Evol., July 1, 2003; 20(7): 1134 - 1142.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
A. Turchin and I. S. Kohane
Gene homology resources on the World Wide Web
Physiol Genomics, December 3, 2002; 11(3): 165 - 177.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.