Bioinformatics Vol. 19 Suppl. 2 2003
pages ii103-ii112
© 2003 Oxford University Press
Modeling sequencing errors by combining Hidden Markov models
1 Swiss Institute of Bioinformatics,
Switzerland
2 Swiss Institute for Experimental Cancer
Research, Switzerland
3 Office of Information Technology, Ludwig
Institute for Cancer Research, chemin des Boveresses 155, CH-1066
Epalinges s/Lausanne, Switzerland
Received on March 17, 2003
; accepted on June 9, 2003
Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone coding regions have shown good performance in detecting and predicting these while correcting sequencing errors using codon usage frequencies. In the research presented here, we improve the detection of translation start and stop sites by integrating a more complex mRNA model with codon usage bias based error correction into one hidden Markov model (HMM), thus generalizing this error correction approach to more complex HMMs. We show that our method maintains the performance in detecting coding sequences.
Keywords: coding region prediction, sequencing errors, expressed sequence tags, hidden Markov models.
Contact: Claudio.Lottaz{at}molgen.mpg.de
* To whom correspondence should be addressed. Current address: Max-Planck-Institute for Molecular Genetics, Ihnestr. 73, D-14195 Berlin (Germany)
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. H. Pashley, J. R. Ellis, D. E. McCauley, and J. M. Burke EST Databases as a Source for Molecular Markers: Lessons from Helianthus J. Hered., July 1, 2006; 97(4): 381 - 388. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Udall, J. M. Swanson, K. Haller, R. A. Rapp, M. E. Sparks, J. Hatfield, Y. Yu, Y. Wu, C. Dowd, A. B. Arpat, et al. A global assembly of cotton ESTs Genome Res., March 1, 2006; 16(3): 441 - 450. [Abstract] [Full Text] [PDF] |
||||

