Bioinformatics Vol. 15 no. 12 1999
Pages 987-993
© 1999 Oxford University Press
Modeling and predicting transcriptional units of Escherichia coligenes using hidden Markov models
1 Genomic Sciences Center, RIKEN, c/o
Laboratory of Genome Database, Human Genome Center, Institute of
Medical Science, University of Tokyo, 4-6-1 Shirokane-dai
Minato-ku, Tokyo 108-8639
2 Institute for Chemical Research, Kyoto
University, Gokashou, Uji, Kyoto 611-0011
3 Human Genome Center, Institute of Medical
Science, University of Tokyo, 4-6-1 Shirokane-dai, Minato-ku, Tokyo
108-8639, Japan
To whom correspondence should be addressed.
Motivation: The hidden Markov model (HMM) is a valuable technique for gene-finding, especially because its flexibility enables the inclusion of various sequence features. Recent programs for bacterial gene-finding include the information of ribosomal binding site (RBS) to improve the recognition accuracy of the start codon, using this feature. We report here our attempt to extend the model into the total transcriptional unit, enabling the prediction of operon structures.
Results: First, we improved the prediction accuracy of coding sequences (CDSs) by employing the models of typical, atypical and negative (false-positive) classes as well as the models of RBS and its downstream spacer. The sensitivity of exactly predicting the 204 experimentally confirmed CDSs reached 90.2% in an objective test. Based on the prediction result of CDSs, the positions of the promoters and terminators were predicted. Our model could exactly recognize 60% of 390 known transcriptional units. Thus, the accuracy and significance of this prediction problem is far from trivial. We would like to propose this problem as an open theme in bioinformatics because the ongoing or planned post-sequencing projects will produce much data for future improvements.
Availability: The table of predicted transcriptional units of Escherichia coli will be distributed upon request.
Contact: knakai{at}ims.u-tokyo.ac.jp
Received on May 14, 1999
; accepted on August 12, 1999
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
R. W. W. Brouwer, O. P. Kuipers, and S. A. F. T. van Hijum The relative value of operon predictions Brief Bioinform, September 1, 2008; 9(5): 367 - 375. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Naito, H. Hirakawa, A. Yamashita, N. Ohara, M. Shoji, H. Yukitake, K. Nakayama, H. Toh, F. Yoshimura, S. Kuhara, et al. Determination of the Genome Sequence of Porphyromonas gingivalis Strain ATCC 33277 and Genomic Comparison with Strain W83 Revealed Extensive Genome Rearrangements in P. gingivalis DNA Res, June 3, 2008; (2008) dsn013v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Singhal, B. Jayaram, S. B. Dixit, and D. L. Beveridge Prokaryotic Gene Finding Based on Physicochemical Characteristics of Codons Calculated from Molecular Dynamics Simulations Biophys. J., June 1, 2008; 94(11): 4173 - 4183. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Goto, A. Yamashita, H. Hirakawa, M. Matsutani, K. Todo, K. Ohshima, H. Toh, K. Miyamoto, S. Kuhara, M. Hattori, et al. Complete Genome Sequence of Finegoldia magna, an Anaerobic Opportunistic Pathogen DNA Res, February 7, 2008; (2008) dsm030v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Charaniya, S. Mehra, W. Lian, K. P. Jayapal, G. Karypis, and W.-S. Hu Transcriptome dynamics-based operon prediction and verification in Streptomyces coelicolor Nucleic Acids Res., December 18, 2007; 35(21): 7222 - 7236. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Kolchanov, T. I. Merkulova, E. V. Ignatieva, E. A. Ananko, D. Yu. Oshchepkov, V. G. Levitsky, G. V. Vasiliev, N. V. Klimova, V. M. Merkulov, and T. C. Hodgman Combined experimental and computational approaches to study the regulatory elements in eukaryotic genes Brief Bioinform, July 12, 2007; (2007) bbm027v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. H. Bergman, K. D. Passalacqua, P. C. Hanna, and Z. S. Qin Operon Prediction for Sequenced Bacterial Genomes without Experimental Information Appl. Envir. Microbiol., February 1, 2007; 73(3): 846 - 854. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Dam, V. Olman, K. Harris, Z. Su, and Y. Xu Operon prediction using both genome-specific and general genomic information Nucleic Acids Res., January 12, 2007; 35(1): 288 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Noguchi, J. Park, and T. Takagi MetaGene: prokaryotic gene finding from environmental genome shotgun sequences Nucleic Acids Res., November 14, 2006; 34(19): 5623 - 5630. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Azuma, H. Hirakawa, A. Yamashita, Y. Cai, M. A. Rahman, H. Suzuki, S. Mitaku, H. Toh, S. Goto, T. Murakami, et al. Genome Sequence of the Cat Pathogen, Chlamydophila felis DNA Res, January 1, 2006; 13(1): 15 - 23. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Okuda, T. Katayama, S. Kawashima, S. Goto, and M. Kanehisa ODB: a database of operons accumulating known operons across multiple genomes Nucleic Acids Res., January 1, 2006; 34(suppl_1): D358 - D362. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Jacob, R. Sasikumar, and K. N. R. Nair A fuzzy guided genetic algorithm for operon prediction Bioinformatics, April 15, 2005; 21(8): 1403 - 1407. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. P. Westover, J. D. Buhler, J. L. Sonnenburg, and J. I. Gordon Operon prediction without a training set Bioinformatics, April 1, 2005; 21(7): 880 - 888. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Kuwahara, A. Yamashita, H. Hirakawa, H. Nakayama, H. Toh, N. Okada, S. Kuhara, M. Hattori, T. Hayashi, and Y. Ohnishi Genomic analysis of Bacteroides fragilis reveals extensive DNA inversions regulating cell surface adaptation PNAS, October 12, 2004; 101(41): 14919 - 14924. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Wang, J. D. Trawick, R. Yamamoto, and C. Zamudio Genome-wide operon prediction in Staphylococcus aureus Nucleic Acids Res., July 13, 2004; 32(12): 3689 - 3702. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. B.L. Alkema, B. Lenhard, and W. W. Wasserman Regulog Analysis: Detection of Conserved Regulatory Networks Across Bacteria: Application to Staphylococcus aureus Genome Res., July 1, 2004; 14(7): 1362 - 1373. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Paredes, I. Rigoutsos, and E. T. Papoutsakis Transcriptional organization of the Clostridium acetobutylicum genome Nucleic Acids Res., April 1, 2004; 32(6): 1973 - 1981. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Vogel, I. M. Axmann, H. Herzel, and W. R. Hess Experimental and computational analysis of transcriptional start sites in the cyanobacterium Prochlorococcus MED4 Nucleic Acids Res., June 1, 2003; 31(11): 2890 - 2899. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Zheng, J. D. Szustakowski, L. Fortnow, R. J. Roberts, and S. Kasif Computational Identification of Operons in Microbial Genomes Genome Res., August 1, 2002; 12(8): 1221 - 1230. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Lesnik, R. Sampath, H. B. Levene, T. J. Henderson, J. A. McNeil, and D. J. Ecker Prediction of rho-independent transcriptional terminators in Escherichia coli Nucleic Acids Res., September 1, 2001; 29(17): 3583 - 3594. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Besemer, A. Lomsadze, and M. Borodovsky GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions Nucleic Acids Res., June 15, 2001; 29(12): 2607 - 2618. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Tan, G. Moreno-Hagelsieb, J. Collado-Vides, and G. D. Stormo A Comparative Genomics Approach to Prediction of New Members of Regulons Genome Res., April 1, 2001; 11(4): 566 - 584. [Abstract] [Full Text] |
||||
![]() |
M. D. Ermolaeva, O. White, and S. L. Salzberg Prediction of operons in microbial genomes Nucleic Acids Res., March 1, 2001; 29(5): 1216 - 1221. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Shimizu, K. Ohtani, H. Hirakawa, K. Ohshima, A. Yamashita, T. Shiba, N. Ogasawara, M. Hattori, S. Kuhara, and H. Hayashi Complete genome sequence of Clostridium perfringens, an anaerobic flesh-eater PNAS, January 22, 2002; 99(2): 996 - 1001. [Abstract] [Full Text] [PDF] |
||||







