Bioinformatics Vol. 15 no. 11 1999
Pages 874-886
© 1999 Oxford University Press
Finding prokaryotic genes by the frame-by-frame algorithm: targeting gene starts and overlapping genes
1 Russian Academy of Science, The Institute
for Problems in Mechanics, Moscow 11526, Russia
2 School of Biology, Georgia Institute of
Technology, Atlanta, USA
Mark Borodovsky
Motivation: Tightly packed prokaryotic genes frequently overlap with each other. This feature, rarely seen in eukaryotic DNA, makes detection of translation initiation sites and, therefore, exact predictions of prokaryotic genes notoriously difficult. Improving the accuracy of precise gene prediction in prokaryotic genomic DNA remains an important open problem.
Results: A software program implementing a new algorithm utilizing a uniform Hidden Markov Model for prokaryotic gene prediction was developed. The algorithm analyzes a given DNA sequence in each of six possible global reading frames independently. Twelve complete prokaryotic genomes were analyzed using the new tool. The accuracy of gene finding, predicting locations of protein-coding ORFs, as well as the accuracy of precise gene prediction, and detecting the whole gene including translation initiation codon were assessed by comparison with existing annotation. It was shown that in terms of gene finding, the program performs at least as well as the previously developed tools, such as GeneMark and GLIMMER. In terms of precise gene prediction the new program was shown to be more accurate, by several percentage points, than earlier developed tools, such as GeneMark.hmm, ECOPARSE and ORPHEUS. The results of testing the program indicated the possibility of systematic bias in start codon annotation in several early sequenced prokaryotic genomes.
Availability: The new gene-finding program can be accessed through the Web site: http://dixie.biology.gatech.edu/GeneMark/fbf.cgi
Contact: mark{at}amber.gatech.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Xiong, C. E. Bauer, and A. Pancholy Insight into the haem d1 biosynthesis pathway in heliobacteria through bioinformatics analysis Microbiology, October 1, 2007; 153(10): 3548 - 3562. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Noguchi, J. Park, and T. Takagi MetaGene: prokaryotic gene finding from environmental genome shotgun sequences Nucleic Acids Res., November 14, 2006; 34(19): 5623 - 5630. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Wang, H.-B. Zhang, G. Chen, L. Chen, and L.-H. Zhang Dual Control of Quorum Sensing by Two TraM-Type Antiactivators in Agrobacterium tumefaciens Octopine Strain A6. J. Bacteriol., April 1, 2006; 188(7): 2435 - 2445. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Walker, V. Pavlovic, and S. Kasif A comparative genomic method for computational identification of prokaryotic translation initiation sites Nucleic Acids Res., July 15, 2002; 30(14): 3181 - 3191. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Shibuya and I. Rigoutsos Dictionary-driven prokaryotic gene finding Nucleic Acids Res., June 15, 2002; 30(12): 2710 - 2725. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Vizcaino, A. Cloeckaert, M. S. Zygmunt, and L. Fernandez-Lago Characterization of a Brucella Species 25-Kilobase DNA Fragment Deleted from Brucella abortus Reveals a Large Gene Cluster Related to the Synthesis of a Polysaccharide Infect. Immun., November 1, 2001; 69(11): 6738 - 6748. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Besemer, A. Lomsadze, and M. Borodovsky GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions Nucleic Acids Res., June 15, 2001; 29(12): 2607 - 2618. [Abstract] [Full Text] [PDF] |
||||



