Bioinformatics Advance Access published online on January 19, 2007
Bioinformatics, doi:10.1093/bioinformatics/btm009
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Identifying bacterial genes and endosymbiont DNA with Glimmer
1Center for Bioinformatics & Computational Biology, University of Maryland, College Park, MD 20742
2Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Dublin 2, Ireland
3Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218
*To whom correspondence should be addressed. Arthur L. Delcher, E-mail: adelcher{at}umiacs.umd.edu
| Abstract |
|---|
Motivation: The Glimmer gene-finding software has been successfully used for finding genes in bacteria, archæa, and viruses representing hundreds of species. We describe several major changes to the Glimmer system, including improved methods for identifying both coding regions and start codons. We also describe a new module of Glimmer that can distinguish host and endosymbiont DNA. This module was developed in response to the discovery that eukaryotic genome sequencing projects sometimes inadvertently capture the DNA of intracellular bacteria living in the host.
Results: The new methods dramatically reduce the rate of false-positive predictions, while maintaining Glimmers 99% sensitivity rate at detecting genes in most species, and they find substantially more correct start sites, as measured by comparisons to known and well-curated genes. We show that our interpolated Markov model (IMM) DNA discriminator correctly separated 99% of the sequences in a recent genome project that produced a mixture of sequences from the bacterium Prochloron didemni and its sea squirt host, Lissoclinum patella.
Availability: Glimmer is OSI Certified Open Source and available at http://cbcb.umd.edu/software/glimmer
Associate Editor: Alfonso Valencia
Received on August 3, 2006; revised on December 15, 2006; accepted on January 14, 2007
This article has been cited by other articles:
![]() |
J. J. van Aartsen The Klebsiella pheV tRNA locus: a hotspot for integration of alien genomic islands Bioscience Horizons, March 1, 2008; 1(1): 51 - 60. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Starcevic, S. Akthar, W. C. Dunlap, J. M. Shick, D. Hranueli, J. Cullum, and P. F. Long Enzymes of the shikimic acid pathway encoded in the genome of a basal metazoan, Nematostella vectensis, have microbial origins PNAS, February 19, 2008; 105(7): 2533 - 2537. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Tuanyok, R. K. Auerbach, T. S. Brettin, D. C. Bruce, A. C. Munk, J. C. Detter, T. Pearson, H. Hornstra, R. W. Sermswan, V. Wuthiekanun, et al. A Horizontal Gene Transfer Event Defines Two Distinct Groups within Burkholderia pseudomallei That Have Dissimilar Geographic Distributions J. Bacteriol., December 15, 2007; 189(24): 9044 - 9049. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. McCutcheon and N. A. Moran Parallel genomic evolution and metabolic interdependence in an ancient symbiosis PNAS, December 4, 2007; 104(49): 19392 - 19397. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. Hillson, P. Hu, G. L. Andersen, and L. Shapiro Caulobacter crescentus as a Whole-Cell Uranium Biosensor Appl. Envir. Microbiol., December 1, 2007; 73(23): 7615 - 7621. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Sulakhe, M. D'Souza, M. Syed, A. Rodriguez, Y. Zhang, E. M. Glass, M. F. Romine, and N. Maltsev GNARE--a grid-based server for the analysis of user submitted genomes Nucleic Acids Res., May 25, 2007; (2007) gkm366v1. [Abstract] [Full Text] [PDF] |
||||




