Bioinformatics Advance Access published online on July 12, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl369
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
* To whom correspondence should be addressed.
Motivation: There is a growing literature on the detection of Horizontal Gene Transfer (HGT) events by means of parametric, non-comparative methods. Such approaches rely only on sequence information and utilize different low and high order indices to capture compositional deviation from the genome backbone; the superiority of the latter over the former has been shown elsewhere. However even high order k-mers may be poor estimators of HGT, when insufficient information is available, e.g. in short sliding windows. Most of the current HGT prediction methods require pre-existing annotation, which may restrict their application on newly sequenced genomes. Results: We introduce a novel computational method, Interpolated Variable Order Motifs (IVOMs), that exploits compositional biases using variable order motif distributions and captures more reliably the local composition of a sequence compared to fixed-order methods. For optimal localization of the boundaries of each predicted region, a 2nd order, 2-state Hidden Markov Model (HMM) is implemented in a change-point detection framework. We applied the IVOM approach to the genome of Salmonella enterica serovar Typhi CT18, a well-studied prokaryote in terms of HGT events, and we show that the IVOMs outperform state-of-the-art low and high order motif methods predicting not only the already characterized Salmonella Pathogenicity Islands (SPI-1 to SPI-10) but also three novel SPIs (SPI-15, SPI-16, SPI-17) and other HGT events. Availability: The software is available under a GPL license as a standalone application at http://www.sanger.ac.uk/Software/analysis/alien_hunter. Supplementary Information: Supplementary data are available at Bioinformatics online.
Received May 3, 2006
Revised June 22, 2006
Accepted July 3, 2006
Article
Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands
Georgios S. Vernikos 1 *
and
Julian Parkhill 1
Georgios S. Vernikos, E-mail: gsv{at}sanger.ac.uk
![]()
Abstract
Associate Editor: John Quackenbush
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. Letek, A. A. Ocampo-Sosa, M. Sanders, U. Fogarty, T. Buckley, D. P. Leadon, P. Gonzalez, M. Scortti, W. G. Meijer, J. Parkhill, et al. Evolution of the Rhodococcus equi vap Pathogenicity Island Seen through Comparison of Host-Associated vapA and vapB Virulence Plasmids J. Bacteriol., September 1, 2008; 190(17): 5797 - 5805. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Buboltz, T. L. Nicholson, M. R. Parette, S. E. Hester, J. Parkhill, and E. T. Harvill Replacement of Adenylate Cyclase Toxin in a Lineage of Bordetella bronchiseptica J. Bacteriol., August 1, 2008; 190(15): 5502 - 5511. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Pearson, M. Sebaihia, C. Churcher, M. A. Quail, A. S. Seshasayee, N. M. Luscombe, Z. Abdellah, C. Arrosmith, B. Atkin, T. Chillingworth, et al. Complete Genome Sequence of Uropathogenic Proteus mirabilis, a Master of both Adherence and Motility J. Bacteriol., June 1, 2008; 190(11): 4027 - 4037. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. P. Stinear, T. Seemann, P. F. Harrison, G. A. Jenkin, J. K. Davies, P. D.R. Johnson, Z. Abdellah, C. Arrowsmith, T. Chillingworth, C. Churcher, et al. Insights from the complete genome sequence of Mycobacterium marinum on the evolution of Mycobacterium tuberculosis Genome Res., May 1, 2008; 18(5): 729 - 741. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. D. Saroj, R. Shashidhar, M. Karani, and J. R. Bandekar Distribution of Salmonella pathogenicity island (SPI)-8 and SPI-10 among different serotypes of Salmonella J. Med. Microbiol., April 1, 2008; 57(4): 424 - 427. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. S. Vernikos and J. Parkhill Resolving the structural features of genomic islands: A machine learning approach Genome Res., February 1, 2008; 18(2): 331 - 342. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Zaneveld, D. R. Nemergut, and R. Knight Are all horizontal gene transfers created equal? Prospects for mechanism-based studies of HGT patterns Microbiology, January 1, 2008; 154(1): 1 - 15. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. K. Azad and J. G. Lawrence Detecting laterally transferred genes: use of entropic clustering methods and genome position Nucleic Acids Res., July 9, 2007; 35(14): 4629 - 4639. [Abstract] [Full Text] [PDF] |
||||




