Bioinformatics Advance Access originally published online on September 3, 2004
Bioinformatics 2005 21(3):282-292; doi:10.1093/bioinformatics/bti007
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics vol. 21 issue 3 © Oxford University Press 2005; all rights reserved.
Detecting overlapping coding sequences with pairwise alignments
Department of Biochemistry, University of Otago P.O. Box 56, Dunedin, New Zealand
*To whom correspondence should be addressed.
Motivation: Overlapping gene coding sequences (CDSs) are particularly common in viruses but also occur in more complex genomes. Detecting such genes with conventional gene-finding algorithms can be difficult for several reasons. If an overlapping CDS is on the same read-strand as a known CDS, then there may not be a distinct promoter or mRNA. Furthermore, the constraints imposed by double-coding can result in atypical codon biases. However, these same constraints lead to particular mutation patterns that may be detectable in sequence alignments.
Results: In this paper, we investigate several statistics for detecting double-coding sequences with pairwise alignmentsincluding a new maximum-likelihood method. We also develop a model for double-coding sequence evolution. Using simulated sequences generated with the model, we characterize the distribution of each statistic as a function of sequence composition, length, divergence time and double-coding frame. Using these results, we develop several algorithms for detecting overlapping CDSs.
The algorithms were tested on known overlapping CDSs and other overlapping open reading frames (ORFs) in the hepatitis B virus (HBV), Escherichia coli and Salmonella typhimurium genomes. The algorithms should prove useful for detecting novel overlapping genesespecially short coding ORFs in viruses.
Availability: Programs may be obtained from the authors.
Contact: chris.brown{at}otago.ac.nz
Supplementary information: http://biochem.otago.ac.nz/double.html
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
B. Y.-W. Chung, W. A. Miller, J. F. Atkins, and A. E. Firth An overlapping essential gene in the Potyviridae PNAS, April 15, 2008; 105(15): 5897 - 5902. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. McCauley, S. de Groot, T. Mailund, and J. Hein Annotation of selection strengths in viral genomes Bioinformatics, November 15, 2007; 23(22): 2978 - 2986. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Belshaw, O. G. Pybus, and A. Rambaut The evolution of genome compression and genomic novelty in RNA viruses Genome Res., October 1, 2007; 17(10): 1496 - 1504. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. de Groot, T. Mailund, and J. Hein Comparative annotation of viral genomes with non-conserved gene structure Bioinformatics, May 1, 2007; 23(9): 1080 - 1089. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. McCauley and J. Hein Using hidden Markov models and observed evolution to annotate viral genomes Bioinformatics, June 1, 2006; 22(11): 1308 - 1316. [Abstract] [Full Text] [PDF] |
||||


