Bioinformatics Advance Access published online on September 3, 2004
Bioinformatics, doi:10.1093/bioinformatics/bti007
Bioinformatics © Oxford University Press 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Biochemistry, University of Otago, PO Box 56, Dunedin, New Zealand
* To whom correspondence should be addressed. E-mail: chris.brown{at}otago.ac.nz.
Motivation: Overlapping gene coding sequences (CDSs) are particularly common in viruses but also occur in more complex genomes. Detecting such genes with conventional gene-finding algorithms can be difficult for several reasons. If an overlapping CDS is on the same read-strand as a known CDS, then there may not be a distinct promoter or mRNA. Furthermore, the constraints imposed by double-coding can result in atypical codon biases. However these same constraints lead to particular mutation patterns that may be detectable in sequence alignments. Results: In this paper we investigate several statistics for detecting double-coding sequences with pairwise alignments - including a new maximum likelihood method. We also develop a model for double-coding sequence evolution. Using simulated sequences generated with the model, we characterize the distribution of each statistic as a function of sequence composition, length, divergence time and double-coding frame. Using these results, we develop several algorithms for detecting overlapping CDSs. The algorithms are tested on known overlapping CDSs and other overlapping open reading frames (ORFs) in the hepatitis B (HBV), Escherichia coli and Salmonella typhimurium genomes. The algorithms should prove useful for detecting novel overlapping genes - especially short coding ORFs in viruses. Availability: Programmes may be obtained from the authors. Supplementary information: http://biochem.otago.ac.nz/double.html.
Revised July 29, 2004
Accepted August 26, 2004
Article
Detecting overlapping coding sequences with pairwise alignments
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
B. Y.-W. Chung, W. A. Miller, J. F. Atkins, and A. E. Firth An overlapping essential gene in the Potyviridae PNAS, April 15, 2008; 105(15): 5897 - 5902. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. McCauley, S. de Groot, T. Mailund, and J. Hein Annotation of selection strengths in viral genomes Bioinformatics, November 15, 2007; 23(22): 2978 - 2986. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Belshaw, O. G. Pybus, and A. Rambaut The evolution of genome compression and genomic novelty in RNA viruses Genome Res., October 1, 2007; 17(10): 1496 - 1504. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. de Groot, T. Mailund, and J. Hein Comparative annotation of viral genomes with non-conserved gene structure Bioinformatics, May 1, 2007; 23(9): 1080 - 1089. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. McCauley and J. Hein Using hidden Markov models and observed evolution to annotate viral genomes Bioinformatics, June 1, 2006; 22(11): 1308 - 1316. [Abstract] [Full Text] [PDF] |
||||


