Bioinformatics, Vol 14, 367-371, Copyright © 1998 by Oxford University Press
NP Brown, C Sander and P Bork
MOTIVATION: The underlying error rate for genomic sequencing sometimes
results in the introduction of artificial frameshifts and in-frame stop
codons into putative protein encoding genes. Severe errors are then
introduced into the inferred transcripts through mis-translation or
premature termination. RESULTS: We describe a system for screening segments
of DNA for frameshift and in-frame stop errors in coding regions. The
method is based on homology matching using blastx to compare all six
reading frames of the query nucleotide sequence against selected protein
sequence databases. Fragments of protein matching neighbouring regions of
the query DNA are united and extended laterally to define candidate open
reading frames, within which, frameshifts and stops are identified.
Suitable targets include prokaryotic or other intron-free genomic sequence
and complementary DNAs. As an example of its use, we report here two
frameshifted ORFs that deviate from the original TIGR sequence annotations
for the recently released Helicobacter pylori genome. AVAILABILITY: The
tool is accessible via the URL http://www.sander.ebi.ac.uk/frame/. CONTACT:
brown@ebi.ac.uk.
ARTICLES
Frame: detection of genomic sequencing errors
1European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge, CB10 1SD, UK. brown@ebi.ac.uk
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Gupta, J. Benhamida, V. Bhargava, D. Goodman, E. Kain, I. Kerman, N. Nguyen, N. Ollikainen, J. Rodriguez, J. Wang, et al. Comparative proteogenomics: Combining mass spectrometry and comparative genomics to analyze multiple genomes Genome Res., July 1, 2008; 18(7): 1133 - 1142. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Perrodou, C. Deshayes, J. Muller, C. Schaeffer, A. Van Dorsselaer, R. Ripp, O. Poch, J.-M. Reyrat, and O. Lecompte ICDS database: interrupted CoDing sequences in prokaryotic genomes Nucleic Acids Res., January 1, 2006; 34(suppl_1): D338 - D343. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Schiex, J. Gouzy, A. Moisan, and Y. de Oliveira FrameD: a flexible program for quality check and gene prediction in prokaryotic genomes and noisy matured eukaryotic sequences Nucleic Acids Res., July 1, 2003; 31(13): 3738 - 3741. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Médigue, M. Rose, A. Viari, and A. Danchin Detecting and Analyzing DNA Sequencing Errors: Toward a Higher Quality of the Bacillus subtilis Genome Sequence Genome Res., November 1, 1999; 9(11): 1116 - 1127. [Abstract] [Full Text] |
||||

