Bioinformatics Advance Access originally published online on September 28, 2004
Bioinformatics 2005 21(5):601-607; doi:10.1093/bioinformatics/bti047
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Improving promoter prediction Improving promoter prediction for the NNPP2.2 algorithm: a case study using Escherichia coli DNA sequences
1 Department of Mathematics and Applied Statistics, University of Wollongong Wollongong, NSW 2522, Australia
2 Department of Biological Sciences, University of Wollongong Wollongong, NSW 2522, Australia
*To whom correspondence should be addressed.
Motivation: Although a great deal of research has been undertaken in the area of promoter prediction, prediction techniques are still not fully developed. Many algorithms tend to exhibit poor specificity, generating many false positives, or poor sensitivity. The neural network prediction program NNPP2.2 is one such example.
Results: To improve the NNPP2.2 prediction technique, the distance between the transcription start site (TSS) associated with the promoter and the translation start site (TLS) of the subsequent gene coding region has been studied for Escherichia coli K12 bacteria. An empirical probability distribution that is consistent for all E.coli promoters has been established. This information is combined with the results from NNPP2.2 to create a new technique called TLSNNPP, which improves the specificity of promoter prediction. The technique is shown to be effective using E.coli DNA sequences, however, it is applicable to any organism for which a set of promoters has been experimentally defined.
Availability: The data used in this project and the prediction results for the tested sequences can be obtained from http://www.uow.edu.au/~yanxia/E_Coli_paper/SBurden_Results.xls
Contact: alh98{at}uow.edu.au
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Zeng, S. Zhu, and H. Yan Towards accurate human promoter recognition: a review of currently used sequence features and classification methods Brief Bioinform, September 1, 2009; 10(5): 498 - 508. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zhang, E. Li, and G. J. Olsen Protein-coding gene promoters in Methanocaldococcus (Methanococcus) jannaschii Nucleic Acids Res., June 1, 2009; 37(11): 3588 - 3601. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Collado-Vides, H. Salgado, E. Morett, S. Gama-Castro, V. Jimenez-Jacinto, I. Martinez-Flores, A. Medina-Rivera, L. Muniz-Rascado, M. Peralta-Gil, and A. Santos-Zavaleta Bioinformatics Resources for the Study of Gene Regulation in Bacteria J. Bacteriol., January 1, 2009; 191(1): 23 - 31. [Full Text] [PDF] |
||||
![]() |
P. S. Hefty and R. S. Stephens Chlamydial Type III Secretion System Is Encoded on Ten Operons Preceded by Sigma 70-Like Promoter Elements J. Bacteriol., January 1, 2007; 189(1): 198 - 206. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Nonaka, M. Blankschien, C. Herman, C. A. Gross, and V. A. Rhodius Regulon and promoter analysis of the E. coli heat-shock factor, {sigma}32, reveals a multifaceted cellular response to heat stress. Genes & Dev., July 1, 2006; 20(13): 1776 - 1789. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Gordon, M. W. Towsey, J. M. Hogan, S. A. Mathews, and P. Timms Improved prediction of bacterial transcription start sites Bioinformatics, January 15, 2006; 22(2): 142 - 148. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. de la Grange, M. Dutertre, N. Martin, and D. Auboeuf FAST DB: a website resource for the study of the expression regulation of human gene products Nucleic Acids Res., July 28, 2005; 33(13): 4276 - 4284. [Abstract] [Full Text] [PDF] |
||||




