Bioinformatics Vol. 17 no. 90001 2001
Pages S90-S96
© 2001 Oxford University Press
Promoter prediction in the human genome
Informatics Research, Celera Genomics, 45 West Gude Drive, Rockville, MD-20850, USA
Received on February 5, 2001
; revised on March 29, 2001
; accepted on March 29, 2001
Computational prediction of eukaryotic polII promoters has been one of the most elusive problems despite considerable effort devoted to the study. Researchers have looked for various types of signals around the transcriptional start site (TSS), viz. oligo-nucleotide statistics, potential binding sites for core factors, clusters of binding sites, proximity to CpG islands etc.. The proximity of CpG islands to gene starts is now a well established fact, although until recently, it was based on very little genomic data. In this work we explore the possibility of enhancing the promoter prediction accuracy by combining CpG island information with a few other, biologically motivated, seemingly independent signals, that cover most of the known knowledge. We benchmarked the method on a much larger genomic datasets compared to previous studies. We were able to improve slightly upon current prediction accuracy. Furthermore, we observe that CpG islands are the most dominant signals and the other signals do not improve the prediction. This suggests that the computational prediction of promoters for genes with no associated CpG-island (typically having tissue-specific expression) looking only at the immediate neighborhood of the TSS may not even be possible. We suggest some biological experiments and studies to better understand the biology of transcription.
Contact: Sridhar.Hannenhalli{at}celera.com; Samuel.Levy{at}celera.com
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. Abeel, Y. Saeys, E. Bonnet, P. Rouze, and Y. Van de Peer Generic eukaryotic core promoter prediction using structural features of DNA Genome Res., February 1, 2008; 18(2): 310 - 323. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Frith, J. Ponjavic, D. Fredman, C. Kai, J. Kawai, P. Carninci, Y. Hayshizaki, and A. Sandelin Evolutionary turnover of mammalian transcription start sites Genome Res., June 1, 2006; 16(6): 713 - 722. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Zhang and F. S. Dietrich Mapping of transcription start sites in Saccharomyces cerevisiae using 5' SAGE Nucleic Acids Res., May 19, 2005; 33(9): 2838 - 2851. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. B. Bajic and S. H. Seah Dragon Gene Start Finder: An Advanced System for Finding Approximate Locations of the Start of Gene Transcriptional Units Genome Res., August 1, 2003; 13(8): 1923 - 1929. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Halees, D. Leyfer, and Z. Weng PromoSer: a large-scale mammalian promoter and transcription start site identification service Nucleic Acids Res., July 1, 2003; 31(13): 3554 - 3559. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. B. Bajic and S. H. Seah Dragon Gene Start Finder identifies approximate locations of the 5' ends of genes Nucleic Acids Res., July 1, 2003; 31(13): 3560 - 3563. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rombauts, K. Florquin, M. Lescot, K. Marchal, P. Rouze, and Y. Van de Peer Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes Plant Physiology, July 1, 2003; 132(3): 1162 - 1176. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Weinmann, P. S. Yan, M. J. Oberley, T. H.-M. Huang, and P. J. Farnham Isolating human transcription factor targets by coupling chromatin immunoprecipitation and CpG island microarray analysis Genes & Dev., January 15, 2002; 16(2): 235 - 244. [Abstract] [Full Text] [PDF] |
||||



