Bioinformatics Advance Access published online on September 25, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl482
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Electronic Engineering, City University of Hong Kong, Hong Kong; Department of Electronic Engineering, Tsinghua University, Beijing, China
* To whom correspondence should be addressed.
Motivation: Promoter prediction is important for the analysis of gene regulations. Although a number of promoter prediction algorithms have been reported in literature, significant improvement in prediction accuracy remains a challenge. In this paper, an effective promoter identification algorithm, which is called PromoterExplorer, is proposed. In our approach, we analyze the different roles of various features, i.e. local distribution of pentamers, positional CpG island features and digitized DNA sequence, and then combine them to build a high-dimensional input vector. A cascade AdaBoost-based learning procedure is adopted to select the most "informative" or "discriminating" features to build a sequence of weak classifiers, which are combined to form a strong classifier so as to achieve a better performance. The cascade structure used for identification can also reduce the false positive. Results: PromoterExplorer is tested based on large-scale DNA sequences from different databases, including the EPD, DBTSS, Genbank and human chromosome 22. Experimental results show that consistent and promising performance can be achieved.
Received July 12, 2006
Revised August 28, 2006
Accepted September 11, 2006
Article
PromoterExplorer: an effective promoter identification method based on the AdaBoost algorithm
Xudong Xie 1, Shuanhu Wu 2, Kin-Man Lam 3, and Hong Yan 4 *
2 Department of Electronic Engineering, City University of Hong Kong, Hong Kong; School of Computer Science and Technology, Yantai University, China
3 Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong
4 Department of Electronic Engineering, City University of Hong Kong, Hong Kong; School of Electronic and Information Engineering, University of Sydney, NSW 2006, Australia
Hong Yan, E-mail: h.yan{at}cityu.edu.hk
![]()
Abstract
Associate Editor: Alex Bateman
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Zeng, S. Zhu, and H. Yan Towards accurate human promoter recognition: a review of currently used sequence features and classification methods Brief Bioinform, September 1, 2009; 10(5): 498 - 508. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Abeel, Y. Van de Peer, and Y. Saeys Toward a gold standard for promoter prediction evaluation Bioinformatics, June 15, 2009; 25(12): i313 - i320. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Nanni, S. Mazzara, L. Pattini, and A. Lumini Protein classification combining surface analysis and primary structure Protein Eng. Des. Sel., April 1, 2009; 22(4): 267 - 272. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Abeel, Y. Saeys, P. Rouze, and Y. Van de Peer ProSOM: core promoter prediction based on unsupervised clustering of DNA physical profiles Bioinformatics, July 1, 2008; 24(13): i24 - i31. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Abeel, Y. Saeys, E. Bonnet, P. Rouze, and Y. Van de Peer Generic eukaryotic core promoter prediction using structural features of DNA Genome Res., February 1, 2008; 18(2): 310 - 323. [Abstract] [Full Text] [PDF] |
||||



