Skip Navigation


Bioinformatics Advance Access originally published online on June 10, 2009
Bioinformatics 2009 25(16):2006-2012; doi:10.1093/bioinformatics/btp359
This Article
Right arrow Full Text
Right arrow Full Text (Print PDF)
Right arrow All Versions of this Article:
25/16/2006    most recent
btp359v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Gan, Y.
Right arrow Articles by Zhou, S.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gan, Y.
Right arrow Articles by Zhou, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2009. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

A pattern-based nearest neighbor search approach for promoter prediction using DNA structural profiles

Yanglan Gan 1, Jihong Guan 1,* and Shuigeng Zhou 2,3,*

1Department of Computer Science and Technology, Tongji University, Shanghai 201804, 2School of Computer Science and 3Shanghai Key Lab of Intelligent Information Processing, Fudan University, Shanghai 200433, China

*To whom correspondence should be addressed.


   Abstract

Motivation: Identification of core promoters is a key clue in understanding gene regulations. However, due to the diverse nature of promoter sequences, the accuracy of existing prediction approaches for non-CpG island (simply CGI)-related promoters is not as high as that for CGI-related promoters. This consequently leads to a low genome-wide promoter prediction accuracy.

Results: In this article, we first systematically analyze the similarities and differences between the two types of promoters (CGI- and non-CGI-related) from a novel structural perspective, and then devise a unified framework, called PNNP (Pattern-based Nearest Neighbor search for Promoter), to predict both CGI- and non-CGI-related promoters based on their structural features. Our comparative analysis on the structural characteristics of promoters reveals two interesting facts: (i) the structural values of CGI- and non-CGI-related promoters are quite different, but they exhibit nearly similar structural patterns; (ii) the structural patterns of promoters are obviously different from that of non-promoter sequences though the sequences have almost similar structural values. Extensive experiments demonstrate that the proposed PNNP approach is effective in capturing the structural patterns of promoters, and can significantly improve genome-wide performance of promoters prediction, especially non-CGI-related promoters prediction.

Availability: The implementation of the program PNNP is available at http://admis.tongji.edu.cn/Projects/pnnp.aspx.

Contact: jhguan{at}tongji.edu.cn; sgzhou{at}fudan.edu.cn

Supplementary information: Supplementary data are available at Bioinformatics online.

Associate Editor: Martin Bishop


Received on December 19, 2008; revised on June 5, 2009; accepted on June 6, 2009

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.