Bioinformatics Advance Access published online on October 10, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl499
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Institute for Genetics, University of Cologne, Zuelpicher Str. 47, 50674 Cologne, Germany; Faculty of Life Sciences, University of Manchester, Michael Smith Building, Oxford Road, M13 9PT Manchester, UK
* To whom correspondence should be addressed.
Motivation: Predicting cis-regulatory modules (CRMs) in higher eukaryotes is a challenging computational task. Commonly used methods to predict CRMs based on the signal of transcription factor binding sites (TFBS) are limited by prior information about transcription factor specificity. More general methods that bypass the reliance on TFBS models are needed for comprehensive CRM prediction. Results: We have developed a method to predict CRMs called CisPlusFinder that identifies high density regions of perfect local ungapped sequences (PLUSs) based on multiple species conservation. By assuming that PLUSs contain core TFBS motifs that are locally overrepresented, the method attempts to capture the expected features of CRM structure and evolution. Applied to a benchmark dataset of CRMs involved in early Drosophila development, CisPlusFinder predicts more annotated CRMs than all other methods tested. Using the REDfly database, we find that some "false positive" predictions in the benchmark dataset correspond to recently annotated CRMs. Our work demonstrates that CRM prediction methods that combine comparative genomic data with statistical properties of DNA may achieve reasonable performance when applied genome-wide in the absence of an a priori set of known TFBS motifs. Availability: The program CisPlusFinder can be downloaded at http://jakob.genetik.uni-koeln.de/bioinformatik/people/nora/nora.html. All software is licensed under the Lesser GNU Public License (LGPL).
Received June 2, 2006
Revised September 8, 2006
Accepted September 28, 2006
Article
Identifying cis-regulatory modules by combining comparative and compositional analysis of DNA
Nora Pierstorff 1 *, Casey M. Bergman 2, and Thomas Wiehe 3
2 Faculty of Life Sciences, University of Manchester, Michael Smith Building, Oxford Road, M13 9PT Manchester, UK
3 Institute for Genetics, University of Cologne, Zuelpicher Str. 47, 50674 Cologne, Germany
Nora Pierstorff, E-mail: nora.pierstorff{at}uni-koeln.de
![]()
Abstract
Associate Editor: Martin Bishop
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. Narlikar and I. Ovcharenko Identifying regulatory elements in eukaryotic genomes Brief Funct Genomic Proteomic, July 1, 2009; 8(4): 215 - 230. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wang, J. Gu, M. Q. Zhang, and Y. Li Identification of phylogenetically conserved microRNA cis-regulatory elements across 12 Drosophila species Bioinformatics, January 15, 2008; 24(2): 165 - 171. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Halfon, S. M. Gallo, and C. M. Bergman REDfly 2.0: an integrated database of cis-regulatory modules and transcription factor binding sites in Drosophila Nucleic Acids Res., January 11, 2008; 36(suppl_1): D594 - D598. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. S.W. Wong and R. Nielsen Finding cis-regulatory modules in Drosophila using phylogenetic hidden Markov models Bioinformatics, August 15, 2007; 23(16): 2031 - 2037. [Abstract] [Full Text] [PDF] |
||||


