Bioinformatics Advance Access originally published online on April 23, 2009
Bioinformatics 2009 25(16):2126-2133; doi:10.1093/bioinformatics/btp278
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
KIRMES: kernel-based identification of regulatory modules in euchromatic sequences

1Friedrich Miescher Laboratory of the Max Planck Society, and 2Max Planck Institute for Developmental Biology, Tübingen, 3Department of Stem Cell Biology, University of Heidelberg and 4Wilhelm Schickard Institute for Computer Science, University of Tübingen, Germany
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: Understanding transcriptional regulation is one of the main challenges in computational biology. An important problem is the identification of transcription factor (TF) binding sites in promoter regions of potential TF target genes. It is typically approached by position weight matrix-based motif identification algorithms using Gibbs sampling, or heuristics to extend seed oligos. Such algorithms succeed in identifying single, relatively well-conserved binding sites, but tend to fail when it comes to the identification of combinations of several degenerate binding sites, as those often found in cis-regulatory modules.
Results: We propose a new algorithm that combines the benefits of existing motif finding with the ones of support vector machines (SVMs) to find degenerate motifs in order to improve the modeling of regulatory modules. In experiments on microarray data from Arabidopsis thaliana, we were able to show that the newly developed strategy significantly improves the recognition of TF targets.
Availability: The python source code (open source-licensed under GPL), the data for the experiments and a Galaxy-based web service are available at http://www.fml.mpg.de/raetsch/suppl/kirmes/
Contact: sebi{at}tuebingen.mpg.de
Supplementary information: Supplementary data are available at Bioinformatics online.
Present Address: Biology Department Duke University Box 90338, Durham, NC 27708, USA
Received on October 31, 2008; revised on March 16, 2009; accepted on April 21, 2009