Bioinformatics Advance Access originally published online on December 6, 2005
Bioinformatics 2006 22(4):430-437; doi:10.1093/bioinformatics/bti819
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Evaluating phylogenetic footprinting for humanrodent comparisons
1Department of Bioinformatics, UKG, Georg-August-University of Goettingen Goldschmidtstrasse 1, 37077 Goettingen, Germany
2BIOBASE GmbH, Halchtersche Strasse 33, 38304 Wolfenbuettel, Germany
*To whom correspondence should be addressed.
Motivation: Phylogenetic footprinting is a widely applied approach to identify regulatory regions and potential transcription factor binding sites (TFBSs) using alignments of non-coding orthologous regions from two or more organisms. A systematic evaluation of its validity and usability based on known TFBSs is needed to use phylogenetic footprinting most effectively in the identification of unknown TFBSs.
Results: In this paper we use 2678 human, mouse and rat TFBSs from the TRANSFAC® database for this evaluation. To ensure the retrieval of correct orthologous sequences, we combine gene annotation and sequence homology searches. Demanding a sequence identity of at least 65% is most effective in discriminating TFBSs from non-functional sequence parts, while different alignment algorithms only have a minor influence on TFBS identification by humanrodent comparisons. With this threshold
72% of the known TFBSs are found conserved, a number which varies significantly between different transcription factors and also depends on the function of the regulated gene. TFBSs for certain transcription factors do not require strict sequence conservation but instead may show a high pattern conservation, limiting somewhat the validity of purely sequence-based phylogenetic footprinting.
Availability: Scripts are available from the authors upon request.
Contact: tsa{at}bioinf.med.uni-goettingen.de
Supplementary information: http://www.sybig.de/download
Received on September 27, 2005; revised on November 30, 2005; accepted on December 2, 2005
This article has been cited by other articles:
![]() |
E. Wingender The TRANSFAC project as an example of framework technology that supports the analysis of genomic regulation Brief Bioinform, April 24, 2008; (2008) bbn016v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Tullai, J. Chen, M. E. Schaffer, E. Kamenetsky, S. Kasif, and G. M. Cooper Glycogen Synthase Kinase-3 Represses Cyclic AMP Response Element-binding Protein (CREB)-targeted Immediate Early Genes in Quiescent Cells J. Biol. Chem., March 30, 2007; 282(13): 9482 - 9491. [Abstract] [Full Text] [PDF] |
||||

