Skip Navigation



Bioinformatics Advance Access published online on May 8, 2007

Bioinformatics, doi:10.1093/bioinformatics/btm249
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
23/14/1737    most recent
btm249v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Hertzberg, L.
Right arrow Articles by Domany, E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hertzberg, L.
Right arrow Articles by Domany, E.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2007). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

STOP: searching for Transcription Factor motifs using gene expression

Libi Hertzberg 1,2, Shai Izraeli 1 and Eytan Domany 2,*

1Department of Pediatric Hemato-Oncology, The Sheba Cancer Research Center, Sheba Medical Center, Tel Hashomer Israel and Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
2Department of Physics of Complex Systems, Weizmann Institute of Science, Rehovot 76100, Israel

*To whom correspondence should be addressed. Eytan Domany, E-mail: eytan.domany{at}weizmann.ac.il


   Abstract

Motivation: Existing computational methods that identify transcription factor (TF) binding sites on a gene's promoter are plagued by significant inaccuracies. Binding of a TF to a particular sequence is assessed by comparing its similarity score, obtained from the TF's known position weight matrix (PWM), to a threshold. If the similarity score is above the threshold, the sequence is considered a putative binding site. Determining this threshold is a central part of the problem, for which no satisfactory biologically based solution exists.

Results: We present here a method that integrates gene expression data with sequence-based scoring of TF binding sites, for determining a global score threshold for each TF. We validate our method, STOP (Searching TFs Of Promoters), in several ways: 1) We calculate the average expression values of groups of human putative target genes of each TF, and compare them to similar averages derived for random gene groups. The groups of putative targets show significantly higher relative average expression. 2) We find high consistency between the induced lists of putative targets in human and in mouse. 3) The expression patterns associated with human and mouse genes (ordered by PWM scores for each TF) exhibit high similarity between human and mouse, indicating that our method has firm biological basis. 4) Comparison of results obtained by STOP and PRIMA (Elkon, et al., 2003) suggests that determining the score threshold using gene expression, as is done in STOP, is more biologically tuned.

Availability: Software package will be available for academic users upon request.

Associate Editor: Dr. Limsoon Wong


Received on December 14, 2006; revised on May 2, 2007; accepted on May 2, 2007

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.