Bioinformatics Advance Access originally published online on January 29, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics 20(5) © Oxford University Press 2004; all rights reserved.
Regression trees for regulatory element identification
1 Department of BioSystems, Korea Advanced Institute of Science and Technology, 373-1 Guseong-dong Yuseong-gu, Daejeon 305-701, Korea and 2 Faculty of Information Technology, Posts & Telecommunications Institute of Technology, Km 10 Nguyen Trai Road, Hatay, Vietnam
Received on July 17, 2003
; revised on October 9, 2003
; accepted on October 10, 2003
Advance Access Publication January 29, 2004
Motivation: The transcription of a gene is largely determined by short sequence motifs that serve as binding sites for transcription factors. Recent findings suggest direct relationships between the motifs and gene expression levels. In this work, we present a method for identifying regulatory motifs. Our method makes use of tree-based techniques for recovering the relationships between motifs and gene expression levels.
Results: We treat regulatory motifs and gene expression levels as predictor variables and responses, respectively, and use a regression tree model to identify the structural relationships between them. The regression tree methodology is extended to handle responses from multiple experiments by modifying the split function. The significance of regulatory elements is determined by analyzing tree structures and using a variable importance measure. When applied to two data sets of the yeast Saccharomyces cerevisiae, the method successfully identifies most of the regulatory motifs that are known to control gene transcription under the given experimental conditions, and suggests several new putative motifs. Analysis of the tree structures also reconfirms several pairs of motifs that are known to regulate gene transcription in combination.
Availability: http://if.kaist.ac.kr/~phuong/RegTree
Contact: doheon{at}kaist.ac.kr
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Ruan and W. Zhang A bi-dimensional regression tree approach to the modeling of gene expression regulation Bioinformatics, February 1, 2006; 22(3): 332 - 340. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-K. Tsai, H. H.-S. Lu, and W.-H. Li Statistical methods for identifying yeast cell cycle transcription factors PNAS, September 20, 2005; 102(38): 13532 - 13537. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Das, N. Banerjee, and M. Q. Zhang Interacting models of cooperative gene regulation PNAS, November 16, 2004; 101(46): 16234 - 16239. [Abstract] [Full Text] [PDF] |
||||

