Bioinformatics Advance Access originally published online on October 28, 2004
Bioinformatics 2005 21(7):1062-1068; doi:10.1093/bioinformatics/bti094
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Extracting relations between promoter sequences and their strengths from microarray data
1Graduate School of Information Sciences, Nara Institute of Science and Technology 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
2Computational Biology Research Center, The National Institute of Advanced Industrial Science and Technology Aomi Frontier Building 2-43 Aomi, 17F, Koto-ku, Tokyo 135-0064, Japan
3Department of Computational Biology, Faculty of Frontier Science, The University of Tokyo 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan
*To whom correspondence should be addressed.
Motivation: The relations between the promoter sequences and their strengths were extensively studied in the 1980s. Although these studies uncovered strong sequence-strength correlations, the cost of their elaborate experimental methods have been too high to be applied to a large number of promoters. On the contrary, a recent increase in the microarray data allows us to compare thousands of gene expressions with their DNA sequences.
Results: We studied the relations between the promoter sequences and their strengths using the Escherichia coli microarray data. We modeled those relations using a simple weight matrix, which was optimized with a novel support vector regression method. It was observed that several non-consensus bases in the 35 and 10 regions of promoter sequences act positively on the promoter strength and that certain consensus bases have a minor effect on the strength. We analyzed outliers for which the observed gene expressions deviate from the promoter strength predictions, and identified several genes with enhanced expressions due to multiple promoters and genes under strong regulation by transcription factors. Our method is applicable to other procaryotes for which both the promoter sequences and the microarray data are available.
Contact: hisano-k{at}is.aist-nara.ac.jp
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Weindl, P. Hanus, Z. Dawy, J. Zech, J. Hagenauer, and J. C. Mueller Modeling DNA-binding of Escherichia coli {sigma}70 exhibits a characteristic energy landscape around strong promoters Nucleic Acids Res., November 29, 2007; 35(20): 7003 - 7010. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. S. Rani, S. D. Bhavani, and R. S. Bapi Analysis of E.coli promoter recognition problem in dinucleotide feature space Bioinformatics, March 1, 2007; 23(5): 582 - 588. [Abstract] [Full Text] [PDF] |
||||

