Bioinformatics Advance Access published online on November 24, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl597
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 UCD Conway Institute, University College Dublin, Belfield, Dublin 4, Ireland
* To whom correspondence should be addressed.
Motivation: Microarrays are widely used to measure gene expression differences between sets of biological samples. Many of these differences will be due to differences in the activities of transcription factors. In principle, these differences can be detected by associating motifs in promoters with differences in gene expression levels between the groups. In practice, this is hard to do. Results: We combine correspondence analysis, between group analysis and co-inertia analysis to determine which motifs, from a database of promoter motifs, are strongly associated with differences in gene expression levels. Given a database of motifs and gene expression levels from a set of arrays, the method produces a ranked list of motifs associated with any specified split in the arrays. We give an example using the Gene Atlas compendium of gene expression levels for human tissues where we search for motifs that are associated with expression in central nervous system (CNS) or muscle tissues. Most of the motifs that we find are known from previous work to be strongly associated with expression in CNS or muscle. We give a second example using a published prostate cancer data set where we can simply and clearly find which transcriptional pathways are associated with differences between benign and metastatic samples. Availability: The source code is freely available upon request from the authors.
Received June 15, 2006
Revised September 12, 2006
Accepted November 21, 2006
Article
Integrating transcription factor binding site information with gene expression datasets
Ian B. Jeffery 1 *, Stephen F. Madden 1, Paul A. McGettigan 1, Guy Perrière 2, Aedín C. Culhane 3, and Desmond G. Higgins 1
2 Laboratoire de Biométrie et de Biologie Évolutive, UMR CNRS 5558, Université Claude Bernard Lyon 1, 43 blvd du 11 Novembre, 1918, 69622 Villeurbanne Cedex, France
3 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Mayer 232, 44 Binney Street, Boston, MA 02115, USA
Ian B. Jeffery, E-mail: Ian.Jeffery{at}ucd.ie
![]()
Abstract
Associate Editor: David Rocke
![]()
CiteULike
Connotea
Del.icio.us What's this?