Bioinformatics Advance Access published online on September 27, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl492
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Computer Science, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ
Motivation: The diverse microarray data sets that have become available over the past several years represent a rich opportunity and challenge for biological data mining. Many supervised and unsupervised methods have been developed for the analysis of individual microarray datasets. However, integrated analysis of multiple datasets can provide a broader insight into genetic regulation of specific biological pathways under a variety of conditions. Results: To aid in the analysis of such large compendia of microarray experiments, we present MEFIT, a scalable Bayesian framework for predicting functional relationships from integrated microarray data sets. Furthermore, MEFIT predicts these functional relationships within the context of specific biological processes. All results are provided in the context of one or more specific biological functions, which can be provided by a biologist or drawn automatically from catalogues such as the Gene Ontology. Using MEFIT, we integrated 40 S. cerevisiae microarray data sets spanning 712 unique conditions. In tests based on 110 biological functions drawn from the GO biological process ontology, MEFIT provided a 5% or greater performance increase for 54 functions, with a 5% or more decrease in performance in only 2 functions. Availability: Supplemental data, a collection of predictions made by MEFIT, and software implementing MEFIT are available online at http://function.princeton.edu/mefit/.
Received July 31, 2006
Revised September 5, 2006
Accepted September 18, 2006
Article
A scalable method for integration and functional analysis of multiple microarray data sets
Curtis Huttenhower 1, Matt Hibbs 1, Chad Myers 1, and Olga G. Troyanskaya 1
![]()
Abstract
Associate Editor: John Quackenbush
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Huttenhower, M. Schroeder, M. D Chikina, and O. G. Troyanskaya The Sleipnir library for computational functional genomics Bioinformatics, July 1, 2008; 24(13): 1559 - 1561. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Huttenhower and O.G. Troyanskaya Assessing the functional structure of genomic data Bioinformatics, July 1, 2008; 24(13): i330 - i338. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Aguilar, L. Skrabanek, S. S. Gross, B. Oliva, and F. Campagne Beyond tissueInfo: functional prediction using tissue expression profile similarity searches Nucleic Acids Res., June 1, 2008; 36(11): 3728 - 3737. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Sykacek, R. Clarkson, C. Print, R. Furlong, and G. Micklem Bayesian modelling of shared gene function Bioinformatics, August 1, 2007; 23(15): 1936 - 1944. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Shi, M. Klustein, I. Simon, T. Mitchell, and Z. Bar-Joseph Continuous hidden process model for time series expression experiments Bioinformatics, July 1, 2007; 23(13): i459 - i467. [Abstract] [Full Text] [PDF] |
||||

