Bioinformatics Advance Access published online on January 28, 2009
Bioinformatics, doi:10.1093/bioinformatics/btp007
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
A novel meta-analysis method exploiting consistency of high-throughput experiments
Department of Physics,1110 W. Green Street, University of Illinois at Urbana-Champaign, Urbana,IL 61801-3080, USA
*To whom correspondence should be addressed. Mr. Satwik Rajaram, E-mail: srajaram{at}uiuc.edu
| Abstract |
|---|
Motivation: Motivation: Large-scale biological experiments provide snapshots into the huge number of processes running in parallel within the organism. These processes depend on a large number of (hidden) (epi)genetic, social, environmental and other factors that are out of experimentalists' control. This makes it extremely dif.cult to identify the dominant processes and the elements involved in them based on a single experiment. It is therefore desirable to use multiple sets of experiments targeting the same phenomena while differing in some experimental parameters (hidden or controllable). Although such datasets are becoming increasingly common, their analysis is complicated by the fact that the various biological elements could be in.uenced by different sets of factors.
Results: The central hypothesis of this article is that biologically related elements and processes are affected by changes in similar ways while unrelated ones are affected differently. Thus, the relations between related elements are more consistent across experiments. The method outlined here looks for groups of elements with robust intra-group relationships in the expectation that they are related. The major groups of elements may be identi.ed in this way. The strengths of relationships per se are not valued, just their consistency. This represents a completely novel and unutilized source of information. In the analysis of time course microarray experiments, I found cell cycle- and ribosome-related genes to be the major groups. Despite not looking for these groups in particular, the identi.cation of these genes rivals that of methods designed speci.cally for this purpose.
Availability: A C++ implementation is available at http://www.rinst. org/ICS/ICS_Programs.tar.gz.
Contact: srajaram{at}uiuc.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: Dr. Joaquin Dopazo
Received on August 6, 2008; revised on November 2, 2008; accepted on January 1, 2009