Skip Navigation


Bioinformatics Advance Access originally published online on February 24, 2006
Bioinformatics 2006 22(9):1122-1129; doi:10.1093/bioinformatics/btl060
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrow All Versions of this Article:
22/9/1122    most recent
btl060v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (27)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Prelic, A.
Right arrow Articles by Zitzler, E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Prelic, A.
Right arrow Articles by Zitzler, E.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

A systematic comparison and evaluation of biclustering methods for gene expression data

Amela Prelic 1, Stefan Bleuler 1,*, Philip Zimmermann 2, Anja Wille 3,4, Peter Bühlmann 4, Wilhelm Gruissem 2, Lars Hennig 2, Lothar Thiele 1 and Eckart Zitzler 1

1 Computer Engineering and Networks Laboratory ETH Zurich, 8092 Zurich, Switzerland
2 Institute for Plant Sciences and Functional Genomics Center Zurich ETH Zurich, 8092 Zurich, Switzerland
3 Colab ETH Zurich, 8092 Zurich, Switzerland
4 Seminar for Statistics ETH Zurich, 8092 Zurich, Switzerland

*To whom correspondence should be addressed.

Motivation: In recent years, there have been various efforts to overcome the limitations of standard clustering approaches for the analysis of gene expression data by grouping genes and samples simultaneously. The underlying concept, which is often referred to as biclustering, allows to identify sets of genes sharing compatible expression patterns across subsets of samples, and its usefulness has been demonstrated for different organisms and datasets. Several biclustering methods have been proposed in the literature; however, it is not clear how the different techniques compare with each other with respect to the biological relevance of the clusters as well as with other characteristics such as robustness and sensitivity to noise. Accordingly, no guidelines concerning the choice of the biclustering method are currently available.

Results: First, this paper provides a methodology for comparing and validating biclustering methods that includes a simple binary reference model. Although this model captures the essential features of most biclustering approaches, it is still simple enough to exactly determine all optimal groupings; to this end, we propose a fast divide-and-conquer algorithm (Bimax). Second, we evaluate the performance of five salient biclustering algorithms together with the reference model and a hierarchical clustering method on various synthetic and real datasets for Saccharomyces cerevisiae and Arabidopsis thaliana. The comparison reveals that (1) biclustering in general has advantages over a conventional hierarchical clustering approach, (2) there are considerable performance differences between the tested methods and (3) already the simple reference model delivers relevant patterns within all considered settings.

Availability: The datasets used, the outcomes of the biclustering algorithms and the Bimax implementation for the reference model are available at http://www.tik.ee.ethz.ch/sop/bimax

Contact: bleuler{at}tik.ee.ethz.ch

Supplementary information: Supplementary data are available at http://www.tik.ee.ethz.ch/sop/bimax


Received on July 27, 2005; revised on January 4, 2006; accepted on February 15, 2006

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
A. Bhattacharya and R. K. De
Bi-correlation clustering algorithm for determining a set of co-regulated genes
Bioinformatics, November 1, 2009; 25(21): 2795 - 2801.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Zeng and J. Li
Maximization of negative correlations in time-course gene expression data for enhancing understanding of molecular pathways
Nucleic Acids Res., October 23, 2009; (2009) gkp822v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Kinoshita and T. Obayashi
Multi-dimensional correlations for gene coexpression and application to the large-scale data of Arabidopsis
Bioinformatics, October 15, 2009; 25(20): 2677 - 2684.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Liu, X.-w. Chen, and R. Jothi
Knowledge-guided inference of domain-domain interactions from incomplete protein-protein interaction networks
Bioinformatics, October 1, 2009; 25(19): 2492 - 2499.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. Li, Q. Ma, H. Tang, A. H. Paterson, and Y. Xu
QUBIC: a qualitative biclustering algorithm for analyses of gene expression data
Nucleic Acids Res., August 1, 2009; 37(15): e101 - e101.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Meng, S.-J. Gao, and Y. Huang
Enrichment constrained time-dependent clustering analysis for finding meaningful temporal transcription modules
Bioinformatics, June 15, 2009; 25(12): 1521 - 1527.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L. French, S. Lane, T. Law, L. Xu, and P. Pavlidis
Application and evaluation of automated semantic annotation of gene expression experiments
Bioinformatics, June 15, 2009; 25(12): 1543 - 1549.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
B. Fode, T. Siemsen, C. Thurow, R. Weigel, and C. Gatz
The Arabidopsis GRAS Protein SCL14 Interacts with Class II TGA Transcription Factors and Is Essential for the Activation of Stress-Inducible Promoters
PLANT CELL, November 1, 2008; 20(11): 3122 - 3135.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Pu, K. Ronen, J. Vlasblom, J. Greenblatt, and S. J. Wodak
Local coherence in genetic interaction patterns reveals prevalent functional versatility
Bioinformatics, October 15, 2008; 24(20): 2376 - 2383.
[Abstract] [Full Text] [PDF]


Home page
Brief Funct Genomic ProteomicHome page
A. Krishnan and A. Pereira
Integrative approaches for mining transcriptional regulatory programs in Arabidopsis
Brief Funct Genomic Proteomic, July 16, 2008; (2008) eln035v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Bhattacharya and R. K. De
Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: detecting varying patterns in expression profiles
Bioinformatics, June 1, 2008; 24(11): 1359 - 1366.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Santamaria, R. Theron, and L. Quintales
BicOverlapper: A tool for bicluster visualization
Bioinformatics, May 1, 2008; 24(9): 1212 - 1213.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
K. Horan, C. Jang, J. Bailey-Serres, R. Mittler, C. Shelton, J. F. Harper, J.-K. Zhu, J. C. Cushman, M. Gollery, and T. Girke
Annotating Genes of Known and Unknown Function by Large-Scale Coexpression Analysis
Plant Physiology, May 1, 2008; 147(1): 41 - 57.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Lee, S. W. Kong, and P. J. Park
Integrative analysis reveals the direct and indirect interactions between DNA copy number aberrations and gene expression changes
Bioinformatics, April 1, 2008; 24(7): 889 - 896.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Wang, R. R. Gutell, and D. P. Miranker
Biclustering as a method for RNA local multiple sequence alignment
Bioinformatics, December 15, 2007; 23(24): 3289 - 3296.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
T. Dhollander, Q. Sheng, K. Lemmens, B. De Moor, K. Marchal, and Y. Moreau
Query-driven module discovery in microarray data
Bioinformatics, October 1, 2007; 23(19): 2573 - 2580.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Buness, R. Kuner, M. Ruschhaupt, A. Poustka, H. Sultmann, and A. Tresch
Identification of aberrant chromosomal regions from gene expression microarray studies applied to human breast cancer
Bioinformatics, September 1, 2007; 23(17): 2273 - 2280.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Liu and L. Wang
Computing the maximum similarity bi-clusters of gene expression data
Bioinformatics, January 1, 2007; 23(1): 50 - 56.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Barkow, S. Bleuler, A. Prelic, P. Zimmermann, and E. Zitzler
BicAT: a biclustering analysis toolbox
Bioinformatics, May 15, 2006; 22(10): 1282 - 1283.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.