Bioinformatics Advance Access originally published online on November 15, 2007
Bioinformatics 2008 24(2):265-271; doi:10.1093/bioinformatics/btm558
Gaining confidence in biological interpretation of the microarray data: the functional consistence of the significant GO categories



1Department of Bioinformatics, Bio-pharmaceutical Key Laboratory of Heilongjiang Province-Incubator of State Key Laboratory, Harbin Medical University, Harbin 150086 and 2Bioinformatics Centre and School of Life Science, University of Electronic Science and Technology of China, Chengdu, 610054, China
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: In microarray studies, numerous tools are available for functional enrichment analysis based on GO categories. Most of these tools, due to their requirement of a prior threshold for designating genes as differentially expressed genes (DEGs), are categorized as threshold-dependent methods that often suffer from a major criticism on their changing results with different thresholds.
Results: In the present article, by considering the inherent correlation structure of the GO categories, a continuous measure based on semantic similarity of GO categories is proposed to investigate the functional consistence (or stability) of threshold-dependent methods. The results from several datasets show when simply counting overlapping categories between two groups, the significant category groups selected under different DEG thresholds are seemingly very different. However, based on the semantic similarity measure proposed in this article, the results are rather functionally consistent for a wide range of DEG thresholds. Moreover, we find that the functional consistence of gene lists ranked by SAM metric behaves relatively robust against changing DEG thresholds.
Availability: Source code in R is available on request from the authors.
Contact: guoz{at}ems.hrbmu.edu.cn
Supplementary information: Supplementary data are available at Bioinformatics online.
The authors wish it to be known that, in their opinion, the first three authors should be regarded as joint First Authors.
Associate Editor: Martin Bishop
Received on July 21, 2007; revised on November 3, 2007; accepted on November 4, 2007
This article has been cited by other articles:
![]() |
G. M. Shivaram, C. H. Kim, N. N. Batra, W. Yang, S. E. Harris, and C. R. Jacobs Novel early response genes in osteoblasts exposed to dynamic fluid flow Phil Trans R Soc A, February 13, 2010; 368(1912): 605 - 616. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Schlicker and M. Albrecht FunSimMat update: new features for exploring functional similarity Nucleic Acids Res., January 1, 2010; 38(suppl_1): D244 - D248. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Zhang, L. Zhang, J. Zou, C. Yao, H. Xiao, Q. Liu, J. Wang, D. Wang, C. Wang, and Z. Guo Evaluating reproducibility of differential expression discoveries in microarray studies by considering correlated molecular changes Bioinformatics, July 1, 2009; 25(13): 1662 - 1668. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Huang, B. T. Sherman, and R. A. Lempicki Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists Nucleic Acids Res., January 1, 2009; 37(1): 1 - 13. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Winnenburg, T. Wachter, C. Plake, A. Doms, and M. Schroeder Facts from text: can text mining help to scale-up high-quality manual curation of gene products with ontologies? Brief Bioinform, December 6, 2008; (2008) bbn043v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Zhang, C. Yao, Z. Guo, J. Zou, L. Zhang, H. Xiao, D. Wang, D. Yang, X. Gong, J. Zhu, et al. Apparently low reproducibility of true differential expression discoveries in microarray studies Bioinformatics, September 15, 2008; 24(18): 2057 - 2063. [Abstract] [Full Text] [PDF] |
||||



