Bioinformatics Advance Access originally published online on April 10, 2006
Bioinformatics 2006 22(13):1600-1607; doi:10.1093/bioinformatics/btl140
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Improved scoring of functional groups from gene expression data by decorrelating GO graph structure
Max-Planck-Institute for Informatics Stuhlsatzenhausweg 85, D-66123 Saarbrücken, Germany
*To whom correspondence should be addressed.
Motivation: The result of a typical microarray experiment is a long list of genes with corresponding expression measurements. This list is only the starting point for a meaningful biological interpretation. Modern methods identify relevant biological processes or functions from gene expression data by scoring the statistical significance of predefined functional gene groups, e.g. based on Gene Ontology (GO). We develop methods that increase the explanatory power of this approach by integrating knowledge about relationships between the GO terms into the calculation of the statistical significance.
Results: We present two novel algorithms that improve GO group scoring using the underlying GO graph topology. The algorithms are evaluated on real and simulated gene expression data. We show that both methods eliminate local dependencies between GO terms and point to relevant areas in the GO graph that remain undetected with state-of-the-art algorithms for scoring functional terms. A simulation study demonstrates that the new methods exhibit a higher level of detecting relevant biological terms than competing methods.
Availability: topgo.bioinf.mpi-inf.mpg.de
Contact: alexa{at}mpi-sb.mpg.de
Supplementary Information: Supplementary data are available at Bioinformatics online.
Received on September 28, 2005; revised on March 30, 2006; accepted on April 4, 2006
This article has been cited by other articles:
![]() |
A. V. Antonov, T. Schmidt, Y. Wang, and H. W. Mewes ProfCom: a web tool for profiling the complex functionality of gene groups identified from high-throughput data Nucleic Acids Res., July 1, 2008; 36(suppl_2): W347 - W351. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Zheng and X.-J. Wang GOEAST: a web-based software toolkit for Gene Ontology enrichment analysis Nucleic Acids Res., July 1, 2008; 36(suppl_2): W358 - W363. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Baerenfaller, J. Grossmann, M. A. Grobei, R. Hull, M. Hirsch-Hoffmann, S. Yalovsky, P. Zimmermann, U. Grossniklaus, W. Gruissem, and S. Baginsky Genome-Scale Proteomics Reveals Arabidopsis thaliana Gene Models and Proteome Dynamics Science, May 16, 2008; 320(5878): 938 - 941. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Shriner, T. M. Baye, M. A. Padilla, S. Zhang, L. K. Vaughan, and A. E. Loraine Commonality of functional annotation: a method for prioritization of candidate genes from genome-wide linkage studies Nucleic Acids Res., March 27, 2008; 36(4): e26 - e26. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Goeman and U. Mansmann Multiple testing on the directed acyclic graph of gene ontology Bioinformatics, February 15, 2008; 24(4): 537 - 544. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Cordero, M. Botta, and R. A. Calogero Microarray data analysis and mining approaches Brief Funct Genomic Proteomic, January 22, 2008; (2008) elm034v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Yang, Y. Li, H. Xiao, Q. Liu, M. Zhang, J. Zhu, W. Ma, C. Yao, J. Wang, D. Wang, et al. Gaining confidence in biological interpretation of the microarray data: the functional consistence of the significant GO categories Bioinformatics, January 15, 2008; 24(2): 265 - 271. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Schlicker and M. Albrecht FunSimMat: a comprehensive functional similarity database Nucleic Acids Res., January 11, 2008; 36(suppl_1): D434 - D439. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. F. Schwarz, O. Hadicke, J. Erdmann, A. Ziegler, D. Bayer, and S. Moller SNPtoGO: characterizing SNPs by enriched GO terms Bioinformatics, January 1, 2008; 24(1): 146 - 148. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Grossmann, S. Bauer, P. N. Robinson, and M. Vingron Improved detection of overrepresentation of Gene-Ontology annotations with parent child analysis Bioinformatics, November 15, 2007; 23(22): 3024 - 3031. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Lottaz, J. Toedling, and R. Spang Annotation-based distance measures for patient subgroup discovery in clinical microarray studies Bioinformatics, September 1, 2007; 23(17): 2256 - 2264. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Liu, J. M. Hughes-Oliver, and J. A. Menius Jr Domain-enhanced analysis of microarray data using GO annotations Bioinformatics, May 15, 2007; 23(10): 1225 - 1234. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Falcon and R. Gentleman Using GOstats to test gene lists for GO term association Bioinformatics, January 15, 2007; 23(2): 257 - 258. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. W. Kong, W. T. Pu, and P. J. Park A multivariate approach for integrating genome-wide expression data and biological knowledge Bioinformatics, October 1, 2006; 22(19): 2373 - 2380. [Abstract] [Full Text] [PDF] |
||||



