Bioinformatics Advance Access published online on June 30, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti565
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Computer Science, Wayne State University, 431 State Hall, Detroit, MI, 48202
* To whom correspondence should be addressed.
Independent of the platform and the analysis methods used, the result of a microarray experiment is, in most cases, a list of differentially expressed genes. An automatic ontological analysis approach has been recently proposed to help with the biological interpretation of such results. Currently, this approach is the de facto standard for the secondary analysis of high throughput experiments and a large number of tools have been developed for this purpose. We present a detailed comparison of 14 such tools using the following criteria: scope of the analysis, visualization capabilities, statistical model(s) used, correction for multiple comparisons, reference microarrays available, installation issues and sources of annotation data. This detailed analysis of the capabilities of these tools will help researchers choose the most appropriate tool for a given type of analysis. More importantly, in spite of the fact that this type of analysis has been generally adopted, this approach has several important intrinsic drawbacks. These drawbacks are associated with all tools discussed and represent conceptual limitations of the current state-of-the-art in ontological analysis. We propose these as challenges for the next generation of secondary data analysis tools.
Received April 24, 2005
Revised June 17, 2005
Accepted June 28, 2005
Review
Ontological analysis of gene expression data: current tools, limitations, and open problems
Sorin Draghici, E-mail: sod{at}cs.wayne.edu
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Schlicker and M. Albrecht FunSimMat update: new features for exploring functional similarity Nucleic Acids Res., November 18, 2009; (2009) gkp979v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. G. Soldatos, S. I. O'Donoghue, V. P. Satagopam, L. J. Jensen, N. P. Brown, A. Barbosa-Silva, and R. Schneider Martini: using literature keywords to compare gene sets Nucleic Acids Res., October 25, 2009; (2009) gkp876v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Matsumura, K. Kano, C. M. de Evsikova, J. A. Young, P. M. Nishina, J. K. Naggert, and K. Naito Transcriptome analysis reveals an unexpected role of a collagen tyrosine kinase receptor gene, Ddr2, as a regulator of ovarian function Physiol Genomics, October 1, 2009; 39(2): 120 - 129. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Gold, J. C. Miecznikowski, and S. Liu Error control variability in pathway-based microarray analysis Bioinformatics, September 1, 2009; 25(17): 2216 - 2221. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Dotan-Cohen, S. Kasif, and A. A. Melkman Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering Bioinformatics, July 15, 2009; 25(14): 1789 - 1795. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Antonov, S. Dietmann, P. Wong, D. Lutter, and H. W. Mewes GeneSet2miRNA: finding the signature of cooperative miRNA activities in the gene lists Nucleic Acids Res., July 1, 2009; 37(suppl_2): W323 - W328. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Sun, M.-S. Kim, Y. Han, and G.-S. Yi COFECO: composite function annotation enriched by protein complex data Nucleic Acids Res., July 1, 2009; 37(suppl_2): W350 - W355. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Hu, J.-H. Hung, Y. Wang, Y.-C. Chang, C.-L. Huang, M. Huyck, and C. DeLisi VisANT 3.5: multi-scale network visualization, analysis and inference based on the gene ontology Nucleic Acids Res., July 1, 2009; 37(suppl_2): W115 - W121. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Nogales-Cadenas, P. Carmona-Saez, M. Vazquez, C. Vicente, X. Yang, F. Tirado, J. M. Carazo, and A. Pascual-Montano GeneCodis: interpreting gene lists through enrichment analysis and integration of diverse biological information Nucleic Acids Res., July 1, 2009; 37(suppl_2): W317 - W322. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. S. Leong and D. Kipling Text-based over-representation analysis of microarray gene lists with annotation bias Nucleic Acids Res., June 1, 2009; 37(11): e79 - e79. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Aberg, J. Fernandez-Vazquez, J. D. Cabrer-Panes, A. Sanchez, and C. Balsalobre Similar and Divergent Effects of ppGpp and DksA Deficiencies on Transcription in Escherichia coli J. Bacteriol., May 15, 2009; 191(10): 3226 - 3236. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-A. Tsai and J. J. Chen Multivariate analysis of variance test for gene set analysis Bioinformatics, April 1, 2009; 25(7): 897 - 903. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Gatti, M. Sypa, I. Rusyn, F. A. Wright, and W. T. Barry SAFEGUI: resampling-based tests of categorical significance in gene expression data made easy Bioinformatics, February 15, 2009; 25(4): 541 - 542. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Sartor, G. D. Leikauf, and M. Medvedovic LRpath: a logistic regression approach for identifying enriched biological groups in gene expression data Bioinformatics, January 15, 2009; 25(2): 211 - 217. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Yilmaz, P. Jonveaux, C. Bicep, L. Pierron, M. Smail-Tabbone, and M.D. Devignes Gene-disease relationship discovery based on model-driven data integration and database view definition Bioinformatics, January 15, 2009; 25(2): 230 - 236. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Huang, B. T. Sherman, and R. A. Lempicki Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists Nucleic Acids Res., January 1, 2009; 37(1): 1 - 13. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Tarca, S. Draghici, P. Khatri, S. S. Hassan, P. Mittal, J.-s. Kim, C. J. Kim, J. P. Kusanovic, and R. Romero A novel signaling pathway impact analysis Bioinformatics, January 1, 2009; 25(1): 75 - 82. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Chen, L. Wang, J. D. Smith, and B. Zhang Supervised principal component analysis for gene set enrichment of microarray data with continuous or survival outcomes Bioinformatics, November 1, 2008; 24(21): 2474 - 2481. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Rodenburg, A. G. Heidema, J. M. A. Boer, I. M. J. Bovee-Oudenhoven, E. J. M. Feskens, E. C. M. Mariman, and J. Keijer A framework to identify physiological responses in microarray-based gene expression studies: selection and interpretation of biologically relevant genes Physiol Genomics, October 8, 2008; 33(1): 78 - 90. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bauer, S. Grossmann, M. Vingron, and P. N. Robinson Ontologizer 2.0--a multifunctional tool for GO term enrichment analysis and data exploration Bioinformatics, July 15, 2008; 24(14): 1650 - 1651. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Reimand, L. Tooming, H. Peterson, P. Adler, and J. Vilo GraphWeb: mining heterogeneous biological networks for gene modules with functional significance Nucleic Acids Res., July 1, 2008; 36(suppl_2): W452 - W459. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Antonov, T. Schmidt, Y. Wang, and H. W. Mewes ProfCom: a web tool for profiling the complex functionality of gene groups identified from high-throughput data Nucleic Acids Res., July 1, 2008; 36(suppl_2): W347 - W351. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Al-Shahrour, J. Carbonell, P. Minguez, S. Goetz, A. Conesa, J. Tarraga, I. Medina, E. Alloza, D. Montaner, and J. Dopazo Babelomics: advanced functional profiling of transcriptomics, proteomics and genomics experiments Nucleic Acids Res., July 1, 2008; 36(suppl_2): W341 - W346. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Romero-Zaliz, C. del Val, J. P. Cobb, and I. Zwir Onto-CC: a web server for identifying Gene Ontology conceptual clusters Nucleic Acids Res., July 1, 2008; 36(suppl_2): W352 - W357. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Gotz, J. M. Garcia-Gomez, J. Terol, T. D. Williams, S. H. Nagaraj, M. J. Nueda, M. Robles, M. Talon, J. Dopazo, and A. Conesa High-throughput functional annotation and data mining with the Blast2GO suite Nucleic Acids Res., June 1, 2008; 36(10): 3420 - 3435. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Hackenberg and R. Matthiesen Annotation-Modules: a tool for finding significant combinations of multisource annotations for gene lists Bioinformatics, June 1, 2008; 24(11): 1386 - 1393. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Nam and S.-Y. Kim Gene-set approach for expression pattern analysis Brief Bioinform, May 1, 2008; 9(3): 189 - 197. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Shriner, T. M. Baye, M. A. Padilla, S. Zhang, L. K. Vaughan, and A. E. Loraine Commonality of functional annotation: a method for prioritization of candidate genes from genome-wide linkage studies Nucleic Acids Res., March 27, 2008; 36(4): e26 - e26. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Goeman and U. Mansmann Multiple testing on the directed acyclic graph of gene ontology Bioinformatics, February 15, 2008; 24(4): 537 - 544. [Abstract] [Full Text] [PDF] |
||||
![]() |
A Bonnet, R Dalbies-Tran, and M A Sirard Opportunities and challenges in applying genomics to the study of oogenesis and folliculogenesis in farm animals Reproduction, February 1, 2008; 135(2): 119 - 128. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Masotti, C. Nardini, S. Rossi, E. Bonora, G. Romeo, S. Volinia, and L. Benini TOM: enhancement and extension of a tool suite for in silico approaches to multigenic hereditary disorders Bioinformatics, February 1, 2008; 24(3): 428 - 429. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Nettleton, J. Recknor, and J. M. Reecy Identification of differentially expressed gene categories in microarray studies using nonparametric multivariate analysis Bioinformatics, January 15, 2008; 24(2): 192 - 201. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Schlicker and M. Albrecht FunSimMat: a comprehensive functional similarity database Nucleic Acids Res., January 11, 2008; 36(suppl_1): D434 - D439. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Rubin, N. H. Shah, and N. F. Noy Biomedical ontologies: a functional perspective Brief Bioinform, January 1, 2008; 9(1): 75 - 90. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Xu, Y. Zhao, and R. Simon Gene Set Expression Comparison kit for BRB-ArrayTools Bioinformatics, January 1, 2008; 24(1): 137 - 139. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Minguez, F. Al-Shahrour, D. Montaner, and J. Dopazo Functional profiling of microarray experiments using text-mining derived bioentities Bioinformatics, November 15, 2007; 23(22): 3098 - 3099. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Draghici, P. Khatri, A. L. Tarca, K. Amin, A. Done, C. Voichita, C. Georgescu, and R. Romero A systems biology approach for pathway level analysis Genome Res., October 1, 2007; 17(10): 1537 - 1545. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Maglietta, A. Piepoli, D. Catalano, F. Licciulli, M. Carella, S. Liuni, G. Pesole, F. Perri, and N. Ancona Statistical assessment of functional categories of genes deregulated in pathological conditions by using microarray data Bioinformatics, August 15, 2007; 23(16): 2063 - 2072. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kuhn, D. R. Goldstein, A. Hodges, A. D. Strand, T. Sengstag, C. Kooperberg, K. Becanovic, M. A. Pouladi, K. Sathasivam, J.-H. J. Cha, et al. Mutant huntingtin's effects on striatal gene expression in mice recapitulate changes observed in human Huntington's disease brain and do not differ with mutant huntingtin length or wild-type huntingtin dosage Hum. Mol. Genet., August 1, 2007; 16(15): 1845 - 1861. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. F. Burkart, J. D. Wren, J. I. Herschkowitz, C. M. Perou, and H. R. Garner Clustering microarray-derived gene lists through implicit literature relationships Bioinformatics, August 1, 2007; 23(15): 1995 - 2003. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Al-Shahrour, P. Minguez, J. Tarraga, I. Medina, E. Alloza, D. Montaner, and J. Dopazo FatiGO +: a functional profiling tool for genomic data. Integration of functional annotation, regulatory motifs and interaction data with microarray experiments Nucleic Acids Res., July 13, 2007; 35(suppl_2): W91 - W96. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Reimand, M. Kull, H. Peterson, J. Hansen, and J. Vilo g:Profiler--a web-based toolset for functional profiling of gene lists from large-scale experiments Nucleic Acids Res., July 13, 2007; 35(suppl_2): W193 - W200. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Huang, B. T. Sherman, Q. Tan, J. Kir, D. Liu, D. Bryant, Y. Guo, R. Stephens, M. W. Baseler, H. C. Lane, et al. DAVID Bioinformatics Resources: expanded annotation database and novel algorithms to better extract biology from large gene lists Nucleic Acids Res., July 13, 2007; 35(suppl_2): W169 - W175. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Khatri, C. Voichita, K. Kattan, N. Ansari, A. Khatri, C. Georgescu, A. L. Tarca, and S. Draghici Onto-Tools: new additions and improvements in 2006 Nucleic Acids Res., July 13, 2007; 35(suppl_2): W206 - W211. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Shi, M. Klustein, I. Simon, T. Mitchell, and Z. Bar-Joseph Continuous hidden process model for time series expression experiments Bioinformatics, July 1, 2007; 23(13): i459 - i467. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Huang and T. W. S. Chow Identifying the biologically relevant gene categories based on gene expression and biological data: an example on prostate cancer Bioinformatics, June 15, 2007; 23(12): 1503 - 1510. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Liu, J. M. Hughes-Oliver, and J. A. Menius Jr Domain-enhanced analysis of microarray data using GO annotations Bioinformatics, May 15, 2007; 23(10): 1225 - 1234. [Abstract] [Full Text] [PDF] |
||||
![]() |
E.-J. Blom, D. W. J. Bosman, S. A. F. T. van Hijum, R. Breitling, L. Tijsma, R. Silvis, J. B. T. M. Roerdink, and O. P. Kuipers FIVA: Functional Information Viewer and Analyzer extracting biological knowledge from transcriptome data of prokaryotes Bioinformatics, May 1, 2007; 23(9): 1161 - 1163. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Goeman and P. Buhlmann Analyzing gene expression data in terms of gene sets: methodological issues Bioinformatics, April 15, 2007; 23(8): 980 - 987. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Van Vooren, B. Thienpont, B. Menten, F. Speleman, B. D. Moor, J. Vermeesch, and Y. Moreau Mapping biomedical concepts onto the human genome by mining literature on chromosomal aberrations Nucleic Acids Res., April 3, 2007; 35(8): 2533 - 2543. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Rivals, L. Personnaz, L. Taing, and M.-C. Potier Enrichment or depletion of a GO category within a class of genes: which test? Bioinformatics, February 15, 2007; 23(4): 401 - 407. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Misirlioglu, G. P. Page, H. Sagirkaya, A. Kaya, J. J. Parrish, N. L. First, and E. Memili Dynamics of global transcriptome in bovine matured oocytes and preimplantation embryos PNAS, December 12, 2006; 103(50): 18905 - 18910. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Draghici, S. Sellamuthu, and P. Khatri Babel's tower revisited: a universal resource for cross-referencing across annotation databases Bioinformatics, December 1, 2006; 22(23): 2934 - 2939. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Manoli, N. Gretz, H.-J. Grone, M. Kenzelmann, R. Eils, and B. Brors Group testing for pathway analysis improves comparability of different microarray datasets Bioinformatics, October 15, 2006; 22(20): 2500 - 2506. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Friedberg Automated protein function prediction--the genomic challenge Brief Bioinform, September 1, 2006; 7(3): 225 - 242. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Al-Shahrour, P. Minguez, J. Tarraga, D. Montaner, E. Alloza, J. M. Vaquerizas, L. Conde, C. Blaschke, J. Vera, and J. Dopazo BABELOMICS: a systems biology perspective in the functional annotation of genome-scale experiments. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W472 - W476. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Montaner, J. Tarraga, J. Huerta-Cepas, J. Burguet, J. M. Vaquerizas, L. Conde, P. Minguez, J. Vera, S. Mukherjee, J. Valls, et al. Next station in microarray data analysis: GEPAS. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W486 - W491. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Khatri, V. Desai, A. L. Tarca, S. Sellamuthu, D. E. Wildman, R. Romero, and S. Draghici New Onto-Tools: Promoter-Express, nsSNPCounter and Onto-Translate. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W626 - W631. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Alexa, J. Rahnenfuhrer, and T. Lengauer Improved scoring of functional groups from gene expression data by decorrelating GO graph structure Bioinformatics, July 1, 2006; 22(13): 1600 - 1607. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Bauer, J. D. Katzenberger, A. C. Hamm, M. Bonaus, I. Zinke, J. Jaekel, and M. J. Pankratz Purine and folate metabolism as a potential target of sex-specific nutrient allocation in Drosophila and its implication for lifespan-reproduction tradeoff Physiol Genomics, May 16, 2006; 25(3): 393 - 404. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Huang and W. Pan Incorporating biological knowledge into distance-based clustering analysis of microarray gene expression data Bioinformatics, May 15, 2006; 22(10): 1259 - 1268. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Breuiller-Fouche and G. Germain Gene and protein expression in the myometrium in pregnancy and labor. Reproduction, May 1, 2006; 131(5): 837 - 850. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Pan Incorporating gene functions as priors in model-based clustering of microarray gene expression data Bioinformatics, April 1, 2006; 22(7): 795 - 801. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. R. Boverhof and T. R. Zacharewski Toxicogenomics in Risk Assessment: Applications and Needs Toxicol. Sci., February 1, 2006; 89(2): 352 - 360. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wu, L. Zhu, J. Guo, D.-Y. Zhang, and K. Lin Prediction of yeast protein-protein interaction network: insights from the Gene Ontology and annotations. Nucleic Acids Res., January 1, 2006; 34(7): 2137 - 2150. [Abstract] [Full Text] [PDF] |
||||
![]() |
Gene Ontology Consortium The Gene Ontology (GO) project in 2006 Nucleic Acids Res., January 1, 2006; 34(suppl_1): D322 - D326. [Abstract] [Full Text] [PDF] |
||||









