Bioinformatics Advance Access published online on November 5, 2004
Bioinformatics, doi:10.1093/bioinformatics/bti111
Bioinformatics © Oxford University Press 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Genomics Institute of the Novartis Research Foundation, San Diego California, 92121, USA
* To whom correspondence should be addressed.
Motivation: With the emergence of genome-wide expression profiling data sets, the guilt by association (GBA) principle has been a cornerstone for deriving gene functional interpretations in silico. Given the limited success of traditional methods for producing clusters of genes with great amounts of functional similarity, new data-mining algorithms are required to fully exploit the potential of high-throughput genomic approaches. Results: Ontology-based Pattern Identification (OPI) is a novel data-mining algorithm that systematically identifies expression patterns that best represent existing knowledge of gene function. Instead of relying on a universal threshold of expression similarity to define functionally related groups of genes, OPI finds the optimal analysis settings that yield gene expression patterns and gene lists that best predict gene function using the principle of GBA. We applied OPI to a publicly available gene expression data set on the life cycle of the malarial parasite P. falciparum and systematically annotated genes for 320 functional categories based on current Gene Ontology annotations. An ontology-based hierarchical tree of the 320 categories provided a systems-wide biological view of this important malarial parasite. Availability: A web accessible P. falciparum e-annotation database containing the results of this study can be accessed online at http://carrier.gnf.org/publications/OPI.
Revised October 25, 2004
Accepted October 29, 2004
Article
In silico gene function prediction using ontology-based pattern identification
2 Department of Cell Biology, The Scripps Research Institute, La Jolla, California 92037, USA
3 Genomics Institute of the Novartis Research Foundation, San Diego California, 92121, USA; Department of Cell Biology, The Scripps Research Institute, La Jolla, California 92037, USA
Yingyao Zhou, E-mail: zhou{at}gnf.org
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. Obayashi and K. Kinoshita Rank of Correlation Coefficient as a Comparable Measure for Biological Significance of Gene Coexpression DNA Res, October 1, 2009; 16(5): 249 - 260. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Sheremet, D. Tishkovsky, F. Wolter, and M. Zakharyaschev A Logic for Concepts and Similarity J Logic Computation, June 1, 2007; 17(3): 415 - 452. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Saxena, D. Orgill, and I. Kohane Absolute enrichment: gene set enrichment analysis for homeostatic systems Nucleic Acids Res., December 2, 2006; 34(22): e151 - e151. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Huang, A. Wallqvist, and D. G. Covell Targeting changes in cancer: assessing pathway stability by comparing pathway gene expression coherence levels in tumor and normal tissues. Mol. Cancer Ther., September 1, 2006; 5(9): 2417 - 2427. [Abstract] [Full Text] [PDF] |
||||



