Bioinformatics Advance Access first published online on November 5, 2004
This version published online on November 16, 2004
Bioinformatics, doi:10.1093/bioinformatics/bti103
Bioinformatics © Oxford University Press 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Physiological Chemistry, University Medical Center Utrecht, P.O. box 85060, 3508 AB Utrecht, the Netherlands
* To whom correspondence should be addressed.
Motivation: Determining gene function is an important challenge arising from the availability of whole genome sequences. Until recently, approaches based on sequence homology were the only high-throughput method for predicting gene function. Use of high-throughput generated experimental datasets for determining gene function has been limited for several reasons. Results: Here a new approach is presented for integration of high-throughput datasets, leading to prediction of function based on relationships supported by multiple types and sources of data. This is achieved with a database containing 125 different high-throughput datasets describing phenotypes, cellular localizations, protein interactions and mRNA expression levels from S. cerevisiae, using a bit-vector representation and information content based ranking. The approach takes characteristic and qualitative differences between the datasets into account, is highly flexible, efficient and scalable. Database queries result in predictions for 543 uncharacterized genes, based on multiple functional relationships each supported by at least three types of experimental data. Some of these are experimentally verified, further demonstrating their reliability. The results also generate insights into the relative merits of different data types and provide a coherent framework for functional genomic datamining. Availability: Free availability over the internet. Supplementary Information: http://www.genomics.med.uu.nl/pub/pk/comb_gen_network.
Revised October 1, 2004
Accepted October 15, 2004
Article
Predicting gene function through systematic analysis and quality assessment of high-throughput data
2 Department of Innovation Studies, Copernicus Institute, Utrecht University, Utrecht, the Netherlands
Frank C. P. Holstege, E-mail: f.c.p.holstege{at}med.uu.nl
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Tao, L. Sam, J. Li, C. Friedman, and Y. A. Lussier Information theory applied to the sparse gene ontology annotation network to predict novel gene function Bioinformatics, July 1, 2007; 23(13): i529 - i538. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Schlitt and A. Brazma Modelling in molecular biology: describing transcription regulatory networks at different scales Phil Trans R Soc B, March 29, 2006; 361(1467): 483 - 494. [Abstract] [Full Text] [PDF] |
||||

