Bioinformatics Advance Access published online on January 22, 2004
Bioinformatics, doi:10.1093/bioinformatics/btg420
Bioinformatics © Oxford University Press 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Bioinformatics Unit, ISTECH Inc., #704, Hyundai Town Vill 848-1, Janghang-dong, Ilsan-gu, Goyang city, Gyunggido 411-380, Republic of Korea
* To whom correspondence should be addressed. E-mail: yskim{at}istech21.com.
Motivation: With the advent of DNA microarray technologies, the parallel quantification of genome-wide transcriptions has been a great opportunity to systematically understand the complicated biological phenomena. Amidst the enthusiastic investigations into the intricate gene expression data, clustering methods have been useful tools to uncover the meaningful patterns hidden in those data. The mathematical techniques, however, entirely based on the numerical expression data, do not show biologically relevant information on the clustering results. Result: We present a novel methodology for biological interpretation of gene clusters. Our graph theoretic algorithm extracts common biological attributes of the genes within a cluster or a group of interest through the modified structure of Gene Ontology (GO) called GO tree. After genes are annotated with GO terms, the hierarchical nature of GO terms is used to find the representative biological meanings of the gene clusters. In addition, the biological significance of gene clusters can be assessed quantitatively by defining a distance function on the GO tree. Our approach has a complementary meaning to many statistical clustering techniques; we can see clustering problems from a different viewpoint by use of biological ontology. We applied this algorithm to the well-known data set (Eisen et al., 1998) and successfully obtained the biological features of the gene clusters with the quantitative biological assessment of clustering quality through GO Biological Process. Availability: The software is available on request from the authors.
Revised June 4, 2003
Accepted August 9, 2003
Article
A graph-theoretic modeling on GO space for biological interpretation of gene clusters
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Z. Wang, Z. Du, R. Payattakool, P. S. Yu, and C.-F. Chen A new method to measure the semantic similarity of GO terms Bioinformatics, May 15, 2007; 23(10): 1274 - 1281. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Tuikkala, L. Elo, O. S. Nevalainen, and T. Aittokallio Improving missing value estimation in microarray data with gene ontology Bioinformatics, March 1, 2006; 22(5): 566 - 572. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Mao, T. Cai, J. G. Olyarchuk, and L. Wei Automated genome annotation and pathway identification using the KEGG Orthology (KO) as a controlled vocabulary Bioinformatics, October 1, 2005; 21(19): 3787 - 3793. [Abstract] [Full Text] [PDF] |
||||
