Bioinformatics Advance Access originally published online on April 7, 2005
Bioinformatics 2005 21(19):3787-3793; doi:10.1093/bioinformatics/bti430
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Automated genome annotation and pathway identification using the KEGG Orthology (KO) as a controlled vocabulary


1Center for Bioinformatics, National Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University Beijing 100871, P.R. China
2Biomedical Informatics, Department of Medicine, Stanford University School of Medicine Stanford, CA 94305, USA
*To whom correspondence should be addressed.
Motivation: High-throughput technologies such as DNA sequencing and microarrays have created the need for automated annotation of large sets of genes, including whole genomes, and automated identification of pathways. Ontologies, such as the popular Gene Ontology (GO), provide a common controlled vocabulary for these types of automated analysis. Yet, while GO offers tremendous value, it also has certain limitations such as the lack of direct association with pathways.
Results: We demonstrated the use of the KEGG Orthology (KO), part of the KEGG suite of resources, as an alternative controlled vocabulary for automated annotation and pathway identification. We developed a KO-Based Annotation System (KOBAS) that can automatically annotate a set of sequences with KO terms and identify both the most frequent and the statistically significantly enriched pathways. Results from both whole genome and microarray gene cluster annotations with KOBAS are comparable and complementary to known annotations. KOBAS is a freely available standalone Python program that can contribute significantly to genome annotation and microarray analysis.
Availability: Supplementary data and the KOBAS system are available at http://genome.cbi.pku.edu.cn/download.html
Contact: weilp{at}mail.cbi.pku.edu.cn
Received on January 18, 2005; revised on March 26, 2005; accepted on April 4, 2005
This article has been cited by other articles:
![]() |
C. Li, X. Li, Y. Miao, Q. Wang, W. Jiang, C. Xu, J. Li, J. Han, F. Zhang, B. Gong, et al. SubpathwayMiner: a software package for flexible identification of pathways Nucleic Acids Res., October 1, 2009; 37(19): e131 - e131. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Yang, L. Xu, and L. He A CitationRank algorithm inheriting Google technology designed to highlight genes responsible for serious adverse drug reaction Bioinformatics, September 1, 2009; 25(17): 2244 - 2250. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Wissler, E. Dattolo, A. D. Moore, T. B. H. Reusch, J. L. Olsen, M. Migliaccio, E. Bornberg-Bauer, and G. Procaccini Dr. Zompo: an online data repository for Zostera marina and Posidonia oceanica ESTs Database, August 4, 2009; 2009(0): bap009 - bap009. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Wei, Y. Tao, G. Liu, C. Chen, R. Luo, H. Xia, Q. Gan, H. Zeng, Z. Lu, Y. Han, et al. Inaugural Article: A transcriptomic analysis of superhybrid rice LYP9 and its parents PNAS, May 12, 2009; 106(19): 7695 - 7701. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-Y. Li, Q.-R. Liu, P.-W. Zhang, X.-M. Li, L. Wei, and G. R. Uhl OKCAM: an ontology-based, human-centered knowledgebase for cell adhesion molecules Nucleic Acids Res., January 1, 2009; 37(suppl_1): D251 - D260. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-T. Li, Y. Zhang, L. Kong, Q.-R. Liu, and L. Wei Trans-natural antisense transcripts including noncoding RNAs in 10 species: implications for expression regulation Nucleic Acids Res., September 1, 2008; 36(15): 4833 - 4844. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. May, S. Wienkoop, S. Kempa, B. Usadel, N. Christian, J. Rupprecht, J. Weiss, L. Recuenco-Munoz, O. Ebenhoh, W. Weckwerth, et al. Metabolomics- and Proteomics-Assisted Genome Annotation and Analysis of the Draft Metabolic Network of Chlamydomonas reinhardtii Genetics, May 1, 2008; 179(1): 157 - 166. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Novershtern, Z. Itzhaki, O. Manor, N. Friedman, and N. Kaminski A Functional and Regulatory Map of Asthma Am. J. Respir. Cell Mol. Biol., March 1, 2008; 38(3): 324 - 336. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Sha, D. Wu, L. Zhang, X. Chen, M. Lei, H. Sun, S. Lin, and J. Lang Differentially expressed genes in human endometrial endothelial cells derived from eutopic endometrium of patients with endometriosis compared with those from patients without endometriosis Hum. Reprod., December 1, 2007; 22(12): 3159 - 3169. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. T. Tran, P. Dam, Z. Su, F. L. Poole II, M. W. W. Adams, G. T. Zhou, and Y. Xu Operon prediction in Pyrococcus furiosus Nucleic Acids Res., January 12, 2007; 35(1): 11 - 20. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Zhang, Y. Zhang, H. Zheng, C. Zhang, W. Xiong, J. G. Olyarchuk, M. Walker, W. Xu, M. Zhao, S. Zhao, et al. SynDB: a Synapse protein DataBase based on synapse ontology Nucleic Acids Res., January 12, 2007; 35(suppl_1): D737 - D741. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Li, L. Wu, and Z. Zhang Constructing biological networks through combined literature mining and microarray analysis: a LMMA approach Bioinformatics, September 1, 2006; 22(17): 2143 - 2150. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Zhang, X. S. Liu, Q.-R. Liu, and L. Wei Genome-wide in silico identification and analysis of cis natural antisense transcripts (cis-NATs) in ten species Nucleic Acids Res., July 18, 2006; 34(12): 3465 - 3475. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Wu, X. Mao, T. Cai, J. Luo, and L. Wei KOBAS server: a web-based platform for automated annotation and pathway identification. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W720 - W724. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-H. Shi, S.-W. Zhu, X.-Z. Mao, J.-X. Feng, Y.-M. Qin, L. Zhang, J. Cheng, L.-P. Wei, Z.-Y. Wang, and Y.-X. Zhu Transcriptome Profiling, Molecular Biological, and Physiological Studies Reveal a Major Role for Ethylene in Cotton Fiber Cell Elongation PLANT CELL, March 1, 2006; 18(3): 651 - 664. [Abstract] [Full Text] [PDF] |
||||







