Bioinformatics Advance Access originally published online on September 5, 2006
Bioinformatics 2006 22(22):2800-2805; doi:10.1093/bioinformatics/btl467
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Discovering disease-genes by topological features in human proteinprotein interaction network
Department of Bioinformatics, Harbin Medical University Harbin 150086, China
*To whom correspondence should be addressed.
Motivation: Mining the hereditary disease-genes from human genome is one of the most important tasks in bioinformatics research. A variety of sequence features and functional similarities between known human hereditary disease-genes and those not known to be involved in disease have been systematically examined and efficient classifiers have been constructed based on the identified common patterns. The availability of human genome-wide proteinprotein interactions (PPIs) provides us with new opportunity for discovering hereditary disease-genes by topological features in PPIs network.
Results: This analysis reveals that the hereditary disease-genes ascertained from OMIM in the literature-curated (LC) PPIs network are characterized by a larger degree, tendency to interact with other disease-genes, more common neighbors and quick communication to each other whereas those properties could not be detected from the network identified from high-throughput yeast two-hybrid mapping approach (EXP) and predicted interactions (PDT) PPIs network. KNN classifier based on those features was created and on average gained overall prediction accuracy of 0.76 in cross-validation test. Then the classifier was applied to 5262 genes on human genome and predicted 178 novel disease-genes. Some of the predictions have been validated by biological experiments.
Contact: jianzxu{at}hotmail.com
Supplementary information: Supplementary data are available at Bioinformatics online.
Received on June 6, 2006; revised on August 3, 2006; accepted on August 29, 2006
This article has been cited by other articles:
![]() |
J. Sun, P. Jia, A. H. Fanous, B. T. Webb, E. J.C.G. van den Oord, X. Chen, J. Bukszar, K. S. Kendler, and Z. Zhao A multi-dimensional evidence-based candidate gene prioritization approach for complex diseases-schizophrenia as a case Bioinformatics, October 1, 2009; 25(19): 2595 - 6602. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Chen, E. E. Bardes, B. J. Aronow, and A. G. Jegga ToppGene Suite for gene list enrichment analysis and candidate gene prioritization Nucleic Acids Res., July 1, 2009; 37(suppl_2): W305 - W311. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Yilmaz, P. Jonveaux, C. Bicep, L. Pierron, M. Smail-Tabbone, and M.D. Devignes Gene-disease relationship discovery based on model-driven data integration and database view definition Bioinformatics, January 15, 2009; 25(2): 230 - 236. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Kann Protein interactions and disease: computational approaches to uncover the etiology of diseases Brief Bioinform, September 1, 2007; 8(5): 333 - 346. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Guo, Y. Li, X. Gong, C. Yao, W. Ma, D. Wang, Y. Li, J. Zhu, M. Zhang, D. Yang, et al. Edge-based scoring and searching method for identifying condition-responsive protein protein interaction sub-network Bioinformatics, August 15, 2007; 23(16): 2121 - 2128. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-I. Goh, M. E. Cusick, D. Valle, B. Childs, M. Vidal, and A.-L. Barabasi The human disease network PNAS, May 22, 2007; 104(21): 8685 - 8690. [Abstract] [Full Text] [PDF] |
||||



