Bioinformatics Advance Access originally published online on February 21, 2006
Bioinformatics 2006 22(8):967-973; doi:10.1093/bioinformatics/btl042
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Assessing semantic similarity measures for the characterization of human regulatory pathways
1 Windber Research Institute Windber, PA 15963, USA
2 GlaxoSmithKline Pharmaceutical R&D King of Prussia, PA 19420, USA
3 Walter Reed Army Medical Center Washington, DC 20307, USA
*To whom correspondence should be addressed.
Motivation: Pathway modeling requires the integration of multiple data including prior knowledge. In this study, we quantitatively assess the application of Gene Ontology (GO)-derived similarity measures for the characterization of direct and indirect interactions within human regulatory pathways. The characterization would help the integration of prior pathway knowledge for the modeling.
Results: Our analysis indicates information content-based measures outperform graph structure-based measures for stratifying protein interactions. Measures in terms of GO biological process and molecular function annotations can be used alone or together for the validation of protein interactions involved in the pathways. However, GO cellular component-derived measures may not have the ability to separate true positives from noise. Furthermore, we demonstrate that the functional similarity of proteins within known regulatory pathways decays rapidly as the path length between two proteins increases. Several logistic regression models are built to estimate the confidence of both direct and indirect interactions within a pathway, which may be used to score putative pathways inferred from a scaffold of molecular interactions.
Contact: s.guo{at}wriwindber.org
Received on September 20, 2005; revised on January 16, 2006; accepted on February 3, 2006
This article has been cited by other articles:
![]() |
C. Li, X. Li, Y. Miao, Q. Wang, W. Jiang, C. Xu, J. Li, J. Han, F. Zhang, B. Gong, et al. SubpathwayMiner: a software package for flexible identification of pathways Nucleic Acids Res., October 1, 2009; 37(19): e131 - e131. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Boden and T. L. Bailey Associating transcription factor-binding site motifs with target GO terms and target genes Nucleic Acids Res., July 1, 2008; 36(12): 4108 - 4117. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Schlicker and M. Albrecht FunSimMat: a comprehensive functional similarity database Nucleic Acids Res., January 11, 2008; 36(suppl_1): D434 - D439. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Yu, R. Jansen, G. Stolovitzky, and M. Gerstein Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications Bioinformatics, August 15, 2007; 23(16): 2163 - 2173. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Z. Wang, Z. Du, R. Payattakool, P. S. Yu, and C.-F. Chen A new method to measure the semantic similarity of GO terms Bioinformatics, May 15, 2007; 23(10): 1274 - 1281. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Schlicker, C. Huthmacher, F. Ramirez, T. Lengauer, and M. Albrecht Functional evaluation of domain domain interactions and human protein interaction networks Bioinformatics, April 1, 2007; 23(7): 859 - 865. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Aittokallio and B. Schwikowski Graph-based methods for analysing networks in cell biology Brief Bioinform, September 1, 2006; 7(3): 243 - 255. [Abstract] [Full Text] [PDF] |
||||


