Bioinformatics Advance Access published online on May 12, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti493
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 State Key Laboratory of Intelligent Technology and Systems (LITS), Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China
* To whom correspondence should be addressed.
Motivation: An enormous number of protein-protein interaction relationships are buried in millions of research articles over the years, and accumulating. Rediscovering them automatically is a challenging bioinformatics task. Solutions to this problem are also far reaching, beyond bioinformatics. Results: We study a new approach that involves automatically discovering English expression patterns, optimizing them and using them to extract protein-protein interactions. In a sister paper [Huang et al, 2004], we have described how to generate English expression patterns related to protein-protein interactions, and it alone already has achieved precision and recall rates significantly higher than other automatic systems. This paper continues to present our theory, focusing on how to improve the patterns. An MDL-based pattern optimization algorithm is designed to reduce and merge patterns. This has significantly increased generalization power, hence the recall and precision rates, as confirmed by our experiments. Availability: http://spies.cs.tsinghua.edu.cn.
Received February 21, 2005
Revised May 6, 2005
Accepted May 6, 2005
Article
Discovering patterns to extract protein-protein interactions from the literature: part II
2 School of Computer Science, University of Waterloo, N2L 3G1, Canada; State Key Laboratory of Intelligent Technology and Systems (LITS), Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China
Xiaoyan Zhu, E-mail: zxy-dcs{at}tsinghua.edu.cn
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J.-j. Kim and D. Rebholz-Schuhmann Categorization of services for seeking information in biomedical literature: a typology for improvement of practice Brief Bioinform, July 26, 2008; (2008) bbn032v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Cheng, C. Knox, N. Young, P. Stothard, S. Damaraju, and D. S. Wishart PolySearch: a web-based text mining system for extracting relationships between human diseases, genes, mutations, drugs and metabolites Nucleic Acids Res., July 1, 2008; 36(suppl_2): W399 - W405. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kim, J. Yoon, and J. Yang Kernel approaches for genic interaction extraction Bioinformatics, January 1, 2008; 24(1): 118 - 126. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. F. Lopez, A. Mikulskis, S. Kuzdzal, E. Golenko, E. F. Petricoin III, L. A. Liotta, W. F. Patton, G. R. Whiteley, K. Rosenblatt, P. Gurnani, et al. A Novel, High-Throughput Workflow for Discovery and Identification of Serum Carrier Protein-Bound Peptide Biomarker Candidates in Ovarian Cancer Samples Clin. Chem., June 1, 2007; 53(6): 1067 - 1074. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Plake, T. Schiemann, M. Pankalla, J. Hakenberg, and U. Leser ALIBABA: PubMed as a graph Bioinformatics, October 1, 2006; 22(19): 2444 - 2445. [Abstract] [Full Text] [PDF] |
||||



