Bioinformatics Advance Access originally published online on July 15, 2004
Bioinformatics 2004 20(18):3370-3378; doi:10.1093/bioinformatics/bth409
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics vol. 20 issue 18 © Oxford University Press 2004; all rights reserved.
Extracting gene pathway relations using a hybrid grammar: the Arizona Relation Parser
Artificial Intelligence Laboratory MIS Department, University of Arizona, 1130 E. Helen St, Tucson, AZ 85721, USA
Received on April 19, 2004; revised on July 4, 2004; accepted on July 5, 2004
Advance Access Publication July 15, 2004
Motivation: Text-mining research in the biomedical domain has been motivated by the rapid growth of new research findings. Improving the accessibility of findings has potential to speed hypothesis generation.
Results: We present the Arizona Relation Parser that differs from other parsers in its use of a broad coverage syntax-semantic hybrid grammar. While syntax grammars have generally been tested over more documents, semantic grammars have outperformed them in precision and recall. We combined access to syntax and semantic information from a single grammar. The parser was trained using 40 PubMed abstracts and then tested using 100 unseen abstracts, half for precision and half for recall. Expert evaluation showed that the parser extracted biologically relevant relations with 89% precision. Recall of expert identified relations with semantic filtering was 35 and 61% before semantic filtering. Such results approach the higher-performing semantic parsers. However, the AZ parser was tested over a greater variety of writing styles and semantic content.
Availability: Relations extracted from over 600 000 PubMed abstracts are available for retrieval and visualization at http://econport.arizona.edu:8080/NetVis/index.html
Contact: dmm{at}eller.arizona.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Li, X. Li, H. Su, H. Chen, and D. W. Galbraith A framework of integrating gene relations from heterogeneous data sources: an experiment on Arabidopsis thaliana Bioinformatics, August 15, 2006; 22(16): 2037 - 2043. [Abstract] [Full Text] [PDF] |
||||
