Skip Navigation



Bioinformatics Advance Access published online on July 15, 2004

Bioinformatics, doi:10.1093/bioinformatics/bth409
Bioinformatics © Oxford University Press 2004; all rights reserved
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
20/18/3370    most recent
bth409v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by McDonald, D. M.
Right arrow Articles by Marshall, B. B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by McDonald, D. M.
Right arrow Articles by Marshall, B. B.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Received April 19, 2004
Revised July 4, 2004
Accepted July 5, 2004

Article

Extracting gene pathway relations using a hybrid grammar: the Arizona relation parser

D. M. McDonald 1*, H. Chen 1, H. Su 1, B. B. Marshall 1

1 Artificial Intelligence Lab, MIS Department, University of Arizona, 1130 E. Helen St., Tucson, AZ 85721, USA

* To whom correspondence should be addressed. E-mail: dmm{at}eller.arizona.edu.


   Abstract

Motivation: Text mining research in the biomedical domain has been motivated by the rapid growth of new research findings. Improving the accessibility of findings has potential to speed hypothesis generation.

Results: We present the Arizona Relation Parser that differs from other parsers in its use of a broad coverage syntax-semantic hybrid grammar. While syntax grammars have generally been tested over more documents, semantic grammars have outperformed them in precision and recall. We combined access to syntax and semantic information from a single grammar. The parser was trained using 40 PubMed abstracts and then tested using 100 unseen abstracts, half for precision and half for recall. Expert evaluation showed that the parser extracted biologically relevant relations with 89 percent precision. Recall of expert identified relations with semantic filtering was 35 percent and 61 percent before semantic filtering. Such results approach the higher-performing semantic parsers. However, the AZ parser was tested over a greater variety of writing styles and semantic content.

Availability: Relations extracted from over 600,000 PubMed abstracts are available for retrieval and visualization at http://econport.arizona.edu:8080/NetVis/index.html.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
J. Li, X. Li, H. Su, H. Chen, and D. W. Galbraith
A framework of integrating gene relations from heterogeneous data sources: an experiment on Arabidopsis thaliana
Bioinformatics, August 15, 2006; 22(16): 2037 - 2043.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.