Skip Navigation

Bioinformatics 2007 23(13):i264-i272; doi:10.1093/bioinformatics/btm196
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Kolárik, C.
Right arrow Articles by Fluck, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kolárik, C.
Right arrow Articles by Fluck, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Identification of new drug classification terms in textual resources

Corinna Kolárik 1,*, Martin Hofmann-Apitius 1,2, Marc Zimmermann 1 and Juliane Fluck 1

1Fraunhofer Institute SCAI, Schloß Birlinghoven, 53754 Sankt Augustin and 2Bonn-Aachen International Center for Information Technology (B-IT), University of Bonn, Dahlmannstrasse 2, 53113 Bonn, Germany

*To whom correspondence should be addressed.


   Abstract

Knowledge about biological effects of small molecules helps in the understanding of biological processes and supports the development of new therapeutic agents. DrugBank is a high quality database providing such information about drugs that contains annotation of drug effects and classification of therapeutic effects. However, to broaden the scope of such a database in classifying and annotating drugs, systems for automatic extraction of classification terms and the corresponding annotation of drugs are needed. We have developed an approach for the identification of new terms used in unstructured text that provide information about drug properties. It is based on the identification and extraction of phrases corresponding to lexico-syntactic patterns - so-called Hearst patterns that contain drug names and directly related drug annotation terms. Such phrases could be identified with a high performance in DrugBank text (0.89 F-score) and in Medline abstracts (0.83 F-score). In comparison to DrugBank annotation terminology, a huge amount of new drug annotation terms could be found. The evaluation of terms extracted from Medline showed that 29–53% of them are new valid drug property terms. They could be assigned to existing and new drug property classes not provided by the DrugBank drug annotation. We come to the conclusion that our system can support database content update by providing additionally drug descriptions of pharmacological effects not yet found in databases like DrugBank. Moreover, we propose that automatic normalization of terms improves the annotation and the retrieval of relevant database entries.

Contact: corinna.kolarik{at}scai.fraunhofer.de

Supplementary information: Supplementary data are available at Bioinformatics online.



Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Brief BioinformHome page
P. Agarwal and D. B. Searls
Literature mining in support of drug discovery
Brief Bioinform, September 27, 2008; (2008) bbn035v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Klinger, C. Kolarik, J. Fluck, M. Hofmann-Apitius, and C. M. Friedrich
Detection of IUPAC and IUPAC-like chemical names
Bioinformatics, July 1, 2008; 24(13): i268 - i276.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.