Bioinformatics Vol. 16 no. 2 2000
Pages 169-175
© 2000 Oxford University Press
PNAD-CSS: a workbench for constructing a protein name abbreviation dictionary
1 Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokane-dai, Minato-ku, Tokyo, 108-8639, Japan
Motivation: Since their initial development, integration and construction of databases for molecular-level data have progressed. Though biological molecules are related to each other and form a complex system, the information is stored in the vast archives of the literature or in diverse databases. There is no unified naming convention for biological object, and biological terms may be ambiguous or polysemic. This makes the integration and interaction of databases difficult. In order to eliminate these problems, machine-readable natural language resources appear to be quite promising. We have developed a workbench for protein name abbreviation dictionary (PNAD) building.
Results: We have developed PNAD Construction Support System (PNAD-CSS), which offers various convenient facilities to decrease the construction costs of a protein name abbreviation dictionary of which entries are collected from abstracts in biomedical papers. The system allows the users to concentrate on higher level interpretation by removing some troublesome tasks, e.g. management of abstracts, extracting protein names and their abbreviations, and so on. To extract a pair of protein names and abbreviations, we have developed a hybrid system composed of the PROPER System and the PNAD System. The PNAD System can extract the pairs from parenthetical-paraphrases involved in protein names, the PROPER System identified these pairs, with 98.95% precision, 95.56% recall and 97.58% complete precision.
Availability: PROPER System is freely available from http://www.hgc.ims.u-tokyo.ac.jp/service/tooldoc/KeX/intro.html. The other software are also available on request. Contact the authors.
Contact: mikio{at}ims.u-tokyo.ac.jp
Received on April 1, 1999
; revised on August 18, 1999
; accepted on November 30, 1999
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
W. Zhou, V. I. Torvik, and N. R. Smalheiser ADAM: another database of abbreviations in MEDLINE Bioinformatics, November 15, 2006; 22(22): 2813 - 2818. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Gaudan, H. Kirsch, and D. Rebholz-Schuhmann Resolving abbreviations to their senses in Medline Bioinformatics, September 15, 2005; 21(18): 3658 - 3664. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Ao and T. Takagi ALICE: An Algorithm to Extract Abbreviations from MEDLINE J. Am. Med. Inform. Assoc., September 1, 2005; 12(5): 576 - 586. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Wren, J. T. Chang, J. Pustejovsky, E. Adar, H. R. Garner, and R. B. Altman Biomedical term mapping databases Nucleic Acids Res., January 1, 2005; 33(suppl_1): D289 - D293. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. T. Chang, H. Schutze, and R. B. Altman Creating an Online Dictionary of Abbreviations from MEDLINE J. Am. Med. Inform. Assoc., November 1, 2002; 9(6): 612 - 620. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Yu, G. Hripcsak, and C. Friedman Mapping Abbreviations to Full Forms in Biomedical Articles J. Am. Med. Inform. Assoc., May 1, 2002; 9(3): 262 - 272. [Abstract] [Full Text] [PDF] |
||||


