Bioinformatics Advance Access originally published online on October 18, 2006
Bioinformatics 2006 22(24):3089-3095; doi:10.1093/bioinformatics/btl534
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Building an abbreviation dictionary using a term recognition approach
1 Graduate School of Information Science and Technology, The University of Tokyo 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8651, Japan
2 Japan Society for the Promotion of Science (JSPS), 8 Ichiban-cho, Chiyoda-ku, Tokyo Japan
3 School of Computer Science, The University of Manchester Oxford Road, Manchester, M13 9PL, UK
4 National Centre for Text Mining (NaCTeM), Manchester Interdisciplinary Biocentre Oxford Road, Manchester, M13 9PL, UK
*To whom correspondence should be addressed.
Motivation: Acronyms result from a highly productive type of term variation and trigger the need for an acronym dictionary to establish associations between acronyms and their expanded forms.
Results: We propose a novel method for recognizing acronym definitions in a text collection. Assuming a word sequence co-occurring frequently with a parenthetical expression to be a potential expanded form, our method identifies acronym definitions in a similar manner to the statistical term recognition task. Applied to the whole MEDLINE (7 811 582 abstracts), the implemented system extracted 886 755 acronym candidates and recognized 300 954 expanded forms in reasonable time. Our method outperformed base-line systems, achieving 99% precision and 8295% recall on our evaluation corpus that roughly emulates the whole MEDLINE.
Availability and Supplementary information: The implementations and supplementary information are available at our web site: http://www.chokkan.org/research/acromine/
Contact: okazaki{at}mi.ci.i.u-tokyo.ac.jp
Received on July 1, 2006; revised on October 10, 2006; accepted on October 12, 2006
This article has been cited by other articles:
![]() |
R. Winnenburg, T. Wachter, C. Plake, A. Doms, and M. Schroeder Facts from text: can text mining help to scale-up high-quality manual curation of gene products with ontologies? Brief Bioinform, December 6, 2008; (2008) bbn043v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. E. Crangle, J. M. Cherry, E. L. Hong, and A. Zbyslaw Mining experimental evidence of molecular function claims from the literature Bioinformatics, December 1, 2007; 23(23): 3232 - 3240. [Abstract] [Full Text] [PDF] |
||||

