Text Mining
EBIMedtext crunching to gather facts for proteins from Medline
European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK
*To whom correspondence should be addressed.
| Abstract |
|---|
Summary: To allow efficient and systematic retrieval of statements from Medline we have developed EBIMed, a service that combines document retrieval with co-occurrence-based analysis of Medline abstracts. Upon keyword query, EBIMed retrieves the abstracts from EMBL-EBI's installation of Medline and filters for sentences that contain biomedical terminology maintained in public bioinformatics resources. The extracted sentences and terminology are used to generate an overview table on proteins, Gene Ontology (GO) annotations, drugs and species used in the same biological context. All terms in retrieved abstracts and extracted sentences are linked to their entries in biomedical databases. We assessed the quality of the identification of terms and relations in the retrieved sentences. More than 90% of the protein names found indeed represented a protein. According to the analysis of four proteinprotein pairs from the Wnt pathway we estimated that 37% of the statements containing such a pair mentioned a meaningful interaction and clarified the interaction of Dkk with LRP. We conclude that EBIMed improves access to information where proteins and drugs are involved in the same biological process, e.g. statements with GO annotations of proteins, proteinprotein interactions and effects of drugs on proteins.
Availability: Available at http://www.ebi.ac.uk/Rebholz-srv/ebimed
Supplementary Data: Supplementary Data are available at Bioinformatics online.
Contact: Rebholz{at}ebi.ac.uk
This article has been cited by other articles:
![]() |
D. Cheng, C. Knox, N. Young, P. Stothard, S. Damaraju, and D. S. Wishart PolySearch: a web-based text mining system for extracting relationships between human diseases, genes, mutations, drugs and metabolites Nucleic Acids Res., May 16, 2008; (2008) gkn296v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Gladki, P. Siedlecki, S. Kaczanowski, and P. Zielenkiewicz e-LiSe--an online tool for finding needles in the '(Medline) haystack' Bioinformatics, April 15, 2008; 24(8): 1115 - 1117. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Rebholz-Schuhmann, M. Arregui, S. Gaudan, H. Kirsch, and A. Jimeno Text processing through Web services: calling Whatizit Bioinformatics, January 15, 2008; 24(2): 296 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Gunther, M. Kuhn, M. Dunkel, M. Campillos, C. Senger, E. Petsalaki, J. Ahmed, E. G. Urdiales, A. Gewiess, L. J. Jensen, et al. SuperTarget and Matador: resources for exploring drug-target relationships Nucleic Acids Res., January 11, 2008; 36(suppl_1): D919 - D922. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Zweigenbaum, D. Demner-Fushman, H. Yu, and K. B. Cohen Frontiers of biomedical text mining: current progress Brief Bioinform, October 30, 2007; (2007) bbm045v1. [Abstract] [Full Text] [PDF] |
||||


