Bioinformatics Advance Access originally published online on May 6, 2004
Bioinformatics 2004 20(16):2597-2604; doi:10.1093/bioinformatics/bth291
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics vol. 20 issue 16 © Oxford University Press 2004; all rights reserved.
Distribution of information in biomedical abstracts and full-text publications
Department of Medical Informatics, Erasmus University Medical Center Rotterdam, P.O. Box 1738, 3000 DR, Rotterdam, The Netherlands
Received on November 19, 2003; revised on February 27, 2004; accepted on April 23, 2004
Advance Access Publication May 6, 2004
Motivation: Full-text documents potentially hold more information than their abstracts, but require more resources for processing. We investigated the added value of full text over abstracts in terms of information content and occurrences of gene symbolgene name combinations that can resolve gene-symbol ambiguity.
Results: We analyzed a set of 3902 biomedical full-text articles. Different keyword measures indicate that information density is highest in abstracts, but that the information coverage in full texts is much greater than in abstracts. Analysis of five different standard sections of articles shows that the highest information coverage is located in the results section. Still, 3040% of the information mentioned in each section is unique to that section. Only 30% of the gene symbols in the abstract are accompanied by their corresponding names, and a further 8% of the gene names are found in the full text. In the full text, only 18% of the gene symbols are accompanied by their gene names.
Contact: m.schuemie{at}erasmusmc.nl
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. M. Fernandez, R. Hoffmann, and A. Valencia iHOP web services Nucleic Acids Res., July 13, 2007; 35(suppl_2): W21 - W26. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. I. Torvik and N. R. Smalheiser A quantitative model for linking two disparate sets of articles in MEDLINE Bioinformatics, July 1, 2007; 23(13): 1658 - 1665. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Xu, J.-W. Fan, G. Hripcsak, E. A. Mendonca, M. Markatou, and C. Friedman Gene symbol disambiguation using knowledge-based profiles Bioinformatics, April 15, 2007; 23(8): 1015 - 1022. [Abstract] [Full Text] [PDF] |
||||

