Skip Navigation

Bioinformatics 2007 23(13):i41-i48; doi:10.1093/bioinformatics/btm229
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Baumgartner, W. A.
Right arrow Articles by Hunter, L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Baumgartner, W. A., Jr
Right arrow Articles by Hunter, L.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Manual curation is not sufficient for annotation of genomic databases

William A. Baumgartner, Jr 1,*,{dagger}, K. Bretonnel Cohen 1,{dagger}, Lynne M. Fox 2, George Acquaah-Mensah 3 and Lawrence Hunter 1,*

1Center for Computational Pharmacology, University of Colorado School of Medicine, 2Denison Library, University of Colorado Health Science Center and 3Department of Pharmaceutical Sciences, Massachusetts College of Pharmacy and Health Sciences, USA

*To whom correspondence should be addressed.


   Abstract

Motivation: Knowledge base construction has been an area of intense activity and great importance in the growth of computational biology. However, there is little or no history of work on the subject of evaluation of knowledge bases, either with respect to their contents or with respect to the processes by which they are constructed. This article proposes the application of a metric from software engineering known as the found/fixed graph to the problem of evaluating the processes by which genomic knowledge bases are built, as well as the completeness of their contents.

Results: Well-understood patterns of change in the found/fixed graph are found to occur in two large publicly available knowledge bases. These patterns suggest that the current manual curation processes will take far too long to complete the annotations of even just the most important model organisms, and that at their current rate of production, they will never be sufficient for completing the annotation of all currently available proteomes.

Contact: larry.hunter{at}uchsc.edu

{dagger}The authors wish it to be known that, in their opinion, both the authors should be regarded as joint First Authors.



Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
R. Chowdhary, J. Zhang, and J. S. Liu
Bayesian inference of protein-protein interactions from biological literature
Bioinformatics, June 15, 2009; 25(12): 1536 - 1542.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.