Bioinformatics, Vol 15, 528-535, Copyright © 1999 by Oxford University Press
F Eisenhaber and P Bork
MOTIVATION: Computer-based selection of entries from sequence databases
with respect to a related functional description, e.g. with respect to a
common cellular localization or contributing to the same phenotypic
function, is a difficult task. Automatic semantic analysis of annotations
is not only hampered by incomplete functional assignments. A major problem
is that annotations are written in a rich, non- formalized language and are
meant for reading by a human expert. This person can extract from the text
considerably more information than is immediately apparent due to his
extended biological background knowledge and logical reasoning. APPROACH: A
technique of automated annotation evaluation based on a combination of
lexical analysis and the usage of biological rule libraries has been
developed. The proposed algorithm generates new functional descriptors from
the annotation of a given entry using the semantic units of the annotation
as prepositions for implications executed in accordance with the rule
library. RESULTS: The prototype of a software system, the Meta_A(nnotator)
program, is described and the results of its application to sequence
attribute assignment and sequence selection problems, such as cellular
localization and sequence domain annotation of SWISS-PROT entries, are
presented. The current software version assigns useful subcellular
localization qualifiers to approximately 88% of all SWISS-PROT entries. As
shown by demonstrative examples, the combination of sequence and annotation
analysis is a powerful approach for the detection of mutual
annotation/sequence inconsistencies. AVAILABILITY: Results for the cellular
localization assignment can be viewed at the URL http://www.bork.
embl-heidelberg.de/CELL_LOC/CELL_LOC.html.
ARTICLES
Evaluation of human-readable annotation in biomolecular sequence databases with biological rule libraries
Max-Delbruck-Centrum fur Molekulare Medizin, Robert-Rossle-Strasse 10, 13122 Berlin-Buch, Germany. Frank.Eisenhaber@embl-heidelberg.de
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
I. Roman, J. Figys, G. Steurs, and M. Zizi Hunting Interactomes of a Membrane Protein: Obtaining the Largest Set of Voltage-Dependent Anion Channel-Interacting Protein Epitopes Mol. Cell. Proteomics, September 1, 2006; 5(9): 1667 - 1680. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Scott, D. Y. Thomas, and M. T. Hallett Predicting Subcellular Localization via Protein Motif Co-Occurrence Genome Res., October 1, 2004; 14(10a): 1957 - 1966. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. R. Bigelow, D. S. Petrey, J. Liu, D. Przybylski, and B. Rost Predicting transmembrane beta-barrels in proteomes Nucleic Acids Res., May 11, 2004; 32(8): 2566 - 2577. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Raychaudhuri, J. T. Chang, F. Imam, and R. B. Altman The computational analysis of scientific literature to define and recognize gene expression clusters Nucleic Acids Res., August 1, 2003; 31(15): 4553 - 4560. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Raychaudhuri, H. Schutze, and R. B. Altman Using Text Analysis to Identify Functionally Coherent Gene Groups Genome Res., October 1, 2002; 12(10): 1582 - 1590. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Mott, J. Schultz, P. Bork, and C. P. Ponting Predicting Protein Cellular Localization Using a Domain Projection Method Genome Res., August 1, 2002; 12(8): 1168 - 1174. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Raychaudhuri, J. T. Chang, P. D. Sutphin, and R. B. Altman Associating Genes with Gene Ontology Codes Using a Maximum Entropy Analysis of Biomedical Literature Genome Res., January 1, 2002; 12(1): 203 - 214. [Abstract] [Full Text] [PDF] |
||||


