Bioinformatics Advance Access published online on December 17, 2004
Bioinformatics, doi:10.1093/bioinformatics/bti187
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Agilent Laboratories, 3500 Deer Creek Road, MS 26U-16, Palo Alto, CA-94304
* To whom correspondence should be addressed.
Motivations: Technological advances in biomedical research are generating a plethora of heterogeneous data at a high rate. There is a critical need for extraction, integration and management tools for information discovery and synthesis from these heterogeneous data. Results: In this paper, we present a general architecture, called ALFA, for information extraction and representation from diverse biological data. The ALFA architecture consists of: (i) a networked, hierarchical, hyper-graph object model for representing information from heterogeneous data sources in a standardized, structured format; and (ii) a suite of integrated, interactive software tools for information extraction and representation from diverse biological data sources. As part of our research efforts to explore this space, we have currently prototyped the ALFA object model and a set of interactive software tools for searching, filtering, and extracting information from scientific text. In particular, we describe BioFerret, a metasearch tool for searching and filtering relevant information from the web, and ALFA Text Viewer, an interactive tool for userguided extraction, disambiguation, and representation of information from scientific text. We further demonstrate the potential of our tools in integrating the extracted information with experimental data and diagrammatic biological models via the common underlying ALFA representation. Vailaya et al. (2004) An architecture for biological information extraction and representation. Symposium on Applied Computing, Proceedings of the 2004 ACM symposium on Applied computing, 103-110; http://doi.acm.org/10.1145/967900.967924 Copyright 2004 Association for Computing Machinery, Inc. Reprinted by permission. Direct permission requests to permissions@acm.org
Accepted November 11, 2004
Article
An architecture for biological information extraction and representation
Aditya Vailaya, E-mail: aditya_vailaya{at}agilent.com
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Igarashi, E. Heureux, K. S. Doctor, P. Talwar, S. Gramatikova, K. Gramatikoff, Y. Zhang, M. Blinov, S. S. Ibragimova, S. Boyd, et al. PMAP: databases for analyzing proteolytic events and pathways Nucleic Acids Res., January 1, 2009; 37(suppl_1): D611 - D618. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Kincaid, A. Kuchinsky, and M. Creech VistaClara: an expression browser plug-in for Cytoscape Bioinformatics, September 15, 2008; 24(18): 2112 - 2114. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. X.-F. Deng, A. Tsalenko, A. Vailaya, A. Ben-Dor, R. Kundu, I. Estay, R. Tabibiazar, R. Kincaid, Z. Yakhini, L. Bruhn, et al. Differences in Vascular Bed Disease Susceptibility Reflect Differences in Gene Expression Response to Atherogenic Stimuli Circ. Res., February 3, 2006; 98(2): 200 - 208. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Y. King, R. Ferrara, R. Tabibiazar, J. M. Spin, M. M. Chen, A. Kuchinsky, A. Vailaya, R. Kincaid, A. Tsalenko, D. X.-F. Deng, et al. Pathway analysis of coronary atherosclerosis Physiol Genomics, September 21, 2005; 23(1): 103 - 118. [Abstract] [Full Text] [PDF] |
||||



