Bioinformatics Vol. 19 no. 18 2003
pages 2404-2412
© 2003 Oxford University Press
caCORE: A common infrastructure for cancer informatics
National Cancer Institute Center for Bioinformatics, National Institutes of Health, U.S. Department of Health and Human Services, 6116 Executive Boulevard, Suite 403, Rockville MD 20852, USA
Received on December 20, 2002
; revised on April 23, 2003
; accepted on June 19, 2003
Motivation:Sites with substantive bioinformatics operations are challenged to build data processing and delivery infrastructure that provides reliable access and enables data integration. Locally generated data must be processed and stored such that relationships to external data sources can be presented. Consistency and comparability across data sets requires annotation with controlled vocabularies and, further, metadata standards for data representation. Programmatic access to the processed data should be supported to ensure the maximum possible value is extracted. Confronted with these challenges at the National Cancer Institute Center for Bioinformatics, we decided to develop a robust infrastructure for data management and integration that supports advanced biomedical applications.
Results: We have developed an interconnected set of software and services called caCORE. Enterprise Vocabulary Services (EVS) provide controlled vocabulary, dictionary and thesaurus services. The Cancer Data Standards Repository (caDSR) provides a metadata registry for common data elements. Cancer Bioinformatics Infrastructure Objects (caBIO) implements an object-oriented model of the biomedical domain and provides Java, Simple Object Access Protocol and HTTPXML application programming interfaces. caCORE has been used to develop scientific applications that bring together data from distinct genomic and clinical science sources.
Availability: caCORE downloads and web interfaces can be accessed from links on the caCORE web site (http://ncicb.nci.nih.gov/core). caBIO software is distributed under an open source license that permits unrestricted academic and commercial use. Vocabulary and metadata content in the EVS and caDSR, respectively, is similarly unrestricted, and is available through web applications and FTP downloads.
Supplementary information: http://ncicb.nci.nih.gov/core/publications contains links to the caBIO 1.0 class diagram and the caCORE 1.0 Technical Guide, which provide detailed information on the present caCORE architecture, data sources and APIs. Updated information appears on a regular basis on the caCORE web site (http://ncicb.nci.nih.gov/core).
Contact: covitzp{at}mail.nih.gov
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
B. P. Vandervalk, E. L. McCarthy, and M. D. Wilkinson Moby and Moby 2: Creatures of the Deep (Web) Brief Bioinform, March 1, 2009; 10(2): 114 - 128. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. F. Schaefer, K. Anthony, S. Krupa, J. Buchoff, M. Day, T. Hannay, and K. H. Buetow PID: the Pathway Interaction Database Nucleic Acids Res., January 1, 2009; 37(suppl_1): D674 - D679. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Graham, J. F. Kerner, K. M. Quinlan, C. Vinson, and A. Best Translating Cancer Control Research Into Primary Care Practice: A Conceptual Framework American Journal of Lifestyle Medicine, June 1, 2008; 2(3): 241 - 249. [Abstract] [PDF] |
||||
![]() |
S. Langella, S. Hastings, S. Oster, T. Pan, A. Sharma, J. Permar, D. Ervin, B. B. Cambazoglu, T. Kurc, and J. Saltz Sharing Data and Analytical Resources Securely in a Biomedical Research Grid Environment J. Am. Med. Inform. Assoc., May 1, 2008; 15(3): 363 - 373. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Oster, S. Langella, S. Hastings, D. Ervin, R. Madduri, J. Phillips, T. Kurc, F. Siebenlist, P. Covitz, K. Shanbhag, et al. caGrid 1.0: An Enterprise Grid Infrastructure for Biomedical Research J. Am. Med. Inform. Assoc., March 1, 2008; 15(2): 138 - 149. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. B. Fridsma, J. Evans, S. Hastak, and C. N. Mead The BRIDG Project: A Technical Report J. Am. Med. Inform. Assoc., March 1, 2008; 15(2): 130 - 137. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. A. Lussier and Y. Liu Computational Approaches to Phenotyping: High-Throughput Phenomics Proceedings of the ATS, January 1, 2007; 4(1): 18 - 25. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. M. Good and M. D. Wilkinson The Life Sciences Semantic Web is Full of Creeps! Brief Bioinform, September 1, 2006; 7(3): 275 - 286. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Stajich and H. Lapp Open source tools and toolkits for bioinformatics: significance, and where are we? Brief Bioinform, September 1, 2006; 7(3): 287 - 296. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Saltz, S. Oster, S. Hastings, S. Langella, T. Kurc, W. Sanchez, M. Kher, A. Manisundaram, K. Shanbhag, and P. Covitz caGrid: design and implementation of the core architecture of the cancer biomedical informatics grid Bioinformatics, August 1, 2006; 22(15): 1910 - 1916. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. L. Whetzel, H. Parkinson, H. C. Causton, L. Fan, J. Fostel, G. Fragoso, L. Game, M. Heiskanen, N. Morrison, P. Rocca-Serra, et al. The MGED Ontology: a resource for semantics-based description of microarray experiments Bioinformatics, April 1, 2006; 22(7): 866 - 873. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Kraj and R. A. McIndoe caBIONet--A .NET wrapper to access and process genomic data stored at the National Cancer Institute's Center for Bioinformatics databases Bioinformatics, August 15, 2005; 21(16): 3456 - 3458. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. H. Buetow Integrating Information through the Cancer Biomedical Informatics Grid (caBIG) Am. Assoc. Cancer Res. Educ. Book, April 1, 2005; 2005(1): 17 - 20. [Full Text] [PDF] |
||||






