Bioinformatics, Vol 14, 2-13, Copyright © 1998 by Oxford University Press
BA Eckman, JS Aaronson, JA Borkowski, WJ Bailey, KO Elliston, AR Williamson and RA Blevins
MOTIVATION: To make effective use of the vast amounts of expressed sequence
tag (EST) sequence data generated by the Merck-sponsored EST project and
other similar efforts, sequences must be organized into gene classes, and
scientists must be able to 'mine' the gene class data in the context of
related genomic data. RESULTS: This paper presents the Merck Gene Index
browser, an easily extensible, World Wide Web- based system for mining the
Merck Gene Index (MGI) and related genomic data. The MGI is a non-redundant
set of clones and sequences, each representing a distinct gene, constructed
from all high-quality 3' EST sequences generated by the Merck-sponsored EST
project. The MGI browser integrates data from a variety of sources and
storage formats, both local and remote, using an eclectic integration
strategy, including a federation of relational databases, a local data
warehouse and simple hypertext links. Data currently integrated include:
LENS cDNA clone and EST data, dbEST protein and non-EST nucleic acid
similarity data, WashU sequence chromatograms. Entrez sequence and Medline
entries, and UniGene gene clusters. Flatfile sequence data are accessed
using the Bioapps server, an internally developed client-server system that
supports generic sequence analysis applications. Browser data are retrieved
and formatted by means of the Bioinformatics Data Integration Toolkit
(B-DIT), a new suite of Perl routines.
ARTICLES
The Merck Gene Index browser: an extensible data integration system for gene finding, gene characterization and EST data mining
Department of Bioinformatics, Merck Research Laboratories, West Point, PA, USA. barbara_eckman@sbphrd.com
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
I. Abdrakhmanov, D. Lodygin, P. Geroth, H. Arakawa, A. Law, J. Plachy, B. Korn, and J.-M. Buerstedde A Large Database of Chicken Bursal ESTs as a Resource for the Analysis of Vertebrate Gene Function Genome Res., December 1, 2000; 10(12): 2062 - 2069. [Abstract] [Full Text] |
||||
![]() |
J. Burke, D. Davison, and W. Hide d2_cluster: A Validated Method for Clustering EST and Full-Length cDNA Sequences Genome Res., November 1, 1999; 9(11): 1135 - 1142. [Abstract] [Full Text] |
||||
![]() |
P. Deloukas, G. D. Schuler, G. Gyapay, E. M. Beasley, C. Soderlund, P. Rodriguez-Tomé, L. Hui, T. C. Matise, K. B. McKusick, J. S. Beckmann, et al. A Physical Map of 30,000 Human Genes Science, October 23, 1998; 282(5389): 744 - 746. [Abstract] [Full Text] |
||||

