Bioinformatics Advance Access published online on June 16, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti548
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Mullard Space Science Laboratory, Holmbury St. Mary, Dorking, Surrey RH5 6NT, UK
* To whom correspondence should be addressed.
Motivation: At present, mapping of sequence identifiers across databases is a daunting, time-consuming and computationally expensive process, usually achieved by sequence similarity searches with strict threshold values. Summary: We present a rapid and efficient method to map sequence identifiers across databases. The method uses the MD5 checksum algorithm for message integrity to generate sequence fingerprints and uses these fingerprints as hash strings to map sequences across databases. The program, called MagicMatch, is able to cross-link any of the major sequence databases within a few seconds on a modest desktop computer. Availability: MagicMatch is available at the following URL: http://cgg.ebi.ac.uk/services/magicmatch/, including an interactive service for major databases and binary downloads for widely used platforms.
Received February 8, 2005
Revised June 13, 2005
Accepted June 15, 2005
Applications note
MagicMatch - cross-referencing sequence identifiers across databases
2 Microbial Ecology Program, DoE Joint Genome Institute, 2800 Mitchell Drive, Bldg 400-404, Walnut Creek, CA 94598, USA
3 Computational Genomics Group, The European Bioinformatics Institute, EMBL Cambridge Outstation, Cambridge CB10 1SD, UK
4 Wellcome Trust Sanger Institute, Cambridge CB10 1SA, UK
Christos A. Ouzounis, E-mail: ouzounis{at}ebi.ac.uk
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. D. McDowall, M. S. Scott, and G. J. Barton PIPs: human protein-protein interaction prediction database Nucleic Acids Res., January 1, 2009; 37(suppl_1): D651 - D656. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. D. Karp, C. A. Ouzounis, C. Moore-Kochlacs, L. Goldovsky, P. Kaipa, D. Ahren, S. Tsoka, N. Darzentas, V. Kunin, and N. Lopez-Bigas Expansion of the BioCyc collection of pathway/genome databases to 160 genomes Nucleic Acids Res., October 24, 2005; 33(19): 6083 - 6089. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Goldovsky, P. Janssen, D. Ahren, B. Audit, I. Cases, N. Darzentas, A. J. Enright, N. Lopez-Bigas, J. M. Peregrin-Alvarez, M. Smith, et al. CoGenT++: an extensive and extensible data environment for computational genomics Bioinformatics, October 1, 2005; 21(19): 3806 - 3810. [Abstract] [Full Text] [PDF] |
||||

