A Workbench for large-scale sequence homology analysis
Sanger Centre, Hinxton Hail, Cambridge CB10 1RQ, UK and MRC Laboratory of Molecular Biology Hills Road, Cambridge CB2 2QH, UK
When routinely analysing very long stretches of DNA sequences produced by genome sequencing projects, detailed analysis of database search results becomes exceedingly time consuming. To reduce the tedious browsing of large quantities of protein similarities, two programs, MSPcrunch and Blixem, were developed, which assist in processing the results from the database search programs in the BLAST suite. MSPcrunch removes biased composition and redundant matches while keeping weak matches that are consistent with a larger gapped alignment. This makes BLASTsearching in practice more sensitive and reduces the risk of overlooking distant similarities. Blixem is a multiple sequence alignment viewer for X-windows which makes it significantly easier to scan and evaluate the matches ratified by MSPcrunch. In Blixem, matches to the translated DNA query sequence are simultaneously aligned in three frames. Also, the distribution of matches over the whole DNA query is displayed. Examples of usage are drawn from 36 C.elegans cosmid clones totalling 1.2 megabases, to which these tools were applied.
This article has been cited by other articles:
![]() |
A. Klevan, N. J. Tourasse, F. B. Stabell, A.-B. Kolsto, and O. A. Okstad Exploring the evolution of the Bacillus cereus group repeat element bcr1 by comparative genome analysis of closely related strains Microbiology, November 1, 2007; 153(11): 3894 - 3908. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Abbott, D. M. Aanensen, K. Rutherford, S. Butcher, and B. G. Spratt WebACT--an online companion for the Artemis Comparison Tool Bioinformatics, September 15, 2005; 21(18): 3665 - 3666. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Amadio, M. I. Romano, F. Bigi, I. Etchechoury, T. Kubica, S. Niemann, A. Cataldi, and K. Caimi Identification and Characterization of Genomic Variations between Mycobacterium bovis and M. tuberculosis H37Rv J. Clin. Microbiol., May 1, 2005; 43(5): 2481 - 2484. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Laidlaw and M. A. Skinner Comparison of the genome sequence of FP9, an attenuated, tissue culture-adapted European strain of Fowlpox virus, with those of virulent American and European viruses J. Gen. Virol., February 1, 2004; 85(2): 305 - 322. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Clark, Y. J.K. Edwards, D. Peterson, S. W. Clifton, A. J. Thompson, M. Sasaki, Y. Suzuki, K. Kikuchi, S. Watabe, K. Kawakami, et al. Fugu ESTs: New Resources for Transcription Analysis and Genome Annotation Genome Res., December 1, 2003; 13(12): 2747 - 2753. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Papazisi, T. S. Gorton, G. Kutish, P. F. Markham, G. F. Browning, D. K. Nguyen, S. Swartzell, A. Madan, G. Mahairas, and S. J. Geary The complete genome sequence of the avian pathogen Mycoplasma gallisepticum strain Rlow Microbiology, September 1, 2003; 149(9): 2307 - 2316. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. D. Pleasance, M. A. Marra, and S. J.M. Jones Assessment of SAGE in Transcript Identification Genome Res., June 1, 2003; 13(6): 1203 - 1215. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Lee and E. L.L. Sonnhammer Genomic Gene Clustering Analysis of Pathways in Eukaryotes Genome Res., May 1, 2003; 13(5): 875 - 882. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Collins, M. E. Goward, C. G. Cole, L. J. Smink, E. J. Huckle, S. Knowles, J. M. Bye, D. M. Beare, and I. Dunham Reevaluating Human Gene Annotation: A Second-Generation Analysis of Chromosome 22 Genome Res., January 1, 2003; 13(1): 27 - 36. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Lindroos, S. Sigurdsson, K. Johansson, L. Ronnblom, and A.-C. Syvanen Multiplex SNP genotyping in pooled DNA samples by a four-colour microarray system Nucleic Acids Res., July 15, 2002; 30(14): e70 - e70. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Friedman and A. L. Hughes Gene Duplication and the Structure of Eukaryotic Genomes Genome Res., March 1, 2001; 11(3): 373 - 381. [Abstract] [Full Text] |
||||
![]() |
R. Sudbrak, G. Wieczorek, U. A. Nuber, W. Mann, R. Kirchner, F. Erdogan, C. J. Brown, D. Wohrle, P. Sterk, V. M. Kalscheuer, et al. X chromosome-specific cDNA arrays: identification of genes that escape from X-inactivation and other applications Hum. Mol. Genet., January 1, 2001; 10(1): 77 - 83. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. P. G. D. C. PlasmoDB: An integrative database of the Plasmodium falciparum genome. Tools for accessing and analyzing finished and unfinished sequence data Nucleic Acids Res., January 1, 2001; 29(1): 66 - 69. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. W. Simmen and A. Bird Sequence Analysis of Transposable Elements in the Sea Squirt, Ciona intestinalis Mol. Biol. Evol., November 1, 2000; 17(11): 1685 - 1694. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Jareborg and R. Durbin Alfresco---A Workbench for Comparative Genomic Sequence Analysis Genome Res., August 1, 2000; 10(8): 1148 - 1157. [Abstract] [Full Text] |
||||
![]() |
E. R. Zabarovsky, R. Gizatullin, R. M. Podowski, V. V. Zabarovska, L. Xie, O. V. Muravenko, S. Kozyrev, L. Petrenko, N. Skobeleva, J. Li, et al. NotI clones in the analysis of the human genome Nucleic Acids Res., April 1, 2000; 28(7): 1635 - 1639. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Elgar, M. S. Clark, S. Meek, S. Smith, S. Warner, Y. J.K. Edwards, N. Bouchireb, A. Cottage, G. S.H. Yeo, Y. Umrania, et al. Generation and Analysis of 25 Mb of Genomic DNA from the Pufferfish Fugu rubripes by Sequence Scanning Genome Res., October 1, 1999; 9(10): 960 - 971. [Abstract] [Full Text] |
||||
![]() |
N. Jareborg, E. Birney, and R. Durbin Comparative Analysis of Noncoding Regions of 77 Orthologous Mouse and Human Gene Pairs Genome Res., September 1, 1999; 9(9): 815 - 824. [Abstract] [Full Text] |
||||
![]() |
M. W. Simmen, S. Leitgeb, J. Charlton, S. J. Jones, B. R. Harris, V. H. Clark, and A. Bird Nonmethylated Transposable Elements and Methylated Genes in a Chordate Genome Science, February 19, 1999; 283(5405): 1164 - 1167. [Abstract] [Full Text] |
||||
![]() |
J.-L. Blond, F. Besème, L. Duret, O. Bouton, F. Bedin, H. Perron, B. Mandrand, and F. Mallet Molecular Characterization and Placental Expression of HERV-W, a New Human Endogenous Retrovirus Family J. Virol., February 1, 1999; 73(2): 1175 - 1185. [Abstract] [Full Text] |
||||
![]() |
F. Sterky, S. Regan, J. Karlsson, M. Hertzberg, A. Rohde, A. Holmberg, B. Amini, R. Bhalerao, M. Larsson, R. Villarroel, et al. Gene discovery in the wood-forming tissues of poplar: Analysis of 5,692 expressed sequence tags PNAS, October 27, 1998; 95(22): 13330 - 13335. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. W. Simmen, S. Leitgeb, V. H. Clark, S. J. M. Jones, and A. Bird Gene number in an invertebrate chordate, Ciona intestinalis PNAS, April 14, 1998; 95(8): 4437 - 4440. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. N. Gladyshev, K.-T. Jeang, J. C. Wootton, and D. L. Hatfield A New Human Selenium-containing Protein. PURIFICATION, CHARACTERIZATION, AND cDNA SEQUENCE J. Biol. Chem., April 10, 1998; 273(15): 8910 - 8915. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. L. Harris Genotator: A Workbench for Sequence Annotation Genome Res., July 1, 1997; 7(7): 754 - 762. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Francis, T. M. Strom, S. Hennig, A. Boddrich, B. Lorenz, O. Brandau, K. L. Mohnike, M. Cagnoli, C. Steffens, S. Klages, et al. Genomic Organization of the Human PEX Gene Mutated in X-Linked Dominant Hypophosphatemic Rickets Genome Res., June 1, 1997; 7(6): 573 - 585. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zhang and T. L. Madden PowerBLAST: A New Network BLAST Application for Interactive or Automated Sequence Analysis and Annotation Genome Res., June 1, 1997; 7(6): 649 - 656. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Thomson, J. J. Lozano, N. Loukili, R. Carrió, F. Serras, B. Cormand, M. Valeri, V. M. Díaz, J. Abril, M. Burset, et al. Fusion of the Human Gene for the Polyubiquitination Coeffector UEV1 with Kua, a Newly Identified Gene Genome Res., November 1, 2000; 10(11): 1743 - 1756. [Abstract] [Full Text] |
||||
![]() |
S. F. Smith, P. Snell, F. Gruetzner, A. J. Bench, T. Haaf, J. A. Metcalfe, A. R. Green, and G. Elgar Analyses of the Extent of Shared Synteny and Conserved Gene Orders between the Genome of Fugu rubripes and Human 20q Genome Res., May 1, 2002; 12(5): 776 - 784. [Abstract] [Full Text] [PDF] |
||||











