Bioinformatics, Vol 15, 471-479, Copyright © 1999 by Oxford University Press
S Henikoff, JG Henikoff and S Pietrokovski
MOTIVATION: As databanks grow, sequence classification and prediction of
function by searching protein family databases becomes increasingly
valuable. The original Blocks Database, which contains ungapped multiple
alignments for families documented in Prosite, can be searched to classify
new sequences. However, Prosite is incomplete, and families from other
databases are now available to expand coverage of the Blocks Database.
RESULTS: To take advantage of protein family information present in several
existing compilations, we have used five databases to construct Blocks+, a
unified database that is built on the PROTOMAT/BLOSUM scoring model and
that can be searched using a single algorithm for consistent sequence
classification. The LAMA blocks- versus-blocks searching program identifies
overlapping protein families, making possible a non-redundant hierarchical
compilation. Blocks+ consists of all blocks derived from PROSITE, blocks
from Prints not present in PROSITE, blocks from Pfam-A not present in
PROSITE or Prints, and so on for ProDom and Domo, for a total of 1995
protein families represented by 8909 blocks, doubling the coverage of the
original Blocks Database. A challenge for any procedure aimed at non-
redundancy is to retain related but distinct families while discarding
those that are duplicates. We illustrate how using multiple compilations
can minimize this potential problem by examining the SNF2 family of
ATPases, which is detectably similar to distinct families of helicases and
ATPases. AVAILABILITY: http://blocks.fhcrc.org/
ARTICLES
Blocks+: a non-redundant database of protein alignment blocks derived from multiple compilations
Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109-1024, USA. steveh@fhcrc.org
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. A. Kelley, P. J. Shrimpton, S. H. Muggleton, and M. J.E. Sternberg Discovering rules for protein-ligand specificity using support vector inductive logic programming Protein Eng. Des. Sel., September 1, 2009; 22(9): 561 - 567. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Eyal, S. Pietrokovski, and I. Bahar Rapid assessment of correlated amino acids from pair-to-pair (P2P) substitution matrices Bioinformatics, July 15, 2007; 23(14): 1837 - 1839. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. del Val, P. Ernst, M Falkenhahn, C. Fladerer, K. H. Glatting, S. Suhai, and A. Hotz-Wagenblatt ProtSweep, 2Dsweep and DomainSweep: protein analysis suite at DKFZ Nucleic Acids Res., July 13, 2007; 35(suppl_2): W444 - W450. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Przybylski and B. Rost Consensus sequences improve PSI-BLAST through mimicking profile-profile alignments Nucleic Acids Res., April 1, 2007; 35(7): 2238 - 2246. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Portugaly, N. Linial, and M. Linial EVEREST: a collection of evolutionary conserved protein domains Nucleic Acids Res., January 12, 2007; 35(suppl_1): D241 - D246. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Kajan, A. Kertesz-Farkas, D. Franklin, N. Ivanova, A. Kocsor, and S. Pongor Application of a simple likelihood ratio approximant to protein sequence classification Bioinformatics, December 1, 2006; 22(23): 2865 - 2869. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kuwae, T. Matsuzawa, N. Ishikawa, H. Abe, T. Nonaka, H. Fukuda, S. Imajoh-Ohmi, and A. Abe BopC Is a Novel Type III Effector Secreted by Bordetella bronchiseptica and Has a Critical Role in Type III-dependent Necrotic Cell Death J. Biol. Chem., March 10, 2006; 281(10): 6589 - 6600. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Mullan Pairwise sequence alignment--it's all about us! Brief Bioinform, March 1, 2006; 7(1): 113 - 115. [Full Text] [PDF] |
||||
![]() |
A. Kocsor, A. Kertesz-Farkas, L. Kajan, and S. Pongor Application of compression-based distance measures to protein sequence classification: a methodological study Bioinformatics, February 15, 2006; 22(4): 407 - 412. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Maltsev, E. Glass, D. Sulakhe, A. Rodriguez, M. H. Syed, T. Bompada, Y. Zhang, and M. D'Souza PUMA2--grid-based high-throughput analysis of genomes and metabolic pathways Nucleic Acids Res., January 1, 2006; 34(suppl_1): D369 - D372. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Gajendran, M. D. Gonzales, A. Farmer, E. Archuleta, J. Win, M. E. Waugh, and S. Kamoun Phytophthora functional genomics database (PFGD): functional genomics of phytophthora-plant interactions Nucleic Acids Res., January 1, 2006; 34(suppl_1): D465 - D470. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Siezen, B. Renckens, I. van Swam, S. Peters, R. van Kranenburg, M. Kleerebezem, and W. M. de Vos Complete Sequences of Four Plasmids of Lactococcus lactis subsp. cremoris SK11 Reveal Extensive Adaptation to the Dairy Environment Appl. Envir. Microbiol., December 1, 2005; 71(12): 8371 - 8382. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Zheng, R. J. Roberts, S. Kasif, and C. Guan Characterization of Two New Aminopeptidases in Escherichia coli J. Bacteriol., June 1, 2005; 187(11): 3671 - 3677. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Gonzales, E. Archuleta, A. Farmer, K. Gajendran, D. Grant, R. Shoemaker, W. D. Beavis, and M. E. Waugh The Legume Information System (LIS): an integrated information resource for comparative legume biology Nucleic Acids Res., January 1, 2005; 33(suppl_1): D660 - D665. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. del Val, A. Mehrle, M. Falkenhahn, M. Seiler, K.-H. Glatting, A. Poustka, S. Suhai, and S. Wiemann High-throughput protein analysis integrating bioinformatics and experimental assays Nucleic Acids Res., February 3, 2004; 32(2): 742 - 748. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. S. Williams, D. I. Chasman, D. D. Hau, B. Hui, A. Y. Lau, and J. N. M. Glover Detection of Protein Folding Defects Caused by BRCA1-BRCT Truncation and Missense Mutations J. Biol. Chem., December 26, 2003; 278(52): 53007 - 53016. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Papazisi, T. S. Gorton, G. Kutish, P. F. Markham, G. F. Browning, D. K. Nguyen, S. Swartzell, A. Madan, G. Mahairas, and S. J. Geary The complete genome sequence of the avian pathogen Mycoplasma gallisepticum strain Rlow Microbiology, September 1, 2003; 149(9): 2307 - 2316. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Petrickova and M. Petricek Eukaryotic-type protein kinases in Streptomyces coelicolor: variations on a common theme Microbiology, July 1, 2003; 149(7): 1609 - 1621. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. P. Bennett, L. Lu, and D. L. Brutlag 3MATRIX and 3MOTIF: a protein structure visualization system for conserved sequence motifs Nucleic Acids Res., July 1, 2003; 31(13): 3328 - 3332. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Bossis and J. A. Chiorini Cloning of an Avian Adeno-Associated Virus (AAAV) and Generation of Recombinant AAAV Particles J. Virol., June 15, 2003; 77(12): 6799 - 6810. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Cherepanov, G. Maertens, P. Proost, B. Devreese, J. Van Beeumen, Y. Engelborghs, E. De Clercq, and Z. Debyser HIV-1 Integrase Forms Stable Tetramers and Associates with LEDGF/p75 Protein in Human Cells J. Biol. Chem., January 3, 2003; 278(1): 372 - 381. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Frishman, M. Mokrejs, D. Kosykh, G. Kastenmuller, G. Kolesov, I. Zubrzycki, C. Gruber, B. Geier, A. Kaps, K. Albermann, et al. The PEDANT genome database Nucleic Acids Res., January 1, 2003; 31(1): 207 - 211. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Papazisi, S. Frasca Jr., M. Gladd, X. Liao, D. Yogev, and S. J. Geary GapA and CrmA Coexpression Is Essential for Mycoplasma gallisepticum Cytadherence and Virulence Infect. Immun., December 1, 2002; 70(12): 6839 - 6845. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. K. Cusick, E. Hager, and R. E. Gill Characterization of bcsA Mutations That Bypass Two Distinct Signaling Requirements for Myxococcus xanthus Development J. Bacteriol., September 15, 2002; 184(18): 5141 - 5150. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Nakamura, H. Tanaka, M. Koga, Y. Miyagawa, N. Iguchi, C. E. de Carvalho, K. Yomogida, M. Nozaki, H. Nojima, K. Matsumiya, et al. Molecular Cloning and Characterization of oppo 1: A Haploid Germ Cell-Specific Complementary DNA Encoding Sperm Tail Protein Biol Reprod, July 1, 2002; 67(1): 1 - 7. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Xuan, W. R. McCombie, and M. Q. Zhang GFScan: A Gene Family Search Tool at Genomic DNA Level Genome Res., July 1, 2002; 12(7): 1142 - 1149. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. P. Skaar, M. P. Lazio, and H. S. Seifert Roles of the recJ and recN Genes in Homologous Recombination and DNA Repair Pathways of Neisseria gonorrhoeae J. Bacteriol., February 15, 2002; 184(4): 919 - 927. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Maltsev, E. Marland, G. X. Yu, S. Bhatnagar, and R. Lusk Sentra, a database of signal transduction proteins Nucleic Acids Res., January 1, 2002; 30(1): 349 - 350. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hoskins, W. E. Alborn Jr., J. Arnold, L. C. Blaszczak, S. Burgett, B. S. DeHoff, S. T. Estrem, L. Fritz, D.-J. Fu, W. Fuller, et al. Genome of the Bacterium Streptococcus pneumoniae Strain R6 J. Bacteriol., October 1, 2001; 183(19): 5709 - 5717. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Pouliot, J. Gao, Q. J. Su, G. G. Liu, and X. B. Ling DIAN: A Novel Algorithm for Genome Ontological Classification Genome Res., October 1, 2001; 11(10): 1766 - 1779. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Valenzuela, Y. Belkaid, M. K. Garfield, S. Mendez, S. Kamhawi, E. D. Rowton, D. L. Sacks, and J. M.C. Ribeiro Toward a Defined Anti-Leishmania Vaccine Targeting Vector Antigens: Characterization of a Protective Salivary Protein J. Exp. Med., August 6, 2001; 194(3): 331 - 342. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Takai, S. A. Hines, T. Sekizaki, V. M. Nicholson, D. A. Alperin, M. Osaki, D. Takamatsu, M. Nakamura, K. Suzuki, N. Ogino, et al. DNA Sequence and Comparison of Virulence Plasmids from Rhodococcus equi ATCC 33701 and 103 Infect. Immun., December 1, 2000; 68(12): 6840 - 6847. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. S.B. Goldstein and S. Gunawardena Flying through the Drosophila Cytoskeletal Genome J. Cell Biol., July 24, 2000; 150(2): 63 - 68. [Full Text] [PDF] |
||||
![]() |
M. G. Reese, G. Hartzell, N. L. Harris, U. Ohler, J. F. Abril, and S. E. Lewis Genome Annotation Assessment in Drosophila melanogaster Genome Res., April 1, 2000; 10(4): 483 - 501. [Abstract] [Full Text] |
||||
![]() |
J. G. Henikoff and S. Henikoff Drosophila Genomic Sequence Annotation Using the BLOCKS+ Database Genome Res., April 1, 2000; 10(4): 543 - 546. [Abstract] [Full Text] |
||||
![]() |
T. K. Attwood, M. D. R. Croning, D. R. Flower, A. P. Lewis, J. E. Mabey, P. Scordis, J. N. Selley, and W. Wright PRINTS-S: the database formerly known as PRINTS Nucleic Acids Res., January 1, 2000; 28(1): 225 - 227. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Henikoff, E. A. Greene, S. Pietrokovski, and S. Henikoff Increased coverage of protein families with the Blocks Database servers Nucleic Acids Res., January 1, 2000; 28(1): 228 - 230. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Porta, A. M. Ramon, and W. A. Fonzi PRR1, a Homolog of Aspergillus nidulans palF, Controls pH-Dependent Gene Expression and Filamentation in Candida albicans J. Bacteriol., December 15, 1999; 181(24): 7516 - 7523. [Abstract] [Full Text] |
||||
![]() |
M. H. F. Sorgine, C. Logullo, R. B. Zingali, G. O. Paiva-Silva, L. Juliano, and P. L. Oliveira A Heme-binding Aspartic Proteinase from the Eggs of the Hard Tick Boophilus microplus J. Biol. Chem., September 8, 2000; 275(37): 28659 - 28665. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Araki, R. Nagarajan, and J. Milbrandt Identification of Genes Induced in Peripheral Nerve after Injury. EXPRESSION PROFILING AND NOVEL GENE DISCOVERY J. Biol. Chem., August 31, 2001; 276(36): 34131 - 34141. [Abstract] [Full Text] [PDF] |
||||













