Bioinformatics, Vol 15, 382-390, Copyright © 1999 by Oxford University Press
GY Srinivasarao, LS Yeh, CR Marzec, BC Orcutt and WC Barker
MOTIVATION: The Protein Information Resource (PIR) maintains a database of
annotated and curated alignments in order to visually represent
interrelationships among sequences in the PIR-International Protein
Sequence Database, to spread and standardize protein names, features and
keywords among members of a family or superfamily, and to aid us in
classifying sequences, in identifying conserved regions, and in defining
new homology domains. RESULTS: Release 22.0, (December 1998), of the
PIR-ALN database contains a total of 3806 alignments, including 1303
superfamily, 2131 family and 372 homology domain alignments. This is an
appropriate dataset to develop and extract patterns, test profiles, train
neural networks or build Hidden Markov Models (HMMs). These alignments can
be used to standardize and spread annotation to newer members by homology,
as well as to understand the modular architecture of multidomain proteins.
PIR-ALN includes 529 alignments that can be used to develop patterns not
represented in PROSITE, Blocks, PRINTS and Pfam databases. The ATLAS
information retrieval system can be used to browse and query the PIR-ALN
alignments. AVAILABILITY: PIR-ALN is currently being distributed as a
single ASCII text file along with the title, member, species, superfamily
and keyword indexes. The quarterly and weekly updates can be accessed via
the WWW at pir.georgetown.edu. The quarterly updates can also be obtained
by anonymous FTP from the PIR FTP site at NBRF.Georgetown.edu, directory
[ANONYMOUS.PIR.ALIGNMENT].
ARTICLES
PIR-ALN: a database of protein sequence alignments
Protein Information Resource (PIR), National Biomedical Research Foundation, 3900 Reservoir Road NW, Washington, DC 20007, USA. Geetha@nbrf.georgetown.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. C. W. May Optimal classification of protein sequences and selection of representative sets from multiple alignments: application to homologous families and lessons for structural genomics Protein Eng. Des. Sel., April 1, 2001; 14(4): 209 - 217. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. C. Barker, J. S. Garavelli, Z. Hou, H. Huang, R. S. Ledley, P. B. McGarvey, H.-W. Mewes, B. C. Orcutt, F. Pfeiffer, A. Tsugita, et al. Protein Information Resource: a community resource for expert annotation of protein data Nucleic Acids Res., January 1, 2001; 29(1): 29 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. H. Haft, B. J. Loftus, D. L. Richardson, F. Yang, J. A. Eisen, I. T. Paulsen, and O. White TIGRFAMs: a protein family resource for the functional identification of proteins Nucleic Acids Res., January 1, 2001; 29(1): 41 - 43. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. H. Wu, C. Xiao, Z. Hou, H. Huang, and W. C. Barker iProClass: an integrated, comprehensive and annotated protein classification database Nucleic Acids Res., January 1, 2001; 29(1): 52 - 54. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. C. Barker, J. S. Garavelli, H. Huang, P. B. McGarvey, B. C. Orcutt, G. Y. Srinivasarao, C. Xiao, L.-S. L. Yeh, R. S. Ledley, J. F. Janda, et al. The Protein Information Resource (PIR) Nucleic Acids Res., January 1, 2000; 28(1): 41 - 44. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Ball, K. Dolinski, S. S. Dwight, M. A. Harris, L. Issel-Tarver, A. Kasarskis, C. R. Scafe, G. Sherlock, G. Binkley, H. Jin, et al. Integrating functional genomic information into the Saccharomyces Genome Database Nucleic Acids Res., January 1, 2000; 28(1): 77 - 80. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Huang, C. Xiao, and C. H. Wu ProClass protein family database Nucleic Acids Res., January 1, 2000; 28(1): 273 - 276. [Abstract] [Full Text] [PDF] |
||||

