Bioinformatics Advance Access originally published online on September 7, 2004
Bioinformatics 2005 21(3):410-412; doi:10.1093/bioinformatics/bti011
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics vol. 21 issue 3 © Oxford University Press 2005; all rights reserved.
iPfam: visualization of proteinprotein interactions in PDB at domain and amino acid resolutions
The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK
*To whom correspondence should be addressed.
| Abstract |
|---|
|
|
|---|
Summary: There are many resources that contain information about binary interactions between proteins. However, protein interactions are defined by only a subset of residues in any protein. We have implemented a web resource that allows the investigation of protein interactions in the Protein Data Bank structures at the level of Pfam domains and amino acid residues. This detailed knowledge relies on the fact that there are a large number of multidomain proteins and protein complexes being deposited in the structure databases. The resource called iPfam is hosted within the Pfam UK website. Most resources focus on the interactions between proteins; iPfam includes these as well as interactions between domains in a single protein.
Availability: iPfam is available on the Web for browsing at http://www.sanger.ac.uk/Software/Pfam/iPfam/; the source-data for iPfam is freely available in relational tables via the ftp site ftp://ftp.sanger.ac.uk/pub/databases/Pfam/database_files/
Contact: rdf{at}sanger.ac.uk
Although we often consider proteins in isolation, it now seems likely that the majority of proteins in the cell are parts of large molecular complexes (Gavin et al., 2002; Uetz et al., 2000). Information on these complexes and the proteinprotein interactions that they are composed of are being collected in databases, such as DIP (Salwinski et al., 2004), MINT (Zanzoni et al., 2002), BIND (Bader et al., 2003) and IntAct (Hermjakob et al., 2004). The large majority of data in these databases presents binary information, which states that protein A interacts with protein B. Although this is useful, we would often like to know which specific region of the proteins are interacting, e.g. which domains are interacting. This may be between two different proteins (interchain interaction) or for many cases the interactions is between two domains within a single protein (intrachain interaction). Inference on these domain interactions from binary interactions has been attempted (Deng et al., 2002) but is not yet sensitive or reliable. To understand how protein domains interact at the molecular level, we need to know which residues, and their constituent atoms, in each protein are interacting. These kind of data are available in the Protein Data Bank (PDB) database of protein structures (Bourne et al., 2004) where multiple domains are present in a single structure. However, these residue level interaction data are not readily accessible to a biologist to use, and even a structural biologist would require some time to define them.
To identify interactions at the level of amino acid residues one has to define where the domains lie in the structure. Domains need to be defined on the structure as we need to distinguish between interactions that occur between amino acids within the domains and interactions between amino acids from another domain. To define domains in iPfam, we use the domain definitions from Pfam (Bateman et al., 2004) projected onto the PDB structures using the UniProt (Apweiler et al., 2004) to PDB mapping provided by the MSD (Golovin et al., 2004). To identify the interactions between residues, we calculate all bonds (van der Waals, side chain and main chain H-bonds, salt bridge and disulphide) between all pairs of residues in different domains. These bonds can be between domains within the same chain or between domains in different chains in the structure.
iPfam contains interactions among 2733 different pairs of domains (as of Pfam release 12.0). These domaindomain interactions can readily be browsed via the UK Pfam website (http://www.sanger.ac.uk/Software/Pfam/iPfam/browse/number.shtml). For each domain page in Pfam with a known domaindomain interaction in the PDB, we provide a link to the iPfam resource. For each protein structure we provide a domain interaction network, as shown in Figure 1, which readily conveys information about which domains in a structure are interacting directly. This graph representation is generated using the GraphViz package (http://www.research.att.com/sw/tools/graphviz/). On this page, we also provide the facility to view the interaction in more detail. We allow visualization of the PDB structure with the interacting residues marked up using Rasmol (Sayle and Milner-White, 1995), and we plan to add viewing with Jmol in the near future (http://jmol.sourceforge.net/). A link from this page also allows users to browse the interaction data at the level of amino acid residue. Figure 2 shows how this information is displayed. By zooming in on the interaction picture one can get exact residue interactions. Although we store the bond types in our relational database, we do not display this information currently. Users interested in this fine grained annotation can download the relational tables for local use.
|
|
iPfam also allows one to look at the conservation of domaindomain interactions between different PDB entries of the same protein. This allows users to discern which residues are conserved in the domaindomain interface.
To our knowledge, this is the first comprehensive site that allows non-experts to access detailed domain interactions available in the protein structures. We hope that this growing dataset will improve our understanding of proteinprotein interactions.
Received on May 20, 2004; revised on August 9, 2004; accepted on August 29, 2004
| REFERENCES |
|---|
|
|
|---|
Apweiler, R., Bairoch, A., Wu, C.H., Barker, W.C., Boeckmann, B., Ferro, S., Gasteiger, E., Huang, H., Lopez, R., Magrane, M., et al. (2004) UniProt: the Universal Protein knowledgebase. Nucleic Acids Res., 32, D115D119
Bader, G.D., Betel, D., Hogue, C.W. (2003) BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res., 31, 248250
Bateman, A., Coin, L., Durbin, R., Finn, R.D., Hollich, V., Griffiths-Jones, S., Khanna, A., Marshall, M., Moxon, S., Sonnhammer, E.L., et al. (2004) The Pfam protein families database. Nucleic Acids Res., 32, D138D141
Benini, S., Ciurli, S., Rypniewski, W.R., Wilson, K.S., Mangani, S. (1998) Crystallization and preliminary high-resolution X-ray diffraction analysis of native and beta-mercaptoethanol-inhibited urease from Bacillus pasteurii . Acta Crystallogr. D Biol. Crystallogr., 54, 409412[Medline].
Bourne, P.E., Addess, K.J., Bluhm, W.F., Chen, L., Deshpande, N., Feng, Z., Fleri, W., Green, R., Merino-Ott, J.C., Townsend-Merino, W., et al. (2004) The distribution and query systems of the RCSB Protein Data Bank. Nucleic Acids Res., 32, D223D225
Deng, M., Mehta, S., Sun, F., Chen, T. (2002) Inferring domaindomain interactions from proteinprotein interactions. Genome Res., 12, 15401548
Gavin, A.C., Bosche, M., Krause, R., Grandi, P., Marzioch, M., Bauer, A., Schultz, J., Rick, J.M., Michon, A.M., Cruciat, C.M., et al. (2002) Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature, 415, 141147[CrossRef][Medline].
Golovin, A., Oldfield, T.J., Tate, J.G., Velankar, S., Barton, G.J., Boutselakis, H., Dimitropoulos, D., Fillon, J., Hussain, A., Ionides, J.M., et al. (2004) E-MSD: an integrated data resource for bioinformatics. Nucleic Acids Res., 32, D211D216
Hermjakob, H., Montecchi-Palazzi, L., Lewington, C., Mudali, S., Kerrien, S., Orchard, S., Vingron, M., Roechert, B., Roepstorff, P., Valencia, A., et al. (2004) IntAct: an open source molecular interaction database. Nucleic Acids Res., 32, D452D455
Salwinski, L., Miller, C.S., Smith, A.J., Pettit, F.K., Bowie, J.U., Eisenberg, D. (2004) The Database of Interacting Proteins: 2004 update. Nucleic Acids Res., 32, D449D451
Sayle, R.A.R. and Milner-White, E.J. (1995) RASMOL: biomolecular graphics for all. Trends Biochem. Sci., 20, 374[CrossRef][Web of Science][Medline].
Uetz, P., Giot, L., Cagney, G., Mansfield, T.A., Judson, R.S., Knight, J.R., Lockshon, D., Narayan, V., Srinivasan, M., Pochart, P., et al. (2000) A comprehensive analysis of proteinprotein interactions in Saccharomyces cerevisiae . Nature, 403, 623627[CrossRef][Medline].
Zanzoni, A., Montecchi-Palazzi, L., Quondam, M., Ausiello, G., Helmer-Citterich, M., Cesareni, G. (2002) MINT: a Molecular INTeraction database. FEBS Lett., 513, 135140[CrossRef][Web of Science][Medline].
This article has been cited by other articles:
![]() |
P. Smialowski, P. Pagel, P. Wong, B. Brauner, I. Dunger, G. Fobo, G. Frishman, C. Montrone, T. Rattei, D. Frishman, et al. The Negatome database: a reference set of non-interacting protein pairs Nucleic Acids Res., November 17, 2009; (2009) gkp1026v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Bjorkholm and E. L. L. Sonnhammer Comparative analysis and unification of domain-domain interaction networks Bioinformatics, November 15, 2009; 25(22): 3020 - 3025. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Liu, X.-w. Chen, and R. Jothi Knowledge-guided inference of domain-domain interactions from incomplete protein-protein interaction networks Bioinformatics, October 1, 2009; 25(19): 2492 - 2499. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Encinar, G. Fernandez-Ballester, I. E. Sanchez, E. Hurtado-Gomez, F. Stricher, P. Beltrao, and L. Serrano ADAN: a database for prediction of protein-protein interaction of modular domains mediated by linear motifs Bioinformatics, September 15, 2009; 25(18): 2418 - 2424. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. C. Friedel and R. Zimmer Identifying the topology of protein complexes from affinity purification assays Bioinformatics, August 15, 2009; 25(16): 2140 - 2146. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Blankenburg, F. Ramirez, J. Buch, and M. Albrecht DASMIweb: online integration, analysis and assessment of distributed protein interaction data Nucleic Acids Res., July 1, 2009; 37(suppl_2): W122 - W128. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Fornes, R. Aragues, J. Espadaler, M. A. Marti-Renom, A. Sali, and B. Oliva ModLink+: improving fold recognition by using protein-protein interactions Bioinformatics, June 15, 2009; 25(12): 1506 - 1512. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Blankenburg, R. D. Finn, A. Prlic, A. M. Jenkinson, F. Ramirez, D. Emig, S.-E. Schelhorn, J. Buch, T. Lengauer, and M. Albrecht DASMI: exchanging, annotating and assessing molecular interaction data Bioinformatics, May 15, 2009; 25(10): 1321 - 1328. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Zhou and Z. Shang 2D molecular graphics: a flattened world of chemistry and biology Brief Bioinform, May 1, 2009; 10(3): 247 - 258. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Higurashi, T. Ishida, and K. Kinoshita PiSite: a database of protein interaction sites using multiple binding states in the PDB Nucleic Acids Res., January 1, 2009; 37(suppl_1): D360 - D364. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Akiva, Z. Itzhaki, and H. Margalit Built-in loops allow versatility in domain-domain interactions: Lessons from self-interacting domains PNAS, September 9, 2008; 105(36): 13292 - 13297. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Goll, S. V. Rajagopala, S. C. Shiau, H. Wu, B. T. Lamb, and P. Uetz MPIDB: the microbial protein interaction database Bioinformatics, August 1, 2008; 24(15): 1743 - 1744. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Michaut, S. Kerrien, L. Montecchi-Palazzi, F. Chauvat, C. Cassier-Chauvat, J.-C. Aude, P. Legrain, and H. Hermjakob InteroPORC: automated inference of highly conserved protein interaction networks Bioinformatics, July 15, 2008; 24(14): 1625 - 1631. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Guo, X. Wu, D.-Y. Zhang, and K. Lin Genome-wide inference of protein interaction sites: lessons from the yeast high-quality negative protein-protein interaction dataset Nucleic Acids Res., April 1, 2008; 36(6): 2002 - 2011. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. K. Basu, L. Carmel, I. B. Rogozin, and E. V. Koonin Evolution of protein domain promiscuity in eukaryotes Genome Res., March 1, 2008; 18(3): 449 - 461. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Raghavachari, A. Tasneem, T. M. Przytycka, and R. Jothi DOMINE: a database of protein domain interactions Nucleic Acids Res., January 11, 2008; 36(suppl_1): D656 - D661. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Pagel, M. Oesterheld, O. Tovstukhina, N. Strack, V. Stumpflen, and D. Frishman DIMA 2.0 predicted and known domain interactions Nucleic Acids Res., January 11, 2008; 36(suppl_1): D651 - D655. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Finn, J. Tate, J. Mistry, P. C. Coggill, S. J. Sammut, H.-R. Hotz, G. Ceric, K. Forslund, S. R. Eddy, E. L. L. Sonnhammer, et al. The Pfam protein families database Nucleic Acids Res., January 11, 2008; 36(suppl_1): D281 - D288. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Brilli, R. Fani, and P. Lio Current trends in the bioinformatic sequence analysis of metabolic pathways in prokaryotes Brief Bioinform, January 1, 2008; 9(1): 34 - 45. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hernandez-Toro, C. Prieto, and J. De Las Rivas APID2NET: unified interactome graphic analyzer Bioinformatics, September 15, 2007; 23(18): 2495 - 2497. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Schlicker, C. Huthmacher, F. Ramirez, T. Lengauer, and M. Albrecht Functional evaluation of domain domain interactions and human protein interaction networks Bioinformatics, April 1, 2007; 23(7): 859 - 865. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Kundrotas and E. Alexov PROTCOM: searchable database of protein complexes enhanced with domain-domain structures Nucleic Acids Res., January 12, 2007; 35(suppl_1): D575 - D579. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ceol, A. Chatr-aryamontri, E. Santonico, R. Sacco, L. Castagnoli, and G. Cesareni DOMINO: a database of domain-peptide interactions Nucleic Acids Res., January 12, 2007; 35(suppl_1): D557 - D560. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. Kim, L. J. Lu, Y. Xia, and M. B. Gerstein Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights Science, December 22, 2006; 314(5807): 1938 - 1941. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Paarmann, B. Schmitt, B. Meyer, M. Karas, and H. Betz Mass Spectrometric Analysis of Glycine Receptor-associated Gephyrin Splice Variants J. Biol. Chem., November 17, 2006; 281(46): 34918 - 34925. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, J. Li, and L. Wong Discovering motif pairs at interaction sites from protein sequences on a proteome-wide scale Bioinformatics, April 15, 2006; 22(8): 989 - 996. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Pagel, M. Oesterheld, V. Stumpflen, and D. Frishman The DIMA web resource--exploring the protein domain network Bioinformatics, April 15, 2006; 22(8): 997 - 998. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Henschel, W. K. Kim, and M. Schroeder Equivalent binding sites reveal convergently evolved interaction motifs Bioinformatics, March 1, 2006; 22(5): 550 - 555. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Finn, J. Mistry, B. Schuster-Bockler, S. Griffiths-Jones, V. Hollich, T. Lassmann, S. Moxon, M. Marshall, A. Khanna, R. Durbin, et al. Pfam: clans, web tools and services Nucleic Acids Res., January 1, 2006; 34(suppl_1): D247 - D251. [Abstract] [Full Text] [PDF] |
||||
![]() |
X.-W. Chen and M. Liu Prediction of protein-protein interactions using random decision forest framework Bioinformatics, December 15, 2005; 21(24): 4394 - 4400. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Pugalenthi, G. Archunan, and R. Sowdhamini DIAL: a web-based server for the automatic identification of structural domains in proteins Nucleic Acids Res., July 1, 2005; 33(suppl_2): W130 - W132. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||








