Skip Navigation


Bioinformatics Advance Access originally published online on March 3, 2005
Bioinformatics 2005 21(10):2541-2543; doi:10.1093/bioinformatics/bti366
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/10/2541    most recent
bti366v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (23)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Gong, S.
Right arrow Articles by Bhak, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gong, S.
Right arrow Articles by Bhak, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions{at}oupjournals.org

PSIbase: a database of Protein Structural Interactome map (PSIMAP)

Sungsam Gong 1, Giseok Yoon 2, Insoo Jang 3, Dan Bolser 4, Panos Dafas 5, Michael Schroeder 6, Hansol Choi 1, Yoobok Cho 2, Kyungsook Han 7, Sunghoon Lee 3, Hwanho Choi 1, Michael Lappe 8, Liisa Holm 9, Sangsoo Kim 3, Donghoon Oh 2 and Jonghwa Bhak 1,2,3,10,*

1Biomatics Lab, Department of BioSystems, KAIST Daejeon, Korea
2OITEK Daejeon, Korea
3NGIC, KRIBB Daejeon, Korea
4MRC-DUNN Cambridge, UK
5City University London, UK
6Biotechnologisches Zentrum TU Dresden, Germany
7Inha University Incheon, Korea
8Max Planck Institute for Molecular Genetics Berlin, Germany
9Helsinki University Finland
10BiO Centre, KAIST Daejeon, Korea

*To whom correspondence should be addressed.


    Abstract
 TOP
 Abstract
 INTRODUCTION
 PSIMAP ALGORITHM
 DATABASE ACCESS
 CONCLUDING REMARKS
 REFERENCES
 

Summary: Protein Structural Interactome map (PSIMAP) is a global interaction map that describes domain–domain and protein–protein interaction information for known Protein Data Bank structures. It calculates the Euclidean distance to determine interactions between possible pairs of structural domains in proteins. PSIbase is a database and file server for protein structural interaction information calculated by the PSIMAP algorithm. PSIbase also provides an easy-to-use protein domain assignment module, interaction navigation and visual tools. Users can retrieve possible interaction partners of their proteins of interests if a significant homology assignment is made with their query sequences.

Availability: http://psimap.org and http://psibase.kaist.ac.kr/

Contact: biopark{at}kaist.ac.kr

Supplementary information: Supplementary material is available at http://psibase.kaist.ac.kr/Doc/supplementary_material.htm


    INTRODUCTION
 TOP
 Abstract
 INTRODUCTION
 PSIMAP ALGORITHM
 DATABASE ACCESS
 CONCLUDING REMARKS
 REFERENCES
 
Most proteins function by interacting with other molecules. Therefore, it is important to investigate the interaction partners of proteins. Recently, high-throughput experiments, such as yeast (Uetz et al., 2000) and fly (Giot et al., 2003) proteomes, have enabled us to elucidate the interaction networks on a large scale. These large-scale experiment results are collected and well-curated into interaction databases such as the Database of Interacting Proteins (DIP) (Salwinski et al., 2000), Biomolecular Interaction Network Database (BIND) (Bader et al., 2003) and Molecular INTeraction database (MINT) (Zanzoni et al., 2002). There have also been computational approaches to map and predict the protein interactome in a genomic context using gene fusion and gene neighborhood methods (Huynen et al., 2000).

In parallel with the above methods, PSIMAP (Protein Structural Interactome map) has introduced a new mapping protocol in protein structural interactome study. An underlying concept of PSIMAP is homologous interaction: the interaction among protein structures is conserved as closely as the protein structures themselves (Park et al., 2001; Aloy and Russell, 2002; Aloy et al., 2003). With PSIMAP, we can view protein interactions in terms of family–family interactions, as well as individual protein–protein interactions. PSIMAP covers interaction information from both gene fusion style protein sequence level interaction and physical interaction within complexes or multi-domain proteins.

Here, we introduce PSIbase: the PSIMAP web server and database. It contains (1) domain–domain and protein–protein interaction information from proteins whose 3D-structures are identified, (2) a protein interaction map and its viewer at protein superfamily and family levels, (3) protein interaction interface viewers and (4) structural domain prediction tools for possible interactions by detecting homologous matches in the Protein Data Bank (PDB) from query sequences. Structural interaction data, in flat file format, can be downloaded from PSIbase (http://psibase.kaist.ac.kr/Download/download.shtml) for further analyses. It contains the smallest distance between two domains and the number of residue pairs that is within the threshold distance according to the PSIMAP algorithm. It not only provides raw data files, but it also serves biologists who need to look up the interaction partners of their proteins of interest. Simply putting a protein sequence is enough to search for possible interaction partners (interlogs). As the possible predicted domains of query sequence are based on a structural assignment protocol, users can see the interlogs' 3D structures if they accept the prediction made by PSIbase. For structural domain assignment, we used two databases and two algorithms. They were the SCOP (http://scop.kaist.ac.kr/scop, Murzin et al., 1995) database with an intermediate sequence library ISL, (Teichmann et al., 2000), and PSI-BLAST (http://www.ncbi.nlm.nih.gov/BLAST/) with a hidden Markov model package (HMMER, http://hmmer.wustl.edu/). We believe that PSIbase is useful for those in the fields of structural bioinformatics and molecular biology.


    PSIMAP ALGORITHM
 TOP
 Abstract
 INTRODUCTION
 PSIMAP ALGORITHM
 DATABASE ACCESS
 CONCLUDING REMARKS
 REFERENCES
 
The basic mechanism to check interactions between any two domains or proteins is the calculation of the Euclidean distance in order to see if they are within a certain distance threshold. PSIMAP checks every possible pair of structural domains in a protein to see if there are at least five residue contacts within a 5 Å distance (5–5 rule). The current PSIMAP protocol has three methods. They are the Full Atom Contact (FAC) PSIMAP, Sampled Atom Contact (SAC) PSIMAP and Bounding Box Contact (BBC) PSIMAP (Dafas et al., 2004). (The supplementary material provides in-depth information about the three different PSIMAP algorithms.)

The FAC calculates all the atomic contacts among two or more protein structural domains. FAC PSIMAP is the most accurate of the three, as we take into account all the atoms in domain pairs.

The SAC and BBC algorithms are approximations of FAC. Their main purpose is to reduce the time taken in constructing PSIMAP. The BBC algorithm is a radically different approach, using a bounding box algorithm to dramatically reduce the time of computation. Dafas et al. introduced a bounding box and convex hull algorithm that can reduce the search space.


    DATABASE ACCESS
 TOP
 Abstract
 INTRODUCTION
 PSIMAP ALGORITHM
 DATABASE ACCESS
 CONCLUDING REMARKS
 REFERENCES
 
The PSIbase server is available at http://psibase.kaist.ac.kr/. There are three different query interfaces to access the PSIbase. All queries are funneled into a web page that shows protein domain interactions with their partners.

First, PSIbase provides a simple search interface that looks up keywords or database accession IDs. Figure 1 shows the search result of ‘ligase’ as a query against 12 annotated DB resources (listed on the PSIbase webpage). Out of the 12, multiple matches for the query ‘ligase’ are listed up from the following databases: PDB, SCOP, TIGRFAMs, Swiss-Prot, ProDom, Pfam, Prosite and Interpro. There are three tools to view interaction interface structures: Chime (http://www.mdli.com), Jmol (http://jmol.sourceforge.net) and Interfacer (http://www.interfacer.org). Interfacer is a slow but advanced protein interface viewer with surface representation capability.



View larger version (25K):
[in this window]
[in a new window]
 
Fig. 1 The visualization of interaction information. Search results for ‘ligase’ as a query. ‘RSAC’ button in box a leads to a page b that shows interactions among domains which are defined by SCOP. c and d boxes show the interacting interfaces with different viewers. The interaction network is shown in e.

 
The second PSIbase query interface is a protein structural domain assignment utility that accepts protein sequences from users. There are two domain assignment algorithms available in PSIbase. One is a homology-based sequence search by PSI-BLAST utilizing the ISL (see Introduction) and the other is the HMMER profile search algorithm. These two are complementary in terms of the coverage in the assignment.

The last PSIbase query interface accepts specific domain IDs at SCOP family or superfamily levels. There are several levels to determine interactions among query domains. For example, interacting partners of a specific query domain can be identified within a specified interaction depth (the maximum depth limit is 4). Interactions between two or more input query domains can also be identified. Additionally, PSIbase is equipped with a simple open-source Java applet program that shows the interaction network of each query.


    CONCLUDING REMARKS
 TOP
 Abstract
 INTRODUCTION
 PSIMAP ALGORITHM
 DATABASE ACCESS
 CONCLUDING REMARKS
 REFERENCES
 
There are 1294 superfamilies and 2327 families in SCOP 1.65. On average, PSIbase covers 87% (1136/1294) of SCOP superfamily interactions, indicating that the majority of SCOP superfamilies have interacting partner information. In the supplementary material, Table 2 shows the 20 most interactive superfamilies in PSIbase. These can be regarded as the most central interaction components in interactomes, so we call them the ‘interactome core’. This core contains proteins with energy metabolism, RNA and DNA binding, and other key biological processes that have existed since the very early days of interaction networks (Bolser et al., 2003). The interactions of non-protein molecules in cells are critical in biological functions. In the next version, PSIbase and PSIMAP will cover interactions between proteins and non-proteins such as nucleic acids and small molecules.


    Acknowledgments
 
We thank Mr. Chung MoonSoul for donating $25 million to the Department of Biosystems at KAIST. This project was funded by IMT-2000 C3-4 grants from the Ministry of Information and Communication of Korea and a grant from KRIBB Research Initiative Program. J.B. is supported by Biogreen21. We thank Maryana Bhak for editing and commenting on this manuscript. We also send our loving gratitude to the anonymous reviewers for their precious comments.

Received on November 1, 2004; revised on January 27, 2005; accepted on February 28, 2005

    REFERENCES
 TOP
 Abstract
 INTRODUCTION
 PSIMAP ALGORITHM
 DATABASE ACCESS
 CONCLUDING REMARKS
 REFERENCES
 

    Aloy, P. and Russell, R.B. (2002) Interrogating protein interaction networks through structural biology. Proc. Natl Acad. Sci. USA, 99, 5896–5901[Abstract/Free Full Text].

    Aloy, P., et al. (2003) The relationship between sequence and interaction divergence in proteins. J. Mol. Biol., 332, 989–998[CrossRef][Web of Science][Medline].

    Bader, G.D., et al. (2003) BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res., 31, 48–50.

    Bolser, D.M., et al. (2003) Visualisation and graph-theoretic analysis of a large-scale protein structural interactome. BMC Bioinform., 4, 1471–2105.

    Dafas, P., et al. (2004) Using convex hulls to compute protein interactions from known structures. Bioinformatics, 20, 1–5[Abstract/Free Full Text].

    Giot, L., et al. (2003) A Protein Interaction Map of Drosophila melanogaster. Science, 302, 1727–1736[Abstract/Free Full Text].

    Huynen, M., et al. (2000) Exploitation of gene context. Curr. Opin. Struct. Biol., 10, 366–370[CrossRef][Web of Science][Medline].

    Murzin, A.G., et al. (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol., 247, 536–540[CrossRef][Web of Science][Medline].

    Park, J.H., et al. (2001) Mapping protein family interactions: intramolecular and intermolecular protein family interaction repertoires in the PDB and yeast. J. Mol. Biol., 307, 929–938[CrossRef][Web of Science][Medline].

    Salwinski, L., et al. (2000) The Database of Interacting Proteins: 2004 update. Nucleic Acids Res., 32, D449–D451.

    Teichmann, S.A., et al. (2000) Fast assignment of protein structures to sequences using the intermediate sequence library PDB-ISL. Bioinformatics, 16, 117–124[Abstract/Free Full Text].

    Uetz, P., et al. (2000) A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature, 403, 623–627[CrossRef][Medline].

    Zanzoni, A., et al. (2002) MINT: a Molecular INTeraction database. FEBS Lett., 513, 135–140[CrossRef][Web of Science][Medline].


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
P. Vanhee, J. Reumers, F. Stricher, L. Baeten, L. Serrano, J. Schymkowitz, and F. Rousseau
PepX: a structural database of non-redundant protein-peptide complexes
Nucleic Acids Res., October 30, 2009; (2009) gkp893v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. A. Encinar, G. Fernandez-Ballester, I. E. Sanchez, E. Hurtado-Gomez, F. Stricher, P. Beltrao, and L. Serrano
ADAN: a database for prediction of protein-protein interaction of modular domains mediated by linear motifs
Bioinformatics, September 15, 2009; 25(18): 2418 - 2424.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
N. Tuncbag, G. Kar, O. Keskin, A. Gursoy, and R. Nussinov
A survey of available tools and web servers for analysis of protein-protein interactions and interfaces
Brief Bioinform, May 1, 2009; 10(3): 217 - 232.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Park, B.-C. Kim, S.-W. Cho, S.-J. Park, J.-S. Choi, S. I. Kim, J. Bhak, and S. Lee
MassNet: a functional annotation service for protein mass spectrometry data
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W491 - W495.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. Dawelbait, C. Winter, Y. Zhang, C. Pilarsky, R. Grutzmann, J.-C. Heinrich, and M. Schroeder
Structural templates predict novel protein interactions and targets from pancreas tumour gene expression data
Bioinformatics, July 1, 2007; 23(13): i115 - i124.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
L. Hakes, S. C. Lovell, S. G. Oliver, and D. L. Robertson
Specificity in protein interactions and its relationship with sequence diversity and coevolution
PNAS, May 8, 2007; 104(19): 7999 - 8004.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. R. Jefferson, T. P. Walsh, T. J. Roberts, and G. J. Barton
SNAPPI-DB: a database and API of Structures, iNterfaces and Alignments for Protein-Protein Interactions
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D580 - D589.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Q. Xu, A. Canutescu, Z. Obradovic, and R. L. Dunbrack Jr
ProtBuD: a database of biological unit structures of protein families and superfamilies
Bioinformatics, December 1, 2006; 22(23): 2876 - 2882.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. Winter, A. Henschel, W. K. Kim, and M. Schroeder
SCOPPI: a structural classification of protein-protein interfaces
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D310 - D314.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Park, S. Lee, D. Bolser, M. Schroeder, M. Lappe, D. Oh, and J. Bhak
Comparative interactomics analysis of protein family interaction networks using PSIMAP (protein structural interactome map)
Bioinformatics, August 1, 2005; 21(15): 3234 - 3240.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/10/2541    most recent
bti366v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (23)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Gong, S.
Right arrow Articles by Bhak, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gong, S.
Right arrow Articles by Bhak, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?