Skip Navigation


Bioinformatics Advance Access originally published online on August 12, 2008
Bioinformatics 2008 24(20):2412-2413; doi:10.1093/bioinformatics/btn427
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
24/20/2412    most recent
btn427v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Zheng, S.
Right arrow Articles by Li, Y.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zheng, S.
Right arrow Articles by Li, Y.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2008. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

MPSQ: a web tool for protein-state searching

Siyuan Zheng 1,2, Jia Sheng 3, Chuan Wang 1, Xiaojing Wang 1, Yao Yu 1, Yun Li 1, Alex Michie 2, Jianliang Dai 2,4, Yang Zhong 4, Pei Hao 1,2,4,*, Lei Liu 1,2,* and Yixue Li 1,2,3,*

1Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, 2SCBIT-Inforsense Joint Lab, Shanghai Center for Bioinformation Technology, 100 Qinzhou Road, Shanghai 200235, 3Life Science and Technology College of Shanghai Jiao Tong University, Shanghai and 4School of Life Sciences, Fudan University, Shanghai 200433, China

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FUNCTION AND USAGE
 3 CONCLUSION AND PERSPECTIVE
 ACKNOWLEDGEMENTS
 REFERENCES
 

Summary: MPSQ (multi-protein-states query) is a web-based tool for the discovery of protein states (e.g. biological interactions, covalent modifications, cellular localizations). In particular, large sets of genes can be used to search for enriched state transition network maps (NMs) and features facilitating the interpretation of genomic-scale experiments such as microarrays. One NM collects all the catalogued states of a protein as well as the mutual transitions between the states. For the returned NM, graph visualization is provided for easy understanding and to guide further analysis.

Availability: MPSQ is freely available via the web at http://mpsq.biosino.org/.

Contact: phao{at}sibs.ac.cn; liulei{at}scbit.org; yxli{at}sibs.ac.cn


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FUNCTION AND USAGE
 3 CONCLUSION AND PERSPECTIVE
 ACKNOWLEDGEMENTS
 REFERENCES
 
Currently, the state of a protein is commonly characterized in four ways: (1) small molecule binding information; (2) covalent modifications; (3) interaction with other proteins and (4) sub-cellular localization (Li et al., 2002). Normally, the state transition of a protein induces changes in protein function or properties mainly around the above four aspects. Typically, in certain biological processes, such as cellular signaling, a critical biological switch can be triggered by state transitions of a protein, thus initiating various communication mechanisms.

Compared to an individual state, the protein-state transition events, which can be linked to form a transition network map (NM) integrated with related information, will provide a detailed dynamic information flow about signal transductions among different states. Furthermore, such NM gives a holistic view of how functionally cooperative proteins are mediated by the state transitions.

Recently, a database of the alliance for cellular signaling (AfCS) (Gilman et al., 2002) called Molecule Pages (Li et al., 2002; Saunders et al., 2007) was established to catalogue state information of key proteins in cellular signaling, with the objective of providing building blocks for reconstructing signaling networks. To make full use of this invaluable knowledge resource, especially for the interpretation of the output from high-throughput experiments, e.g. microarrays, we developed a web interface named MPSQ (multi-protein-states query) to assist in the discovery of protein states as well as the enriched transition NMs for a given list of user-specified input genes. We also integrated protein–protein interaction (PPI) data and information from protein biochemical states to permit explicit gene-function linkage queries.


    2 FUNCTION AND USAGE
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FUNCTION AND USAGE
 3 CONCLUSION AND PERSPECTIVE
 ACKNOWLEDGEMENTS
 REFERENCES
 
MPSQ is a publicly accessible web tool with a user-friendly interface, aimed at presenting protein-state information to the biological research community in the most intuitive way. Multiple types of query identifiers are allowed, e.g. Entrez gene ID (Maglott et al., 2007), and the returned information is a mapping between genes and states which are hyperlinked directly to the detail information.

While a state consists of interacting proteins, covalent modifications, bound small molecules, etc., a NM consists of all curated states of a key protein. The concept of ‘network map’ is inherited from the Molecule Pages (Li et al., 2002). An NM is a directed graph in which the vertex represents one particular state of the key protein and the edge represents the transition between states. In other words, one NM collects all the catalogued states of a key protein. Meanwhile, the transition relationships between the numerous states of the NM link the proteins which serve as constituents of the states together, generating a functional block of these proteins. Therefore, for query genes, checking if they participate in the same NM would be of assistance in discovering the functions and relations between them. For instance, if protein A and B are found in the NM of the key protein C, both of them have interactions with C in same or different states, and thus are functionally linked through C. Similarly if a group of genes are found in one NM, they have cooperative functional linkages through interactions with the key protein of the NM. MPSQ returns the NM search result as a table including four fields, NM ID, core protein ID, genes found in the NM and P-value from the hypergeometric distribution-based enrichment analysis. All the states of the NM are listed, and all query genes involved in a given NM are highlighted when a user clicks a NM ID link.

To help users better understand a NM, an interactive picture was generated for NM visualization. An example NM picture is illustrated in Figure 1: each node represents a state, with the green node representing the states in which any query genes were found. In contrast, other states of the NM are colored red. Directed arrows between nodes represent the state transitions.


Figure 1
View larger version (21K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. An example of transition NM visualization. Green nodes represent the states in which any query genes were found, and arrow lines represent state transitions.

 
While the NM presents states and their mutual transitions, explicit functional linkages, e.g. physical interactions between the query genes reflect their molecular-level relationships. MPSQ uses protein–protein interaction data derived from curated biochemical protein-state information to present such molecular-level linkages. Internally a vertex-induced subgraph strategy is adopted. In order to find a more indicative functional subgraph from the protein interaction network, MPSQ expands the vertex set by including both the query genes and their direct neighbors in the network. Thus, closely related genes will be selected and emerge as a compact subgraph that can be visualized with query genes highlighted as red for better understanding with the software ‘medusa’ (Hooper and Bork, 2005). As a complement and comparison, PPI data from other databases are also available for query. Currently, the PPI networks from the molecular interaction database (Chatr-aryamontri et al., 2007) and IntAct database (Kerrien et al., 2007) are integrated.

To facilitate users wishing to cross-reference existing databases, the Kyoto Encyclopedia of Genes and Genomes database (Kanehisa, 2002) pathway database and Gene Ontology (Ashburner et al., 2000) are stored locally and provided for query.

In addition to gene identifiers, users are able to specify a batch of state IDs to retrieve state information, with the search results being returned according to user's preferences—a feature allowing users to browse the data of most interest to them.


    3 CONCLUSION AND PERSPECTIVE
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FUNCTION AND USAGE
 3 CONCLUSION AND PERSPECTIVE
 ACKNOWLEDGEMENTS
 REFERENCES
 
Proteins do not function in isolation, but rather as part of molecular networks (Saghatelian and Cravatt, 2005). Currently many databases collect the molecular interaction data, for instance, the STRING database (von Mering et al., 2003) contains protein functional associations for over 1 500 000 proteins. However, in the cellular context, proteins are also regulated by other mechanisms like small molecules, covalent modifications, etc. To give a more dynamic description of proteins in vivo, the pioneer LiveDIP database (Duan et al., 2002) collected states information of yeast proteins and aimed to describe protein interactions using the protein states and state transitions. The Molecule Pages database, first proposed a full definition of protein states and catalogued the state information for over 4000 key proteins which are believed to have important functions in cellular signaling (Li et al., 2002).

MPSQ is a web interface for Molecule Pages and designed to promote the utilization of biochemical protein-state information in the biological research community. Despite batch retrieval, MPSQ allows to search for enriched NMs which imply functional blocks. Compared to the most often used functional modules constructed from protein interaction networks (Segal et al., 2003), protein-state NM which includes all the states of the key protein is a dynamic outline of the protein in vivo. On the other hand, protein interaction data are integrated in MPSQ for protein functional linkage query.

MPSQ updates every month and has been run robustly for over 1 year. In the future more biological data will be integrated and analyzed in parallel with the states information. With the continuing accumulation of available state data and ongoing integration with other biological resources, MPSQ will provide more functions in systems biology research.


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FUNCTION AND USAGE
 3 CONCLUSION AND PERSPECTIVE
 ACKNOWLEDGEMENTS
 REFERENCES
 
We thank Haiwei Fan for technical help. We also thank the anonymous reviewers for their helpful suggestions.

Funding: National High-Tech R&D Program of China (863) (grant 2006AA02Z334, 2007AA02Z304, 2006AA020406, 2007DFA31040); Shanghai Committee of Science and Technology (grant 07dz22004, 07ZR14085); the National Basic Research Program of China (grant 2006CB910700, 2003CB715901); the Key Research Program of Chinese Academy of Sciences (grant KSCX2-YW-R-112).

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Jonathan Wren

Received on May 22, 2008; revised on July 13, 2008; accepted on August 12, 2008

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 FUNCTION AND USAGE
 3 CONCLUSION AND PERSPECTIVE
 ACKNOWLEDGEMENTS
 REFERENCES
 

    Ashburner M, et al. Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat. Genet. (2000) 25:25–29.[CrossRef][Web of Science][Medline]

    Chatr-aryamontri A, et al. MINT: the Molecular INTeraction database. Nucleic Acids Res. (2007) 35:D572–D574.[Abstract/Free Full Text]

    Duan XJ, et al. Describing biological protein interactions in terms of protein states and state transitions: the LiveDIP database. Mol. Cell Proteomics (2002) 1:104–116.[Abstract/Free Full Text]

    Gilman AG, et al. Overview of the alliance for cellular signaling. Nature (2002) 420:703–706.[CrossRef][Web of Science][Medline]

    Hooper SD, Bork P. Medusa: a simple tool for interaction graph analysis. Bioinformatics (2005) 21:4432–4433.[Abstract/Free Full Text]

    Kanehisa M. The KEGG database. Novartis Found Symp (2002) 247:91–101. discussion 101–103, 119–128, 244–152.[Web of Science][Medline]

    Kerrien S, et al. IntAct – open source resource for molecular interaction data. Nucleic Acids Res (2007) 35:D561–D565.[Abstract/Free Full Text]

    Li J, et al. The Molecule Pages database. Nature (2002) 420:716–717.[CrossRef][Web of Science][Medline]

    Maglott D, et al. Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res. (2007) 35:D26–D31.[Abstract/Free Full Text]

    Saghatelian A, Cravatt BF. Assignment of protein function in the postgenomic era. Nat. Chem. Biol. (2005) 1:130–142.[CrossRef][Web of Science][Medline]

    Saunders B, et al. The Molecule Pages database. Nucleic Acids Res (2007) 36:700–706.[CrossRef]

    Segal E, et al. Discovering molecular pathways from protein interaction and gene expression data. Bioinformatics (2003) 19:i264–i271. (Suppl. 1).[Abstract]

    von Mering C, et al. STRING: a database of predicted functional associations between proteins. Nucleic Acids Res. (2003) 31:258–261.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
24/20/2412    most recent
btn427v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Zheng, S.
Right arrow Articles by Li, Y.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zheng, S.
Right arrow Articles by Li, Y.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?