Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (18)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Bader, J. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bader, J. S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 19 no. 15 2003
pages 1869-1874
© 2003 Oxford University Press

Greedily building protein networks with confidence

Joel S. Bader *

CuraGen Corporation, 555 Long Wharf Drive, New Haven, CT 06511, USA

Received on May 1, 2003 ; revised on July 15, 2003 ; accepted on July 15, 2003

Motivation: With genome sequences complete for human and model organisms, it is essential to understand how individual genes and proteins are organized into biological networks. Much of the organization is revealed by proteomics experiments that now generate torrents of data. Extracting relevant complexes and pathways from high-throughput proteomics data sets has posed a challenge, however, and new methods to identify and extract networks are essential. We focus on the problem of building pathways starting from known proteins of interest.

Results: We have developed an efficient, greedy algorithm, SEEDY, that extracts biologically relevant biological networks from protein–protein interaction data, building out from selected seed proteins. The algorithm relies on our previous study establishing statistical confidence levels for interactions generated by two-hybrid screens and inferred from mass spectrometric identification of protein complexes. We demonstrate the ability to extract known yeast complexes from high-throughput protein interaction data with a tunable parameter that governs the trade-off between sensitivity and selectivity. DNA damage repair pathways are presented as a detailed example. We highlight the ability to join heterogeneous data sets, in this case protein–protein interactions and genetic interactions, and the appearance of cross-talk between pathways caused by re-use of shared components.

Significance and comparison: The significance of the SEEDY algorithm is that it is fast, running time O[(E + V) log V] for V proteins and E interactions, a single adjustable parameter controls the size of the pathways that are generated, and an associated P-value indicates the statistical confidence that the pathways are enriched for proteins with a coherent function. Previous approaches have focused on extracting sub-networks by identifying motifs enriched in known biological networks. SEEDY provides the complementary ability to perform a directed search based on proteins of interest.

Availability: SEEDY software (Perl source), data tables and confidence score models (R source) are freely available from the author.

Contact: jbader{at}bme.jhu.edu

* Present Address: Department of Biomedical Engineering, Johns Hopkins University, 3400 N. Charles St, Baltimore, MD 21218, USA


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
V. Pihur, S. Datta, and S. Datta
Reconstruction of genetic association networks from microarray data: a partial least squares approach
Bioinformatics, February 15, 2008; 24(4): 561 - 568.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
C. L. Myers and O. G. Troyanskaya
Context-sensitive data integration and prediction of biological networks
Bioinformatics, September 1, 2007; 23(17): 2322 - 2330.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. Prieto and J. De Las Rivas
APID: Agile Protein Interaction DataAnalyzer.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W298 - W302.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Krauthammer, C. A. Kaufmann, T. C. Gilliam, and A. Rzhetsky
Molecular triangulation: Bridging linkage and molecular-network information for identifying candidate genes in Alzheimer's disease
PNAS, October 19, 2004; 101(42): 15148 - 15153.
[Abstract] [Full Text] [PDF]


Home page
Cold Spring Harb Symp Quant BiolHome page
P. JORGENSEN, B.-J. BREITKREUTZ, K. BREITKREUTZ, C. STARK, G. LIU, M. COOK, J. SHAROM, J.L. NISHIKAWA, T. KETELA, D. BELLOWS, et al.
Harvesting the Genome's Bounty: Integrative Genomics
Cold Spring Harb Symp Quant Biol, January 1, 2003; 68(0): 431 - 444.
[Abstract] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.