Skip Navigation


Bioinformatics Advance Access originally published online on November 15, 2007
Bioinformatics 2008 24(2):282-284; doi:10.1093/bioinformatics/btm554
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrowOA All Versions of this Article:
24/2/282    most recent
btm554v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (12)
Google Scholar
Right arrow Articles by Assenov, Y.
Right arrow Articles by Albrecht, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Assenov, Y.
Right arrow Articles by Albrecht, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Computing topological parameters of biological networks

Yassen Assenov , Fidel Ramírez , Sven-Eric Schelhorn , Thomas Lengauer and Mario Albrecht *

Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Stuhlsatzenhausweg 85, 66123 Saarbrücken, Germany

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 PROGRAM OVERVIEW
 3 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 

Summary: Rapidly increasing amounts of molecular interaction data are being produced by various experimental techniques and computational prediction methods. In order to gain insight into the organization and structure of the resultant large complex networks formed by the interacting molecules, we have developed the versatile Cytoscape plugin NetworkAnalyzer. It computes and displays a comprehensive set of topological parameters, which includes the number of nodes, edges, and connected components, the network diameter, radius, density, centralization, heterogeneity, and clustering coefficient, the characteristic path length, and the distributions of node degrees, neighborhood connectivities, average clustering coefficients, and shortest path lengths. NetworkAnalyzer can be applied to both directed and undirected networks and also contains extra functionality to construct the intersection or union of two networks. It is an interactive and highly customizable application that requires no expert knowledge in graph theory from the user.

Availability: NetworkAnalyzer can be downloaded via the Cytoscape web site: http://www.cytoscape.org

Contact: mario.albrecht{at}mpi-inf.mpg.de

Supplementary information: Supplementary data are available at Bioinformatics online.


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 PROGRAM OVERVIEW
 3 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
In recent years, high-throughput experiments have produced large networks of interacting molecules, which are represented as nodes linked by edges in complex graphs (Albrecht et al., 2005; Ramírez et al., 2007; Zhu et al., 2007). In this context, the characterization of biological networks by means of graph-topological properties has become very popular for gaining insight into the global network structure (Albert, 2005; Almaas, 2007; Barabasi and Oltvai, 2004; Dong and Horvath, 2007; Zhu et al., 2007). However, general software libraries for graph analysis such as JUNG (http://jung.sourceforge.net/), LEDA (http://algorithmic-solutions.com/enleda.htm), NetworkX (https://networkx.lanl.gov/) and yFiles (http://www.yworks.com/) are not easily applied by the biological user. Other applications like Pajek (Batagelj and Mrvar, 1998) require expert knowledge in graph theory on the user side. Specialized tools for the analysis of biological networks like CentiBiN (Junker et al., 2006), tYNA/TopNet (Yip et al., 2006; Yu et al., 2004) and VisANT (Hu et al., 2005) calculate only a limited set of topological parameters.

Therefore, we have developed NetworkAnalyzer, a user-friendly Java plugin for Cytoscape (Shannon et al., 2003), which is an established free open-source software platform for the visualization and analysis of molecular interaction networks (Shannon et al., 2003). An initial release of NetworkAnalyzer was made available in January 2006. In the following, we describe the basic functionality of NetworkAnalyzer and numerous extensions and improvements of the next major release.


    2 PROGRAM OVERVIEW
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 PROGRAM OVERVIEW
 3 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
NetworkAnalyzer efficiently computes a large number of topological network parameters for directed and undirected networks loaded into Cytoscape. The user can decide whether directed edges should be treated as undirected for the analysis. The computed simple and complex topology parameters are represented as single values and distributions, respectively. Simple parameters are the number of nodes, edges, self-loops, and connected components, the average number of neighbors, the network diameter, radius, density, centralization, heterogeneity, and clustering coefficient, the number of shortest paths, and the characteristic path length. Complex parameters are distributions of node degrees, neighborhood connectivities, average clustering coefficients, topological coefficients, shortest path lengths, and shared neighbors of two nodes. NetworkAnalyzer utilizes the free Java libraries JFreeChart (http://jfree.org/jfreechart/) and Batik (http://xmlgraphics.apache.org/batik/) to display the distributions as histograms or scatter plots (Fig. 1) and to export them as chart images in the formats JPG/PNG/SVG or as tables in plain text files. Details on the formal definitions of all topological parameters are given in the online help page of the plugin. To ensure the validity of the calculations performed by NetworkAnalyzer, the computed parameters were compared with those obtained from Pajek, TopNet, and using the Python graph library NetworkX.


Figure 1
View larger version (37K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. Analysis of a human protein interaction network for neurodegenerative diseases with 3607 nodes and 7093 edges (Lim et al., 2006). The shortest path length distribution (left) indicates that the network possesses small-world property. The decreasing trend of the neighborhood connectivity (right) shows that the network is dominated by edges between low and highly connected nodes.

 
While the majority of the topological parameters included in NetworkAnalyzer is already well known and frequently used in the literature, our plugin additionally computes some novel network properties. In particular, we have extended the original definition of neighborhood connectivity (Maslov and Sneppen, 2002) to directed networks by introducing three types of related connectivity parameters, see Supplementary Data for more details. NetworkAnalyzer is also capable of enumerating the shared neighbors of all node pairs in a network. As an application of that, the Supplementary Data describes the use of the shared neighbors distribution to detect bias in the topology of predicted human networks of protein–protein interactions in comparison to experimentally derived networks.

Further unique features of NetworkAnalyzer comprise various visual settings of the obtained diagrams (Fig. 1). The user has the option of switching between histogram or scatter plot of the computed distributions and between linear or logarithmic scales for any of the two displayed diagram axes. Gridlines can be enabled or disabled, and a power law can be fitted to resultant distributions. Additionally, the title of the chart diagram, the labels of the axes, and the colors of the scatter points and gridlines can be configured.

Topology parameters computed for network nodes are stored as node attributes in the Cytoscape data structure. Thus, users can easily apply the visual mapping settings of Cytoscape to highlight any parameter on the screen (see online tutorial). For example, the clustering coefficient may be visualized proportional to the node size, and the node color may be related to its degree. Another useful application of NetworkAnalyzer is the selection of nodes based on any of the calculated attributes. This enables Cytoscape users to examine, for instance, structural perturbations in a network caused by the removal of nodes with high degrees.

In NetworkAnalyzer, the complete set of simple and complex parameters is referred to as network statistics. Once calculated and displayed, the network statistics can be saved into and reloaded from a text file in order to avoid recomputation. The comparison of multiple network topologies can easily be achieved by the parallel inspection of the computed statistics for different networks. Optional user settings can be stored and reloaded. Users can customize the appearance of the results by choosing between two alternative dialog interfaces, the compact one shown in Fig. 1 and an expandable interface. Aside from parameter computations, NetworkAnalyzer offers a useful set of network modifications and supports the construction of the intersection, union, and difference of two networks, the extraction of connected components as new separate networks, and the removal of self-loops.


    3 CONCLUSIONS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 PROGRAM OVERVIEW
 3 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
NetworkAnalyzer is a versatile and user-friendly tool for the analysis of biological and other networks. This plugin is well integrated into Cytoscape and computes a comprehensive list of simple and complex topology parameters using efficient graph algorithms. It incorporates useful visualization settings to display and export the resulting distributions and adds node attributes for the results.


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 PROGRAM OVERVIEW
 3 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Part of this work has been financially supported by the German National Genome Research Network (NGFN) and the German Research Foundation (DFG), contract number KFO 129/1-1. The research has been conducted in the context of the BioSapiens Network of Excellence funded by the European Commission under grant number LSHG-CT-2003-503265.

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Martin Bishop

Received on June 16, 2007; revised on October 8, 2007; accepted on November 1, 2007

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 PROGRAM OVERVIEW
 3 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 

    Albert R. Scale-free networks in cell biology. J. Cell Sci. (2005) 118:4947–4957.[Abstract/Free Full Text]

    Albrecht M, et al. Decomposing protein networks into domain-domain interactions. Bioinformatics (2005) 21(Suppl. 2):ii220–ii221.[Abstract]

    Almaas E. Biological impacts and context of network theory. J. Exp. Biol. (2007) 210:1548–1558.[Abstract/Free Full Text]

    Barabasi AL, Oltvai ZN. Network biology: understanding the cell's functional organization. Nat. Rev. Genet. (2004) 5:101–113.[CrossRef][Web of Science][Medline]

    Batagelj V, Mrvar A. Pajek – program for large network analysis. Connections (1998) 21:47–57.

    Dong J, Horvath S. Understanding network concepts in modules. BMC Syst. Biol. (2007) 1:24.[CrossRef][Medline]

    Hu Z, et al. VisANT: data-integrating visual framework for biological networks and modules. Nucleic Acids Res. (2005) 33:W352–W357.[Abstract/Free Full Text]

    Junker BH, et al. Exploration of biological network centralities with CentiBiN. BMC Bioinformatics (2006) 7:219.[CrossRef][Medline]

    Lim J, et al. A protein-protein interaction network for human inherited ataxias and disorders of purkinje cell degeneration. Cell (2006) 125:801–814.[CrossRef][Web of Science][Medline]

    Maslov S, Sneppen K. Specificity and stability in topology of protein networks. Science (2002) 296:910–913.[Abstract/Free Full Text]

    Ramírez F, et al. Computational analysis of human protein interaction networks. Proteomics (2007) 7:2541–2552.[CrossRef][Web of Science][Medline]

    Shannon P, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. (2003) 13:2498–2504.[Abstract/Free Full Text]

    Yip KY, et al. The tYNA platform for comparative interactomics: a web tool for managing, comparing and mining multiple networks. Bioinformatics (2006) 22:2968–2970.[Abstract/Free Full Text]

    Yu H, et al. TopNet: a tool for comparing biological sub-networks, correlating protein properties with topological statistics. Nucleic Acids Res. (2004) 32:328–337.[Abstract/Free Full Text]

    Zhu X, et al. Getting connected: analysis and principles of biological networks. Genes Dev. (2007) 21:1010–1024.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
G. Scardoni, M. Petterlini, and C. Laudanna
Analyzing biological network parameters with CentiScaPe
Bioinformatics, November 1, 2009; 25(21): 2857 - 2859.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
J. Gu, Y. Wang, and T. Lilburn
A Comparative Genomics, Network-Based Approach to Understanding Virulence in Vibrio cholerae
J. Bacteriol., October 15, 2009; 191(20): 6262 - 6272.
[Abstract] [Full Text] [PDF]


Home page
GENES CELLSHome page
Y. Hayashi, T. Senda, N. Sano, and M. Horikoshi
Theoretical framework for the histone modification network: modifications in the unstructured histone tails form a robust scale-free network
Genes Cells, July 1, 2009; 14(7): 789 - 806.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Minguez, S. Gotz, D. Montaner, F. Al-Shahrour, and J. Dopazo
SNOW, a web-based tool for the statistical analysis of protein-protein interaction networks
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W109 - W114.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
A. H. Beck, I. Espinosa, B. Edris, R. Li, K. Montgomery, S. Zhu, S. Varma, R. J. Marinelli, M. van de Rijn, and R. B. West
The Macrophage Colony-Stimulating Factor 1 Response Signature in Breast Carcinoma
Clin. Cancer Res., February 1, 2009; 15(3): 778 - 787.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrowOA All Versions of this Article:
24/2/282    most recent
btm554v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (12)
Google Scholar
Right arrow Articles by Assenov, Y.
Right arrow Articles by Albrecht, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Assenov, Y.
Right arrow Articles by Albrecht, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?