Skip Navigation



Bioinformatics Advance Access published online on February 22, 2005

Bioinformatics, doi:10.1093/bioinformatics/bti339
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
21/10/2488    most recent
bti339v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Kato, T.
Right arrow Articles by Asai, K.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kato, T.
Right arrow Articles by Asai, K.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2005). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oupjournals.org
Received November 30, 2004
Revised February 15, 2005
Accepted February 17, 2005

Article

Selective integration of multiple biological data for supervised network inference

Tsuyoshi Kato 1*, Koji Tsuda 2, and Kiyoshi Asai 3

1 Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-43 Aomi Koto-ku, Tokyo, Japan
2 Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-43 Aomi Koto-ku, Tokyo, Japan; Max Planck Institute for Biological Cybernetics, Spemannstr. 38, 72076 Tübingen, Germany
3 Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-43 Aomi Koto-ku, Tokyo, Japan; Graduate School of Frontier Sciences, University of Tokyo, 5-1-5, Kashiwanoha, Kashiwa, 277-8562, Japan

* To whom correspondence should be addressed.
Tsuyoshi Kato, E-mail: kato-tsuyoshi{at}aist.go.jp


   Abstract

Motivation: Inferring networks of proteins from biological data is a central issue of computational biology. Most network inference methods, including Bayesian networks, take unsupervised approaches in which the network is totally unknown in the beginning, and all the edges have to be predicted. Yamanishi et al. [2004] recently proposed a more realistic supervised framework that assumes that a substantial part of the network is known. We propose a new kernel-based method for supervised graph inference based on multiple types of biological datasets such as gene expression, phylogenetic profiles, and amino acid sequences. Notably, our method assigns a weight to each type of dataset and thereby selects informative ones. Data selection is useful to reduce data collection costs. For example, when a similar network inference problem must be solved for other organisms, the dataset excluded by our algorithm need not to be collected.

Results: First, we formulate supervised network inference as a kernel matrix completion problem, where the inference of edges boils down to estimation of missing entries of a kernel matrix. Then, an EM algorithm is proposed to simultaneously infer the missing entries of the kernel matrix and the weights of multiple datasets. By introducing the weights, we can integrate multiple datasets selectively and thereby exclude irrelevant and noisy datasets. Our approach is favorably tested in two biological networks: a metabolic network and a protein interaction network.

Availability: A supplementary report including mathematical details is available at www.cbrc.jp/~kato/faem/faem.html. Software is available on request.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
H. Kashima, Y. Yamanishi, T. Kato, M. Sugiyama, and K. Tsuda
Simultaneous inference of biological networks of multiple species from genome-wide data and evolutionary information: a semi-supervised approach
Bioinformatics, November 15, 2009; 25(22): 2962 - 2968.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Anjum, A. Doucet, and C. C. Holmes
A boosting approach to structure learning of graphs with and without prior knowledge
Bioinformatics, November 15, 2009; 25(22): 2929 - 2936.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Cui, P. Li, G. Li, F. Xu, C. Zhao, Y. Li, Z. Yang, G. Wang, Q. Yu, Y. Li, et al.
AtPID: Arabidopsis thaliana protein interactome database an integrative platform for plant systems biology
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D999 - D1008.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Bleakley, G. Biau, and J.-P. Vert
Supervised reconstruction of biological networks with local models
Bioinformatics, July 1, 2007; 23(13): i57 - i65.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.