Skip Navigation



Bioinformatics Advance Access published online on July 28, 2006

Bioinformatics, doi:10.1093/bioinformatics/btl401
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
22/19/2373    most recent
btl401v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Kong, S. W.
Right arrow Articles by Park, P. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kong, S. W.
Right arrow Articles by Park, P. J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2006). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
Received February 9, 2006
Revised June 27, 2006
Accepted July 18, 2006

Article

A multivariate approach for integrating genome-wide expression data and biological knowledge

Sek Won Kong 1, William T. Pu 2, and Peter J. Park 3 *

1 Department of Cardiology, Children's Hospital Boston, 300 Longwood Avenue, Boston, MA 02115, USA; Informatics Program, Children's Hospital Boston, 300 Longwood Avenue, Boston, MA 02115, USA
2 Department of Cardiology, Children's Hospital Boston, 300 Longwood Avenue, Boston, MA 02115, USA
3 Informatics Program, Children's Hospital Boston, 300 Longwood Avenue, Boston, MA 02115, USA; Harvard-Partners Center for Genetics and Genomics, 77 Avenue Louis Pasteur, Boston, MA 02115, USA

* To whom correspondence should be addressed.
Peter J. Park, E-mail: peter_park{at}harvard.edu


   Abstract

Motivation: Several statistical methods that combine analysis of differential gene expression with biological knowledge databases have been proposed for a more rapid interpretation of expression data. However, most such methods are based on a series of univariate statistical tests and do not properly account for the complex structure of gene interactions.

Results: We present a simple yet effective multivariate statistical procedure for assessing the correlation between a subspace defined by a group of genes and a binary phenotype. A subspace is deemed significant if the samples corresponding to different phenotypes are well separated in that subspace. The separation is measured using Hotelling's T2 statistic, which captures the covariance structure of the subspace. When the dimension of the subspace is larger than that of the sample space, we project the original data to a smaller orthonormal subspace. We use this method to search through functional pathway subspaces defined by Reactome, KEGG, BioCarta, and Gene Ontology. To demonstrate its performance, we apply this method to the data from two published studies, and we visualize the results in the principal component space.


Associate Editor: Alvis Brazma
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
V. Zuber and K. Strimmer
Gene ranking and biomarker discovery under correlation
Bioinformatics, October 15, 2009; 25(20): 2700 - 2707.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. V. Glazko and F. Emmert-Streib
Unite and conquer: univariate and multivariate approaches for finding differentially expressed gene sets
Bioinformatics, September 15, 2009; 25(18): 2348 - 2354.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Ma and M. R. Kosorok
Identification of differential gene pathways with principal component analysis
Bioinformatics, April 1, 2009; 25(7): 882 - 889.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
C.-A. Tsai and J. J. Chen
Multivariate analysis of variance test for gene set analysis
Bioinformatics, April 1, 2009; 25(7): 897 - 903.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
I. Dinu, J. D. Potter, T. Mueller, Q. Liu, A. J. Adewale, G. S. Jhangri, G. Einecke, K. S. Famulski, P. Halloran, and Y. Yasui
Gene-set analysis and reduction
Brief Bioinform, January 1, 2009; 10(1): 24 - 34.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. P. Oron, Z. Jiang, and R. Gentleman
Gene set enrichment analysis using linear models and diagnostics
Bioinformatics, November 15, 2008; 24(22): 2586 - 2591.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
D. Nam and S.-Y. Kim
Gene-set approach for expression pattern analysis
Brief Bioinform, May 1, 2008; 9(3): 189 - 197.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Shriner, T. M. Baye, M. A. Padilla, S. Zhang, L. K. Vaughan, and A. E. Loraine
Commonality of functional annotation: a method for prioritization of candidate genes from genome-wide linkage studies
Nucleic Acids Res., March 27, 2008; 36(4): e26 - e26.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Hummel, R. Meister, and U. Mansmann
GlobalANCOVA: exploration and assessment of gene group effects
Bioinformatics, January 1, 2008; 24(1): 78 - 85.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Xu, Y. Zhao, and R. Simon
Gene Set Expression Comparison kit for BRB-ArrayTools
Bioinformatics, January 1, 2008; 24(1): 137 - 139.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Goffard and G. Weiller
PathExpress: a web-based tool to identify relevant pathways in gene expression data
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W176 - W181.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. M. Kemp, N. R. Nirmala, and J. D. Szustakowski
Extending the pathway analysis framework with a test for transcriptional variance implicates novel pathway modulation during myogenic differentiation
Bioinformatics, June 1, 2007; 23(11): 1356 - 1362.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.