Skip Navigation



Bioinformatics Advance Access published online on July 2, 2009

Bioinformatics, doi:10.1093/bioinformatics/btp406
This Article
Right arrow Advance Access manuscript (PDF)
Right arrow All Versions of this Article:
25/18/2348    most recent
btp406v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Glazko, G. V.
Right arrow Articles by Emmert-Streib, F.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Glazko, G. V.
Right arrow Articles by Emmert-Streib, F.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2009). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Unite and conquer: univariate and multivariate approaches for finding differentially expressed gene sets

Galina V. Glazko 1,* and Frank Emmert-Streib 2

1Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, NY 14642
2 Center for Cancer Research and Cell Biology School of Medicine, Dentistry and Biomedical Science, Queen's University Belfast, 97 Lisburn Road, Belfast BT9 7BL, UK

*To whom correspondence should be addressed. Dr. Galina V. Glazko, E-mail: Galina_Glazko{at}urmc.rochester.edu


   Abstract

Motivation: Recently, many univariate and several multivariate approaches have been suggested for testing differential expression of gene sets between different phenotypes. However, despite a wealth of literature studying their performance on simulated and real biological data, still there is a need to quantify their relative performance when they are testing different null hypotheses.

Results: In this paper we compare the performance of univariate and multivariate tests on both simulated and biological data. In the simulation study we demonstrate that high correlations equally affect the power of both, univariate as well as multivariate tests. In addition, for most of them the power is similarly affected by the dimensionality of the gene set and by the percentage of genes in the set, for which expression is changing between two pheno-types. The application of different test statistics to biological data reveals that three statistics (sum of squared t-tests, Hotelling's T2, N-statistic), testing different null hypotheses, find some common but also some complementing differentially expressed gene sets under specific settings. This demonstrates that due to comple-menting null hypotheses each test projects on different aspects of the data and for the analysis of biological data it is beneficial to use all three tests simultaneously instead of focusing exclusively on just one.

Associate Editor: Dr. Joaquin Dopazo


Received on January 30, 2009; revised on April 2, 2009; accepted on June 29, 2009

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.