Bioinformatics Advance Access published online on October 30, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl548
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 The Jackson Laboratory, Bar Harbor, Maine 04609, USA
* To whom correspondence should be addressed.
Motivation: Microarray data typically have small numbers of observations per gene which can result in low power for statistical tests. Test statistics that borrow information from data across all of the genes can improve power, but these statistics have non-standard distributions, and their significance must be assessed using permutation analysis. When sample sizes are small, the number of distinct permutations can be severely limited, and pooling the permutation-derived test statistics across all genes has been proposed. However, the null distribution of the test statistics under permutation is not the same for equally and differentially expressed genes. This can have a negative impact on both p-value estimation and the power of information borrowing statistics. Results: We investigate permutation based methods for estimating p-values. One of methods that uses pooling from a selected subset of the data is shown to have the correct type I error rate and to provide accurate estimates of the false discovery rate (FDR). We provide guidelines to select an appropriate subset. We also demonstrate that information borrowing statistics have substantially increased power compared to the t-test in small experiments.
Received May 1, 2006
Revised October 12, 2006
Accepted October 22, 2006
Article
Estimating p-values in small microarray experiments
Hyuna Yang 1 and Gary Churchill 1 *
Gary Churchill, E-mail: garyc{at}jax.org
![]()
Abstract
Associate Editor: John Quackenbush
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
K. R. Shockley, D. Witmer, S. L. Burgess-Herbert, B. Paigen, and G. A. Churchill Effects of atherogenic diet on hepatic gene expression across mouse strains Physiol Genomics, November 1, 2009; 39(3): 172 - 182. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. F. Kerr Comments on the analysis of unbalanced microarray data Bioinformatics, August 15, 2009; 25(16): 2035 - 2041. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhu, G.-J. Li, L. Ding, X. Cui, H. Berg, S. M. Assmann, and Y. Xia Arabidopsis Extra Large G-Protein 2 (XLG2) Interacts with the G{beta} Subunit of Heterotrimeric G Protein and Functions in Disease Resistance Mol Plant, May 1, 2009; 2(3): 513 - 525. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. L. Ackert-Bicknell, K. R. Shockley, L. G. Horton, B. Lecka-Czernik, G. A. Churchill, and C. J. Rosen Strain-Specific Effects of Rosiglitazone on Bone Mass, Body Composition, and Serum Insulin-Like Growth Factor-I Endocrinology, March 1, 2009; 150(3): 1330 - 1340. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Merbl and M. W. Kirschner Large-scale detection of ubiquitination substrates using cell extracts and protein microarrays PNAS, February 24, 2009; 106(8): 2543 - 2548. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Qin, T. Feng, S. A. Harding, C.-J. Tsai, and S. Zhang An efficient method to identify differentially expressed genes in microarray experiments Bioinformatics, July 15, 2008; 24(14): 1583 - 1589. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Stylianou, J. P. Affourtit, K. R. Shockley, R. Y. Wilpan, F. A. Abdi, S. Bhardwaj, J. Rollins, G. A Churchill, and B. Paigen Applying Gene Expression, Proteomics and Single-Nucleotide Polymorphism Analysis for Complex Trait Gene Identification Genetics, March 1, 2008; 178(3): 1795 - 1805. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Zhou, C. Cras-Meneur, M. Ohsugi, G. D. Stormo, and M. Alan. Permutt A global approach to identify differentially expressed genes in cDNA (two-color) microarray experiments Bioinformatics, August 15, 2007; 23(16): 2073 - 2079. [Abstract] [Full Text] [PDF] |
||||





