Skip Navigation



Bioinformatics Advance Access published online on November 25, 2004

Bioinformatics, doi:10.1093/bioinformatics/bti162
Bioinformatics © Oxford University Press 2004; all rights reserved
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
21/8/1502    most recent
bti162v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Tsai, C.-A.
Right arrow Articles by Chen, J. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tsai, C.-A.
Right arrow Articles by Chen, J. J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Received October 28, 2004
Revised November 16, 2004
Accepted November 17, 2004

Article

Sample size for gene expression microarray experiments*

Chen-An Tsai 1, Sue-Jane Wang 2, Dung-Tsa Chen 3, and James J. Chen 1*

1 Division of Biometry and Risk Assessment, National Center for Toxicological Research, Food and Drug Administration, Jefferson, Arkansas 72079
2 Division of Biometrics II, Office of Biostatistics, Center for Drug Evaluation and Research, Food and Drug Administration, Rockville, Maryland 20857
3 Biostatistics and Bioinformatics Unit, University of Alabama at Birmingham, 153 Wallace Tumor Institute, Birmingham, Alabama 35294

* To whom correspondence should be addressed.
James J. Chen, E-mail: jchen{at}nctr.fda.gov


   Abstract

Motivation: Microarray experiments often involve hundreds or thousands of genes. In a typical experiment, only a fraction of genes is expected to be differentially expressed; in addition, the measured intensities among different genes may be correlated. Depending on the experimental objectives, sample size calculations can be based on one of the three specified measures: sensitivity, true discovery, and accuracy rates. Sample size problem is formulated as: the number of arrays needed in order to achieve the desired fraction of the specified measure at the desired family-wise power at the given type I error and (standardized) effect size.

Results: We present a general approach for estimating sample size under independent and equally-correlated models using binomial and beta-binomial models, respectively. The sample sizes needed for a two-sample z-test are computed; the computed theoretical numbers agree well with the Monte Carlo simulation results. But, under more general correlation structures, the beta-binomial model can underestimate the needed samples by about 1-5 arrays.


* The views presented in this paper are those of authors and not necessarily representing those of the U.S. Food and Drug Administration


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BiostatisticsHome page
T. Oura, S. Matsui, and K. Kawakami
Sample size calculations for controlling the distribution of false discovery proportion in microarray experiments
Biostat., October 1, 2009; 10(4): 694 - 705.
[Abstract] [Full Text] [PDF]


Home page
BiostatisticsHome page
P. de Valpine, H.-M. Bitter, M. P. S. Brown, and J. Heller
A simulation-approximation approach to sample size planning for high-dimensional classification studies
Biostat., July 1, 2009; 10(3): 424 - 435.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K.-S. Lynn, L.-L. Li, Y.-J. Lin, C.-H. Wang, S.-H. Sheng, J.-H. Lin, W. Liao, W.-L. Hsu, and W.-H. Pan
A neural network model for constructing endophenotypes of common complex diseases: an application to male young-onset hypertension microarray data
Bioinformatics, April 15, 2009; 25(8): 981 - 988.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Seo, H. Gordish-Dressman, and E. P. Hoffman
An interactive power analysis tool for microarray hypothesis testing and generation
Bioinformatics, April 1, 2006; 22(7): 808 - 814.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
S. B. Pounds
Estimation and control of multiple testing error rates for microarray studies
Brief Bioinform, March 1, 2006; 7(1): 25 - 36.



Home page
BioinformaticsHome page
S. Pounds and C. Cheng
Sample size determination for the false discovery rate
Bioinformatics, December 1, 2005; 21(23): 4263 - 4271.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.