Bioinformatics Advance Access published online on March 29, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti407
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Center for Bioinformatics, University of Pennsylvania, 1429 Blockley Hall, 423 Guardian Drive, Philadelphia, PA 19104-6021
* To whom correspondence should be addressed.
Searching for differentially expressed genes is one of the most common applications for microarrays, yet statistically there are difficult hurdles to achieving adequate rigor and practicality. False Discovery Rate (FDR) approaches have become relatively standard, however, how to define and control the FDR has been hotly debated. Permutation estimation approaches such as SAM (Tusher et al. (2001)) and PaGE (Manduchi et al. (1999)) can be effective, however leave much room for improvement. We pursue the permutation estimation method and describe a convenient definition for the FDR that can be estimated in a straightforward manner. We then discuss issues regarding the choice of statistic and data transformation. It is impossible to optimize the power of any statistic for thousands of genes simultaneously, and we look at the practical consequences of this. For example, the log transform can both help and hurt at the same time, depending on the gene. We examine issues surrounding the "fudge factor" parameter introduced by Tusher et al. (2001), and how to handle these issues by optimizing with respect to power. Java and Perl implementations are available at www.cbil.upenn.edu/PaGE.
Received December 15, 2004
Revised March 16, 2005
Accepted March 22, 2005
Article
A practical false discovery rate approach to identifying patterns of differential expression in microarray data
Gregory R. Grant, E-mail: ggrant{at}pcbi.upenn.edu
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
P. White, C. Lee May, R. N. Lamounier, J. E. Brestelli, and K. H. Kaestner Defining Pancreatic Endocrine Precursors and Their Descendants Diabetes, March 1, 2008; 57(3): 654 - 668. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kidd, B. Nadler, S. Mane, G. Eick, M. Malfertheiner, M. Champaneria, R. Pfragner, and I. Modlin GeneChip, geNorm, and gastrointestinal tumors: novel reference genes for real-time PCR Physiol Genomics, August 20, 2007; 30(3): 363 - 370. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Sahoo, D. L. Dill, R. Tibshirani, and S. K. Plevritis Extracting binary signals from microarray time-course data Nucleic Acids Res., June 28, 2007; 35(11): 3705 - 3712. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-D. Zhang and T. W. Gant Effect of pooling samples on the efficiency of comparative studies using microarrays Bioinformatics, December 15, 2005; 21(24): 4378 - 4383. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Pawitan, K. R. K. Murthy, S. Michiels, and A. Ploner Bias in the estimation of false discovery rate in microarray studies Bioinformatics, October 15, 2005; 21(20): 3865 - 3872. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Mazzella, M. V. Arana, R. J. Staneloni, S. Perelman, M. J. Rodriguez Batiller, J. Muschietti, P. D. Cerdan, K. Chen, R. A. Sanchez, T. Zhu, et al. Phytochrome Control of the Arabidopsis Transcriptome Anticipates Seedling Exposure to Light PLANT CELL, September 1, 2005; 17(9): 2507 - 2516. [Abstract] [Full Text] [PDF] |
||||




