Bioinformatics Advance Access originally published online on May 14, 2004
Bioinformatics 2004 20(16):2694-2701; doi:10.1093/bioinformatics/bth310
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics vol. 20 issue 16 © Oxford University Press 2004; all rights reserved.
A mixture model for estimating the local false discovery rate in DNA microarray analysis
1 Biometrics Division, School of Public Health and The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey, 683 Hoes Lane West, Piscataway, NJ 08854, USA and 2 Department of Pediatrics, UMDNJRobert Wood Johnson Medical School, One Robert Wood Johnson Place, MEB 3rd Floor, P.O. Box 19, New Brunswick, NJ 08903-0019, USA
Received on January 26, 2004; accepted on April 22, 2004
Advance Access Publication May 14, 2004
Motivation: Statistical methods based on controlling the false discovery rate (FDR) or positive false discovery rate (pFDR) are now well established in identifying differentially expressed genes in DNA microarray. Several authors have recently raised the important issue that FDR or pFDR may give misleading inference when specific genes are of interest because they average the genes under consideration with genes that show stronger evidence for differential expression. The paper proposes a flexible and robust mixture model for estimating the local FDR which quantifies how plausible each specific gene expresses differentially.
Results: We develop a special mixture model tailored to multiple testing by requiring the P-value distribution for the differentially expressed genes to be stochastically smaller than the P-value distribution for the non-differentially expressed genes. A smoothing mechanism is built in. The proposed model gives robust estimation of local FDR for any reasonable underlying P-value distributions. It also provides a single framework for estimating the proportion of differentially expressed genes, pFDR, negative predictive values, sensitivity and specificity. A cervical cancer study shows that the local FDR gives more specific and relevant quantification of the evidence for differential expression that can be substantially different from pFDR.
Availability: An R function implementing the proposed model is available at http://www.geocities.com/jg_liao/software
Contact: liaojg{at}umdnj.edu
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
W.-J. Hong, R. Tibshirani, and G. Chu Local false discovery rate facilitates comparison of different microarray experiments Nucleic Acids Res., December 1, 2009; 37(22): 7483 - 7497. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Bukszar, J. L. McClay, and E. J. C. G. van den Oord Estimating the posterior probability that genome-wide association findings are true or false Bioinformatics, July 15, 2009; 25(14): 1807 - 1813. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.G. Liao and K.-V. Chin Logistic regression for disease classification using microarray data: model selection in a large p and small n case Bioinformatics, August 1, 2007; 23(15): 1945 - 1951. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. B. Pounds Estimation and control of multiple testing error rates for microarray studies Brief Bioinform, March 1, 2006; 7(1): 25 - 36. |
||||
![]() |
S. Pounds and C. Cheng Sample size determination for the false discovery rate Bioinformatics, December 1, 2005; 21(23): 4263 - 4271. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hu, F. Zou, and F. A. Wright Practical FDR-based sample size calculations in microarray experiments Bioinformatics, August 1, 2005; 21(15): 3264 - 3272. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Scheid and R. Spang twilight; a Bioconductor package for estimating the local false discovery rate Bioinformatics, June 15, 2005; 21(12): 2921 - 2922. [Abstract] [Full Text] [PDF] |
||||


