Skip Navigation

Bioinformatics 2005 21(Suppl 1):i423-i430; doi:10.1093/bioinformatics/bti1020
This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Soukup, M.
Right arrow Articles by Lee, J. K.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Soukup, M.
Right arrow Articles by Lee, J. K.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions{at}oupjournals.org

Robust classification modeling on microarray data using misclassification penalized posterior

Mat Soukup 1, HyungJun Cho 2 and Jae K. Lee 2,*

1Division of Biometrics III, Food and Drug Administration 9201 Corporate Blvd, Rm. N-250, Rockville, MD 20850, USA
2Division of Biostatistics and Epidemiology, University of Virginia PO Box 800717, Charlottesville, VA 22908, USA

*To whom correspondence should be addressed.

Motivation: Genome-wide microarray data are often used in challenging classification problems of clinically relevant subtypes of human diseases. However, the identification of a parsimonious robust prediction model that performs consistently well on future independent data has not been successful due to the biased model selection from an extremely large number of candidate models during the classification model search and construction. Furthermore, common criteria of prediction model performance, such as classification error rates, do not provide a sensitive measure for evaluating performance of such astronomic competing models. Also, even though several different classification approaches have been utilized to tackle such classification problems, no direct comparison on these methods have been made.

Results: We introduce a novel measure for assessing the performance of a prediction model, the misclassification-penalized posterior (MiPP), the sum of the posterior classification probabilities penalized by the number of incorrectly classified samples. Using MiPP, we implement a forward step-wise cross-validated procedure to find our optimal prediction models with different numbers of features on a training set. Our final robust classification model and its dimension are determined based on a completely independent test dataset. This MiPP-based classification modeling approach enables us to identify the most parsimonious robust prediction models only with two or three features on well-known microarray datasets. These models show superior performance to other models in the literature that often have more than 40–100 features in their model construction.

Availability: Our MiPP software program is available at the Bioconductor website (http://www.bioconductor.org).

Contact: jaeklee{at}virginia.edu


Received on January 15, 2005; accepted on March 27, 2005

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
J. K. Lee, D. M. Havaleshko, H. Cho, J. N. Weinstein, E. P. Kaldjian, J. Karpovich, A. Grimshaw, and D. Theodorescu
A strategy for predicting the chemosensitivity of human cancers and its application to drug discovery
PNAS, August 7, 2007; 104(32): 13086 - 13091.
[Abstract] [Full Text] [PDF]


Home page
Molecular Cancer TherapeuticsHome page
D. M. Havaleshko, H. Cho, M. Conaway, C. R. Owens, G. Hampton, J. K. Lee, and D. Theodorescu
Prediction of drug combination chemosensitivity in human bladder cancer
Mol. Cancer Ther., February 1, 2007; 6(2): 578 - 586.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.