Bioinformatics Advance Access originally published online on March 24, 2007
Bioinformatics 2007 23(8):972-979; doi:10.1093/bioinformatics/btm046
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Improved centroids estimation for the nearest shrunken centroid classifier
1Department of Biostatistics and 2Department of Statistics, University of Michigan, Ann Arbor, MI, 48109, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
Motivation: The nearest shrunken centroid (NSC) method has been successfully applied in many DNA-microarray classification problems. The NSC uses shrunken centroids as prototypes for each class and identifies subsets of genes that best characterize each class. Classification is then made to the nearest (shrunken) centroid. The NSC is very easy to implement and very easy to interpret, however, it has drawbacks.
Results: We show that the NSC method can be interpreted in the framework of LASSO regression. Based on that, we consider two new methods, adaptive L
-norm penalized NSC (ALP-NSC) and adaptive hierarchically penalized NSC (AHP-NSC), with two different penalty functions for microarray classification, which improve over the NSC. Unlike the L1-norm penalty used in LASSO, the penalty terms that we consider make use of the fact that parameters belonging to one gene should be treated as a natural group. Numerical results indicate that the two new methods tend to remove irrelevant genes more effectively and provide better classification results than the L1-norm approach.
Availability: R code for the ALP-NSC and the AHP-NSC algorithms are available from authors upon request.
Contact: jizhu{at}umich.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
Associate Editor: John Quackenbush
Received on November 17, 2006; revised on January 19, 2007; accepted on February 4, 2007
This article has been cited by other articles:
![]() |
F. Tai and W. Pan Incorporating prior knowledge of gene functional groups into regularized discriminant analysis of microarray data Bioinformatics, December 1, 2007; 23(23): 3170 - 3177. [Abstract] [Full Text] [PDF] |
||||
