Skip Navigation


Bioinformatics Advance Access originally published online on September 20, 2005
Bioinformatics 2005 21(22):4148-4154; doi:10.1093/bioinformatics/bti681
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/22/4148    most recent
bti681v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (10)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Dabney, A. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dabney, A. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions{at}oxfordjournals.org

Classification of microarrays to nearest centroids

Alan R. Dabney

Department of Biostatistics, University of Washington Seattle 98195, USA

Motivation: Classification of biological samples by microarrays is a topic of much interest. A number of methods have been proposed and successfully applied to this problem. It has recently been shown that classification by nearest centroids provides an accurate predictor that may outperform much more complicated methods. The ‘Prediction Analysis of Microarrays’ (PAM) approach is one such example, which the authors strongly motivate by its simplicity and interpretability. In this spirit, I seek to assess the performance of classifiers simpler than even PAM.

Results: I surprisingly show that the modified t-statistics and shrunken centroids employed by PAM tend to increase misclassification error when compared with their simpler counterparts. Based on these observations, I propose a classification method called ‘Classification to Nearest Centroids’ (ClaNC). ClaNC ranks genes by standard t-statistics, does not shrink centroids and uses a class-specific gene-selection procedure. Because of these modifications, ClaNC is arguably simpler and easier to interpret than PAM, and it can be viewed as a traditional nearest centroid classifier that uses specially selected genes. I demonstrate that ClaNC error rates tend to be significantly less than those for PAM, for a given number of active genes.

Availability: Point-and-click software is freely available at http://students.washington.edu/adabney/clanc

Contact: adabney{at}u.washington.edu

Supplementary Information: http://students.washington.edu/adabney/clanc/supplement.pdf


Received on August 15, 2005; revised on September 15, 2005; accepted on September 17, 2005

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
F. Tai and W. Pan
Incorporating prior knowledge of predictors into penalized classifiers with multiple penalty terms
Bioinformatics, July 15, 2007; 23(14): 1775 - 1782.
[Abstract] [Full Text] [PDF]


Home page
Clin. Chem.Home page
M. Mullins, L. Perreard, J. F. Quackenbush, N. Gauthier, S. Bayer, M. Ellis, J. Parker, C. M. Perou, A. Szabo, and P. S. Bernard
Agreement in Breast Cancer Classification between Microarray and Quantitative Reverse Transcription PCR from Fresh-Frozen and Formalin-Fixed, Paraffin-Embedded Tissues
Clin. Chem., July 1, 2007; 53(7): 1273 - 1279.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
I. A. Wood, P. M. Visscher, and K. L. Mengersen
Classification based upon gene expression data: bias and precision of error rates
Bioinformatics, June 1, 2007; 23(11): 1363 - 1370.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Wang and J. Zhu
Improved centroids estimation for the nearest shrunken centroid classifier
Bioinformatics, April 15, 2007; 23(8): 972 - 979.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Shen, D. Ghosh, A. Chinnaiyan, and Z. Meng
Eigengene-based linear discriminant model for tumor classification using gene expression microarray data
Bioinformatics, November 1, 2006; 22(21): 2635 - 2642.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. R. Dabney
ClaNC: point-and-click software for classifying microarrays to nearest centroids
Bioinformatics, January 1, 2006; 22(1): 122 - 123.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.