Skip Navigation


Bioinformatics Advance Access originally published online on August 16, 2005
Bioinformatics 2005 21(20):3896-3904; doi:10.1093/bioinformatics/bti631
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
21/20/3896    most recent
bti631v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (32)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Tan, A. C.
Right arrow Articles by Geman, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tan, A. C.
Right arrow Articles by Geman, D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions{at}oxfordjournals.org

Simple decision rules for classifying human cancers from gene expression profiles

Aik Choon Tan 1,*, Daniel Q. Naiman 1,2, Lei Xu 1, Raimond L. Winslow 1 and Donald Geman 1,2

1Center for Cardiovascular Bioinformatics and Modeling, Whitaker Biomedical Engineering Institute 3400 N. Charles Street, Baltimore, MD 21218, USA
2Department of Applied Mathematics and Statistics, Johns Hopkins University 3400 N. Charles Street, Baltimore, MD 21218, USA

*To whom correspondence should be addressed.

Motivation: Various studies have shown that cancer tissue samples can be successfully detected and classified by their gene expression patterns using machine learning approaches. One of the challenges in applying these techniques for classifying gene expression data is to extract accurate, readily interpretable rules providing biological insight as to how classification is performed. Current methods generate classifiers that are accurate but difficult to interpret. This is the trade-off between credibility and comprehensibility of the classifiers. Here, we introduce a new classifier in order to address these problems. It is referred to as k-TSP (k–Top Scoring Pairs) and is based on the concept of ‘relative expression reversals’. This method generates simple and accurate decision rules that only involve a small number of gene-to-gene expression comparisons, thereby facilitating follow-up studies.

Results: In this study, we have compared our approach to other machine learning techniques for class prediction in 19 binary and multi-class gene expression datasets involving human cancers. The k-TSP classifier performs as efficiently as Prediction Analysis of Microarray and support vector machine, and outperforms other learning methods (decision trees, k-nearest neighbour and naïve Bayes). Our approach is easy to interpret as the classifier involves only a small number of informative genes. For these reasons, we consider the k-TSP method to be a useful tool for cancer classification from microarray gene expression data.

Availability: The software and datasets are available at http://www.ccbm.jhu.edu

Contact: actan{at}jhu.edu


Received on May 9, 2005; revised on July 28, 2005; accepted on August 14, 2005

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Mol. Cell. ProteomicsHome page
C.-S. Chen, S. Sullivan, T. Anderson, A. C. Tan, P. J. Alex, S. R. Brant, C. Cuffari, T. M. Bayless, M. V. Talor, C. L. Burek, et al.
Identification of Novel Serological Biomarkers for Inflammatory Bowel Disease Using Escherichia coli Proteome Chip
Mol. Cell. Proteomics, August 1, 2009; 8(8): 1765 - 1776.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
N.V. Rajeshkumar, A. C. Tan, E. De Oliveira, C. Womack, H. Wombwell, S. Morgan, M. V. Warren, J. Walker, T. P. Green, A. Jimeno, et al.
Antitumor Effects and Biomarkers of Activity of AZD0530, a Src Inhibitor, in Pancreatic Cancer
Clin. Cancer Res., June 15, 2009; 15(12): 4138 - 4146.
[Abstract] [Full Text] [PDF]


Home page
Molecular Cancer TherapeuticsHome page
W. A. Messersmith, N.V. Rajeshkumar, A. C. Tan, X. F. Wang, V. Diesl, S. E. Choe, M. Follettie, C. Coughlin, F. Boschelli, E. Garcia-Garcia, et al.
Efficacy and pharmacodynamic effects of bosutinib (SKI-606), a Src/Abl inhibitor, in freshly generated human pancreas cancer xenografts
Mol. Cancer Ther., June 1, 2009; 8(6): 1484 - 1493.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. T. Leek
The tspair package for finding top scoring pair classifiers in R
Bioinformatics, May 1, 2009; 25(9): 1203 - 1204.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. R. Weichselbaum, H. Ishwaran, T. Yoon, D. S. A. Nuyten, S. W. Baker, N. Khodarev, A. W. Su, A. Y. Shaikh, P. Roach, B. Kreike, et al.
An interferon-related gene signature for DNA damage resistance is a predictive marker for chemotherapy and radiation for breast cancer
PNAS, November 25, 2008; 105(47): 18490 - 18495.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
R. Kuner, A. S. Barth, M. Ruschhaupt, A. Buness, L. Zwermann, E. Kreuzer, G. Steinbeck, A. Poustka, H. Sultmann, and M. Nabauer
Genomic analysis reveals poor separation of human cardiomyopathies of ischemic and nonischemic etiologies
Physiol Genomics, June 1, 2008; 34(1): 88 - 94.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N. D. Price, J. Trent, A. K. El-Naggar, D. Cogdell, E. Taylor, K. K. Hunt, R. E. Pollock, L. Hood, I. Shmulevich, and W. Zhang
Highly accurate two-gene classifier for differentiating gastrointestinal stromal tumors and leiomyosarcomas
PNAS, February 27, 2007; 104(9): 3414 - 3419.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. C. Cawley and N. L. C. Talbot
Gene selection in cancer classification using sparse logistic regression with Bayesian regularization
Bioinformatics, October 1, 2006; 22(19): 2348 - 2355.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.