Skip Navigation



Bioinformatics Advance Access published online on February 1, 2006

Bioinformatics, doi:10.1093/bioinformatics/btl029
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrowOA All Versions of this Article:
22/8/950    most recent
btl029v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Koo, J.-Y.
Right arrow Articles by Lee, J. W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Koo, J.-Y.
Right arrow Articles by Lee, J. W.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2006). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
Received December 21, 2005
Revised January 26, 2006
Accepted January 26, 2006

Article

Structured polychotomous machine diagnosis of multiple cancer types using gene expression

Ja-Yong Koo 1 *, Insuk Sohn 1, Sujong Kim 2, and Jae Won Lee 1

1 Department of Statistics, Korea University, Seoul 136-701, Korea
2 Department of Biochemistry, College of Medicine, Hanyang University, Seoul 133-791, Korea; Current address: Skin Research Institute, AmorePacific R&D Center, Yongin 449-729, Korea

* To whom correspondence should be addressed.
Ja-Yong Koo, E-mail: jykoo{at}korea.ac.kr


   Abstract

Motivation: The problem of class prediction has received a tremendous amount of attention in the literature recently. In the context of DNA microarrays, where the task is to classify and predict the diagnostic category of a sample on the basis of its gene expression profile, a problem of particular importance is the diagnosis of cancer type based on microarray data. One method of classification which has been very successful in cancer diagnosis is the support vector machine. The latter has been shown (through simulations) to be superior in comparison to other methods, such as classical discriminant analysis, however, support vector machine suffers from the drawback that the solution is implicit and therefore is difficult to interpret. In order to remedy this difficulty, an analysis of variance decomposition using structured kernels is proposed and is referred to as the structured polychotomous machine. This technique utilizes Newton-Raphson to find estimates of coefficients followed by the Rao and Wald tests, respectively, for addition and deletion of import vectors.

Results: The proposed method is applied to microarray data and simulation data. The major breakthrough of our method is efficiency in that only a minimal number of genes that accurately predict the classes are selected. It has been verified that the selected genes serve as legitimate markers for cancer classification from a biological point of view.

Availability: All source codes used are available on request from the authors.


Associate Editor: Martin Bishop
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.