Bioinformatics Advance Access originally published online on January 22, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bioinformatics 20(5) © Oxford University Press 2004; all rights reserved.
Optimization models for cancer classification: extracting gene interaction information from microarray expression data
1 GSF National Research Center for Environment and Health, Institute for Bioinformatics, Ingolstädter Landstraße 1, D-85764 Neuherberg, Germany and 2 Department of Genome-Oriented Bioinformatics, Wissenschaftszentrum Weihenstephan, Technische Universität München, 85350 Freising, Germany
Received on August 7, 2003
; accepted on October 6, 2003
Advance Access Publication January 22, 2004
Motivation: Microarray data appear particularly useful to investigate mechanisms in cancer biology and represent one of the most powerful tools to uncover the genetic mechanisms causing loss of cell cycle control. Recently, several different methods to employ microarray data as a diagnostic tool in cancer classification have been proposed. These procedures take changes in the expression of particular genes into account but do not consider disruptions in certain gene interactions caused by the tumor. It is probable that some genes participating in tumor development do not change their expression level dramatically. Thus, they cannot be detected by simple classification approaches used previously. For these reasons, a classification procedure exploiting information related to changes in gene interactions is needed.
Results: We propose a MAximal MArgin Linear Programming (MAMA) method for the classification of tumor samples based on microarray data. This procedure detects groups of genes and constructs models (features) that strongly correlate with particular tumor types. The detected features include genes whose functional relations are changed for particular cancer types. The proposed method was tested on two publicly available datasets and demonstrated a prediction ability superior to previously employed classification schemes.
Availability: The MAMA system was developed using the linear programming system LINDO http://www.lindo.com. A Perl script that specifies the optimization problem for this software is available upon request from the authors.
Contact: antonov{at}gsf.de
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Zhang, C. Wu, E. Blanzieri, Y. Zhou, Y. Wang, W. Du, and Y. Liang Methods for labeling error detection in microarrays based on the effect of data perturbation on the regression model Bioinformatics, October 15, 2009; 25(20): 2708 - 2714. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Baek, C.-A. Tsai, and J. J. Chen Development of biomarker classifiers from high-dimensional data Brief Bioinform, September 1, 2009; 10(5): 537 - 546. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-Y. Koo, I. Sohn, S. Kim, and J. W. Lee Structured polychotomous machine diagnosis of multiple cancer types using gene expression Bioinformatics, April 15, 2006; 22(8): 950 - 958. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Zhang, R. Rekaya, and K. Bertrand A method for predicting disease subtypes in presence of misclassification among training samples using gene expression: application to human breast cancer Bioinformatics, February 1, 2006; 22(3): 317 - 325. [Abstract] [Full Text] [PDF] |
||||

