Skip Navigation

Bioinformatics 2009 25(5):643-649; doi:10.1093/bioinformatics/btn662
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow Supplementary Data
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Alexandrov, T.
Right arrow Articles by Thiele, H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Alexandrov, T.
Right arrow Articles by Thiele, H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2009 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation

Theodore Alexandrov 1,*, Jens Decker 2, Bart Mertens 3, Andre M. Deelder 4, Rob A. E. M. Tollenaar 5, Peter Maass 1 and Herbert Thiele 2

1Center for Industrial Mathematics, University of Bremen, D-28334 Bremen, 2Bruker Daltonik GmbH, D-28359 Bremen, Germany, 3Department of Medical Statistics and Bioinformatics, Leiden University Medical Center (LUMC), 2300 RC Leiden, The Netherlands, 4Department of Parasitology and 5Department of Surgery, Leiden University Medical Center, 2300 RC Leiden, The Netherlands

*To whom correspondence should be addressed.


   Abstract

Motivation: Automatic classification of high-resolution mass spectrometry proteomic data has increasing potential in the early diagnosis of cancer. We propose a new procedure of biomarker discovery in serum protein profiles based on: (i) discrete wavelet transformation of the spectra; (ii) selection of discriminative wavelet coefficients by a statistical test and (iii) building and evaluating a support vector machine classifier by double cross-validation with attention to the generalizability of the results. In addition to the evaluation results (total recognition rate, sensitivity and specificity), the procedure provides the biomarker patterns, i.e. the parts of spectra which discriminate cancer and control individuals. The evaluation was performed on matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) serum protein profiles of 66 colorectal cancer patients and 50 controls.

Results: Our procedure provided a high recognition rate (97.3%), sensitivity (98.4%) and specificity (95.8%). The extracted biomarker patterns mostly represent the peaks expressing mean differences between the cancer and control spectra. However, we showed that the discriminative power of a peak is not simply expressed by its mean height and cannot be derived by comparison of the mean spectra. The obtained classifiers have high generalization power as measured by the number of support vectors. This prevents overfitting and contributes to the reproducibility of the results, which is required to find biomarkers differentiating cancer patients from healthy individuals.

Availability: The data and scripts used in this study are available at http://www.math.uni-bremen.de/~theodore/MALDIDWT.

Contact: theodore{at}math.uni-bremen.de

Supplementary information: Supplementary data are available at Bioinformatics online.

Associate Editor: Burkhard Rost


Received on July 1, 2008; revised on December 22, 2008; accepted on December 23, 2008

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.