Bioinformatics Advance Access published online on March 22, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti370
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 School of Electronics Engineering and Computer Science, Peking University, China; Institute for Genomics and Bioinformatics Graz University of Technology, 8010 Graz, Austria
* To whom correspondence should be addressed.
Motivation: High-throughput and high-resolution mass spectrometry instruments are increasingly used for disease classification and therapeutic guidance. However, the analysis of immense amount of data poses considerable challenges. We have therefore developed a novel method for dimensionality reduction and tested on a published ovarian high-resolution SELDI-TOF data set. Results: We have developed a four-step strategy for data preprocessing based on: 1) binning, 2) Kolmogorov-Smirnov test, 3) restriction of coefficient of variation, and 4) wavelet analysis. Subsequently, Support Vector Machines (SVM) were used for classification. The developed method achieves an average sensitivity of 97.38% (sd=0.0125) and an average specificity of 93.30% (sd=0.0174) in 1,000 independent k-fold cross validations, where k = 2, ..., 10. Availability: The software is available for academic and non-commercial institutions.
Received September 14, 2004
Revised March 1, 2005
Accepted March 1, 2005
Article
Ovarian cancer identification based on dimensionality reduction for high-throughput mass spectrometry data
2 Department of Information Engineering, University of Padova, Italy; Institute for Genomics and Bioinformatics, Graz University of Technology, 8010 Graz, Austria
3 Institute for Genomics and Bioinformatics Graz University of Technology, 8010 Graz, Austria
4 Electrical Engineering and Computer Science Department, Information and Telecommunication Center, University of Kansas, USA
5 Department of Information Engineering, University of Padova, Italy
6 Institute for Genomics and Bioinformatics, Christian-Doppler Laboratory for Genomics and Bioinformatics Graz University of Technology, 8010 Graz, Austria
Z. Trajanoski, E-mail: zlatko.trajanoski{at}tugraz.at
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. H. Oh, Y. B. Kim, P. Gurnani, K. P. Rosenblatt, and J. X. Gao Biomarker selection and sample prediction for multi-category disease on MALDI-TOF data Bioinformatics, August 15, 2008; 24(16): 1812 - 1818. [Abstract] [PDF] |
||||
![]() |
M. Hilario and A. Kalousis Approaches to dimensionality reduction in proteomic biomarker studies Brief Bioinform, March 1, 2008; 9(2): 102 - 118. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Saeys, I. Inza, and P. Larranaga A review of feature selection techniques in bioinformatics Bioinformatics, October 1, 2007; 23(19): 2507 - 2517. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Baumgartner and D. Baumgartner Biomarker Discovery, Disease Classification, and Similarity Query Processing on High-Throughput MS/MS Data of Inborn Errors of Metabolism J Biomol Screen, February 1, 2006; 11(1): 90 - 99. [Abstract] [PDF] |
||||


