Bioinformatics Advance Access originally published online on January 26, 2005
Bioinformatics 2005 21(9):1764-1775; doi:10.1093/bioinformatics/bti254
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Feature extraction and quantification for mass spectrometry in biomedical applications using the mean spectrum


1Department of Biostatistics and Applied Mathematics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
2Department of Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
*To whom correspondence should be addressed.
Motivation: Mass spectrometry yields complex functional data for which the features of scientific interest are peaks. A common two-step approach to analyzing these data involves first extracting and quantifying the peaks, then analyzing the resulting matrix of peak quantifications. Feature extraction and quantification involves a number of interrelated steps. It is important to perform these steps well, since subsequent analyses condition on these determinations. Also, it is difficult to compare the performance of competing methods for analyzing mass spectrometry data since the true expression levels of the proteins in the population are generally not known.
Results: In this paper, we introduce a new method for feature extraction in mass spectrometry data that uses translation-invariant wavelet transforms and performs peak detection using the mean spectrum. We examine the method's performance through examples and simulation, and demonstrate the advantages of using the mean spectrum to detect peaks. We also describe a new physics-based computer model of mass spectrometry and demonstrate how one may design simulation studies based on this tool to systematically compare competing methods.
Availability: MATLAB scripts to implement the methods described in this paper and R code for the virtual mass spectrometer are available at http://bioinformatics.mdanderson.org/software.html
Contact: jefmorris{at}mdanderson.org
Supplementary information: http://bioinformatics.mdanderson.org/supplements.html
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Wang, X. Zhou, H. Wang, K. Li, L. Yao, and S. T.C. Wong Reversible jump MCMC approach for peak identification for stroke SELDI mass spectrometry using mixture model Bioinformatics, July 1, 2008; 24(13): i407 - i413. [Abstract] [PDF] |
||||
![]() |
A. Barla, G. Jurman, S. Riccadonna, S. Merler, M. Chierici, and C. Furlanello Machine learning methods for predictive proteomics Brief Bioinform, March 1, 2008; 9(2): 119 - 128. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Morris, B. N. Clark, and H. B. Gutstein Pinnacle: a fast, automatic and accurate method for detecting and quantifying protein spots in 2-dimensional gel electrophoresis data Bioinformatics, February 15, 2008; 24(4): 529 - 536. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Noy and D. Fasulo Improved model-based, platform-independent feature extraction for mass spectrometry Bioinformatics, October 1, 2007; 23(19): 2528 - 2535. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Park, S. R. Downing, D. Kim, W. C. Hahn, C. Li, P. W. Kantoff, and L.J. Wei Simultaneous and exact interval estimates for the contrast of two groups based on an extremely high dimensional variable: application to mass spec data Bioinformatics, June 15, 2007; 23(12): 1451 - 1458. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. V. Karpievitch, E. G. Hill, A. J. Smolka, J. S. Morris, K. R. Coombes, K. A. Baggerly, and J. S. Almeida PrepMS: TOF MS data graphical preprocessing tool Bioinformatics, January 15, 2007; 23(2): 264 - 265. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. S. Cornett, J. A. Mobley, E. C. Dias, M. Andersson, C. L. Arteaga, M. E. Sanders, and R. M. Caprioli A Novel Histology-directed Strategy for MALDI-MS Tissue Profiling That Improves Throughput and Cellular Specificity in Human Breast Cancer Mol. Cell. Proteomics, October 1, 2006; 5(10): 1975 - 1983. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Bellew, M. Coram, M. Fitzgibbon, M. Igra, T. Randolph, P. Wang, D. May, J. Eng, R. Fang, C. Lin, et al. A suite of algorithms for the comprehensive analysis of complex protein mixtures using high-resolution LC-MS Bioinformatics, August 1, 2006; 22(15): 1902 - 1909. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Atkins and J. S. Johansson Technologies to shape the future: proteomics applications in anesthesiology and critical care medicine. Anesth. Analg., April 1, 2006; 102(4): 1207 - 1216. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. W. Randolph, B. L. Mitchell, D. F. McLerran, P. D. Lampe, and Z. Feng Quantifying Peptide Signal in MALDI-TOF Mass Spectrometry Data Mol. Cell. Proteomics, December 1, 2005; 4(12): 1990 - 1999. [Abstract] [Full Text] [PDF] |
||||



