Skip Navigation


Bioinformatics Advance Access originally published online on July 4, 2006
Bioinformatics 2006 22(17):2059-2065; doi:10.1093/bioinformatics/btl355
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
22/17/2059    most recent
btl355v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (30)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Du, P.
Right arrow Articles by Lin, S. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Du, P.
Right arrow Articles by Lin, S. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching

Pan Du 1, Warren A. Kibbe 1 and Simon M. Lin 1,*

1 Robert H. Lurie Comprehensive Cancer Center, Northwestern University Chicago, IL 60611, USA

*To whom correspondence should be addressed.

Motivation: A major problem for current peak detection algorithms is that noise in mass spectrometry (MS) spectra gives rise to a high rate of false positives. The false positive rate is especially problematic in detecting peaks with low amplitudes. Usually, various baseline correction algorithms and smoothing methods are applied before attempting peak detection. This approach is very sensitive to the amount of smoothing and aggressiveness of the baseline correction, which contribute to making peak detection results inconsistent between runs, instrumentation and analysis methods.

Results: Most peak detection algorithms simply identify peaks based on amplitude, ignoring the additional information present in the shape of the peaks in a spectrum. In our experience, ‘true’ peaks have characteristic shapes, and providing a shape-matching function that provides a ‘goodness of fit’ coefficient should provide a more robust peak identification method. Based on these observations, a continuous wavelet transform (CWT)-based peak detection algorithm has been devised that identifies peaks with different scales and amplitudes. By transforming the spectrum into wavelet space, the pattern-matching problem is simplified and in addition provides a powerful technique for identifying and separating the signal from the spike noise and colored noise. This transformation, with the additional information provided by the 2D CWT coefficients can greatly enhance the effective signal-to-noise ratio. Furthermore, with this technique no baseline removal or peak smoothing preprocessing steps are required before peak detection, and this improves the robustness of peak detection under a variety of conditions. The algorithm was evaluated with SELDI-TOF spectra with known polypeptide positions. Comparisons with two other popular algorithms were performed. The results show the CWT-based algorithm can identify both strong and weak peaks while keeping false positive rate low.

Availability: The algorithm is implemented in R and will be included as an open source module in the Bioconductor project.

Contact: s-lin2{at}northwestern.edu

Supplementary material: http://basic.northwestern.edu/publications/peakdetection/. Colour versions of the figures in this article can be found at Bioinformatics Online.


Received on April 24, 2006; revised on June 22, 2006; accepted on June 23, 2006

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
X. Kong and C. Reilly
A Bayesian approach to the alignment of mass spectra
Bioinformatics, December 15, 2009; 25(24): 3213 - 3220.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Hussong, B. Gregorius, A. Tholey, and A. Hildebrandt
Highly accelerated feature detection in proteomics data sets using modern graphics processing units
Bioinformatics, August 1, 2009; 25(15): 1937 - 1943.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Kim, S. Yu, B. Shim, H. Kim, H. Min, E.-Y. Chung, R. Das, and S. Yoon
A robust peak detection method for RNA structure inference by high-throughput contact mapping
Bioinformatics, May 1, 2009; 25(9): 1137 - 1144.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Cruz-Marcelo, R. Guerra, M. Vannucci, Y. Li, C. C. Lau, and T.-K. Man
Comparison of algorithms for pre-processing of SELDI-TOF mass spectrometry data
Bioinformatics, October 1, 2008; 24(19): 2129 - 2136.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Wang, X. Zhou, H. Wang, K. Li, L. Yao, and S. T.C. Wong
Reversible jump MCMC approach for peak identification for stroke SELDI mass spectrometry using mixture model
Bioinformatics, July 1, 2008; 24(13): i407 - i413.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Zheng, P. Lu, Y. Liu, J. Pease, J. Usuka, G. Liao, and G. Peltz
2D NMR metabonomic analysis: a novel method for automated peak alignment
Bioinformatics, November 1, 2007; 23(21): 2926 - 2933.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Noy and D. Fasulo
Improved model-based, platform-independent feature extraction for mass spectrometry
Bioinformatics, October 1, 2007; 23(19): 2528 - 2535.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.