Bioinformatics Advance Access published online on August 16, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti620
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
* To whom correspondence should be addressed.
Motivation: The key to mass-spectrometry-based proteomics is peptide sequencing. The major challenge in peptide sequencing, whether library search or de novo, is to better infer statistical significance and better attain noise reduction. Because the noise in a spectrum depends on experimental conditions, the instrument used, and many other factors, it cannot be predicted even if the peptide sequence is known. The characteristics of the noise can only be uncovered once a spectrum is given. We wish to overcome such issues. Results: We design RAId to identify peptides from their associated tandem mass spectrometry data. RAId performs a novel de novo sequencing followed by a search in a peptide library that we created. Through de novo sequencing, we establish the spectrum-specific background score statistics for the library search. When the database search fails to return significant hits, the top-ranking de novo sequences become potential candidates for new peptides that are not yet in the database. The use of spectrum-specific background statistics seems to enable RAId to perform well even when the spectral quality is marginal. Other important features of RAId include its potential in de novo sequencing alone and the ease of incorporating post-translational modifications. Availability: Programs implementing the methods described are available from the authors upon request.
Received June 15, 2005
Revised July 26, 2005
Accepted August 8, 2005
Article
Robust Accurate Identification of Peptides (RAId): deciphering MS2 data using a structured library search with de novo based statistics
Yi-Kuo Yu, E-mail: yyu{at}ncbi.nlm.nih.gov
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Kim, N. Bandeira, and P. A. Pevzner Spectral Profiles, a Novel Representation of Tandem Mass Spectra and Their Applications for de Novo Peptide Sequencing and Identification Mol. Cell. Proteomics, June 1, 2009; 8(6): 1391 - 1400. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kim, N. Gupta, N. Bandeira, and P. A. Pevzner Spectral Dictionaries: Integrating de novo Peptide Sequencing with Database Search of Tandem Mass Spectra Mol. Cell. Proteomics, January 1, 2009; 8(1): 53 - 69. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Bandeira, J. V. Olsen, M. Mann, and P. A. Pevzner Multi-spectra peptide sequencing and its applications to multistage mass spectrometry Bioinformatics, July 1, 2008; 24(13): i416 - i423. [Abstract] [Full Text] [PDF] |
||||

