Bioinformatics Advance Access published online on December 13, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti829
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Information Technology and Turku Centre for Computer Science, University of Turku, Finland
* To whom correspondence should be addressed.
Motivation: Peptide identification by tandem mass spectrometry is an important tool in proteomic research. Powerful identification programs exist, such as SEQUEST, ProICAT and Mascot, which can relate experimental spectra to the theoretical ones derived from protein databases, thus removing much of the manual input needed in the identification process. However, the time-consuming validation of the peptide identifications is still the bottleneck of many proteomic studies. One way to further streamline this process is to remove those spectra that are unlikely to provide a confident or valid peptide identification, and in this way to reduce the labour from the validation phase. Results: We propose a prefiltering scheme for evaluating the quality of spectra before the database search. The spectra are classified into two classes: spectra which contain valuable information for peptide identification and spectra that are not derived from peptides or contain insufficient information for interpretation. The different spectral features developed for the classification are tested on a real-life material originating from human lymphoblast samples and on a standard mixture of 9 proteins, both labelled with the ICAT reagent. The results show that the prefiltering scheme efficiently separates the two spectra classes. Availability: The software tools are available on request from the authors. Supplementary information: The Mascot ion score distributions and the C4.5 classification rules can be found at address http://staff.cs.utu.fi/staff/jussi.salmi/Supplementary_material.pdf.
Received July 21, 2005
Revised December 7, 2005
Accepted December 8, 2005
Article
Quality classification of tandem mass spectrometry data
Jussi Salmi 1 *,
Robert Moulder 2,
Jan-Jonas Filén 3,
Olli S. Nevalainen 1,
Tuula A. Nyman 4,
Riitta Lahesmaa 2,
and
Tero Aittokallio 5
2 Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, Finland
3 Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, Finland; The National Graduate School in Informational and Structural Biology, Finland
4 Finnish Institute of Occupational Health, Helsinki, Finland
5 Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, Finland; Department of Mathematics, University of Turku, Finland
Jussi Salmi, E-mail: jussi.salmi{at}it.utu.fi
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?