Skip Navigation



Bioinformatics Advance Access published online on May 3, 2008

Bioinformatics, doi:10.1093/bioinformatics/btn218
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow Supplementary Data
Right arrow All Versions of this Article:
24/13/1503    most recent
btn218v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Webb-Robertson, B.-J. M.
Right arrow Articles by Waters, K. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Webb-Robertson, B.-J. M.
Right arrow Articles by Waters, K. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author (2008). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

A Support Vector Machine model for the prediction of proteotypic peptides for accurate mass and time proteomics

Bobbie-Jo M. Webb-Robertson 1,*, William R. Cannon 1, Christopher S. Oehmen 1, Anuj R. Shah 2, Vidhya Gurumoorthi 3, Mary S. Lipton 4 and Katrina M. Waters 1

1Computational Biology & Bioinformatics, Pacific Northwest National Laboratory
2Scientific Data Management, Pacific Northwest National Laboratory
3Applied Computer Science, Pacific Northwest National Laboratory
4Biological Separations and Mass Spectrometry, Pacific Northwest National Laboratory

*To whom correspondence should be addressed. Dr. Bobbie-Jo M. Webb-Robertson, E-mail: bj{at}pnl.gov


   Abstract

Motivation: The standard approach to identifying peptides based on accurate mass and elution time (AMT) compares these profiles obtained from a high resolution mass spectrometer to a database of peptides previously identified from tandem mass spectrometry (MS/MS) studies. It would be advantageous, with respect to both accuracy and cost, to only search for those peptides that are detectable by MS (proteotypic).

Results: We present a Support Vector Machine (SVM) model that uses a simple descriptor space based on 35 properties of amino acid content, charge, hydrophilicity, and polarity for the quantitative prediction of proteotypic peptides. Using three independently derived AMT databases (Shewanella oneidensis, Salmonella typhimurium, Yersinia pestis) for training and validation within and across species, the SVM resulted in an average accuracy measure of ~0.8 with a standard deviation of less than 0.025. Furthermore, we demonstrate that these results are achievable with a small set of 12 variables and can achieve high proteome coverage.

Availability: http://omics.pnl.gov/software/STEPP.php

Associate Editor: Prof. Quackenbush


Received on March 18, 2008; revised on April 18, 2008; accepted on April 29, 2008

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.