Bioinformatics Advance Access published online on April 3, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl118
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Computer Sciences, The University of Texas at Austin
* To whom correspondence should be addressed.
Motivation: We reformulate the problem of comparing mass-spectra by mapping spectra to a vector space model. Our search method leverages a metric space indexing algorithm to produce an initial candidate set, which can be followed by any fine ranking scheme. Results: We consider three distance measures integrated into a multi-vantage point index structure. Of these, a semi-metric fuzzy-cosine distance using peptide precursor mass constraints performs the best. The index acts as a coarse, lossless filter with respect to the SEQUEST [Yates III et al., 1995] and ProFound [Zhang and Chait, 2000] scoring schemes, reducing the number of distance computations and returned candidates for fine filtering to about 0.5% and 0.02% of the database respectively. The fuzzy cosine distance term improves specificity over a peptide precursor mass filter, reducing the number of returned candidates by an order of magnitude. Run time measurements suggest proportional speedups in overall search times. Using an implementation of ProFound's Bayesian score as an example of a fine filter on a test set of E.coli protein fragmentation spectra, the top results of our sample system are consistent with that of SEQUEST. Supplementary Information: Available at Bioinformatics online.
Received August 17, 2005
Revised March 24, 2006
Accepted March 25, 2006
Article
A fast coarse filtering method for peptide identification by mass spectrometry
Smriti R. Ramakrishnan 1 *,
Rui Mao 1,
Aleksey A. Nakorchevskiy 2,
John T. Prince 3,
Willard S. Willard 1,
Weijia Xu 1,
Edward M. Marcotte 4,
and
Daniel P. Miranker 5
2 Department of Chemistry and Biochemistry, The University of Texas at Austin, Austin, Texas 78712
3 Institute for Cellular and Molecular Biology, The University of Texas at Austin
4 Institute for Cellular and Molecular Biology, The University of Texas at Austin; Department of Chemistry and Biochemistry, The University of Texas at Austin, Austin, Texas 78712
5 Department of Computer Sciences, The University of Texas at Austin; Institute for Cellular and Molecular Biology, The University of Texas at Austin
Smriti R. Ramakrishnan, E-mail: smriti{at}cs.utexas.edu
![]()
Abstract
Associate Editor: Satoru Miyano
![]()
CiteULike
Connotea
Del.icio.us What's this?