Bioinformatics Advance Access originally published online on January 10, 2008
Bioinformatics 2008 24(6):882-884; doi:10.1093/bioinformatics/btn012
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
OutlierD: an R package for outlier detection using quantile regression on mass spectrometry data
1Department of Statistics, 2Department of Biostatistics, 3Institute of Statistics and 4Department of Chemistry, Korea University, Seoul, Korea
*To whom correspondence should be addressed.
| Abstract |
|---|
Summary: It is important to preprocess high-throughput data generated from mass spectrometry experiments in order to obtain a successful proteomics analysis. Outlier detection is an important preprocessing step. A naive outlier detection approach may miss many true outliers and instead select many non-outliers because of the heterogeneity of the variability observed commonly in high-throughput data. Because of this issue, we developed a outlier detection software program accounting for the heterogeneous variability by utilizing linear, non-linear and non-parametric quantile regression techniques. Our program was developed using the R computer language. As a consequence, it can be used interactively and conveniently in the R environment.
Availability: An R package, OutlierD, is available at the Bioconductor project at http://www.bioconductor.org
Contact: jael{at}korea.ac.kr
Supplementary information: Supplementary Data are available at Bioinformatics online.
Associate Editor: Limsoon Wong
Received on August 10, 2007; revised on January 4, 2008; accepted on January 4, 2008