Bioinformatics Advance Access originally published online on August 10, 2009
Bioinformatics 2009 25(19):2609-2610; doi:10.1093/bioinformatics/btp477
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
PineSAP—sequence alignment and SNP identification pipeline
1Department of Plant Sciences and 2Department of Evolution and Ecology, Address One Shields Ave. University of California, Davis, CA 95616, USA
*To whom correspondence should be addressed.
| Abstract |
|---|
Summary: The Pine Alignment and SNP Identification Pipeline (PineSAP) provides a high-throughput solution to single nucleotide polymorphism (SNP) prediction using multiple sequence alignments from re-sequencing data. This pipeline integrates a hybrid of customized scripting, existing utilities and machine learning in order to increase the speed and accuracy of SNP calls. The implementation of this pipeline results in significantly improved multiple sequence alignments and SNP identifications when compared with existing solutions. The use of machine learning in the SNP identifications extends the pipeline's application to any eukaryotic species where full genome sequence information is unavailable.
Availability: All code used for this pipeline is freely available at the Dendrome project website (http://dendrome.ucdavis.edu/adept2/resequencing.html)
Contact: jlwegrzyn{at}ucdavis.edu
Associate Editor: John Quackenbush
Received on May 22, 2009; revised on July 31, 2009; accepted on August 2, 2009