Bioinformatics Advance Access published online on May 14, 2008
Bioinformatics, doi:10.1093/bioinformatics/btn203
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
An overview of the wcd EST Clustering Tool
aWits Bioinformatics, University of the Witwatersrand, Johannesburg, Private Bag 3, 2050 Wits, South Africa; bSouth African National Bioinformatics Institute, University of the Western Cape, Private Bag X17, Bellville 7535, South Africa; cUniversität Bielefeld, AG Genominformatik, Technische Fakultät, Postfach 100131, 33501Bielefeld, Germany
*To whom correspondence should be addressed. Dr. Scott Hazelhurst, E-mail: scott.hazelhurst{at}wits.ac.za
| Abstract |
|---|
Summary: The wcd system is an open source tool for clustering Expressed Sequence Tags (EST) and other DNA and RNA sequences. wcd allows efficient all-versus-all comparison of ESTs using either the d2 distance function or edit distance, improving existing implementations of d2. It supports merging, refinement and reclustering of clusters. It is drop in' compatible with the StackPack clustering package. wcd supports parallelisation under both SMP and cluster architectures. It is distributed with an EMBOSS wrapper allowing wcd to be installed as part of an EMBOSS installation (and so provided by a web server).
Availability: wcd is distributed under a GPL licence and is available from http://code.google.com/p/wcdest.
Contact: scott.hazelhurst{at}wits.ac.za
Supplementary information: Additional experimental results. The wcd manual, a companion paper describing underlying algorithms, and all data sets used for experimentation can also be found at www.bioinf.wits.ac.za/~scott/wcdsupp.html
Associate Editor: Prof. David Rocke
Received on October 30, 2007; revised on March 21, 2008; accepted on April 20, 2008