A consensus program for molecular sequences
Department of Computer Science, Memorial University of Newfoundland St John's, NF AlC 5S7, Canada
1Department of Mathemancs, University of Louisville Louisville, KY 40292, USA
Although molecular biologists often calculate consensus sequences from aligned DNA or protein sequences, relatively little is known about the properties of many of the consensus methods being used. Consequently, we wrote a program, CONSENSUS, to analyze and compare methods of calculating a consensus result (a base, an ambiguity code or a subset of codes) at a position in an aligned set of molecular sequences. The program supports alphabets of up to four symbols (e.g. (R, Y) or A, C, G, T). The program's output makes it suitable for exploratory data analysis or for selecting values of thresholds or confidence levels in consensus methods having such parameters.
Received on December 1, 1992; accepted on June 1, 1993