Bioinformatics Advance Access originally published online on October 20, 2005
Bioinformatics 2005 21(24):4420-4422; doi:10.1093/bioinformatics/bti719
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
ConFind: a robust tool for conserved sequence identification
Department of Chemistry and Biochemistry, The University of Colorado at Boulder UCB #215, Boulder, CO 80309, USA
*To whom correspondence should be addressed.
Summary: ConFind (conserved region finder) identifies regions of conservation in multiple sequence alignments that can serve as diagnostic targets. Designed to work with a large number of closely related, highly variable sequences, ConFind provides robust handling of alignments containing partial sequences and ambiguous characters. Conserved regions are defined in terms of minimum region length, maximum informational entropy (variability) per position, number of exceptions allowed to the maximum entropy criterion and the minimum number of sequences that must contain a non-ambiguous character at a position to be considered for inclusion in a conserved region. Comparison of the calculated entropy for an alignment of 95 influenza A hemagglutinin sequences with random deletions results in a 98% reduction in the average error in ConFind relative to the Find Conserved Regions option in BioEdit.
Requirements: ConFind requires Python 2.3, but Python 2.4 or an upgrade of the optparse module to Optik 1.5 is suggested. The program is known to run under Linux and DOS.
Availability: ConFind is licensed under the GNU General Public License (GPL). Source code, documentation, and a precompiled DOS executable are available for download at http://www.colorado.edu/chemistry/RGHP/software/
Contact: rowlen{at}colorado.edu
Received on August 10, 2005; revised on September 28, 2005; accepted on October 14, 2005
This article has been cited by other articles:
![]() |
B. Song, J.-H. Choi, G. Chen, J. Szymanski, G.-Q. Zhang, A. K. H. Tung, J. Kang, S. Kim, and J. Yang ARCS: an aggregated related column scoring scheme for aligned sequences Bioinformatics, October 1, 2006; 22(19): 2326 - 2332. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Mehlmann, E. D. Dawson, M. B. Townsend, J. A. Smagala, C. L. Moore, C. B. Smith, N. J. Cox, R. D. Kuchta, and K. L. Rowlen Robust Sequence Selection Method Used To Develop the FluChip Diagnostic Microarray for Influenza Virus. J. Clin. Microbiol., August 1, 2006; 44(8): 2857 - 2862. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. B. Townsend, E. D. Dawson, M. Mehlmann, J. A. Smagala, D. M. Dankbar, C. L. Moore, C. B. Smith, N. J. Cox, R. D. Kuchta, and K. L. Rowlen Experimental Evaluation of the FluChip Diagnostic Microarray for Influenza Virus Surveillance. J. Clin. Microbiol., August 1, 2006; 44(8): 2863 - 2871. [Abstract] [Full Text] [PDF] |
||||

