Bioinformatics Advance Access published online on October 20, 2005
Bioinformatics, doi:10.1093/bioinformatics/bti719
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Chemistry and Biochemistry, The University of Colorado at Boulder, UCB #215, Boulder, CO, 80309, USA
* To whom correspondence should be addressed.
Summary: ConFind (CONserved region FINDer) identifies regions of conservation in multiple sequence alignments that can serve as diagnostic targets. Designed to work with a large number of closely related, highly variable sequences, ConFind provides robust handling of alignments containing partial sequences and ambiguous characters. Conserved regions are defined in terms of minimum region length, maximum informational entropy (variability) per position, number of exceptions allowed to the maximum entropy criterion, and the minimum number of sequences that must contain a non-ambiguous character at a position to be considered for inclusion in a conserved region. Comparison of the calculated entropy for an alignment of 95 influenza A hemagglutinin sequences with random deletions results in a 98% reduction in the average error in ConFind relative to the Find Conserved Regions option in BioEdit. Requirements: ConFind requires Python 2.3, but Python 2.4 or an upgrade of the optparse module to Optik 1.5 is suggested. The program is known to run under Linux and DOS. Availability: ConFind is licensed under the GNU General Public License (GPL). Source code, documentation, and a precompiled DOS executable are available for download at http://www.colorado.edu/chemistry/RGHP/software/.
Received August 10, 2005
Revised September 28, 2005
Accepted October 14, 2005
Applications note
ConFind: a robust tool for conserved sequence identification
Kathy L. Rowlen, E-mail: rowlen{at}colorado.edu
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
B. Song, J.-H. Choi, G. Chen, J. Szymanski, G.-Q. Zhang, A. K. H. Tung, J. Kang, S. Kim, and J. Yang ARCS: an aggregated related column scoring scheme for aligned sequences Bioinformatics, October 1, 2006; 22(19): 2326 - 2332. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Mehlmann, E. D. Dawson, M. B. Townsend, J. A. Smagala, C. L. Moore, C. B. Smith, N. J. Cox, R. D. Kuchta, and K. L. Rowlen Robust Sequence Selection Method Used To Develop the FluChip Diagnostic Microarray for Influenza Virus. J. Clin. Microbiol., August 1, 2006; 44(8): 2857 - 2862. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. B. Townsend, E. D. Dawson, M. Mehlmann, J. A. Smagala, D. M. Dankbar, C. L. Moore, C. B. Smith, N. J. Cox, R. D. Kuchta, and K. L. Rowlen Experimental Evaluation of the FluChip Diagnostic Microarray for Influenza Virus Surveillance. J. Clin. Microbiol., August 1, 2006; 44(8): 2863 - 2871. [Abstract] [Full Text] [PDF] |
||||

