ConFind: a robust tool for conserved sequence identification.

Bioinformatics

Department of Chemistry and Biochemistry, The University of Colorado at Boulder, UCB #215, Boulder, CO 80309, USA.

Published: December 2005

Summary: ConFind (conserved region finder) identifies regions of conservation in multiple sequence alignments that can serve as diagnostic targets. Designed to work with a large number of closely related, highly variable sequences, ConFind provides robust handling of alignments containing partial sequences and ambiguous characters. Conserved regions are defined in terms of minimum region length, maximum informational entropy (variability) per position, number of exceptions allowed to the maximum entropy criterion and the minimum number of sequences that must contain a non-ambiguous character at a position to be considered for inclusion in a conserved region. Comparison of the calculated entropy for an alignment of 95 influenza A hemagglutinin sequences with random deletions results in a 98% reduction in the average error in ConFind relative to the 'Find Conserved Regions' option in BioEdit.

Requirements: ConFind requires Python 2.3, but Python 2.4 or an upgrade of the optparse module to Optik 1.5 is suggested. The program is known to run under Linux and DOS.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bti719DOI Listing

Publication Analysis

Top Keywords

confind robust
8
conserved region
8
confind
5
conserved
5
robust tool
4
tool conserved
4
conserved sequence
4
sequence identification
4
identification summary
4
summary confind
4

Similar Publications

Summary: ConFind (conserved region finder) identifies regions of conservation in multiple sequence alignments that can serve as diagnostic targets. Designed to work with a large number of closely related, highly variable sequences, ConFind provides robust handling of alignments containing partial sequences and ambiguous characters. Conserved regions are defined in terms of minimum region length, maximum informational entropy (variability) per position, number of exceptions allowed to the maximum entropy criterion and the minimum number of sequences that must contain a non-ambiguous character at a position to be considered for inclusion in a conserved region.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!