Strategies for sequence similarity database searches.

CSH Protoc

Published: July 2007

INTRODUCTIONDatabase similarity search programs tend to produce large volumes of output. It can become difficult to screen this volume of material and to assess whether the more remotely related sequences are really related to the query sequence. Thus, it is important to limit the sequence output; there are some relatively simple procedures that may be followed for each program, as described in this article. For searches of protein databases, avoid repetitive alignments with the same sequence by limiting searches to the protein sequence databases that are well curated, such as SwissProt and PIR, or to a specific genome, as opposed to the entire set of translated GenBank sequences (the GenPept database).

Download full-text PDF	Source
http://dx.doi.org/10.1101/pdb.top15	DOI Listing

Publication Analysis

Top Keywords

searches protein

strategies sequence

sequence similarity

similarity database

database searches

searches introductiondatabase

introductiondatabase similarity

similarity search

search programs

programs tend

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!