INTRODUCTIONDatabase similarity search programs tend to produce large volumes of output. It can become difficult to screen this volume of material and to assess whether the more remotely related sequences are really related to the query sequence. Thus, it is important to limit the sequence output; there are some relatively simple procedures that may be followed for each program, as described in this article. For searches of protein databases, avoid repetitive alignments with the same sequence by limiting searches to the protein sequence databases that are well curated, such as SwissProt and PIR, or to a specific genome, as opposed to the entire set of translated GenBank sequences (the GenPept database).

Download full-text PDF

Source
http://dx.doi.org/10.1101/pdb.top15DOI Listing

Publication Analysis

Top Keywords

searches protein
8
strategies sequence
4
sequence similarity
4
similarity database
4
database searches
4
searches introductiondatabase
4
introductiondatabase similarity
4
similarity search
4
search programs
4
programs tend
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!