INTRODUCTIONDatabase similarity search programs tend to produce large volumes of output. It can become difficult to screen this volume of material and to assess whether the more remotely related sequences are really related to the query sequence. Thus, it is important to limit the sequence output; there are some relatively simple procedures that may be followed for each program, as described in this article. For searches of protein databases, avoid repetitive alignments with the same sequence by limiting searches to the protein sequence databases that are well curated, such as SwissProt and PIR, or to a specific genome, as opposed to the entire set of translated GenBank sequences (the GenPept database).
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1101/pdb.top15 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!