J Bioinform Comput Biol
June 2021
Discovering motifs and repeats in data sequences is of great importance in biology and a large number of efficient tools for their finding have been developed. As the number of results found can be very large, our goal is to provide a tool that, on a mathematical basis, can precisely find all motifs and repeats, filter them according to input arguments and output the results in a convenient way. RepeatsPlus is a program that provides statistical filtering according to input sequence length and number of repeat occurrences, motif mask filtering and filtering related to ambiguous letters in input sequence and a large number of other options.
View Article and Find Full Text PDFDNA repeats have great importance for biological research and a large number of tools for determining repeats have been developed. Herein we define a method for extracting a statistically significant subset of a determined set of repeats. Our aim was to identify a subset of repeats in the input sequences that are not expected to occur with a number of their appearances in a random sequence of the same length.
View Article and Find Full Text PDF