A disk-aware algorithm for time series motif discovery.

Data Min Knowl Discov

Swartz Center for Computational Neuroscience, University of California, San Diego, CA, USA.

Published: January 2011

Time series motifs are sets of very similar subsequences of a long time series. They are of interest in their own right, and are also used as inputs in several higher-level data mining algorithms including classification, clustering, rule-discovery and summarization. In spite of extensive research in recent years, finding time series motifs in massive databases is an open problem. Previous efforts either found motifs or considered relatively small datasets residing in . In this work, we leverage off previous work on pivot-based indexing to introduce a disk-aware algorithm to find time series motifs exactly in multi-gigabyte databases which contain on the order of of time series. We have evaluated our algorithm on datasets from diverse areas including medicine, anthropology, computer networking and image processing and show that we can find interesting and meaningful motifs in datasets that are many orders of magnitude larger than anything considered before.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7062370PMC
http://dx.doi.org/10.1007/s10618-010-0176-8DOI Listing

Publication Analysis

Top Keywords

time series
24
series motifs
12
disk-aware algorithm
8
time
6
series
6
motifs
5
algorithm time
4
series motif
4
motif discovery
4
discovery time
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!