PRESEE: an MDL/MML algorithm to time-series stream segmenting.

ScientificWorldJournal

College of Computer Science & Technology, Chengdu University of Information Technology, Chengdu 610225, China.

Published: February 2014

Time-series stream is one of the most common data types in data mining field. It is prevalent in fields such as stock market, ecology, and medical care. Segmentation is a key step to accelerate the processing speed of time-series stream mining. Previous algorithms for segmenting mainly focused on the issue of ameliorating precision instead of paying much attention to the efficiency. Moreover, the performance of these algorithms depends heavily on parameters, which are hard for the users to set. In this paper, we propose PRESEE (parameter-free, real-time, and scalable time-series stream segmenting algorithm), which greatly improves the efficiency of time-series stream segmenting. PRESEE is based on both MDL (minimum description length) and MML (minimum message length) methods, which could segment the data automatically. To evaluate the performance of PRESEE, we conduct several experiments on time-series streams of different types and compare it with the state-of-art algorithm. The empirical results show that PRESEE is very efficient for real-time stream datasets by improving segmenting speed nearly ten times. The novelty of this algorithm is further demonstrated by the application of PRESEE in segmenting real-time stream datasets from ChinaFLUX sensor networks data stream.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3706014PMC
http://dx.doi.org/10.1155/2013/386180DOI Listing

Publication Analysis

Top Keywords

time-series stream
20
stream segmenting
12
stream
8
real-time stream
8
stream datasets
8
presee
6
time-series
6
segmenting
6
presee mdl/mml
4
algorithm
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!