A Frequent Pattern Based Time Series Classification Framework
Wan Li①③,Liao Jian-xin①②,Zhu Xiao-min①②,Ni Ping①③
①State Key Laboratory of Networking and Switching Technology Beijing University of Posts and Telecommunications, Beijing 100876, China; ②EBUPT Information Technology Co., Ltd, Beijing 100191, China; ③Carnegie Mellon University, Pittsburgh, US 15213, USA
Abstract:How to extract and select features from time series are two important topics in time series classification. In this paper, a MNOE (Mining Non-Overlap Episode) algorithm is presented to find non-overlap frequent patterns in time series and these non-overlap frequent patterns are considered as features of the time series. Based on these non-overlap episodes, an EGMAMC (Episode Generated Mixed memory Aggregation Markov Chain) model is presented to describe time series. According to the principle of likelihood ratio test, the connection between the support of episode and whether EGMAMC could describe the time series significantly is induced. Based on the definition of information gain, significant frequent patterns are selected as the features of time series for classification. The experiments on UCI (University of California Irvine) datasets and smart building datasets demonstrate that the classification model trained with selecting significant frequent patterns as features outperforms the one trained without selecting them on precision, recall and F-Measure. The time series classification models can be improved by efficiently extracting and effectively selecting non-overlap frequent patterns as features of time series.
万里,廖建新,朱晓民,倪萍. 一种基于频繁模式的时间序列分类框架[J]. 电子与信息学报, 2010, 32(2): 261-266 .
Wan Li①③,Liao Jian-xin①②,Zhu Xiao-min①②Ni Ping①③. A Frequent Pattern Based Time Series Classification Framework. , 2010, 32(2): 261-266 .