首页> 外文会议>International conference on advanced data mining and applications;ADMA 2010 >Discretization of Time Series Dataset Using Relative Frequency and K-Nearest Neighbor Approach
【24h】

Discretization of Time Series Dataset Using Relative Frequency and K-Nearest Neighbor Approach

机译:使用相对频率和K最近邻方法离散化时间序列数据集

获取原文

摘要

In this work, we propose an improved approach of time series data discretization using the Relative Frequency and K- nearest Neighbor functions called the RFknn method. The main idea of the method is to improve the process of determining the sufficient number of intervals for discretization of time series data. The proposed approach improved the time series data representation by integrating it with the Piecewise Aggregate Approximation (PAA) and the Symbolic Aggregate Approximation (SAX) representation. The intervals are represented as a symbol and can ensure efficient mining process where better knowledge model can be obtained without major loss of knowledge. The basic idea is not to minimize or maximize the number of intervals of the temporal patterns over their class labels. The performance of RFknn is evaluated using 22 temporal datasets and compared to the original time series discretization SAX method with similar representation. We show that RFknn can improve representation preciseness without losing symbolic nature of the original SAX representation. The experimental results showed that RFknn gives better term of representation with lower and comparable error rates.
机译:在这项工作中,我们提出了一种改进的时间序列数据离散化方法,该方法使用了称为RFknn方法的相对频率和K最近邻函数。该方法的主要思想是改进确定足够数量的间隔以离散化时间序列数据的过程。所提出的方法通过将其与分段聚合近似(PAA)和符号聚合近似(SAX)表示相集成,改进了时间序列数据的表示。间隔表示为一个符号,可以确保有效的挖掘过程,在此过程中可以获得更好的知识模型而不会大量丢失知识。基本思想不是最小化或最大化其类别标签上的时间模式的间隔数。使用22个时间数据集评估RFknn的性能,并与具有类似表示形式的原始时间序列离散化SAX方法进行比较。我们表明RFknn可以提高表示精度,而不会丢失原始SAX表示的符号性质。实验结果表明,RFknn可以提供更好的表示条件,并且具有更低且可比的错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号