首页> 外文OA文献 >A study of the use of complexity measures in the similarity search process adopted by kNN algorithm for time series prediction
【2h】

A study of the use of complexity measures in the similarity search process adopted by kNN algorithm for time series prediction

机译:kNN算法用于时间序列预测的相似度搜索过程中复杂度度量的使用研究

摘要

In the last two decades, with the rise of the Data Mining process, there is an increasing interest in the adaptation of Machine Learning methods to support Time Series non-parametric modeling and prediction. The non-parametric temporal data modeling can be performed according to local and global approaches. The most of the local prediction data strategies are based on the k-Nearest Neighbor (kNN) learning method. In this paper we propose a modification of the kNN algorithm for Time Series prediction. Our proposal differs from the literature by incorporating three techniques for obtaining amplitude and offset invariance, complexity invariance, and treatment of trivial matches. We evaluate the proposed method with six complexity measures, in order to verify the impact of these measures in the projection of the future values. Besides, we face our method with two Machine Learning regression algorithms. The experimental comparisons were performed using 55 data sets, which are available at the ICMC-USP Time Series Prediction Repository. Our results indicate that the developed method is competitive and the use of a complexity-invariant distance measure generally improves the predictive performance.
机译:在过去的二十年中,随着数据挖掘过程的兴起,人们对适应于支持时间序列非参数建模和预测的机器学习方法的兴趣日益浓厚。非参数时间数据建模可以根据局部和全局方法来执行。大多数本地预测数据策略都基于k最近邻(kNN)学习方法。在本文中,我们提出了对kNN算法进行时间序列预测的改进。我们的建议与文献不同,它采用了三种技术来获得幅度和偏移不变性,复杂性不变性以及琐碎匹配的处理。为了验证这些方法对未来价值预测的影响,我们使用六个复杂性度量来评估所提出的方法。此外,我们通过两种机器学习回归算法来面对我们的方法。使用55个数据集进行了实验比较,这些数据集可在ICMC-USP时间序列预测存储库中获得。我们的结果表明,所开发的方法具有竞争力,并且使用复杂度不变的距离度量通常可以提高预测性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号