A methodology for training set instance selection using mutual information in time series prediction

Milos B. Stojanovic; Milos M. Bozic; Milena M. Stankovic; Zoran P. Stajic

首页> 外文期刊>Neurocomputing >A methodology for training set instance selection using mutual information in time series prediction

【24h】

A methodology for training set instance selection using mutual information in time series prediction

机译：在时间序列预测中使用互信息训练集合实例的方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Training set instance selection is an important preprocessing step in many machine learning problems, including time series prediction, and has to be considered in practice in order to increase the quality of the predictions and possibly reduce training time. Recently, the usage of mutual information (MI) has been proposed in regression tasks, mostly for feature selection and for identifying the real data from data sets that contain noise and outliers. This paper proposes a new methodology for training set instance selection for long-term time series prediction. The proposed methodology combines a recursive prediction strategy and advanced instance selection criterion-the nearest neighbor based MI estimator. An application of the concept of MI is presented for the selection of training instances based on MI computation between initial training set instances and the current forecasting instance, for every prediction step. The novelty of the approach lies in the fact that it fits the initial training subset with the current forecasting instance, and consequently reduces the uncertainty of the prediction. In this way, by selecting instances which share a large amount of MI with the current forecasting instance in every prediction step, error propagation and accumulation can be reduced, both of which are well known shortcomings of the recursive prediction strategy, thus leading to better forecasting quality. Another element which sets this approach apart from others is that it is not proposed as an outlier detector, but for the instance selection of data which do not necessarily have to contain noise and outliers. The results obtained from the data sets from NN5 competition in time series prediction indicate that the proposed method increases the quality of long-term time series prediction, as well as reduces the amount of instances needed for building the model.

机译：训练集实例选择是许多机器学习问题（包括时间序列预测）中重要的预处理步骤，必须在实践中加以考虑，以提高预测的质量并可能减少训练时间。最近，已经提出在回归任务中使用互信息（MI），主要用于特征选择以及从包含噪声和异常值的数据集中识别真实数据。本文提出了一种用于长期时间序列预测的训练集实例选择的新方法。所提出的方法结合了递归预测策略和高级实例选择准则-基于最近邻的MI估计器。提出了MI概念的应用，用于针对每个预测步骤，基于初始训练集实例与当前预测实例之间的MI计算来选择训练实例。该方法的新颖之处在于，它使初始训练子集与当前的预测实例相适应，因此减少了预测的不确定性。这样，通过在每个预测步骤中选择与当前预测实例共享大量MI的实例，可以减少错误传播和累积，这都是递归预测策略的众所周知的缺点，因此可以更好地进行预测质量。将这种方法与其他方法区分开的另一个元素是，它不建议用作离群值检测器，但是对于实例数据的选择并不一定要包含噪声和离群值。从时间序列预测中NN5竞争的数据集获得的结果表明，该方法提高了长期时间序列预测的质量，并且减少了构建模型所需的实例数量。

著录项

来源
《Neurocomputing》 |2014年第2期|236-245|共10页
作者
Milos B. Stojanovic; Milos M. Bozic; Milena M. Stankovic; Zoran P. Stajic;
展开▼
作者单位

College of Applied Technical Sciences,Aleksandra Medvedeva 20,18000 Nis,Serbia,Bulevar doktora Zorana Dindica 29/5,18000 Nis,Serbia,Aleksandra Medvedeva 20,18000 Nis,Serbia,College of Applied Technical Sciences,Nis,Serbia;

Faculty of Electronic Engineering,University of Nis,Aleksandra Medvedeva 14,18000 Nis,Serbia;

Faculty of Electronic Engineering,University of Nis,Aleksandra Medvedeva 14,18000 Nis,Serbia;

Faculty of Electronic Engineering,University of Nis,Aleksandra Medvedeva 14,18000 Nis,Serbia;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Instance selection; Mutual information; Time-series prediction;

机译：实例选择;相互信息;时间序列预测;

相似文献

外文文献
中文文献
专利

1. New method for instance or prototype selection using mutual information in time series prediction [J] . A. Guillen, L.J. Herrera, G. Rubio, Neurocomputing . 2010,第10a12期

机译：在时序预测中使用互信息进行实例或原型选择的新方法
2. Forecasting method for global radiation time series without training phase: Comparison with other well-known prediction methodologies [J] . Voyant Cyril, Motte Fabrice, Fouilloy Alexis, Energy . 2017,第FEBa1期

机译：没有训练阶段的全球辐射时间序列的预测方法：与其他著名的预测方法的比较
3. Forecasting the accuracy of genomic prediction with different selection targets in the training and prediction set as well as truncation selection [J] . Schopp Pascal, Riedelsheimer Christian, Utz H. Friedrich, Theoretical and Applied Genetics: International Journal of Breeding Research and Cell Genetics . 2015,第11期

机译：预测训练和预测集中具有不同选择目标的基因组预测的准确性以及截断选择
4. Time series prediction of retirement mutual fund values using optimal window size selection and support vector regression [C] . Piyawadee Sukkachart, Chotiros Surapholchai, Rajalida Lipikorn International Conference on Information Technology Systems and Innovation . 2017

机译：使用最佳窗口大小选择和支持向量回归的退休共同基金价值的时间序列预测
5. Instance selection for simplified decision trees through the generation and selection of instance candidate subsets. [D] . Bennette, Walter Dean. 2011

机译：通过实例候选子集的生成和选择，简化决策树的实例选择。
6. Estimating Conditional Transfer Entropy in Time Series Using Mutual Information and Nonlinear Prediction [O] . Payam Shahsavari Baboukani, Carina Graversen, Emina Alickovic, 2020

机译：使用相互信息和非线性预测估算时间序列的条件转移熵
7. Direct and Recursive Prediction of Time Series Using Mutual Information Selection [O] . Yongnan Ji, Jin Hao, Nima Reyhani, 2005

机译：基于互信息选择的时间序列直接递归预测

A methodology for training set instance selection using mutual information in time series prediction

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅