...
首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Dynamic Structure Embedded Online Multiple-Output Regression for Streaming Data
【24h】

Dynamic Structure Embedded Online Multiple-Output Regression for Streaming Data

机译:动态结构嵌入式在线多输出数据流回归

获取原文
获取原文并翻译 | 示例
           

摘要

Online multiple-output regression is an important machine learning technique for modeling, predicting, and compressing multi-dimensional correlated data streams. In this paper, we propose a novel online multiple-output regression method, called MORES, for streaming data. MORES can dynamically learn the structure of the regression coefficients to facilitate the model's continuous refinement. Considering that limited expressive ability of regression models often leading to residual errors being dependent, MORES intends to dynamically learn and leverage the structure of the residual errors to improve the prediction accuracy. Moreover, we introduce three modified covariance matrices to extract necessary information from all the seen data for training, and set different weights on samples so as to track the data streams' evolving characteristics. Furthermore, an efficient algorithm is designed to optimize the proposed objective function, and an efficient online eigen value decomposition algorithm is developed for the modified covariance matrix. Finally, we analyze the convergence of MORES in certain ideal condition. Experiments on two synthetic datasets and three real-world datasets validate the effectiveness and efficiency of MORES. In addition, MORES can process at least 2,000 instances per second (including training and testing) on the three real-world datasets, more than 12 times faster than the state-of-the-art online learning algorithm.
机译:在线多输出回归是用于建模,预测和压缩多维相关数据流的重要机器学习技术。在本文中,我们提出了一种新颖的在线多输出回归方法,称为MORES,用于流数据。 MORES可以动态地学习回归系数的结构,以促进模型的不断完善。考虑到回归模型的有限表达能力通常会导致残差依赖,因此MORES旨在动态学习和利用残差结构来提高预测准确性。此外,我们引入了三种改进的协方差矩阵,从所有可见数据中提取必要的信息以进行训练,并对样本设置不同的权重,以跟踪数据流的演化特征。此外,设计了一种有效的算法来优化所提出的目标函数,并为改进的协方差矩阵开发了一种有效的在线特征值分解算法。最后,我们分析了在理想条件下MORES的收敛性。在两个合成数据集和三个现实数据集上进行的实验验证了MORES的有效性和效率。此外,MORES每秒可以在三个真实数据集上每秒至少处理2,000个实例(包括训练和测试),比最新的在线学习算法快12倍以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号