首页> 外文会议> >Temporal-Difference learning an online support vector regression approach

【24h】

Temporal-Difference learning an online support vector regression approach

机译：时差学习在线支持向量回归方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a new algorithm for Temporal-Difference (TD) learning using online support vector regression. It benefits from the good generalization properties support vector regression (SVR) has, and also can do incremental learning and automatically track variation of environment with time-varying characteristics. Using the online SVR we can obtain good estimation of value function in TD learning in linear and nonlinear prediction problems. Experimental results demonstrate the effectiveness of the proposed method by comparison with others methods.

机译：本文提出了一种使用在线支持向量回归的时差学习算法。它得益于支持向量回归（SVR）的良好泛化特性，并且还可以进行增量学习并自动跟踪具有时变特征的环境变化。使用在线SVR，我们可以在线性和非线性预测问题中的TD学习中获得良好的价值函数估计。实验结果通过与其他方法的比较证明了该方法的有效性。

著录项

来源
《》|2015年|318-323|共6页
会议地点
作者
Teixeira Hugo Tanzarella; Bottura Celso Pascoli;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Approximation algorithms; Function approximation; Kernel; Markov processes; Prediction algorithms; Support vector machines; Machine Learning; Online Support Vector Machine; Reinforcement Learning; Temporal Difference Learning; Value Function Approximation;

机译：近似算法;函数逼近;核心;马尔可夫过程;预测算法;支持向量机;机器学习;在线支持向量机;强化学习;时间差异学习;值函数近似;

相似文献

外文文献
中文文献
专利

1. An adaptive online learning approach for Support Vector Regression: Online-SVR-FID [J] . Jie Liu, Enrico Zio Mechanical systems and signal processing . 2016,第auga期

机译：支持向量回归的自适应在线学习方法：Online-SVR-FID
2. Online learning for quantile regression and support vector regression [J] . Hu T., Xiang D.-H., Zhou D.-X. Journal of Statistical Planning and Inference . 2012,第12期

机译：在线学习，用于分位数回归和支持向量回归
3. A Soft Sensor Modelling of Biomass Concentration during Fermentation using Accurate Incremental Online v-Support Vector Regression Learning Algorithm [J] . Binjie Cu, Feng Pan American Journal of Biochemistry and Biotechnology . 2015,第3期

机译：使用精确增量在线v-Support向量回归学习算法的发酵过程中生物质浓度的软传感器建模
4. Temporal-Difference Learning An Online Support Vector Regression Approach [C] . Hugo Tanzarella Teixeira, Celso Pascoli Bottura International Conference on Informatics in Control, Automation and Robotics . 2015

机译：时间差异学习在线支持向量回归方法
5. Machine Learning: Several Advances in Linear Discriminant Analysis, Multi-View Regression and Support Vector Machine [D] . Zheng, Shuai 2017

机译：机器学习：线性判别分析，多视图回归和支持向量机的若干进展
6. A Hybrid Approach of Stepwise Regression Logistic Regression Support Vector Machine and Decision Tree for Forecasting Fraudulent Financial Statements [O] . Suduan Chen, Yeong-Jia James Goo, Zone-De Shen -1

机译：逐步欺诈逻辑回归支持向量机和决策树的混合方法用于预测欺诈性财务报表
7. An adaptive online learning approach for Support Vector Regression: Online-SVR-FID [O] . Jie Liu, Enrico Zio 2016

机译：支持向量回归的自适应在线学习方法：在线SVR-FID

Temporal-Difference learning an online support vector regression approach

摘要

著录项

相似文献

相关主题

期刊订阅