Improving Gradient Estimation by Incorporating Sensor Data

机译：通过合并传感器数据来改善梯度估计

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas most policy search methods estimate this gradient by observing the rewards obtained during policy trials, we show, both theoretically and empirically, that taking into account the sensor data as well gives better gradient estimates and hence faster learning. The reason is that rewards obtained during policy execution vary from trial to trial due to noise in the environment; sensor data, which correlates with the noise, can be used to partially correct for this variation, resulting in an estimator with lower variance.

机译：一个有效的策略搜索算法应该根据尽可能少的试验来估计目标函数相对于策略参数的局部梯度。尽管大多数策略搜索方法都是通过观察策略试验期间获得的奖励来估计此梯度的，但我们在理论和经验上均表明，将传感器数据也考虑在内也可以提供更好的梯度估计，从而可以更快地学习。原因是由于环境中的噪音，在执行政策期间获得的报酬因试验而异。与噪声相关的传感器数据可用于部分校正此变化，从而使估算器具有较低的方差。

著录项

来源
《Reliability and Quality in Design》|2008年|P.375-382|共8页
会议地点 OrlandoFL(US)
作者
Gregory Lawrence; Stuart Russell;
展开▼
作者单位

International Society of Science and Applied Technologies(ISSAT);

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类工业设计;
关键词

相似文献

外文文献
中文文献
专利

1. An alternative method to improve gravity field models by incorporating GOCE gradient data [J] . Wan Xiaoyun, Ran Jiangjun Earth sciences research journal . 2018,第3期

机译：通过合并GOCE梯度数据来改善重力场模型的另一种方法
2. Incorporation of scheduling and adaptive historical data in the Sensor-Utility-Network method for occupancy estimation [J] . Tim Ryan, Jeffrey S. Vipperman Energy and Buildings . 2013,第juna期

机译：将调度和自适应历史数据合并到Sensor-Utility-Network方法中以进行占用估算
3. Improving juvenile stature estimation by incorporating maturational data [J] . Lenover Makenna B., Seselj Maja American Journal of Physical Anthropology . 2020,第S69期

机译：通过纳入良好数据来改善少年身高估计
4. Improving Gradient Estimation by Incorporating Sensor Data [C] . Gregory Lawrence, Stuart Russell Conference on Uncertainty in Artificial Intelligence . 2008

机译：通过合并传感器数据来改善梯度估计
5. A Low-Signal-to-Noise-Ratio Sensor Framework Incorporating Improved Nighttime Capabilities in DIRSIG. [D] . Rizzuto, Anthony P. 2009

机译：一种低信号噪声比的传感器框架，在DIRSIG中整合了改进的夜间功能。
6. Improved Convolutional Pose Machines for Human Pose Estimation Using Image Sensor Data [O] . Baohua Qiang, Shihao Zhang, Yongsong Zhan, 2019

机译：使用图像传感器数据进行人体姿态估计的改进卷积姿态机
7. An alternative method to improve gravity field models by incorporating GOCE gradient data [O] . Xiaoyun Wan, Jiangjun Ran 2018

机译：通过结合鼠标梯度数据来改进重力场模型的替代方法

Improving Gradient Estimation by Incorporating Sensor Data

摘要

著录项

相似文献

相关主题

期刊订阅