首页> 外国专利> METHOD OF GENERATING TRAINING DATA FOR TRAINING NEURAL NETWORK, METHOD OF TRAINING NEURAL NETWORK AND USING NEURAL NETWORK FOR AUTONOMOUS OPERATIONS

METHOD OF GENERATING TRAINING DATA FOR TRAINING NEURAL NETWORK, METHOD OF TRAINING NEURAL NETWORK AND USING NEURAL NETWORK FOR AUTONOMOUS OPERATIONS

机译:用于训练神经网络的训练数据的生成方法,用于自主操作的神经网络的训练方法和使用神经网络的方法

摘要

A method of generating training data for training a neural network, method of training a neural network and using a neural network for autonomous operations, related devices and systems. In one aspect, a neural network for autonomous operation of an object in an environment is trained. Policy values are generated based a sample data set. An approximate action-value function is generated from the policy values. A set of approximated policy values is generated using the approximate action-value function for all states in the sample data set for all possible actions. Attaining target for the neural network is calculated based on the approximated policy values. A training error is calculated as the difference between the training target and the policy value for the corresponding state-action pair in the sample data set. At least some of the parameters of the neural network are updated to minimize the training error.
机译:产生用于训练神经网络的训练数据的方法,训练神经网络并将神经网络用于自主操作的方法,相关设备和系统。一方面,训练了用于环境中对象的自主操作的神经网络。策略值是基于样本数据集生成的。根据策略值生成一个近似的行动值函数。使用近似操作值函数为样本数据集中的所有可能操作的所有状态生成一组近似策略值。根据近似的策略值计算神经网络的达到目标。计算出训练误差,作为样本数据集中相应状态-动作对的训练目标与策略值之间的差异。神经网络的至少一些参数被更新以最小化训练误差。

著录项

  • 公开/公告号WO2019141197A1

    专利类型

  • 公开/公告日2019-07-25

    原文格式PDF

  • 申请/专利权人 HUAWEI TECHNOLOGIES CO. LTD.;

    申请/专利号WO2019CN72049

  • 发明设计人 YAO HENGSHUAI;

    申请日2019-01-16

  • 分类号G06N3/08;

  • 国家 WO

  • 入库时间 2022-08-21 11:53:54

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号