首页> 外国专利> PRE-TRAINING NEURAL NETWORKS WITH HUMAN DEMONSTRATIONS FOR DEEP REINFORCEMENT LEARNING

PRE-TRAINING NEURAL NETWORKS WITH HUMAN DEMONSTRATIONS FOR DEEP REINFORCEMENT LEARNING

机译:带有人类演示的预训练神经网络,用于深度强化学习

摘要

Disclosed herein are a system and method for providing a machine learning architecture based on monitored demonstrations. The system may include: a non-transitory computer-readable memory storage; at least one processor configured for dynamically training a machine learning architecture for performing one or more sequential tasks, the at least one processor configured to provide: a data receiver for receiving one or more demonstrator data sets, each demonstrator data set including a data structure representing the one or more state-action pairs; a neural network of the machine learning architecture, the neural network including a group of nodes in one or more layers; and a pre-training engine configured for processing the one or more demonstrator data sets to extract one or more features, the extracted one or more features used to pre-train the neural network based on the one or more state-action pairs observed in one or more interactions with the environment.
机译:本文公开了一种用于基于监视的示范来提供机器学习架构的系统和方法。该系统可以包括:非暂时性计算机可读存储器存储;以及至少一个处理器,其被配置为动态地训练用于执行一个或多个顺序任务的机器学习架构,所述至少一个处理器被配置为:提供用于接收一个或多个演示器数据集的数据接收器,每个演示器数据集包括代表一个或多个状态动作对;机器学习架构的神经网络,该神经网络包括一层或多层中的一组节点;以及预训练引擎,其被配置用于处理一个或多个演示器数据集以提取一个或多个特征,所提取的一个或多个特征用于基于在一个观察到的一个或多个状态动作对来预训练神经网络。或与环境的更多互动。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号