首页> 外国专利> TEXT-BASED RESPONSE ENVIRONMENT ACTION SELECTION

TEXT-BASED RESPONSE ENVIRONMENT ACTION SELECTION

机译:基于文本的响应环境动作选择

摘要

In an approach, a processor trains a model, via a reinforcement learning process, to produce a first action function for relating states of a natural language based response environment to actions applicable to the natural language based response environment. A processor retrains the model, via the reinforcement learning process, to produce a second action function, including iterations of: applying the first action function to a current state representation of the natural language based response environment to obtain a ground-truth action representation, emphasizing a word of the current state representation based on relevancy to the ground-truth action representation to obtain a modified state representation, applying a model to the modified state representation to obtain an untrained action representation, and submitting the untrained action representation to a natural language based response environment to obtain a subsequent state representation, where the subsequent state representation becomes the current state representation for a subsequent iteration.
机译:在一种方法中,处理器通过增强学习过程培训模型,以产生用于将基于自然语言的响应环境的状态相关的第一动作功能,以适用于基于自然语言的响应环境的动作。处理器通过增强学习过程检索模型以产生第二动作功能,包括:将第一动作函数应用于基于自然语言的响应环境的当前状态表示,以获得地面真理动作表示,强调基于与地面真理动作表示的相关性的当前状态表示的词,以获得修改状态表示,将模型应用于修改状态表示以获得未训练的动作表示,并将未培训的动作表示提交给基于自然语言响应环境获得后续状态表示,其中后续状态表示成为随后迭代的当前状态表示。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号