首页> 外国专利> TEXT-BASED RESPONSE ENVIRONMENT ACTION SELECTION

TEXT-BASED RESPONSE ENVIRONMENT ACTION SELECTION

机译：基于文本的响应环境动作选择

页面导航

摘要
著录项
相似文献

摘要

In an approach, a processor trains a model, via a reinforcement learning process, to produce a first action function for relating states of a natural language based response environment to actions applicable to the natural language based response environment. A processor retrains the model, via the reinforcement learning process, to produce a second action function, including iterations of: applying the first action function to a current state representation of the natural language based response environment to obtain a ground-truth action representation, emphasizing a word of the current state representation based on relevancy to the ground-truth action representation to obtain a modified state representation, applying a model to the modified state representation to obtain an untrained action representation, and submitting the untrained action representation to a natural language based response environment to obtain a subsequent state representation, where the subsequent state representation becomes the current state representation for a subsequent iteration.

机译：在一种方法中，处理器通过增强学习过程培训模型，以产生用于将基于自然语言的响应环境的状态相关的第一动作功能，以适用于基于自然语言的响应环境的动作。处理器通过增强学习过程检索模型以产生第二动作功能，包括：将第一动作函数应用于基于自然语言的响应环境的当前状态表示，以获得地面真理动作表示，强调基于与地面真理动作表示的相关性的当前状态表示的词，以获得修改状态表示，将模型应用于修改状态表示以获得未训练的动作表示，并将未培训的动作表示提交给基于自然语言响应环境获得后续状态表示，其中后续状态表示成为随后迭代的当前状态表示。

著录项

公开/公告号US2021390387A1

专利类型
公开/公告日2021-12-16

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US202016901040
发明设计人 SUBHAJIT CHAUDHURY;DAIKI KIMURA;MICHIAKI TATSUBORI;ASIM MUNAWAR;
展开▼

申请日2020-06-15
分类号G06N3/08;H04L12/58;
国家 US
入库时间 2022-08-24 22:51:31

相似文献

专利
外文文献
中文文献