首页> 外国专利> SYSTEM AND METHOD FOR REINFORCEMENT LEARNING BASED CONTROLLED NATURAL LANGUAGE GENERATION

SYSTEM AND METHOD FOR REINFORCEMENT LEARNING BASED CONTROLLED NATURAL LANGUAGE GENERATION

机译：基于钢筋学习自然语言生成的系统和方法

页面导航

摘要
著录项
相似文献

摘要

A system for reinforcement learning based controlled natural language generation is disclosed. The system includes a token generator subsystem to generate an initial output phrase including a sequence of output tokens. The system includes trained models associated with corresponding predefined tasks. Each trained model includes an attention layer to compute attention-based weights for each output token. The trained models include a scoring layer to generate a phrase sequence level score for the output phrase. The trained models include a reward generation layer to generate dense rewards for each output token based on the attention- based weights and the phrase sequence level score. The trained models include a feedback score generation layer to generate a feedback score based on the dense rewards and reward weights assigned to the dense rewards of the corresponding trained models. The feedback score generation layer provides the feedback score iteratively to the token generator subsystem.

机译：公开了一种基于钢筋基于学习的受控自然语言生成系统。该系统包括令牌生成器子系统，用于生成包括一系列输出令牌的初始输出短语。该系统包括与相应的预定任务相关联的训练模型。每个训练的模型都包括注意层，以计算每个输出令牌的关注权重。训练有素的模型包括评分层，以为输出短语生成短语序列级别分数。训练有素的模型包括奖励生成层，基于基于注意力的权重和短语序列级别得分为每个输出令牌生成密集的卷。训练的模型包括反馈得分生成层，以基于分配给相应训练模型的密集卷筒的密集奖励和奖励权重生成反馈分数。反馈得分生成层迭代地提供给令牌发生器子系统的反馈分数。

著录项

公开/公告号WO2021247231A1

专利类型
公开/公告日2021-12-09

原文格式PDF
申请/专利权人 PM LABS INC.;
展开▼

申请/专利号WO2021US32848
发明设计人 MAHESWARAN ARJUN;SUDHAKAR AKHILESH;UPADHYAY BHARGAV;
展开▼

申请日2021-05-18
分类号G06F40;G06N5;G06N5/02;
国家 US
入库时间 2022-08-24 22:44:59

相似文献

专利
外文文献
中文文献