首页>
外国专利>
SYSTEM AND METHOD FOR REINFORCEMENT LEARNING BASED CONTROLLED NATURAL LANGUAGE GENERATION
SYSTEM AND METHOD FOR REINFORCEMENT LEARNING BASED CONTROLLED NATURAL LANGUAGE GENERATION
展开▼
机译:基于钢筋学习自然语言生成的系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system for reinforcement learning based controlled natural language generation is disclosed. The system includes a token generator subsystem to generate an initial output phrase including a sequence of output tokens. The system includes trained models associated with corresponding predefined tasks. Each trained model includes an attention layer to compute attention-based weights for each output token. The trained models include a scoring layer to generate a phrase sequence level score for the output phrase. The trained models include a reward generation layer to generate dense rewards for each output token based on the attention- based weights and the phrase sequence level score. The trained models include a feedback score generation layer to generate a feedback score based on the dense rewards and reward weights assigned to the dense rewards of the corresponding trained models. The feedback score generation layer provides the feedback score iteratively to the token generator subsystem.
展开▼