首页> 外国专利> SYSTEM AND METHODS FOR TRAINING TASK-ORIENTED DIALOGUE (TOD) LANGUAGE MODELS

SYSTEM AND METHODS FOR TRAINING TASK-ORIENTED DIALOGUE (TOD) LANGUAGE MODELS

机译:面向任务的对话(TOD)语言模型训练系统和方法

摘要

Embodiments described herein provide methods and systems for training task-oriented dialogue (TOD) language models. In some embodiments, a TOD language model may receive a TOD dataset including a plurality of dialogues and a model input sequence may be generated from the dialogues using a first token prefixed to each user utterance and a second token prefixed to each system response of the dialogues. In some embodiments, the first token or the second token may be randomly replaced with a mask token to generate a masked training sequence and a masked language modeling (MLM) loss may be computed using the masked training sequence. In some embodiments, the TOD language model may be updated based on the MLM loss.
机译:本文描述的实施例提供了用于训练面向任务对话(TOD)语言模型的方法和系统。在一些实施例中,TOD语言模型可以接收包括多个对话的TOD数据集,并且可以使用前缀为每个用户话语的第一标记和前缀为对话的每个系统响应的第二标记从对话生成模型输入序列。在一些实施例中,可以用掩码令牌随机替换第一令牌或第二令牌以生成掩码训练序列,并且可以使用掩码训练序列计算掩码语言建模(MLM)损失。在一些实施例中,可以基于传销损失更新TOD语言模型。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号