首页> 外文会议>Workshop on Arabic Natural Language Processing >Empathetic BERT2BERT Conversational Model: Learning Arabic Language Generation with Little Data
【24h】

Empathetic BERT2BERT Conversational Model: Learning Arabic Language Generation with Little Data

机译:Impathetic Bert2bert会话模型:学习用少数数据学习阿拉伯语

获取原文

摘要

Enabling empathetic behavior in Arabic dialogue agents is an important aspect of building human-like conversational models. While Arabic Natural Language Processing has seen significant advances in Natural Language Understanding (NLU) with language models such as AraBERT, Natural Language Generation (NLG) remains a challenge. The shortcomings of NLG encoder-decoder models are primarily due to the lack of Arabic datasets suitable to train NLG models such as conversational agents. To overcome this issue, we propose a transformer-based encoder-decoder initialized with AraBERT parameters. By initializing the weights of the encoder and decoder with AraBERT pre-trained weights, our model was able to leverage knowledge transfer and boost performance in response generation. To enable empathy in our conversational model, we train it using the ArabicEmpatheticDialogues dataset and achieve high performance in empathetic response generation. Specifically, our model achieved a low perplexity value of 17.0 and an increase in 5 BLEU points compared to the previous state-of-the-art model. Also, our proposed model was rated highly by 85 human evaluators, validating its high capability in exhibiting empathy while generating relevant and fluent responses in open-domain settings.
机译:在阿拉伯语对话代理中实现同情性的行为是建立像人类的会话模型的一个重要方面。虽然阿拉伯语自然语言处理已经看出了自然语言理解(NLU)的重要进展,语言模型如阿拉伯语,自然语言生成(NLG)仍然是一个挑战。 NLG编码器 - 解码器模型的缺点主要是由于缺乏适用于培训诸如会话代理的NLG模型的阿拉伯语数据集。为了克服这个问题,我们提出了一个基于变压器的编码器解码器,初始化了Arabert参数。通过初始化具有Arabert预训练的权重的编码器和解码器的权重,我们的模型能够利用知识转移和促进响应生成的性能。为了在我们的会话模型中启用同理心,我们使用ArabiceMPatheticDialogues数据集培训并在Impathetic响应生成中实现高性能。具体而言,我们的模型实现了17.0的低困惑值,而与以前的最先进的模型相比,5个BLEU点增加。此外,我们的拟议模型高度评价了85人评估员,验证其高能力在表现出在开放域设置中产生相关和流畅的反应时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号