首页> 外文会议>AAAI Conference on Artificial Intelligence >Re-Evaluating ADEM: A Deeper Look at Scoring Dialogue Responses
【24h】

Re-Evaluating ADEM: A Deeper Look at Scoring Dialogue Responses

机译:重新评估Adem:更深层次的观看分解对话响应

获取原文

摘要

Automatically evaluating the quality of dialogue responses for unstructured domains is a challenging problem. ADEM (Lowe et al. 2017) formulated the automatic evaluation of dialogue systems as a learning problem and showed that such a model was able to predict responses which correlate significantly with human judgements, both at utterance and system level. Their system was shown to have beaten word-overlap metrics such as BLEU with large margins. We start with the question of whether an adversary can game the ADEM model. We design a battery of targeted attacks at the neural network based ADEM evaluation system and show that automatic evaluation of dialogue systems still has a long way to go. ADEM can get confused with a variation as simple as reversing the word order in the text! We report experiments on several such adversarial scenarios that draw out counterintuitive scores on the dialogue responses. We take a systematic look at the scoring function proposed by ADEM and connect it to linear system theory to predict the shortcomings evident in the system. We also devise an attack that can fool such a system to rate a response generation system as favorable. Finally, we allude to future research directions of using the adversarial attacks to design a truly automated dialogue evaluation system.
机译:自动评估非结构化域对话响应的质量是一个具有挑战性的问题。 Adem(Lowe等人2017)制定了对话系统的自动评估作为学习问题,并表明这种模型能够在话语和系统水平上预测与人类判断相关的反应。他们的系统被证明已经击败了具有大边缘的Bleu等词汇量标。我们首先是对敌人可以游戏Adem模型的问题。我们在基于神经网络的ADEM评估系统中设计了一种有针对性攻击的电池,并显示对话系统的自动评估仍然有很长的路要走。 Adem可能会与变型混淆,尽可能简单地颠倒文本中的单词顺序!我们向几个这样的对抗方案报告实验,这些情况在对话反应上阐述了对抗直接评分。我们对Adem提出的评分功能进行了系统的,并将其连接到线性系统理论,以预测系统中明显的缺点。我们还设计了一个可以欺骗这样一个系统来评估响应生成系统的攻击。最后,我们提到了未来的研究方向,使用对抗攻击来设计真正自动化的对话评估系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号