【24h】

Learning non-cooperative dialogue behaviours

机译:学习非合作对话行为

获取原文

摘要

Non-cooperative dialogue behaviour has been identified as important in a variety of application areas, including education, military operations, video games and healthcare. However, it has not been addressed using statistical approaches to dialogue management, which have always been trained for co-operative dialogue. We develop and evaluate a statistical dialogue agent which learns to perform non-cooperative dialogue moves in order to complete its own objectives in a stochastic trading game. We show that, when given the ability to perform both cooperative and non-cooperative dialogue moves, such an agent can learn to bluff and to lie so as to win games more often - against a variety of adversaries, and under various conditions such as risking penalties for being caught in deception. For example, we show that a non-cooperative dialogue agent can learn to win an additional 15.47% of games against a strong rule-based adversary, when compared to an optimised agent which cannot perform non-cooperative moves. This work is the first to show how an agent can learn to use non-cooperative dialogue to effectively meet its own goals.
机译:非合作对话行为已被认为在包括教育,军事行动,视频游戏和医疗保健在内的许多应用领域中都很重要。但是,尚未使用统计方法进行对话管理来解决该问题,而对话管理始终经过训练以进行合作对话。我们开发和评估一个统计对话代理,该代理学习执行非合作对话动作,以完成其在随机交易游戏中的目标。我们证明,只要具备执行合作和非合作对话动作的能力,这样的特工就可以学会虚张声势和撒谎,以便更频繁地赢得比赛-对抗各种对手,并在各种条件下(例如冒险)被欺骗的处罚。例如,我们显示,与不能执行非合作动作的优化代理相比,非合作对话代理可以学会赢得更多的15.47%的游戏,以对抗强大的基于规则的对手。这项工作是第一个展示代理如何学习非合作对话以有效实现其目标的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号