【24h】

Beyond Task Success: A Closer Look at Jointly Learning to See, Ask, and GuessWhat

机译:超越任务成功:仔细看看共同学习,看,询问和猜测

获取原文

摘要

We propose a grounded dialogue state encoder which addresses a foundational issue on how to integrate visual grounding with dialogue system components. As a test-bed, we focus on the GuessWhat?! game, a two-player game where the goal is to identify an object in a complex visual scene by asking a sequence of yes/no questions. Our visually-grounded encoder leverages synergies between guessing and asking questions, as it is trained jointly using multi-task learning. We further enrich our model via a cooperative learning regime. We show that the introduction of both the joint architecture and cooperative learning lead to accuracy improvements over the baseline system. We compare our approach to an alternative system which extends the baseline with reinforcement learning. Our in-depth analysis shows that the linguistic skills of the two models differ dramatically, despite approaching comparable performance levels. This points at the importance of analyzing the linguistic output of competing systems beyond numeric comparison solely based on task success.
机译:我们提出了一个接地的对话状态编码器,该编码器解决了如何将可视接地与对话系统组件集成的基础问题。作为一个测试床,我们专注于猜测?!游戏,一个双人游戏,目标是通过询问一系列/否问题来识别复杂的视觉场景中的对象。我们视觉上接地的编码器利用了猜测和提出问题之间的协同作用,因为它通过多任务学习共同培训。我们通过合作学习制度进一步丰富了我们的模型。我们表明,引入联合架构和合作学习的引入导致基线系统的准确性改进。我们将我们的方法与替代系统进行比较,延伸了钢筋学习的基线。我们深入的分析表明,尽管接近可比性水平,但两种模型的语言技能急剧差异。本指数仅基于任务成功,分析竞争系统的语言产量超出数字比较的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号