【24h】

TIMEDIAL: Temporal Commonsense Reasoning in Dialog

机译:Timedial:对话中的时间致辞推理

获取原文

摘要

Everyday conversations require understanding everyday events, which in turn, requires understanding temporal commonsense concepts interwoven with those events. Despite recent progress with massive pre-trained language models (LMs) such as T5 and GPT-3, their capability of temporal reasoning in dialogs remains largely under-explored. In this paper, we present the first study to investigate pre-trained LMs for their temporal reasoning capabilities in dialogs by introducing a new task and a crowd-sourced English challenge set, TIMEDIAL. We formulate TIMEDIAL as a multiple choice cloze task with over 1.1K carefully curated dialogs. Empirical results demonstrate that even the best performing models struggle on this task compared to humans, with 23 absolute points of gap in accuracy. Furthermore, our analysis reveals that the models fail to reason about dialog context correctly; instead, they rely on shallow cues based on existing temporal patterns in context, motivating future research for modeling temporal concepts in text and robust contextual reasoning about them.
机译:日常对话需要了解日常事件,这反过来需要了解与这些事件交织的时间致辞概念。尽管最近具有大规模预训练的语言模型(LMS)如T5和GPT-3的进展,但它们在对话中的时间推理能力仍然很大程度上仍未探讨。在本文中,我们通过引入新的任务和众所周知的英语挑战,定时,首次研究第一项研究来调查训练预先训练的LMS在对话中的时间推理能力。我们将TimeDial标准为多选,CLOZE任务超过1.1K仔细策划对话框。实证结果表明,与人类相比,即使是最佳表演模型也争取这项任务,准确性的23个绝对间隙。此外,我们的分析表明,模型无法正确推理对话框;相反,它们根据上下文中的现有时间模式依赖浅线,激励未来的文本中的时间概念和关于它们的强大上下文推理的未来研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号