TIMEDIAL: Temporal Commonsense Reasoning in Dialog

机译：Timedial：对话中的时间致辞推理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Everyday conversations require understanding everyday events, which in turn, requires understanding temporal commonsense concepts interwoven with those events. Despite recent progress with massive pre-trained language models (LMs) such as T5 and GPT-3, their capability of temporal reasoning in dialogs remains largely under-explored. In this paper, we present the first study to investigate pre-trained LMs for their temporal reasoning capabilities in dialogs by introducing a new task and a crowd-sourced English challenge set, TIMEDIAL. We formulate TIMEDIAL as a multiple choice cloze task with over 1.1K carefully curated dialogs. Empirical results demonstrate that even the best performing models struggle on this task compared to humans, with 23 absolute points of gap in accuracy. Furthermore, our analysis reveals that the models fail to reason about dialog context correctly; instead, they rely on shallow cues based on existing temporal patterns in context, motivating future research for modeling temporal concepts in text and robust contextual reasoning about them.

机译：日常对话需要了解日常事件，这反过来需要了解与这些事件交织的时间致辞概念。尽管最近具有大规模预训练的语言模型（LMS）如T5和GPT-3的进展，但它们在对话中的时间推理能力仍然很大程度上仍未探讨。在本文中，我们通过引入新的任务和众所周知的英语挑战，定时，首次研究第一项研究来调查训练预先训练的LMS在对话中的时间推理能力。我们将TimeDial标准为多选，CLOZE任务超过1.1K仔细策划对话框。实证结果表明，与人类相比，即使是最佳表演模型也争取这项任务，准确性的23个绝对间隙。此外，我们的分析表明，模型无法正确推理对话框;相反，它们根据上下文中的现有时间模式依赖浅线，激励未来的文本中的时间概念和关于它们的强大上下文推理的未来研究。

著录项

来源
《International Joint Conference on Natural Language Processing;Annual Meeting of the Association for Computational Linguistics》|2021年|7066-7076|共11页
会议地点
作者
Lianhui Qin; Aditya Gupta; Shyam Upadhyay; Luheng He; Yejin Choi; Manaal Faruqui;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Geospatial Narratives and Their Spatio-Temporal Dynamics: Commonsense Reasoning for High-Level Analyses in Geographic Information Systems [J] . Jan Oliver Wallgr#x000FC, n, Mehul Bhatt ISPRS International Journal of Geo-Information . 2013,第1期

机译：地理空间叙事及其时空动态：地理信息系统中高级分析的常识性推理
2. Commonsense reasoning and commonsense knowledge in artificial intelligence [J] . Lalit Saxena Computing reviews . 2016,第8期

机译：人工智能中的常识推理和常识知识
3. Commonsense Reasoning and Commonsense Knowledge in Artificial Intelligence [J] . Davis Ernest, Marcus Gary Communications of the ACM . 2015,第9期

机译：人工智能中的常识推理和常识知识
4. Leveraging commonsense reasoning and multimodal perception for robot spoken dialog systems [C] . Dongcai Lu, Shiqi Zhang, Peter Stone, IEEE/RSJ International Conference on Intelligent Robots and Systems . 2017

机译：利用常识推理和多模式感知来进行机器人语音对话系统
5. Understanding Natural Language with Commonsense Knowledge Representation, Reasoning, and Simulation [D] . Bosselut, Antoine. 2020

机译：了解具有型号知识表示，推理和模拟的自然语言
6. Producing or reproducing reasoning? Socratic dialog is very effective, but only for a few [O] . Andrea Paula Goldin, Olivia Pedroncini, Mariano Sigman 2012

机译：生产还是重现推理？苏格拉底式对话非常有效，但仅适用于少数情况
7. Geospatial Narratives and their Spatio-Temporal Dynamics ˚ Commonsense Reasoning for High-level Analyses in Geographic Information Systems [O] . Duccio Rocchini, Mehul Bhatt, Jan Oliver Wallgruen 2016

机译：地理空间叙事及其时空动态˚地理信息系统高层次分析的常识推理

TIMEDIAL: Temporal Commonsense Reasoning in Dialog

摘要

著录项

相似文献

相关主题

期刊订阅