Similarity or deeper understanding? Analyzing the TED-Q dataset of evoked questions

机译：相似或更深入的理解？分析诱发问题的TED-Q数据集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We take a close look at a recent dataset of TED-talks annotated with the questions they implicitly evoke, TED-Q (Westera et al., 2020). We test to what extent the relation between a discourse and the questions it evokes is merely one of similarity or association, as opposed to deeper semantic/pragmatic interpretation. We do so by turning the TED-Q dataset into a binary classification task, constructing an analogous task from explicit questions we extract from the BookCorpus (Zhu et al., 2015), and fitting a BERT-based classifier alongside models based on different notions of similarity. The BERT-based classifier, achieving close to human performance, outperforms all similarity-based models, suggesting that there is more to identifying true evoked questions than plain similarity.

机译：我们仔细查看了最近的TED谈判数据集，其中包含了他们隐含地唤起的问题TED-Q（Westers等，2020）。我们测试了讨论的话语与问题之间的关系在多大程度上仅仅是相似性或关联之一，而不是更深层次的语义/务实的解释。我们这样做是通过将TED-Q数据集转换为二进制分类任务，构建来自Bookcorpus（Zhu等，2015）提取的明确问题的类似任务，并根据不同的概念与模型一起拟合基于伯特的分类器相似性。基于BERT的分类器，实现了接近人类性能，优于基于相似性的模型，表明还有比普通相似性识别真正的诱发问题。

著录项

来源
《International Conference on Computational Linguistics》|2020年|5004-5012|共9页
会议地点
作者
Matthijs Westera; Jacopo Amidei; Laia Mayol;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. An Enhanced Protein Fold Recognition for Low Similarity Datasets Using Convolutional and Skip-Gram Features With Deep Neural Network [J] . Bankapur Sanjay, Patil Nagamma IEEE transactions on nanobioscience . 2021,第1期

机译：利用深神经网络的卷积和跳过革兰特征的低相似性数据集增强蛋白质折叠识别
2. Applying deep matching networks to Chinese medical question answering: a study and a dataset [J] . Junqing He, Mingming Fu, Manshu Tu BMC Medical Informatics and Decision Making . 2019,第2期

机译：将深度匹配网络应用于中医问答：研究和数据集
3. Analyzing student conceptual understanding of resistor networks using binary, descriptive, and computational questions [J] . Mujtaba Abid H. American journal of physics . 2018,第2期

机译：使用二进制，描述性和计算问题分析对电阻网络的学生概念理解
4. TED-Q: TED Talks and the Questions they Evoke [C] . Matthijs Westera, Laia Mayol, Hannah Rohde International Conference on Language Resources and Evaluation . 2020

机译：TED-Q：TED谈判和他们唤起的问题
5. Hashing Based Similarity Search over Massive Datasets [D] . Li, Jinfeng. 2018

机译：基于哈希的大规模数据集相似度搜索
6. Applying deep matching networks to Chinese medical question answering: a study and a dataset [O] . Junqing He, Mingming Fu, Manshu Tu 2019

机译：将深度匹配网络应用于中医问答：一项研究和数据集
7. Deep Representational Similarity Learning for Analyzing Neural Signatures in Task-based fMRI Dataset [O] . -1

机译：基于任务的FMRI数据集中神经签名的深度代表性相似度学习

Similarity or deeper understanding? Analyzing the TED-Q dataset of evoked questions

摘要

著录项

相似文献

相关主题

期刊订阅