首页> 外文会议>Conference on Empirical Methods in Natural Language Processing >LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

【24h】

LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

机译：卢克：具有实体意识的自我关注的深层情境化实体表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Entity representations are useful in natural language tasks involving entities. In this paper, we propose new pretrained contextualized representations of words and entities based on the bidirectional transformer (Vaswani et al., 2017). The proposed model treats words and entities in a given text as independent tokens, and outputs contextualized representations of them. Our model is trained using a new pretraining task based on the masked language model of BERT (Devlin et al., 2019). The task involves predicting randomly masked words and entities in a large entity-annotated corpus retrieved from Wikipedia. We also propose an entity-aware self-attention mechanism that is an extension of the self-attention mechanism of the transformer, and considers the types of tokens (words or entities) when computing attention scores. The proposed model achieves impressive empirical performance on a wide range of entity-related tasks. In particular, it obtains state-of-the-art results on five well-known datasets: Open Entity (entity typing), TACRED (relation classification), CoNLL-2003 (named entity recognition), ReCoRD (cloze-style question answering), and SQuAD 1.1 (extractive question answering).

机译：实体表示在涉及实体的自然语言任务中是有用的。在本文中，我们提出了基于双向变压器的新预借出的语境化表示（Vaswani等，2017）。所提出的模型将给定文本中的单词和实体视为独立的令牌，并输出它们的上下文化表示。我们的模型采用基于BERT屏蔽语言模型的新预制任务培训（Devlin等，2019）。该任务涉及预测从维基百科检索的大实体注释语料库中的随机屏蔽的单词和实体。我们还提出了一个实体感知的自我关注机制，该机制是变压器自我关注机制的延伸，并在计算注意力分数时考虑令牌（单词或实体）的类型。拟议的模型在广泛的实体相关任务中实现了令人印象深刻的经验性能。特别地，它获得了五个众所周知的数据集的最先进结果：开放实体（实体键入），TACRED（关系分类），CONLL-2003（命名实体识别），记录（CLOZE样式问题应答）和小队1.1（提取问题回答）。

著录项

来源
《Conference on Empirical Methods in Natural Language Processing 》|2020年|6442-6454|共13页
会议地点
作者
Ikuya Yamada; Akari Asai; Hiroyuki Shindo; Hideaki Takeda; Yuji Matsumoto;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Entity-aware capsule network for multi-class classification of big data: A deep learning approach [J] . Amit Kumar Jaiswal, Prayag Tiwari, Sahil Garg, Future generation computer systems . 2021 ,第Apra期

机译：实体感知胶囊网络用于大数据的多级分类：深度学习方法
2. Chinese Clinical Named Entity Recognition in Electronic Medical Records: Development of a Lattice Long Short-Term Memory Model With Contextualized Character Representations [J] . Yongbin Li, Xiaohua Wang, Linhu Hui, JMIR Medical Informatics . 2020 ,第9期

机译：中国临床名为实体识别在电子医疗记录中：具有语境化字符表示的格子长短短期记忆模型的发展
3. A Multichannel Biomedical Named Entity Recognition Model Based on Multitask Learning and Contextualized Word Representations [J] . Hao Wei, Mingyuan Gao, Ai Zhou, Wireless communications & mobile computing . 2020 ,第1期

机译：基于多任务学习和上下文化字表示的多通道生物医学名为实体识别模型
4. Event-Driven News Stream Clustering using Entity-Aware Contextual Embeddings [C] . Kailash Karthik Saravanakumar, Miguel Ballesteros, Muthu Kumar Chandrasekaran, Conference of the European Chapter of the Association for Computational Linguistics . 2021

机译：事件驱动的新闻流群集使用实体感知的上下文嵌入
5. Contextual representation of documents, entities, and faces of people using a news corpus [D] . Kader, Md Abdul. 2017

机译：使用新闻语料库的文档，实体和人脸的上下文表示
6. Biomedical named entity recognition using deep neural networks with contextual information [O] . Hyejin Cho, Hyunju Lee 2019

机译：使用具有上下文信息的深度神经网络的生物医学命名实体识别
7. LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention [O] . Ikuya Yamada, Akari Asai, Hiroyuki Shindo, 2020

机译：卢克：具有实体意识的自我关注的深层情境化实体表示

LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

摘要

著录项

相似文献

相关主题

期刊订阅