首页> 外文会议>European Conference on IR Research >Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization

【24h】

Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization

机译：命名实体识别中的上下文嵌入：关于泛化的实证研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Contextualized embeddings use unsupervised language model pretraining to compute word representations depending on their context. This is intuitively useful for generalization, especially in Named-Entity Recognition where it is crucial to detect mentions never seen during training. However, standard English benchmarks overestimate the importance of lexical over contextual features because of an unrealistic lexical overlap between train and test mentions. In this paper, we perform an empirical analysis of the generalization capabilities of state-of-the-art contextualized embeddings by separating mentions by novelty and with out-of-domain evaluation. We show that they are particularly beneficial for unseen mentions detection, especially out-of-domain. For models trained on CoNLL03, language model contextualization leads to a +1.2% maximal relative micro-Fl score increase in-domain against + 13% out-of-domain on the WNUT dataset.

机译：上下文化嵌入使用无监督语言模型预训练来根据单词上下文来计算单词表示形式。这对于一般化很有用，特别是在命名实体识别中，这对于检测训练中从未见过的提及至关重要。但是，由于训练和测试提及之间不切实际的词汇重叠，标准的英语基准测试高估了词汇相对于上下文特征的重要性。在本文中，我们通过新颖性将提及分开并进行域外评估，对最新的上下文化嵌入的泛化能力进行了实证分析。我们表明，它们对于看不见的提及检测（尤其是域外）特别有用。对于在CoNLL03上训练的模型，语言模型的语境化导致WNUT数据集的域内最大相对micro-Fl得分增加+ 1.2％，而域外+ 13％。

著录项

来源
《European Conference on IR Research》|2020年|383-391|共9页
会议地点
作者
Bruno Taille; Vincent Guigue; Patrick Gallinari;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
NER; Contextualized embeddings; Domain adaptation;

机译：NER;上下文相关的嵌入;领域适应;
入库时间 2022-08-26 13:55:18

相似文献

外文文献
中文文献
专利

1. Combining Contextualized Embeddings and Prior Knowledge for Clinical Named Entity Recognition: Evaluation Study [J] . Min Jiang, Todd Sanger, Xiong Liu JMIR Medical Informatics . 2019,第4期

机译：结合上下文化嵌入和临床名称实体识别的先验知识：评估研究
2. Hierarchical Network with Label Embedding for Contextual Emotion Recognition [J] . Jiawen Deng, Fuji Ren 研究（英文） . 2021,第002期

机译：具有标签嵌入上下文情感识别的分层网络
3. Biomedical Named-Entity Recognition by Hierarchically Fusing BioBERT Representations and Deep Contextual-Level Word-Embedding [C] . Usman Naseem, Katarzyna Musial, Peter Eklund, International Joint Conference on Neural Networks . 2020

机译：通过分层融合BioBERT表示和深度上下文级别词嵌入的生物医学命名实体识别
4. Contextualizing international corporate tax rates: An empirical study of the determinants of international corporate tax rates. [D] . Ofori-Brobbey, Kwadwo. 2003

机译：国际公司税率的语境化：国际公司税率决定因素的实证研究。
5. Semi-Supervised Bidirectional Long Short-Term Memory and Conditional Random Fields Model for Named-Entity Recognition Using Embeddings from Language Models Representations [O] . Min Zhang, Guohua Geng, Jing Chen 2020

机译：使用语言模型表示的嵌入式识别命名实体识别的半监控双向短期内存和条件随机字段模型
6. NERO: A Biomedical Named-entity (Recognition) Ontology with a Large, Annotated Corpus Reveals Meaningful Associations Through Text Embedding [O] . Kanix Wang, Robert Stevens, Halima Alachram, 2020

机译：Nero：一种生物医学命名实体（识别）本体，具有大，注释的语料库，通过文本嵌入显示有意义的关联
7. Generalization in Backpropagation Networks: An Empirical Study Using Image Data. [R] . Moya, M. M., Fogler, R. J., Hostetler, L. D. 1989

机译：反向传播网络的推广：基于图像数据的实证研究。

Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization

摘要

著录项

相似文献

相关主题

期刊订阅