首页> 外文会议>Nordic conference of computational Linguistics >May I Check Again? — A simple but efficient way to generate and use contextual dictionaries for Named Entity Recognition. Application to French Legal Texts
【24h】

May I Check Again? — A simple but efficient way to generate and use contextual dictionaries for Named Entity Recognition. Application to French Legal Texts

机译:我可以再检查一次吗? —生成和使用上下文词典进行命名实体识别的简单而有效的方法。适用于法国法律文本

获取原文

摘要

In this paper we present a new method to learn a model robust to typos for a Named Entity Recognition task. Our improvement over existing methods helps the model to take into account the context of the sentence inside a court decision in order to recognize an entity with a typo. We used state-of-the-art models and enriched the last layer of the neural network with high-level information linked with the potential of the word to be a certain type of entity. More precisely, we utilized the similarities between the word and the potential entity candidates in the tagged sentence context. The experiments on a dataset of French court decisions show a reduction of the relative F1-score error of 32%, upgrading the score obtained with the most competitive fine-tuned state-of-the-art system from 94.85% to 96.52%.
机译:在本文中,我们提出了一种新方法,用于为命名实体识别任务学习对拼写错误具有鲁棒性的模型。我们对现有方法的改进有助于模型在法院判决中考虑句子的上下文,从而识别出带有错字的实体。我们使用了最先进的模型,并在高级信息中丰富了神经网络的最后一层,这些信息与单词可能成为某种特定类型的实体有关。更准确地说,我们利用了标记句子上下文中单词与潜在实体候选者之间的相似性。在法国法院判决数据集上进行的实验表明,相对F1得分误差降低了32%,将竞争最激烈的,经过微调的最新技术系统所获得的得分从94.85%提高到96.52%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号