首页> 外文OA文献 >Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents

【2h】

Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents

机译：在金融和生物医学文件中转移学习的名称实体识别

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent deep learning approaches have shown promising results for named entity recognition (NER). A reasonable assumption for training robust deep learning models is that a sufficient amount of high-quality annotated training data is available. However, in many real-world scenarios, labeled training data is scarcely present. In this paper we consider two use cases: generic entity extraction from financial and from biomedical documents. First, we have developed a character based model for NER in financial documents and a word and character based model with attention for NER in biomedical documents. Further, we have analyzed how transfer learning addresses the problem of limited training data in a target domain. We demonstrate through experiments that NER models trained on labeled data from a source domain can be used as base models and then be fine-tuned with few labeled data for recognition of different named entity classes in a target domain. We also witness an interest in language models to improve NER as a way of coping with limited labeled data. The current most successful language model is BERT. Because of its success in state-of-the-art models we integrate representations based on BERT in our biomedical NER model along with word and character information. The results are compared with a state-of-the-art model applied on a benchmarking biomedical corpus.

机译：最近的深入学习方法已经显示了命名实体识别（NER）的有希望的结果。培训强大的深度学习模型的合理假设是有足够的高质量注释培训数据。但是，在许多真实世界的情景中，几乎没有存在标记的训练数据。在本文中，我们考虑两种用例：从金融和生物医学文件中提取通用实体提取。首先，我们在生物医学文档中为新的金融文档和基于字符和字符的模型开发了基于字符的型号。此外，我们已经分析了转移学习如何解决目标域中有限培训数据的问题。我们通过实验证明了从源域标记数据训练的NER模型可以用作基础模型，然后用几个标记的数据进行微调，以识别目标域中的不同命名实体类。我们还目睹了对语言模型的兴趣，以改善NER作为应对有限标记数据的方式。目前最成功的语言模型是伯特。由于它在最先进的模型中取得了成功，我们将基于BERT的伯特与单词和字符信息相结合。将结果与应用于基准生物医学语料库的最新模型进行了比较。

著录项

作者
Sumam Francis; Jordy Van Landeghem; Marie-Francine Moens;
展开▼
作者单位

展开▼
年度 2019
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents [J] . Sumam Francis, Jordy Van Landeghem, Marie-Francine Moens Information . 2019,第8期

机译：在财务和生物医学文档中进行转移学习以进行命名实体识别
2. Combining Multi-task Learning with Transfer Learning for Biomedical Named Entity Recognition [J] . Tahir Mehmood, Alfonso E. Gerevini, Alberto Lavelli, Procedia Computer Science . 2020,第5期

机译：将多任务学习与生物医学命名实体识别的转移学习相结合
3. Transfer learning for biomedical named entity recognition with neural networks [J] . Giorgi John M., Bader Gary D. Bioinformatics . 2018,第23期

机译：与神经网络的生物医学命名实体识别的转移学习
4. Transfer Learning in Biomedical Named Entity Recognition: An Evaluation of BERT in the PharmaCoNER task [C] . Cong Sun, Zhihao Yang Workshop on bioNLP open shared tasks . 2019

机译：生物医学命名实体识别中的转移学习：PharmaCoNER任务中BERT的评估
5. Named entity recognition and an application to document clustering [D] . Wei, Gang 2004

机译：命名实体识别及其在文档聚类中的应用
6. Transfer learning for biomedical named entity recognition with neural networks [O] . John M Giorgi, Gary D Bader -1

机译：利用神经网络进行生物医学命名实体识别的转移学习
7. Transfer Learning in Biomedical Named Entity Recognition: An Evaluation of BERT in the PharmaCoNER task [O] . Cong Sun, Zhihao Yang 2019

机译：在生物医学命名实体识别中转移学习：Pharmaconer任务中伯特的评估

Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents

摘要

著录项

相似文献

相关主题

期刊订阅