Multitask learning for biomedical named entity recognition with cross-sharing structure

Xi Wang; Jiagao Lyu; Li Dong; Ke Xu

首页> 外文期刊>BMC Bioinformatics >Multitask learning for biomedical named entity recognition with cross-sharing structure

【24h】

Multitask learning for biomedical named entity recognition with cross-sharing structure

机译：具有交叉共享结构的生物医学命名实体识别的多任务学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Biomedical named entity recognition (BioNER) is a fundamental and essential task for biomedical literature mining, which affects the performance of downstream tasks. Most BioNER models rely on domain-specific features or hand-crafted rules, but extracting features from massive data requires much time and human efforts. To solve this, neural network models are used to automatically learn features. Recently, multi-task learning has been applied successfully to neural network models of biomedical literature mining. For BioNER models, using multi-task learning makes use of features from multiple datasets and improves the performance of models. In experiments, we compared our proposed model with other multi-task models and found our model outperformed the others on datasets of gene, protein, disease categories. We also tested the performance of different dataset pairs to find out the best partners of datasets. Besides, we explored and analyzed the influence of different entity types by using sub-datasets. When dataset size was reduced, our model still produced positive results. We propose a novel multi-task model for BioNER with the cross-sharing structure to improve the performance of multi-task models. The cross-sharing structure in our model makes use of features from both datasets in the training procedure. Detailed analysis about best partners of datasets and influence between entity categories can provide guidance of choosing proper dataset pairs for multi-task training. Our implementation is available at https://github.com/JogleLew/bioner-cross-sharing .

机译：生物医学命名实体识别（BioNER）是生物医学文献挖掘的一项基本且必不可少的任务，它会影响下游任务的性能。大多数BioNER模型都依赖于特定于域的功能或手工制定的规则，但是从海量数据中提取功能需要大量时间和人力。为了解决这个问题，使用神经网络模型来自动学习特征。最近，多任务学习已成功应用于生物医学文献挖掘的神经网络模型。对于BioNER模型，使用多任务学习可利用多个数据集中的特征并提高模型的性能。在实验中，我们将我们提出的模型与其他多任务模型进行了比较，发现在基因，蛋白质，疾病类别的数据集上，我们的模型优于其他模型。我们还测试了不同数据集对的性能，以找出最佳的数据集合作伙伴。此外，我们使用子数据集探索和分析了不同实体类型的影响。当数据集大小减小时，我们的模型仍然产生了积极的结果。我们使用交叉共享结构为BioNER提出了一种新颖的多任务模型，以提高多任务模型的性能。我们模型中的交叉共享结构在训练过程中利用了两个数据集的特征。有关数据集的最佳合作伙伴以及实体类别之间的影响的详细分析可以为选择合适的数据集对进行多任务训练提供指导。我们的实现可在https://github.com/JogleLew/bioner-cross-sharing上获得。

著录项

来源
《BMC Bioinformatics》 |2019年第1期|共13页
作者
Xi Wang; Jiagao Lyu; Li Dong; Ke Xu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物科学;
关键词
Multi-task learningNamed entity recognitionCross-sharing structure;

机译：多任务学习命名实体识别交叉共享结构;

相似文献

外文文献
中文文献
专利

1. Multitask learning for biomedical named entity recognition with cross-sharing structure [J] . Xi Wang, Jiagao Lyu, Li Dong, BMC Bioinformatics . 2019,第1期

机译：具有交叉共享结构的生物医学命名实体识别的多任务学习
2. A Multichannel Biomedical Named Entity Recognition Model Based on Multitask Learning and Contextualized Word Representations [J] . Hao Wei, Mingyuan Gao, Ai Zhou, Wireless communications & mobile computing . 2020,第1期

机译：基于多任务学习和上下文化字表示的多通道生物医学名为实体识别模型
3. Combining Multi-task Learning with Transfer Learning for Biomedical Named Entity Recognition [J] . Tahir Mehmood, Alfonso E. Gerevini, Alberto Lavelli, Procedia Computer Science . 2020,第5期

机译：将多任务学习与生物医学命名实体识别的转移学习相结合
4. Multitask Learning for Chinese Named Entity Recognition [C] . Qun Zhang, Zhenzhen Li, Dawei Feng, Pacific-Rim conference on multimedia . 2018

机译：用于中文命名实体识别的多任务学习
5. Semi-supervised Named Entity Recognition: Learning to recognize 100 entity types with little supervision [D] . Nadeau, David. 2007

机译：半监督的命名实体识别：在很少的监督下学习识别100种实体类型
6. Multitask learning for biomedical named entity recognition with cross-sharing structure [O] . Xi Wang, Jiagao Lyu, Li Dong, 2019

机译：具有交叉共享结构的生物医学命名实体识别的多任务学习
7. Multitask learning for biomedical named entity recognition with cross-sharing structure [O] . Xi Wang, Jiagao Lyu, Li Dong, 2019

机译：与交叉共享结构的生物医学命名实体识别的多址学习

Multitask learning for biomedical named entity recognition with cross-sharing structure

摘要

著录项

相似文献

相关主题

期刊订阅