Uncertainty-based Self-training for Biomedical Keyphrase Extraction

机译：基于不确定性的生物医学关键正萃取的自我训练

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

To keep pace with the increased generation and digitization of documents, automated methods that can improve search, discovery and mining of the vast body of literature are essential. Keyphrases provide a concise representation by identifying salient concepts in a document. Various supervised approaches model keyphrase extraction using local context to predict the label for each token and perform much better than the unsupervised counterparts. However, existing supervised datasets have limited annotated examples to train better deep learning models. In contrast, many domains have large amount of un-annotated data that can be leveraged to improve model performance in keyphrase extraction. We introduce a self- learning based model that incorporates uncertainty estimates to select instances from large-scale unlabeled data to augment the small labeled training set. Performance evaluation on a publicly available biomedical dataset demonstrates that our method improves performance of keyphrase extraction over state of the art models.

机译：为了跟上速度的增加和数字化文件，可以改善庞大文学的搜查，发现和挖掘的自动化方法是必不可少的。密钥段通过识别文档中的突出概念提供简洁的表示。各种监督方法模型使用本地上下文提取关键正文提取以预测每个令牌的标签，并且比无监督的对应物更好地执行。但是，现有的监督数据集具有有限的注释示例，以培训更好的深度学习模型。相比之下，许多域具有大量的未注释数据，可以利用以改善关键斑提取中的模型性能。我们介绍了一种基于自学习的模型，该模型包含不确定性估计，以从大规模未标记数据中选择实例，以增加小标记的训练集。公开的生物医学数据集的性能评估表明，我们的方法提高了关键词提取的关键型号的性能。

著录项

来源
《IEEE EMBS International Conference on Biomedical and Health Informatics》|2021年|1-4|共4页
会议地点
作者
Zelalem Gero; Joyce C. Ho;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Performance evaluation; Uncertainty; Monte Carlo methods; Biological system modeling; Training data; Predictive models;

机译：培训;绩效评估;不确定性;蒙特卡罗方法;生物系统建模;培训数据;预测模型;

相似文献

外文文献
中文文献
专利

1. Active self-training based on fault detection for biomedical event extraction [J] . Basic & clinical pharmacology & toxicology. . 2019,第S10期

机译：基于生物医学事件提取的故障检测的主动自培训
2. Active self-training based on fault detection for biomedical event extraction [J] . Lu Yang, Ma Xiaolei, Jiang Mingyang Basic & clinical pharmacology & toxicology. . 2019,第S1期

机译：基于生物医学事件提取的故障检测的主动自培训
3. Single-Document Keyphrase Extraction for Multi-Document Keyphrase Extraction [J] . Gábor Berend, Richárd Farkas Computacion y Sistemas . 2013,第2期

机译：单文档关键字提取用于多文档关键字提取
4. Unsupervised Keyphrase Extraction: Introducing New Kinds of Words to Keyphrases [C] . Tho Thi Ngoc Le, Minh Le Nguyen, Akira Shimazu Australasian joint conference on artificial intelligence . 2016

机译：无监督的关键字短语提取：将新单词引入关键字短语
5. Keyphrase Extraction and Its Applications to Digital Libraries [D] . Patel, Krutarth Indubhai. 2021

机译：关键词提取及其对数字图书馆的应用
6. Deep neural model with self-training for scientific keyphrase extraction [O] . Xun Zhu, Chen Lyu, Donghong Ji, 2020

机译：具有自我训练的深度神经模型用于科学关键训练
7. Deep neural model with self-training for scientific keyphrase extraction [O] . Xun Zhu, Chen Lyu, Donghong Ji, 2020

机译：具有自我训练的深度神经模型，用于科学关键训练

Uncertainty-based Self-training for Biomedical Keyphrase Extraction

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅