首页> 外文会议>International Conference on Big Data and Smart Computing >Bagging-based active learning model for named entity recognition with distant supervision
【24h】

Bagging-based active learning model for named entity recognition with distant supervision

机译:基于袋的主动学习模型在远程监督下的命名实体识别

获取原文

摘要

Named entity recognition (NER) is a preliminary step to performing information extraction and question answering. Most previous studies on NER have been based on supervised machine learning methods that need a large amount of human-annotated training corpus. In this paper, we propose a semi-supervised NER model to minimize the time-consuming and labor-intensive task for constructing the training corpus. The proposed model generates weakly labeled training corpus using a distant supervision method. Then, it improves NER accuracy by refining the weakly labeled training corpus using a bagging-based active learning method. In the experiments, the proposed model outperformed the previous semi-supervised model. It showed F1-measure of 0.764 after 15 times of bagging-based active learning.
机译:命名实体识别(NER)是执行信息提取和问题解答的初步步骤。以前有关NER的大多数研究都是基于需要大量人工注释训练语料的有监督的机器学习方法。在本文中,我们提出了一种半监督的NER模型,以最大程度地减少构建训练语料库的耗时且劳动强度大的任务。提出的模型使用远程监督方法生成标记较弱的训练语料库。然后,通过使用基于套袋的主动学习方法完善弱标记的训练语料库,可以提高NER准确性。在实验中,提出的模型优于先前的半监督模型。经过15次基于套袋的主动学习后,其F1测度为0.764。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号