Semi-supervised learning for named entity recognition using weakly labeled training data

机译：使用弱标签的训练数据进行半监督学习以进行命名实体识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The shortage of the annotated training data is still an important challenge to building many Natural Language Process (NLP) tasks such as Named Entity Recognition. NER requires a large amount of training data with a high degree of human supervision whereas there is not enough labeled data for every language. In this paper, we use an unlabeled bilingual corpora to extract useful features from transferring information from resource-rich language toward resource-poor language and by using these features and a small training data, make a NER supervised model. Then we utilize a graph-based semi-supervised learning method that trains a CRF-based supervised classifier using that labeled data and uses high-confidence predictions on the unlabeled data to expand the training set and improve efficiency of NER model with the new training set.

机译：注释培训数据的短缺仍然是建立许多自然语言过程（NLP）任务（例如命名实体识别）的重要挑战。 NER需要大量的培训数据，并且需要高度的人工监督，而每种语言的标签数据不足。在本文中，我们使用未标记的双语语料库，从将信息从资源丰富的语言向资源贫乏的语言传递的信息中提取有用的特征，并利用这些特征和少量的训练数据，建立NER监督模型。然后，我们使用基于图的半监督学习方法，该方法使用标记的数据训练基于CRF的监督分类器，并对未标记的数据使用高置信度预测来扩展训练集并通过新的训练集提高NER模型的效率。

著录项

来源
《International Symposium on Artificial Intelligence amp; Signal Processing》|2015年|129-135|共7页
会议地点 Mashhad(IR)
作者
Zafarian Atefeh; Rokni Ali; Khadivi Shahram; Ghiasifard Sonia;
展开▼
作者单位

Dept. of Comput. Eng. IT, Amirkabir Univ. of Technol., Tehran, Iran;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Bilingual parallel corpora; Named entity Recognition; graph-based semi-supervised learning;

机译：双语并行语料库命名实体识别基于图的半监督学习;

相似文献

外文文献
中文文献
专利

1. Learning to select pseudo labels:a semi-supervised method for named entity recognition [J] . Zhen-zhen LI, Da-wei FENG, Dong-sheng LI, 浙江大学学报（英文版）（C辑：计算机与电子） . 2020,第006期

机译：学习选择伪标签：一个用于命名实体识别的半监督方法
2. GNER: A Generative Model for Geological Named Entity Recognition Without Labeled Data Using Deep Learning [J] . Qinjun Qiu, Zhong Xie, Liang Wu, Earth and Space Science . 2019,第6期

机译：GNER：使用深度学习的无标记数据的地质命名实体识别的生成模型
3. Named entity recognition: a semi-supervised learning approach [J] . H. Sintayehu, G. S. Lehal International Journal of Information Technology . 2021,第4期

机译：命名实体识别：半监督学习方法
4. Semi-supervised learning for named entity recognition using weakly labeled training data [C] . Zafarian Atefeh, Rokni Ali, Khadivi Shahram, International Symposium on Artificial Intelligence Signal Processing . 2015

机译：使用弱标记培训数据的分半监督学习命名实体识别
5. Semi-supervised Named Entity Recognition: Learning to recognize 100 entity types with little supervision [D] . Nadeau, David. 2007

机译：半监督的命名实体识别：在很少的监督下学习识别100种实体类型
6. DTranNER: biomedical named entity recognition with deep learning-based label-label transition model [O] . S. K. Hong, Jae-Gil Lee 2020

机译：DTranNER：具有基于深度学习的标签-标签转换模型的生物医学命名实体识别
7. Semi-Supervised Noisy Label Learning for Chinese Medical Named Entity Recognition [O] . Zhucong Li, Zhen Gan, Baoli Zhang, 2021

机译：用于中国医疗名为实体认可的半监督嘈杂的标签学习

Semi-supervised learning for named entity recognition using weakly labeled training data

摘要

著录项

相似文献

相关主题

期刊订阅