Investigation on Data Adaptation Techniques for Neural Named Entity Recognition

机译：神经名称实体识别数据适应技术的调查

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data processing is an important step in various natural language processing tasks. As the commonly used datasets in named entity recognition contain only a limited number of samples, it is important to obtain additional labeled data in an efficient and reliable manner. A common practice is to utilize large monolingual unla-beled corpora. Another popular technique is to create synthetic data from the original labeled data (data augmentation). In this work, we investigate the impact of these two methods on the performance of three different named entity recognition tasks.

机译：数据处理是各种自然语言处理任务的重要步骤。由于命名实体识别中的常用数据集仅包含有限数量的样本，因此重要的是以有效可靠的方式获得其他标记数据。一个常见的做法是利用大型单机Ulbled Corpora。另一种流行的技术是从原始标记数据（数据增强）创建合成数据。在这项工作中，我们调查这两种方法对三种不同命名实体识别任务的性能的影响。

著录项

来源
《Annual Meeting of the Association for Computational Linguistics;International Joint Conference on Natural Language Processing》|2021年|1-15|共15页
会议地点
作者
Evgeniia Tokarchuk; David Thulke; Weiyue Wang; Christian Dugast; Hermann Ney;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:58:31

相似文献

外文文献
中文文献
专利

1. Data Augmentation Techniques on Arabic Data for Named Entity Recognition [J] . Caroline Sabty, Islam Omar, Fady Wasfalla, Procedia Computer Science . 2021,第a期

机译：用于命名实体识别的阿拉伯语数据的数据增强技术
2. Myanmar named entity corpus and its use in syllable-based neural named entity recognition [J] . Hsu Myat Mo, Khin Mar Soe International Journal of Electrical and Computer Engineering . 2020,第2期

机译：缅甸名为实体语料库及其在基于音节的神经名为实体识别中的用途
3. Interlinking SciGraph and DBpedia Datasets Using Link Discovery and Named Entity Recognition Techniques [J] . Beyza Yaman, Michele Pasin, Markus Freudenberg OASIcs : OpenAccess Series in Informatics . 2019,第1期

机译：使用链接发现和命名实体识别技术互连SciGraph和DBpedia数据集
4. Named Entity Chunking Techniques in Supervised Learning for Japanese Named Entity Recognition [C] . Manabu Sassano, Takehito Utsuro International conference on computational linguistics;COLING 2000 . 2000

机译：日语命名实体识别的监督学习中的命名实体分块技术
5. A data-intensive approach to named entity recognition using domain and language independent methods [D] . Osesina, Olukayode Isaac. 2010

机译：使用领域和语言无关的方法进行的数据密集型命名实体识别方法
6. CollaboNet: collaboration of deep neural networks for biomedical named entity recognition [O] . Wonjin Yoon, Chan Ho So, Jinhyuk Lee, 2019

机译：CollaboNet：用于生物医学命名实体识别的深度神经网络协作
7. Named Entity Chunking Techniques in Supervised Learning for Japanese Named Entity Recognition [O] . Manabu Sassano, Takehito Utsuro 2000

机译：日语命名实体识别监督学习中的命名实体分块技术

Investigation on Data Adaptation Techniques for Neural Named Entity Recognition

摘要

著录项

相似文献

相关主题

期刊订阅