Updating a Name Tagger Using Contemporary Unlabeled Data

机译：使用当代未标记的数据更新名称标记

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For many NLP tasks, including named entity tagging, semi-supervised learning has been proposed as a reasonable alternative to methods that require annotating large amounts of training data. In this paper, we address the problem of analyzing new data given a semi-supervised NE tagger trained on data from an earlier time period. We will show that updating the unlabeled data is sufficient to maintain quality over time, and outperforms updating the labeled data. Furthermore, we will also show that augmenting the unlabeled data with older data in most cases does not result in better performance than simply using a smaller amount of current unlabeled data.

机译：对于许多NLP任务，包括命名实体标记，已经提出了半监督学习作为需要注释大量培训数据的方法的合理替代方案。在本文中，我们解决了从早期时间段训练的半监控的NE标记器的分析问题的问题。我们将显示更新未标记的数据足以保持质量随着时间的推移，并且优于更新标记的数据。此外，我们还将显示在大多数情况下使用较旧数据增强未标记的数据不会导致更好的性能，而不是简单地使用较少量的当前未标记的数据。

著录项

来源
《Joint conference of the annual meeting of the Association for Computational Linguistics;International joint conference on natural language processing of the Asian Federation of Natural Languages Processing;ACL 2009;IJCNLP 2009》|2009年|P.353-356|共4页
会议地点
作者
Cristina Mota; Ralph Grishman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Tag N’ Train: a technique to train improved classifiers on unlabeled data [J] . Oz Amram, Cristina Mantilla Suarez The journal of high energy physics . 2021,第1期

机译：标签n'列车：一种在未标记数据上培训改进的分类器的技术
2. Take full advantage of unlabeled data for sentiment classification [J] . La Lei, Cao Shuyan, Qin Liangjuan Kybernetes: The International Journal of Systems & Cybernetics . 2018,第3期

机译：充分利用未标记数据进行情感分类
3. The finnish disease heritage database (findis) update-a database for the genes mutated in the finnish disease heritage brought to the next-generation sequencing era [J] . PolviA., LinturiH., VariloT., Human mutation . 2013,第11期

机译：芬兰疾病遗产数据库（findis）更新-进入下一代测序时代的芬兰疾病遗产中突变的基因的数据库
4. Updating a Name Tagger Using Contemporary Unlabeled Data [C] . Joint conference of the annual meeting of the Association for Computational Linguistics . 2009

机译：使用当代未标记数据更新名称标记器
5. Statistical Methods for Detecting Anomalous Model Behavior with Unlabeled Data [D] . Lagnese, Joseph A. 2020

机译：用未标记数据检测异常模型行为的统计方法
6. TAVR update: Contemporary data from the UK TAVI and US TVT registries [O] . Hussam S. Suradi, Ziyad M. Hijazi 2015

机译：TAVR更新：来自英国TAVI和美国TVT注册管理机构的当代数据
7. Training a prosody-based dialog act tagger from unlabeled data [O] . Anand Venkataraman, Luciana Ferrer, Andreas Stolcke, 2003

机译：从未标记的数据训练基于韵律的对话行为标记器

Updating a Name Tagger Using Contemporary Unlabeled Data

摘要

著录项

相似文献

相关主题

期刊订阅