Coner: A Collaborative Approach for Long-Tail Named Entity Recognition in Scientific Publications

机译：Coner：科学出版物中长尾命名实体识别的协作方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Named Entity Recognition (NER) for rare long-tail entities as e.g., often found in domain-specific scientific publications is a challenging task, as typically the extensive training data and test data for fine-tuning NER algorithms is lacking. Recent approaches presented promising solutions relying on training NER algorithms in an iterative weakly-supervised fashion, thus limiting human interaction to only providing a small set of seed terms. Such approaches heavily rely on heuristics in order to cope with the limited training data size. As these heuristics are prone to failure, the overall achievable performance is limited. In this paper, we therefore introduce a collaborative approach which incrementally incorporates human feedback on the relevance of extracted entities into the training cycle of such iterative NER algorithms. This approach, called Coner, allows to still train new domain specific rare long-tail NER extractors with low costs, but with ever increasing performance while the algorithm is actively used in an application.

机译：例如，通常在特定领域的科学出版物中经常见到的稀有长尾实体的命名实体识别（NER）是一项艰巨的任务，因为通常缺少用于微调NER算法的大量训练数据和测试数据。最近的方法提出了有希望的解决方案，该解决方案依赖于以迭代的弱监督方式训练NER算法，从而将人机交互限制为仅提供少量种子项。为了应对有限的训练数据量，这种方法严重依赖于启发法。由于这些试探法容易失败，因此可实现的总体性能受到限制。因此，在本文中，我们引入了一种协作方法，该方法将有关提取的实体的相关性的人类反馈逐步纳入这种迭代NER算法的训练周期中。这种称为Coner的方法仍允许以低成本训练新的领域特定的稀有长尾NER提取器，但在算法被积极地应用到应用程序时，性能却不断提高。

著录项

来源
《International conference on theory and practice of digital libraries》|2019年|3-17|共15页
会议地点
作者
Daniel Vliegenthart; Sepideh Mesbah; Christoph Lofi; Akiko Aizawa; Alessandro Bozzon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Arabic Named Entity Recognition Using Artificial Neural Network | Science Publications [J] . Naji F. Mohammed, Nazlia Omar Journal of computer sciences . 2012,第8期

机译：人工神经网络的阿拉伯命名实体识别科学出版物
2. Arabic Named Entity Recognition Using Artificial Neural Network | Science Publications [J] . Naji F. Mohammed, Nazlia Omar Journal of computer sciences . 2012,第8期

机译：人工神经网络的阿拉伯命名实体识别科学出版物
3. Automatic Recognition of Chemical Entity Mentions in Texts of Scientific Publications [J] . Biziukova N. Yu, Tarasova O. A., Rudik A. V, Automatic Documentation and Mathematical Linguistics . 2020,第6期

机译：在科学出版物文本中自动识别化学实体提到
4. Coner: A Collaborative Approach for Long-Tail Named Entity Recognition in Scientific Publications [C] . Daniel Vliegenthart, Sepideh Mesbah, Christoph Lofi, International conference on theory and practice of digital libraries . 2019

机译：锥形：科学出版物中的长尾名称实体识别的协作方法
5. A data-intensive approach to named entity recognition using domain and language independent methods [D] . Osesina, Olukayode Isaac. 2010

机译：使用领域和语言无关的方法进行的数据密集型命名实体识别方法
6. A Weakly-Supervised Named Entity Recognition Machine Learning Approach for Emergency Medical Services Clinical Audit [O] . Han Wang, Wesley Lok Kin Yeung, Qin Xiang Ng, 2021

机译：紧急医疗服务临床审计的弱监督名为实体识别机器学习方法
7. TSE-NER: An Iterative Approach for Long-Tail Entity Extraction in Scientific Publications [O] . Sepideh Mesbah, Christoph Lofi, Manuel Valle Torre, 2018

机译：TSE-ner：科学出版物中的长尾实体提取迭代方法

Coner: A Collaborative Approach for Long-Tail Named Entity Recognition in Scientific Publications

摘要

著录项

相似文献

相关主题

期刊订阅