A Named Entity Recognition Shootout for German

机译：德语的命名实体识别大战

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We ask how to practically build a model for German named entity recognition (NER) that performs at the state of the art for both contemporary and historical texts, i.e., a big-data and a small-data scenario. The two best-performing model families are pitted against each other (linear-chain CRFs and BiLSTM) to observe the trade-off between expressiveness and data requiremenis. BiLSTM outperforms the CRF when large datasets are available and performs inferior for the smallest dataset. BiLSTMs profit substantially from transfer learning, which enables them to be trained on multiple corpora, resulting in a new state-of-the-art model for German NER on two contemporary German corpora (CoNLL 2003 and GermEval 2014) and two historic corpora.

机译：我们询问如何为德国命名实体识别（NER）建立一个模型，该模型在当代和历史文本（即大数据和小数据方案）方面都处于最新状态。将两个性能最佳的模型系列相互抗衡（线性链CRF和BiLSTM），以观察表达能力和数据需求之间的权衡。当可获得较大的数据集时，BiLSTM的效果优于CRF，而对于最小的数据集，BiLSTM的效果却逊色于CRF。 BiLSTM从迁移学习中获得了可观的收益，这使他们能够接受多种语料库的培训，从而在两个当代德国语料库（CoNLL 2003和GermEval 2014）和两个历史性语料库上为德国NER建立了新的最新模型。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2018年|120-125|共6页
会议地点
作者
Martin Riedl; Sebastian Pado;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Myanmar named entity corpus and its use in syllable-based neural named entity recognition [J] . Hsu Myat Mo, Khin Mar Soe International Journal of Electrical and Computer Engineering . 2020,第2期

机译：缅甸名为实体语料库及其在基于音节的神经名为实体识别中的用途
2. Massive parallel sequencing uncovers actionable FGFR2–PPHLN1 fusion and ARAF mutations in intrahepatic cholangiocarcinoma [J] . Daniela Sia, Bojan Losic, Agrin Moeini, Nature Communications . 2015,第1期

机译：大规模并行测序发现可行的 FGFR2 – PPHLN1 融合和 <肝内胆管癌的named-entity> ARAF 突变
3. Dppa3 expression is critical for generation of fully reprogrammed iPS cells and maintenance of Dlk1-Dio3 imprinting [J] . Xingbo Xu, Lukasz Smorag, Toshinobu Nakamura, Nature Communications . 2015,第2016期

机译： Dppa3 表达对于生成完全重新编程的iPS细胞和维护 Dlk1 - Dio3 印记
4. A Named Entity Recognition Shootout for German [C] . Martin Riedl, Sebastian Pado Annual meeting of the Association for Computational Linguistics . 2018

机译：一个名为实体识别枪战的德语
5. Semi-supervised Named Entity Recognition: Learning to recognize 100 entity types with little supervision [D] . Nadeau, David. 2007

机译：半监督的命名实体识别：在很少的监督下学习识别100种实体类型
6. Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition [O] . Wangjin Lee, Jinwook Choi 2019

机译：前体诱导的条件随机场：通过诱导连接单独的实体以改善临床命名实体的识别
7. Multilingual Language Models for Named Entity Recognition in German and English [O] . Antonia Baumann 2019

机译：德语和英语中命名实体识别的多语言语言模型

A Named Entity Recognition Shootout for German

摘要

著录项

相似文献

相关主题

期刊订阅