...
首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Probabilistic vs deep learning based approaches for narrow domain NER in Spanish
【24h】

Probabilistic vs deep learning based approaches for narrow domain NER in Spanish

机译:西班牙语窄域网的概率与深度学习的方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This work presents an experimental study on the task of Named Entity Recognition (NER) for a narrow domain in Spanish language. This study considers two approaches commonly used in this kind of problem, namely, a Conditional Random Fields (CRF) model and Recurrent Neural Network (RNN). For the latter, we employed a bidirectional Long Short-Term Memory with ELMO's pre-trained word embeddings for Spanish. The comparison between the probabilistic model and the deep learning model was carried out in two collections, the Spanish dataset from CoNLL-2002 considering four classes under the IOB tagging schema, and aMexican Spanish news dataset with seventeen classes under IOBES schema. The paper presents an analysis about the scalability, robustness, and common errors of both models. This analysis indicates in general that the BiLSTM-ELMo model is more suitable than the CRF model when there is "enough" training data, and also that it is more scalable, as its performance was not significantly affected in the incremental experiments (by adding one class at a time). On the other hand, results indicate that the CRF model is more adequate for scenarios having small training datasets and many classes.
机译:本作品对西班牙语中窄域的命名实体识别(ner)的任务提供了一个实验研究。本研究考虑了在这种问题中常用的两种方法,即条件随机字段(CRF)模型和经常性神经网络(RNN)。对于后者,我们聘请了一款双向长期内记忆,与Elmo的预训练Word Embeddings为西班牙语。概率模型与深学习模型之间的比较是在两个集合中进行的,来自Conll-2002的西班牙数据集考虑了IOB标记架构下的四个类,以及在IOBES模式下的Amexican Spanish新闻数据集。本文提出了对两种模型的可扩展性,稳健性和常见误差的分析。该分析一般表示,当有“足够的”训练数据时,Bilstm-Elmo模型比CRF模型更适合,并且它更加可扩展,因为它的性能在增量实验中没有显着影响(通过添加一个)一段时间)。另一方面,结果表明CRF模型更适合具有小型训练数据集的场景和许多课程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号