【24h】

Named Entity Recognition for Nepali Language

机译:尼泊尔语言命名实体识别

获取原文

摘要

Named Entity Recognition (NER) has been studied for many languages like English, German, Spanish, and others but virtually no studies have focused on the Nepali language. One key reason is the lack of an appropriate, annotated dataset. In this paper, we describe a Nepali NER dataset that we created. We discuss and compare the performance of various machine learning models on this dataset. We also propose a novel NER scheme for Nepali and show that this scheme, based on grapheme-level representations, outperforms character-level representations when combined with BiLSTM models. Our best models obtain an overall F1 score of 86.89, which is a significant improvement on previously reported performance in literature.
机译:已对英语,德语,西班牙语等多种语言进行了命名实体识别(NER)的研究,但实际上没有针对尼泊尔语言的研究。关键原因之一是缺少适当的带注释的数据集。在本文中,我们描述了我们创建的尼泊尔NER数据集。我们讨论并比较了该数据集上各种机器学习模型的性能。我们还为尼泊尔语提出了一种新颖的NER方案,并表明该方案基于字素级表示,与BiLSTM模型结合使用时,性能优于字符级表示。我们最好的模型获得的F1总体得分为86.89,这是对先前报道的文献表现的重大改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号