Named Entity Recognition for Nepali Language

机译：尼泊尔语言命名实体识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Named Entity Recognition (NER) has been studied for many languages like English, German, Spanish, and others but virtually no studies have focused on the Nepali language. One key reason is the lack of an appropriate, annotated dataset. In this paper, we describe a Nepali NER dataset that we created. We discuss and compare the performance of various machine learning models on this dataset. We also propose a novel NER scheme for Nepali and show that this scheme, based on grapheme-level representations, outperforms character-level representations when combined with BiLSTM models. Our best models obtain an overall F1 score of 86.89, which is a significant improvement on previously reported performance in literature.

机译：已对英语，德语，西班牙语等多种语言进行了命名实体识别（NER）的研究，但实际上没有针对尼泊尔语言的研究。关键原因之一是缺少适当的带注释的数据集。在本文中，我们描述了我们创建的尼泊尔NER数据集。我们讨论并比较了该数据集上各种机器学习模型的性能。我们还为尼泊尔语提出了一种新颖的NER方案，并表明该方案基于字素级表示，与BiLSTM模型结合使用时，性能优于字符级表示。我们最好的模型获得的F1总体得分为86.89，这是对先前报道的文献表现的重大改进。

著录项

来源
《IEEE International Conference on Collaboration and Internet Computing》|2019年|184-190|共7页
会议地点
作者
Oyesh Mann Singh; Ankur Padia; Anupam Joshi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Hidden Markov models; Task analysis; Training; Encoding; Artificial neural networks; Support vector machines;

机译：隐马尔可夫模型;任务分析;训练;编码;人工神经网络;支持向量机;

相似文献

外文文献
中文文献
专利

1. Named Entity Recognition for Nepali Text Using Support Vector Machines [J] . Surya Bahadur Bam, Tej Bahadur Shahi Intelligent Information Management . 2014,第2期

机译：支持向量机对尼泊尔文字的命名实体识别
2. A multiobjective simulated annealing approach for classifier ensemble: Named entity recognition in Indian languages as case studies [J] . AsifEkbal, SriparnaSaha Expert Systems with Application . 2011,第12期

机译：分类器集成的多目标模拟退火方法：以印度语言中的命名实体识别为案例研究
3. Named Entity Recognition in Indian Languages Using Maximum Entropy Approach [J] . Asif Ekbal, Sivaji Bandyopadhyay International journal of computer processing of languages . 2008,第3期

机译：使用最大熵方法的印度语言中的命名实体识别
4. Named Entity Recognition for Nepali Language [C] . Oyesh Mann Singh, Ankur Padia, Anupam Joshi IEEE International Conference on Collaboration and Internet Computing . 2019

机译：命名为尼泊尔语的实体识别
5. Improving Search via Named Entity Recognition in Morphologically Rich Languages: A Case Study in Urdu [D] . Riaz, Kashif H. 2018

机译：通过形态丰富的语言中的命名实体识别来改善搜索：以乌尔都语为例
6. Semi-Supervised Bidirectional Long Short-Term Memory and Conditional Random Fields Model for Named-Entity Recognition Using Embeddings from Language Models Representations [O] . Min Zhang, Guohua Geng, Jing Chen 2020

机译：使用语言模型表示的嵌入式识别命名实体识别的半监控双向短期内存和条件随机字段模型
7. The First Cross-Lingual Challenge on Recognition, Normalization and Matching of Named Entities in Slavic Languages [O] . Piskorski, Jakub, Pivovarova, Lidia, Šnajder, Jan, 2017

机译：斯拉夫语言中命名实体的识别，规范化和匹配的第一个跨语言挑战

Named Entity Recognition for Nepali Language

摘要

著录项

相似文献

相关主题

期刊订阅