Urdu Named Entity Recognition and Classification System Using Artificial Neural Network

MUHAMMAD KAMRAN MALIK

首页> 外文期刊>ACM transactions on Asian language information processing >Urdu Named Entity Recognition and Classification System Using Artificial Neural Network

【24h】

Urdu Named Entity Recognition and Classification System Using Artificial Neural Network

机译：基于人工神经网络的乌尔都语命名实体识别与分类系统

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Named Entity Recognition and Classification (NERC) is a process of identifying words and classifying them into person names, location names, organization names, and so on. In this article, we discuss the development of an Urdu Named Entity (NE) corpus, called the Kamran-PU-NE (KPU-NE) corpus, for three entity types, that is, Person, Organization, and Location, and marking the remaining tokens as Others (O). We use two supervised learning algorithms, Hidden Markov Model (HMM) and Artificial Neural Network (ANN), for the development of the Urdu NERC system. We annotate the 652852-token corpus taken from 15 different genres with a total of 44480 NEs. The inter-annotator agreement between the two annotators in terms of Kappa k statistic is 73.41%. With HMM, the highest recorded precision, recall, and f-measure values are 55.98%, 83.11%, and 66.90%, respectively, and with ANN, they are 81.05%, 87.54%, and 84.17%, respectively.

机译：命名实体识别和分类（NERC）是识别单词并将其分类为人员名称，位置名称，组织名称等的过程。在本文中，我们讨论了针对三种实体类型（人，组织和位置）的乌尔都语命名实体（NE）语料库（称为Kamran-PU-NE（KPU-NE）语料库）的开发，并标记了其余标记为其他（O）。我们使用两种监督学习算法，即隐马尔可夫模型（HMM）和人工神经网络（ANN），来开发Urdu NERC系统。我们注释了来自15种不同流派的652852令牌语料，总共有44480个NE。根据Kappa k统计，两个注释者之间的注释者之间的一致性为73.41％。使用HMM时，记录的最高精度，召回率和f测量值分别为55.98％，83.11％和66.90％，而使用ANN时，分别为81.05％，87.54％和84.17％。

著录项

来源
《ACM transactions on Asian language information processing》 |2018年第1期|2.1-2.13|共13页
作者
MUHAMMAD KAMRAN MALIK;
展开▼
作者单位

Punjab University College of Information Technology (PUCIT), University of the Punjab, Lahore Pakistan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Resource Poor Languages; Deep Learning; NER using Deep Learning; Urdu POS tagged Data; NER Data; Urdu word2vec;

机译：资源贫乏的语言;深度学习;NER使用深度学习;Urdu POS标记的数据;NER数据;乌尔都语word2vec;
入库时间 2022-08-18 04:03:41

相似文献

外文文献
中文文献
专利

1. Deep recurrent neural networks with word embeddings for Urdu named entity recognition [J] . Wahab Khan, Ali Daud, Fahd Alotaibi, ETRI journal . 2020,第1期

机译：具有Word Embeddings的深度经常性神经网络，用于URDU命名实体识别
2. Arabic Named Entity Recognition using Artificial Neural Network [J] . Naji F. Mohammed, Nazlia Omar Journal of computer sciences . 2012,第8期

机译：使用人工神经网络的阿拉伯命名实体识别
3. Arabic Named Entity Recognition Using Artificial Neural Network | Science Publications [J] . Naji F. Mohammed, Nazlia Omar Journal of computer sciences . 2012,第8期

机译：人工神经网络的阿拉伯命名实体识别科学出版物
4. Named Entity Recognition System for Urdu [C] . UmrinderPal Singh, Vishal Goyal, Gurpreet Singh Lenal International conference on computational linguistics . 2012

机译：乌尔都语命名实体识别系统
5. Improving Search via Named Entity Recognition in Morphologically Rich Languages: A Case Study in Urdu [D] . Riaz, Kashif H. 2018

机译：通过形态丰富的语言中的命名实体识别来改善搜索：以乌尔都语为例
6. CollaboNet: collaboration of deep neural networks for biomedical named entity recognition [O] . Wonjin Yoon, Chan Ho So, Jinhyuk Lee, 2019

机译：CollaboNet：用于生物医学命名实体识别的深度神经网络协作
7. Named Entity Recognition and Classification using Artificial Neural Network [O] . Luka Bašek, Borko Boškovič 2010

机译：使用人工神经网络命名实体识别和分类

Urdu Named Entity Recognition and Classification System Using Artificial Neural Network

摘要

著录项

相似文献

相关主题

期刊订阅