A deep neural network-based model for named entity recognition for Hindi language

Sharma Richa; Morwal Sudha; Agarwal Basant; Chandra Ramesh; Khan Mohammad S.

首页> 外文期刊>Neural computing & applications >A deep neural network-based model for named entity recognition for Hindi language

【24h】

A deep neural network-based model for named entity recognition for Hindi language

机译：基于深度神经网络的印地语语言名称实体识别模型

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The aim of this work is to develop efficient named entity recognition from the given text that in turn improves the performance of the systems that use natural language processing (NLP). The performance of IoT-based devices such as Alexa and Cortana significantly depends upon an efficient NLP model. To increase the capability of the smart IoT devices in comprehending the natural language, named entity recognition (NER) tools play an important role in these devices. In general, the NER is a two-step process that initially the proper nouns are identified from text and then classify them into predefined categories of entities such as person, location, measure, organization and time. NER is often performed as a subtask while processing natural languages which increases the accuracy level of a NLP task. In this paper, we propose deep neural network architecture for named entity recognition for the resource-scarce language Hindi, based on convolutional neural network (CNN), bidirectional long short-term memory (Bi-LSTM) neural network and conditional random field (CRF). In the proposed approach, initially, we use skip-gram word2vec model and GloVe model to represent words in semantic vectors which are further used in different deep neural network-based architectures. In the proposed approach, we use character- and word-level embedding to represent the text that includes information at fine-grained level. Due to the use of character-level embeddings, the proposed model is robust for the out-of-vocabulary words. Experimental results show that the combination of Bi-LSTM, CNN and CRF algorithms performs better as compared to the other baseline methods such as recurrent neural network, long short-term memory and Bi-LSTM individually.

机译：这项工作的目的是从给定的文本开发有效的命名实体识别，反过来改善了使用自然语言处理（NLP）的系统的性能。基于IOT的设备如Alexa和Cortana的性能显着取决于高效的NLP模型。为了提高智能物联网设备在理解自然语言时，命名实体识别（ner）工具在这些设备中发挥着重要作用。通常，ner是一个两步的过程，最初是从文本中识别的正确名词，然后将它们分为预定义的类别，例如人，位置，测量，组织和时间。 NER通常作为子任务进行，同时处理自然语言，这增加了NLP任务的精度级别。在本文中，我们提出了深度神经网络架构，用于基于卷积神经网络（CNN），双向长短期记忆（Bi-LSTM）神经网络和条件随机场（CRF ）。在拟议的方法中，首先，我们使用Skip-gram Word2Vec模型和手套模型来表示语义向量中的单词，这些载体中进一步用于基于深度神经网络的架构。在所提出的方法中，我们使用字符和字级嵌入来表示包含细粒度级别信息的文本。由于使用字符级嵌入式，所提出的模型对于词汇外单词是强大的。实验结果表明，与其他基线方法（如经常性神经网络，长短短期记忆和双LSTM）相比，Bi-LSTM，CNN和CRF算法的组合更好地执行更好。

著录项

来源
《Neural computing & applications》 |2020年第20期|共13页
作者
Sharma Richa; Morwal Sudha; Agarwal Basant; Chandra Ramesh; Khan Mohammad S.;
展开▼
作者单位

Banasthali Vidyapith Dept Comp Sci &

Engn Kota India;

Banasthali Vidyapith Dept Comp Sci &

Engn Kota India;

Indian Inst Informat Technol Kota Dept Comp Sci &

Engn Kota India;

Norwegian Univ Sci &

Technol Dept ICT &

Nat Sci Alesund Norway;

East Tennessee State Univ Dept Comp Johnson City TN 37614 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类人工神经网络计算机;人工智能理论;
关键词
Neural networks; Machine learning; Sequence labeling; Deep learning; Convolutional neural network; Bi-LSTM;

机译：神经网络;机器学习;序列标记;深度学习;卷积神经网络;BI-LSTM;

相似文献

外文文献
中文文献
专利

1. A deep neural network-based model for named entity recognition for Hindi language [J] . Sharma Richa, Morwal Sudha, Agarwal Basant, Neural computing & applications . 2020,第20期

机译：基于深度神经网络的印地语语言名称实体识别模型
2. Deep neural network-based recognition of entities in Chinese online medical inquiry texts [J] . Xin Liu, Yanju Zhou, Zongrun Wang Future generation computer systems . 2021,第Jana期

机译：基于深度神经网络的中文在线医学查询文本的实体认可
3. Discriminatively trained continuous Hindi speech recognition system using interpolated recurrent neural network language modeling [J] . Dua Mohit, Aggarwal R. K., Biswas Mantosh Neural computing & applications . 2019,第10期

机译：使用内插复发性神经网络语言建模判别训练的连续印地语语音识别系统
4. Named entity recognition approaches: A study applied to English and Hindi language [C] . Prasad Gowri, Fousiya K.K. International Conference on Circuit, Power and Computing Technologies . 2015

机译：命名实体识别方法：适用于英语和印地语的研究
5. Improving Search via Named Entity Recognition in Morphologically Rich Languages: A Case Study in Urdu [D] . Riaz, Kashif H. 2018

机译：通过形态丰富的语言中的命名实体识别来改善搜索：以乌尔都语为例
6. Semi-Supervised Bidirectional Long Short-Term Memory and Conditional Random Fields Model for Named-Entity Recognition Using Embeddings from Language Models Representations [O] . Min Zhang, Guohua Geng, Jing Chen 2020

机译：使用语言模型表示的嵌入式识别命名实体识别的半监控双向短期内存和条件随机字段模型
7. Improving Named Entity Recognition for Biomedical and Patent Data Using Bi-LSTM Deep Neural Network Models [O] . Farag Saad, Hidir Aras, René Hackl-Sommer 2020

机译：使用Bi-LSTM深神经网络模型改进生物医学和专利数据的命名实体识别

A deep neural network-based model for named entity recognition for Hindi language

摘要

著录项

相似文献

相关主题

期刊订阅