首页> 外文会议>Novel Intelligent and Leading Emerging Sciences Conference >A Multi-Embeddings Approach Coupled with Deep Learning for Arabic Named Entity Recognition
【24h】

A Multi-Embeddings Approach Coupled with Deep Learning for Arabic Named Entity Recognition

机译:一种多嵌入方法与深度学习相结合的阿拉伯命名实体识别

获取原文

摘要

Named Entity Recognition (NER) is an important task in many natural language processing applications. There are several studies that have focused on NER for the English language. However, there are some limitations when applying the current methodologies directly on the Arabic language text. Recent studies have shown the effectiveness of pooled contextual embedding representations and significant improvements in English NER tasks. This work investigates the performance of pooled contextual embeddings and bidirectional encoder representations from Transformers (BERT) model when used for NER on the Arabic language while addressing Arabic specific issues. The proposed method is an end-to-end deep learning model that utilizes a combination of pre-trained word embeddings, pooled contextual embeddings, and BERT model. Embeddings are then fed into bidirectional long-short term memory networks with a conditional random field. Different types of classical and contextual embeddings were experimented to pool for the best model. The proposed method achieves an F1 score of 77.62% on the AQMAR dataset, outperforming all previously published results of deep learning, and non-deep learning models on the same dataset. The presented results also surpass those of the wining system for the same task on the same data in the Topcoder website competition.
机译:在许多自然语言处理应用程序中,命名实体识别(NER)是一项重要任务。有一些针对NER的英语研究。但是,在阿拉伯语文本上直接应用当前方法时存在一些限制。最近的研究表明,合并上下文嵌入表示的有效性和对英语NER任务的显着改进。这项工作调查了变形器(BERT)模型用于阿拉伯语NER时,合并的上下文嵌入和来自Transformers(BERT)模型的双向编码器表示的性能,同时解决了阿拉伯语的特定问题。所提出的方法是一种端到端深度学习模型,该模型利用了预训练的单词嵌入,合并的上下文嵌入和BERT模型的组合。然后将嵌入信息带入带有条件随机字段的双向长期短期存储网络中。实验了不同类型的经典嵌入和上下文嵌入,以汇集最佳模型。所提出的方法在AQMAR数据集上的F1分数达到77.62%,优于之前发布的所有深度学习结果和同一数据集上的非深度学习模型。在Topcoder网站竞赛中,针对相同数据执行相同任务的结果也超过了胜出系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号