首页> 外文期刊>Egyptian Informatics Journal >A simple Galois Power-of-Two real time embedding scheme for performing Arabic morphology deep learning tasks
【24h】

A simple Galois Power-of-Two real time embedding scheme for performing Arabic morphology deep learning tasks

机译:一种简单的伽罗技力量 - 两个实时嵌入方案,用于执行阿拉伯语形态深入学习任务

获取原文
           

摘要

This paper describes how a simple novel Galois Power-of-Two (GPOW2) real-time embedding scheme is used to improve the performance and accuracy of downstream NLP tasks. GPOW2 computes embeddings live on the fly (real time) in the context of target NLP tasks without the need for tabulated pre-embeddings. One excellent feature of the method is the ability to capture multilevel embeddings in the same pass. It simultaneously computes character, word and sentence embeddings on the fly. GPOW2 has been derived in the context of attempts to improve the performance of the SWAM Arabic morphological engine, which is a multipurpose tool that supports segmentation, classification, POS tagging, spell checking, word embeddings, sematic search, among other tasks. SWAM is a pattern-oriented algorithm that relies on morphological patterns and POS tagging to perform NLP tasks. The paper demonstrates how GPOW2 led to improvements in the accuracy of POS tagging and pattern matching, and accordingly the performance of the whole engine. The accuracy for pattern prediction is 99.47% and is 98.80% for POS tagging.
机译:本文介绍了一种简单的新颖Galois电源 - 两种(GPOW2)实时嵌入方案用于提高下游NLP任务的性能和准确性。 GPOW2在目标NLP任务的上下文中计算嵌入式(实时),无需列表预嵌入。该方法的一个很好的特征是能够在同一通行证中捕获多级嵌入的能力。它同时使用飞行计算字符,单词和句子嵌入。 GPOW2已经在尝试上导出了改进SAMAM阿拉伯语形态引擎的性能的背景下,这是一种支持分割,分类,POS标记,拼写检查,Word Embeddings,Sematic Search等的多功能工具。 SAVAM是一种以模式为导向的算法,依赖于形态模式和POS标记来执行NLP任务。本文演示了GPOW2如何导致POS标签和模式匹配的准确性改善,并因此进行整个发动机的性能。 POS标记的图案预测的精度为99.47%,为98.80%。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号