...
首页> 外文期刊>The international arab journal of information technology >Generating Sense Inventories for Ambiguous Arabic Words
【24h】

Generating Sense Inventories for Ambiguous Arabic Words

机译:为模糊的阿拉伯语发电

获取原文
获取原文并翻译 | 示例
           

摘要

The process of selecting the appropriate meaning of an ambigous word according to its context is known as word sense disambiguation. In this research, we generate a number of Arabic sense inventories based on an unsupervised approach and different pre-trained embeddings, such as Aravec, Fasttext, and Arabic-News embeddings. The resulted inventories from the pre-trained embeddings are evaluated to investigate their efficiency in Arabic word sense disambiguation and sentence similarity. The sense inventories are generated using an unsupervised approach that is based on a graph- based word sense inductionalgorithm. Results show that the AravecTwitter inventory achieves the best accuracy of 0.47 for 50 neighbors and a close accuracy to the Fasttext inventory for 200 neighbors while it provides similar accuracy to the Arabic-News inventory for 100neighbors. The experiment of replacing ambiguous words with their sense vectors is tested for sentence similarity using all sense inventories and the results show that using Aravec-Twitter sense inventoryprovides a better correlation value.
机译:根据其上下文选择歧义单词的适当含义的过程被称为词感歧义。在这项研究中,我们基于无监督的方法和不同的预训练嵌入品,例如ARAVEC,FastText和阿拉伯新闻嵌入。从预先训练的嵌入的所产生的库存中被评估,以研究阿拉伯语词感歧义和句子相似性的效率。使用无常用的方法生成意义清单,该方法是基于基于图形的字形感测电感算法。结果表明,aravectwitter库存为50个邻居的最佳精度为0.47,并为200个邻居的FastText库存的最佳准确性,同时为100neighbors提供类似的准确性。使用所有感测库存测试替换模糊单词的实验,以句子相似度测试句子相似度,结果表明,使用aravec-twitter sense库存提供更好的相关价值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号