首页> 外文会议>Annual International Conference on Computational Linguistics and Intelligent Text Processing >High Quality Arabic Lexical Ontology Based on MUHIT, WordNet, SUMO and DBpedia
【24h】

High Quality Arabic Lexical Ontology Based on MUHIT, WordNet, SUMO and DBpedia

机译:基于Muhit,Wordnet,Sumo和DBPedia的高质量阿拉伯词汇本体论

获取原文

摘要

In this paper, we aim to move ontology-based Arabic NLP forward by experimenting with the generation of a comprehensive Arabic lexical ontology using multiple language resources. We recommend a combination of MUHIT, WordNet and SUMO and use a simple method to link them, which results in the generation of an Arabic-lexicalized version of the SUMO ontology. Then, we evaluate the generated ontology, and propose a method for increasing its named entity coverage using DBpedia, English-to-Arabic Transliteration, and Named Entity Recognition. We end up with an Arabic lexical ontology that has 228K Arabic synsets, linked to 7.8K concepts and 143K instances. This ontology achieves a precision of 96.9% and recall of 75.5% for NLU scenarios.
机译:在本文中,我们的目的是通过使用多语言资源来试验全面的阿拉伯语词法本体学,通过尝试来移动基于本体的阿拉伯语NLP。我们建议使用Muhit,Wordnet和Sumo的组合,并使用简单的方法将它们链接,这导致生成Arabic-Lexicalized版本的Sumo本体。然后,我们评估生成的本体,并提出了一种使用DBPedia,英语到阿拉伯语音译和命名实体识别增加其命名实体覆盖的方法。我们最终获得了一个拥有228k的阿拉伯语的词组本体,与7.8K概念和143K实例相关联。这种本体论实现了96.9%的精确度,并考虑了NLU情景的75.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号