首页> 外文期刊>Procedia Computer Science >A Comparative Study of Linguistic and Computational Features Based on a Machine Learning for Arabic Anaphora Resolution
【24h】

A Comparative Study of Linguistic and Computational Features Based on a Machine Learning for Arabic Anaphora Resolution

机译:基于Arabophora分辨率的机器学习的语言和计算特征对比较研究

获取原文
           

摘要

Anaphora resolution is one of the problems in natural language processing. It is the process of disambiguating the antecedent of a referring expression from the set of entities in a discourse. The correct interpretation of pronouns plays an important role in the construction of meaning. Thus, the resolution of pronominal anaphors remains a very important task for many natural language processing applications. Additionally, it plays an increasingly significant role in computational linguistics. However, a significant amount of work on anaphora resolution is focused on English; anaphora resolution for other languages, including Arabic, is still limited. In this paper, we present a new set of computational and linguistic features to resolve Arabic anaphors using a machine learning approach. In this paper, an in-depth study was conducted on a set of computational and linguistic features to exploit their effectiveness and investigate their effect on anaphora resolution. The aim was to efficiently integrate different feature sets and classification algorithms to synthesize a more accurate classification procedure. Four well-known machine learning algorithms k-nearest neighbor, maximum entropy, decision tree and meta-classifier, were employed as base-classifiers for each of the feature sets. A wide range of comparative experiments on Quran datasets was conducted, the discussion presented, and conclusions were drawn. The experimental results show that our approach gives satisfactory results.
机译:Anaphora解决方案是自然语言处理中的问题之一。它是歧义话语中的一组实体的引用表达的前进的过程。代词的正确解释在含义的构建中起着重要作用。因此,多种语态阴影的分辨率仍然是许多自然语言处理应用的非常重要的任务。此外,它在计算语言学中起着越来越重要的作用。但是,关于APAPHORA决议的大量工作重点是英语; Anaphora为其他语言(包括阿拉伯语)的分辨率仍然有限。在本文中,我们展示了一系列新的计算和语言特征,可以使用机器学习方法解决阿拉伯语宣言。在本文中,对一系列计算和语言特征进行了深入研究,以利用其有效性并调查它们对视性分辨率的影响。目的是有效地整合不同的特征集和分类算法,以合成更准确的分类过程。四个众所周知的机器学习算法K-最近邻居,最大熵,决策树和元分类器被用作每个特征集的基础分类器。对古兰经数据集进行了广泛的比较实验,提出的讨论和得出结论。实验结果表明,我们的方法提供了令人满意的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号