首页> 外文期刊>Journal of computer sciences >QUERY TRANSLATION USING CONCEPTS SIMILARITY BASED ON QURAN ONTOLOGYFOR CROSS-LANGUAGE INFORMATION RETRIEVAL
【24h】

QUERY TRANSLATION USING CONCEPTS SIMILARITY BASED ON QURAN ONTOLOGYFOR CROSS-LANGUAGE INFORMATION RETRIEVAL

机译:使用基于Quran本体的概念相似性进行查询翻译以进行跨语言信息检索

获取原文
获取原文并翻译 | 示例
       

摘要

In Cross-Language Information Retrieval (CLIR) process, the translation effects have a direct impact on the accuracy of follow-up retrieval results. In dictionary-based approach, we are dealing with the words that have more than one meaning which can decrease the retrieval performance if the query translation return an incorrect translations. These issues need to be overcome using efficient technique. In this study we proposed a Cross-Language Information Retrieval (CLIR) method based on domain ontology using Quran concepts for disambiguating translation of the query and to improve the dictionary-based query translation. For experimentation, we use Quran ontology written in English and Malay languages as a bilingual parallel-corpora and Quran concepts as a resource for cross-language query translation along with dictionary-based translation. For evaluation, we measure the performance of three IR systems. IR-(1) is natural language query IR, IR-(2) is natural language query CLIR based on dictionary (as a Baseline) and IR-(3) is the retrieval of this research proposed method using Mean Average Precision (MAP) and average precision at 11 points of recall. The experimental result shows that our proposed method brings significant improvement in retrieval accuracy for English document collections, but deficient for Malay document collections. The proposed CLIR method can obtain query expansion effect and improve retrieval performance in certain language.
机译:在跨语言信息检索(CLIR)过程中,翻译效果直接影响后续检索结果的准确性。在基于字典的方法中,我们要处理具有多个含义的单词,如果查询翻译返回不正确的翻译,则会降低检索性能。这些问题需要使用有效的技术来克服。在这项研究中,我们提出了一种基于领域本体的跨语言信息检索(CLIR)方法,该方法使用Quran概念消除查询的歧义并改善基于字典的查询翻译。为了进行实验,我们使用以英语和马来语编写的古兰经本体作为双语并行语料库,而古兰经概念则用作跨语言查询翻译以及基于字典的翻译的资源。为了进行评估,我们测量了三个IR系统的性能。 IR-(1)是自然语言查询IR,IR-(2)是基于字典(作为基准)的自然语言查询CLIR,IR-(3)是使用均值平均精度(MAP)对该研究提出的方法的检索召回点平均精度为11。实验结果表明,本文提出的方法对英语文献集的检索准确度有显着提高,但对马来文文献集却不足。所提出的CLIR方法可以获得一定的语言查询扩展效果,提高了检索性能。

著录项

  • 来源
    《Journal of computer sciences》 |2013年第7期|889-897|共9页
  • 作者单位

    Department of Multimedia,Faculty of Computer Science and Information Technology,Universiti Putra Malaysia, 43400 UPM Serdang, Selangor, Malaysia;

    Department of Multimedia,Faculty of Computer Science and Information Technology,Universiti Putra Malaysia, 43400 UPM Serdang, Selangor, Malaysia;

    Department of Multimedia,Faculty of Computer Science and Information Technology,Universiti Putra Malaysia, 43400 UPM Serdang, Selangor, Malaysia;

    Department of Computer Science,Faculty of Computer Science and Information Technology,Universiti Putra Malaysia, 43400 UPM Serdang, Selangor, Malaysia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    English Language; Malay Language; Bilingual Dictionary; Quran Concepts; Quran Ontology;

    机译:英语;马来语双语词典;古兰经概念;古兰经本体论;
  • 入库时间 2022-08-17 13:48:13

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号