首页> 外文期刊>International journal of web information systems >CASONTO: AN EFFICIENT AND SCALABLE ARABIC SEMANTIC SEARCH ENGINE BASED ON A DOMAIN SPECIFIC ONTOLOGY AND QUESTION ANSWERING
【24h】

CASONTO: AN EFFICIENT AND SCALABLE ARABIC SEMANTIC SEARCH ENGINE BASED ON A DOMAIN SPECIFIC ONTOLOGY AND QUESTION ANSWERING

机译:CASONTO:一种基于领域特定本体和问题回答的高效且可缩放的阿拉伯语语义搜索引擎

获取原文
获取原文并翻译 | 示例
           

摘要

Purpose - The purpose of this paper is to present an efficient and scalable Arabic semantic search engine based on a domain-specific ontological graph for Colleges of Applied Science, Sultanate of Oman (CASOnto). It also supports the factorial question answering and uses two types of searching: the keyword-based search and the semantics-based search in both languages Arabic and English. This engine is built on variety of technologies such as resource description framework data and ontological graph. Furthermore, two experimental results are conducted; the first is a comparison among entity-search and the classical-search in the system itself. The second compares the CASOnto with well-known semantic search engines such as Kngine, Wolfram Alpha and Google to measure their performance and efficiency. Design/methodology/approach - The design and implementation of the system comprises the following phases, namely, designing inference, storing, indexing, searching, query processing and the user's friendly interface, where it is designed based on a specific domain of the IBRI CAS (College of Applied Science) to highlight the academic and nonacademic departments. Furthermore, it is ontological inferred data stored in the tuple data base (TDB) and MySQL to handle the keyword-based search as well as entity-based search. The indexing and searching processes are built based on the Lucene for the keyword search, while TDB is used for the entity search. Query processing is a very important component in the search engines that helps to improve the user's search results and make the system efficient and scalable. CASOnto handles the Arabic issues such as spelling correction, query completion, stop words' removal and diacritics removal. It also supports the analysis of the factorial question answering. Findings - In this paper, an efficient and scalable Arabic semantic search engine is proposed. The results show that the semantic search that built on the SPARQL is better than the classical search in both simple and complex queries. Clearly, the accuracy of semantic search equals to 100 per cent in both types of queries. On the other hand, the comparison of CASOnto with the Wolfram Alpha, Kngine and Google refers to better results by CASOnto. Consequently, it seems that our proposed engine retrieved better and efficient results than other engines. Thus, it is built according to the ontological domain-specific, highly scalable performance and handles the complex queries well by understanding the context behind the query. Research limitations/implications - The proposed engine is built on a specific domain (CAS Ibri -Oman), and in the future vision, it will highlight the nonfactorial question answering and expand the domain of CASOnto to involve more integrated different domains. Originality/value - The main contribution of this paper is to build an efficient and scalable Arabic semantic search engine. Because of the widespread use of search engines, a new dimension of challenge is created to keep up with the evolution of the semantic Web. Whereas, catering to the needs of users has become a matter of paramount importance in the light of artificial intelligence and technological development to access the accurate and the efficient information in less possible time. However, the research challenges still in its infancy due to lack of research engine that supports the Arabic language. It could be traced back to the complexity of the Arabic language morphological and grammar rules.
机译:目的-本文的目的是为阿曼苏丹国应用科学大学(CASOnto)提供一种基于领域特定本体图的高效且可扩展的阿拉伯语语义搜索引擎。它还支持阶乘问答,并使用两种类型的搜索:基于关键字的搜索和基于语义的搜索,同时使用阿拉伯语和英语。该引擎基于多种技术构建,例如资源描述框架数据和本体图。此外,进行了两个实验结果。首先是对系统本身中的实体搜索和经典搜索进行比较。第二种将CASOnto与著名的语义搜索引擎(例如Kngine,Wolfram Alpha和Google)进行比较,以衡量其性能和效率。设计/方法/方法-系统的设计和实现包括以下几个阶段,即设计推断,存储,建立索引,搜索,查询处理以及用户友好的界面,其中基于IBRI CAS的特定领域进行设计(应用科学学院)以突出学术和非学术部门。此外,它是存储在元组数据库(TDB)和MySQL中的本体论推断数据,用于处理基于关键字的搜索以及基于实体的搜索。索引和搜索过程基于Lucene构建,用于关键字搜索,而TDB用于实体搜索。查询处理是搜索引擎中非常重要的组件,它有助于改善用户的搜索结果并使系统高效且可扩展。 CASOnto处理阿拉伯语问题,例如拼写更正,查询完成,去除停用词和变音符号。它还支持析因问答的分析。发现-本文提出了一种高效且可扩展的阿拉伯语语义搜索引擎。结果表明,在简单和复杂查询中,基于SPARQL的语义搜索都比经典搜索更好。显然,在两种查询中,语义搜索的准确性都等于100%。另一方面,CASOnto与Wolfram Alpha,Kngine和Google的比较表明CASOnto的结果更好。因此,似乎我们提出的引擎比其他引擎检索出更好,更有效的结果。因此,它是根据特定于本体论领域的,高度可扩展的性能构建的,并通过了解查询背后的上下文来很好地处理复杂的查询。研究局限性/含义-拟议的引擎建立在特定领域(CAS Ibri -Aman)上,在未来的愿景中,它将突出非因果问题的回答,并将CASOn的领域扩展到涉及更多集成的不同领域。原创性/价值-本文的主要贡献是建立一个高效且可扩展的阿拉伯语语义搜索引擎。由于搜索引擎的广泛使用,创建了一个新的挑战维度,以跟上语义Web的发展。鉴于人工智能和技术发展,以尽可能少的时间访问准确和高效的信息,满足用户的需求已成为最重要的问题。然而,由于缺乏支持阿拉伯语言的研究引擎,研究挑战仍处于起步阶段。可以追溯到阿拉伯语言形态和语法规则的复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号