首页> 外文期刊>International journal of web information systems >Effective keyword query structuring using NER for XML retrieval
【24h】

Effective keyword query structuring using NER for XML retrieval

机译:使用NER进行XML检索的有效关键字查询结构

获取原文
获取原文并翻译 | 示例
           

摘要

Purpose - The purpose of this paper is to propose and evaluate XKQSS, a query structuring method that relegates the task of generating structured queries from a user to a search engine while retaining the simple keyword search query interface. A more effective way for searching XML database is to use structured queries. However, using query languages to express queries prove to be difficult for most users since this requires learning a query language and knowledge of the underlying data schema. On the other hand, the success of Web search engines has made many users to be familiar with keyword search and, therefore, they prefer to use a keyword search query interface to search XML data. Design/methodology/approach - Existing query structuring approaches require users to provide structural hints in their input keyword queries even though their interface is keyword base. Other problems with existing systems include their inability to put keyword query ambiguities into consideration during query structuring and how to select the best generated structure query that best represents a given keyword query. To address these problems, this study allows users to submit a schema independent keyword query, use named entity recognition (NER) to categorize query keywords to resolve query ambiguities and compute semantic information for a node from its data content. Algorithms were proposed that find user search intentions and convert the intentions into a set of ranked structured queries. Findings - Experiments with Sigmod and IMDB datasets were conducted to evaluate the effectiveness of the method. The experimental result shows that the XKQSS is about 20 per cent more effective than XReal in terms of return nodes identification, a state-of-art systems for XML retrieval. Originality/value - Existing systems do not take keyword query ambiguities into account. XKSS consists of two guidelines based on NER that help to resolve these ambiguities before converting the submitted query. It also include a ranking function computes a score for each generated query by using both semantic information and data statistic, as opposed to data statistic only approach used by the existing approaches.
机译:目的-本文的目的是提出和评估XKQSS,这是一种查询结构化方法,可将从用户生成结构化查询的任务委派给搜索引擎,同时保留简单的关键字搜索查询界面。搜索XML数据库的一种更有效的方法是使用结构化查询。但是,对于大多数用户而言,使用查询语言来表达查询被证明是困难的,因为这需要学习查询语言和有关基础数据模式的知识。另一方面,Web搜索引擎的成功使许多用户熟悉关键字搜索,因此,他们更喜欢使用关键字搜索查询界面来搜索XML数据。设计/方法/方法-现有的查询结构化方法要求用户在其输入关键字查询中提供结构提示,即使他们的界面是基于关键字的。现有系统的其他问题包括在查询结构化过程中无法考虑关键字查询的歧义性以及如何选择最能代表给定关键字查询的最佳生成结构查询。为了解决这些问题,本研究允许用户提交与模式无关的关键字查询,使用命名实体识别(NER)对查询关键字进行分类,以解决查询歧义并从节点的数据内容计算语义信息。提出了找到用户搜索意图并将该意图转换为一组排名的结构化查询的算法。结果-使用Sigmod和IMDB数据集进行了实验,以评估该方法的有效性。实验结果表明,在返回节点识别方面,XKQSS的有效性比XReal高出20%,这是XML检索的最新系统。创意/价值-现有系统未考虑关键字查询的歧义性。 XKSS包含两个基于NER的准则,可帮助您在转换提交的查询之前解决这些歧义。它还包括一个排序功能,该功能通过使用语义信息和数据统计信息为每个生成的查询计算得分,这与现有方法仅使用数据统计信息的方法不同。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号