...
首页> 外文期刊>International journal of web information systems >Effective keyword search on graph data using limited root redundancy of answer trees
【24h】

Effective keyword search on graph data using limited root redundancy of answer trees

机译:使用有限的答案树根冗余对图形数据进行有效的关键字搜索

获取原文
获取原文并翻译 | 示例
           

摘要

Purpose - This paper aims to propose a new keyword search method on graph data to improve the relevance of search results and reduce duplication of content nodes in the answer trees obtained by previous approaches based on distinct root semantics. The previous approaches are restricted to find answer trees having different root nodes and thus often generate a result consisting of answer trees with low relevance to the query or duplicate content nodes. The method allows limited redundancy in the root nodes of top-k answer trees to produce more effective query results. Design/methodology/approach - A measure for redundancy in a set of answer trees regarding their root nodes is defined, and according to the metric, a set of answer trees with limited root redundancy is proposed for the result of a keyword query on graph data. For efficient query processing, an index on the useful paths in the graph using inverted lists and a hash map is suggested. Then, based on the path index, a top-k query processing algorithm is presented to find most relevant and diverse answer trees given a maximum amount of root redundancy allowed for a set of answer trees. Findings - The results of experiments using real graph datasets show that the proposed approach can produce effective query answers which are more diverse in the content nodes and more relevant to the query than the previous approach based on distinct root semantics. Originality/value - This paper first takes redundancy in the root nodes of answer trees into account to improve the relevance and content nodes redundancy of query results over the previous distinct root semantics. It can satisfy the users' various information need on a large and complex graph data using a keyword-based query.
机译:目的-本文旨在提出一种新的基于图数据的关键字搜索方法,以提高搜索结果的相关性,并减少以前的方法基于不同的根语义获得的答案树中内容节点的重复。先前的方法仅限于查找具有不同根节点的答案树,因此通常会生成由与查询或副本内容节点的相关性低的答案树组成的结果。该方法允许在前k个答案树的根节点中进行有限的冗余以产生更有效的查询结果。设计/方法/方法-定义了针对其根节点的一组答案树中的冗余度量,并根据度量标准,针对图数据中的关键字查询结果,提出了一组具有有限根冗余的答案树。 。为了进行有效的查询处理,建议使用反向列表和哈希图在图形中的有用路径上建立索引。然后,基于路径索引,提出了top-k查询处理算法,以在给定一组答案树所允许的最大根冗余的情况下,找到最相关且变化最大的答案树。发现-使用实际图形数据集进行的实验结果表明,与以前的基于独特的根语义的方法相比,所提出的方法可以产生有效的查询答案,这些查询答案在内容节点中更加多样化,并且与查询更加相关。原创性/价值-本文首先考虑了答案树的根节点中的冗余,以提高查询结果的相关性和内容节点的冗余度,而这些结果与先前的独特根语义无关。它可以使用基于关键字的查询来满足用户对大型复杂图形数据的各种信息需求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号