首页> 外文期刊>Information Sciences: An International Journal >SAIL: Structure-aware indexing for effective and progressive top-k keyword search over XML documents
【24h】

SAIL: Structure-aware indexing for effective and progressive top-k keyword search over XML documents

机译:SAIL:结构感知索引,用于对XML文档进行有效且渐进的top-k关键字搜索

获取原文
获取原文并翻译 | 示例
       

摘要

Keyword search in XML documents has recently gained a lot of research attention. Given a keyword query, existing approaches first compute the lowest common ancestors (LCAs) or their variants of XML elements that contain the input keywords. and then identify the sub-trees rooted at the LCAs as the answer. In this the paper we study how to use the rich structural relationships embedded in XML documents to facilitate the processing of keyword queries. We develop a novel method, called SAIL, to index such structural relationships for efficient XML keyword search. We propose the concept of minimal-cost trees to answer keyword queries and devise structure-aware indices to maintain the structural relationships for efficiently identifying the minimal-cost trees. For effectively and progressively identifying the top-k answers, we develop techniques using link-based relevance ranking and keyword-pair-based ranking. To reduce the index size, we incorporate a numbering scheme, namely schema-aware dewey code, into our structure-aware indices. Experimental results on real data sets show that our method outperforms state-of-the-art approaches significantly, in both answer quality and search efficiency.
机译:XML文档中的关键字搜索最近引起了很多研究关注。给定关键字查询,现有方法首先会计算出包含输入关键字的最低公共祖先(LCA)或其XML元素的变体。然后确定以LCA为根的子树作为答案。在本文中,我们研究了如何使用XML文档中嵌入的丰富结构关系来促进关键字查询的处理。我们开发了一种称为SAIL的新颖方法来为有效的XML关键字搜索建立这种结构关系的索引。我们提出了最小代价树的概念来回答关键字查询并设计结构感知索引,以维护有效识别最小代价树的结构关系。为了有效,逐步地确定前k个答案,我们开发了使用基于链接的相关性排名和基于关键字对的排名的技术。为了减少索引的大小,我们在我们的结构感知索引中加入了编号方案,即模式感知杜威代码。在真实数据集上的实验结果表明,我们的方法在回答质量和搜索效率方面均明显优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号