首页> 外文会议>Australasian Database Conference on Databases theory and applications >XEdge: An Efficient Method for Returning Meaningful Clustered Results for XML Keyword Search
【24h】

XEdge: An Efficient Method for Returning Meaningful Clustered Results for XML Keyword Search

机译:XEdge:一种有效的方法,可为XML关键字搜索返回有意义的聚类结果

获取原文

摘要

In this paper, we investigate the problem of returning meaningful clustered results for XML keyword search. We begin by presenting a multi-granularity computing methodology, in order to make full use of the structural information of XML trees to extract features. In this method, we first propose the concept of Cluster Compactness Granularity (CCG) to partition the search results into different clusters, which enable users to precisely and quickly seek their desired answers, according to the connection compactness between LCA nodes. We then propose the concept of Subtree Compactness Granularity (SCG) to rank individual results within clusters and measure the query result relevance. Furthermore, we define a novel semantics of Compact LCA (CLCA), which not only improves the accuracy by eliminating redundant LCAs that do not contribute to meaningful answers, but also overcomes the shielding effects of SLCA-based methods. Using the proposed CCG and SCG features and the CLCA semantics, we finally implement an efficient algorithm called XEdge for generating meaningful clustered results. Comparing with the existing methods such as XSeek and XK-LUSTER, the experimental results demonstrate the effectiveness of the proposed multi-granularity clustering methodology and validity of the complemented ranking strategy, as well as the meaningfulness of CLCA semantics.
机译:在本文中,我们研究了为XML关键字搜索返回有意义的聚类结果的问题。我们首先介绍一种多粒度计算方法,以便充分利用XML树的结构信息来提取特征。在这种方法中,我们首先提出了群集紧凑度粒度(CCG)的概念,将搜索结果划分为不同的群集,从而使用户能够根据LCA节点之间的连接紧凑性快速准确地寻找所需的答案。然后,我们提出了子树压缩粒度(SCG)的概念,以对聚类中的各个结果进行排名并衡量查询结果的相关性。此外,我们定义了一种紧凑LCA(CLCA)的新颖语义,它不仅通过消除对有意义的答案无用的冗余LCA来提高准确性,而且还克服了基于SLCA的方法的屏蔽效果。使用建议的CCG和SCG功能以及CLCA语义,我们最终实现了一种称为XEdge的高效算法,用于生成有意义的聚类结果。与现有的XSeek和XK-LUSTER方法相比,实验结果证明了所提出的多粒度聚类方法的有效性和互补排序策略的有效性,以及CLCA语义的意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号