首页> 外文会议>Australasian Database Conference >XEdge: An Efficient Method for Returning Meaningful Clustered Results for XML Keyword Search
【24h】

XEdge: An Efficient Method for Returning Meaningful Clustered Results for XML Keyword Search

机译:xedge:返回有意义的聚类结果的有效方法,用于XML关键字搜索

获取原文

摘要

In this paper, we investigate the problem of returning meaningful clustered results for XML keyword search. We begin by presenting a multi-granularity computing methodology, in order to make full use of the structural information of XML trees to extract features. In this method, we first propose the concept of Cluster Compactness Granularity (CCG) to partition the search results into different clusters, which enable users to precisely and quickly seek their desired answers, according to the connection compactness between LCA nodes. We then propose the concept of Subtree Compactness Granularity (SCG) to rank individual results within clusters and measure the query result relevance. Furthermore, we define a novel semantics of Compact LCA (CLCA), which not only improves the accuracy by eliminating redundant LCAs that do not contribute to meaningful answers, but also overcomes the shielding effects of SLCA-based methods. Using the proposed CCG and SCG features and the CLCA semantics, we finally implement an efficient algorithm called XEdge for generating meaningful clustered results. Comparing with the existing methods such as XSeek and XK-LUSTER, the experimental results demonstrate the effectiveness of the proposed multi-granularity clustering methodology and validity of the complemented ranking strategy, as well as the meaningfulness of CLCA semantics.
机译:在本文中,我们调查返回XML关键字搜索的有意义聚类结果的问题。我们首先呈现多粒度计算方法,以充分利用XML树的结构信息来提取特征。在此方法中,我们首先提出了群集紧凑粒度的概念(CCG),将搜索结果分配到不同的集群中,这使用户能够根据LCA节点之间的连接紧凑性精确地快速地寻求所需的答案。然后,我们提出了子树紧凑性粒度(SCG)的概念,以在集群内排名个体结果并测量查询结果相关性。此外,我们定义了一种小型LCA(CLCA)的新颖语义,这不仅通过消除没有贡献有意义答案的冗余LCA来提高精度,而且还克服了基于SLCA的方法的屏蔽效应。使用所提出的CCG和SCG功能和CLCA语义,我们最终实现了一个称为XEDGE的有效算法,以生成有意义的聚类结果。与Xseek和XK光泽等现有方法相比,实验结果证明了所提出的多粒度聚类方法和补充排名策略的有效性的有效性,以及CLCA语义的有意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号