首页> 外文期刊>Information Systems >Finding and ranking compact connected trees for effective keyword proximity search in XML documents
【24h】

Finding and ranking compact connected trees for effective keyword proximity search in XML documents

机译:查找和排序紧凑的连接树,以便在XML文档中进行有效的关键字邻近搜索

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, we study the problem of keyword proximity search in XML documents. We take the disjunctive semantics among the keywords into consideration and find top-k relevant compact connected trees (CCTrees) as the answers of keyword proximity queries. We first introduce the notions of compact lowest common ancestor (CLCA) and maximal CLCA (MCLCA), and then propose compact connected trees and maximal CCTrees (MCCTrees) to efficiently and effectively answer keyword proximity queries. We give the theoretical upper bounds of the numbers of CLCAs, MCLCAs, CCTrees and MCCTrees, respectively. We devise an efficient algorithm to generate all MCCTrees, and propose a ranking mechanism to rank MCCTrees. Our extensive experimental study shows that our method achieves both high efficiency and effectiveness, and outperforms existing state-of-the-art approaches significantly.
机译:在本文中,我们研究了XML文档中的关键字邻近搜索问题。我们考虑了关键字之间的析取语义,并找到了前k个相关的紧凑连接树(CCTree)作为关键字邻近查询的答案。我们首先介绍紧凑的最小公共祖先(CLCA)和最大的CLCA(MCLCA)的概念,然后提出紧凑的连接树和最大CCTree(MCCTrees),以有效地回答关键字邻近查询。我们分别给出了CLCA,MCLCAs,CCTree和MCCTree数量的理论上限。我们设计了一种有效的算法来生成所有MCCTree,并提出了一种用于对MCCTree进行排名的排名机制。我们广泛的实验研究表明,我们的方法既达到了高效率,又达到了效果,并且明显优于现有的最新方法。

著录项

  • 来源
    《Information Systems》 |2010年第2期|186-203|共18页
  • 作者单位

    Department of Computer Science and Technology, Tsinghua National Laboratory for Information Science and Technology, Tsinghua University,Beijing 10084, China;

    Department of Computer Science and Technology, Tsinghua National Laboratory for Information Science and Technology, Tsinghua University,Beijing 10084, China;

    Department of Computer Science and Technology, Tsinghua National Laboratory for Information Science and Technology, Tsinghua University,Beijing 10084, China;

    Department of Computer Science and Technology, Tsinghua National Laboratory for Information Science and Technology, Tsinghua University,Beijing 10084, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    lowest common ancestor (LCA); compact LCA (CLCA); maximal CLCA (MCLCA)vcompact connected trees (CCTrees); maximal CCTrees (MCCTrees);

    机译:最低共同祖先(LCA);紧凑型LCA(CLCA);最大CLCA(MCLCA)v紧凑型连接树(CCTree);最大CCTree(MCCTree);

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号