首页> 外文会议>第十七届国际万维网大会(the 17th International World Wide Web Conference)(WWW08)论文集 >SAILER: An Effective Search Engine for Unified Retrieval of Heterogeneous XML and Web Documents
【24h】

SAILER: An Effective Search Engine for Unified Retrieval of Heterogeneous XML and Web Documents

机译:SAILER:一个有效的搜索引擎,用于统一检索异构XML和Web文档

获取原文

摘要

This paper studies the problem of unified ranked retrieval of heterogeneous XML documents and Web data. We propose an effective search engine called Sailer to adaptively and versatilely answer keyword queries over the heterogenous data. We model the Web pages and XML documents as graphs. We propose the concept of pivotal trees to effectively answer keyword queries and present an effective method to identify the top-k pivotal trees with the highest ranks from the graphs. Moreover, we propose effective indexes to facilitate the effective unified ranked retrieval. We have conducted an extensive experimental study using real datasets, and the experimental results show that Sailer achieves both high search efficiency and accuracy, and outperforms the existing approaches significantly.
机译:本文研究了异构XML文档和Web数据的统一排名检索问题。我们提出了一种有效的搜索引擎,称为Sailer,可以自适应地通用地回答异构数据中的关键字查询。我们将网页和XML文档建模为图形。我们提出了枢纽树的概念来有效地回答关键字查询,并提出了一种有效的方法来从图中识别出排名最高的前k个枢纽树。此外,我们提出了有效的索引,以促进有效的统一排名检索。我们使用真实的数据集进行了广泛的实验研究,实验结果表明Sailer达到了很高的搜索效率和准确性,并且明显优于现有方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号