...
首页> 外文期刊>Tsinghua Science and Technology >Keyword searches in data-centric XML documents using tree partitioning
【24h】

Keyword searches in data-centric XML documents using tree partitioning

机译:使用树分区在以数据为中心的XML文档中进行关键字搜索

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents an effective keyword search method for data-centric extensive markup language (XML) documents. The method divides an XML document into compact connected integral subtrees, called self-integral trees (SI-Trees), to capture the structural information in the XML document. The SI-Trees are generated based on a schema guide. Meaningful self-integral trees (MSI-Trees) are identified, which contain all or some of the input keywords for the keyword search in the XML documents. Indexing is used to accelerate the retrieval of MSI-Trees related to the input keywords. The MSI-Trees are ranked to identify the top-k results with the highest ranks. Extensive tests demonstrate that this method costs 10–100 ms to answer a keyword query, and outperforms existing approaches by 1–2 orders of magnitude.
机译:本文提出了一种有效的关键字搜索方法,以数据为中心的扩展标记语言(XML)文档。该方法将XML文档分为紧密连接的整体子树,称为自整体树(SI-Trees),以捕获XML文档中的结构信息。 SI-Tree是根据架构指南生成的。确定有意义的自集成树(MSI-Trees),其中包含XML文档中用于关键字搜索的所有或一些输入关键字。索引用于加速与输入关键字相关的MSI树的检索。对MSI树进行排名以标识排名最高的前k个结果。大量测试表明,该方法花费10-100毫秒来回答关键字查询,并且比现有方法要高1-2个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号