首页> 外文会议>Advances in Databases: Concepts, Systems and Applications; Lecture Notes in Computer Science; 4443 >An Original Semantics to Keyword Queries for XML Using Structural Patterns
【24h】

An Original Semantics to Keyword Queries for XML Using Structural Patterns

机译:使用结构模式的XML关键字查询的原始语义

获取原文
获取原文并翻译 | 示例

摘要

XML is by now the de facto standard for exporting and exchanging data on the web. The need for querying XML data sources whose structure is not fully known to the user and the need to integrate multiple data sources with different tree structures have motivated recently the suggestion of keyword-based techniques for querying XML documents. The semantics adopted by these approaches aims at restricting the answers to meaningful ones. However, these approaches suffer from low precision, while recent ones with improved precision suffer from low recall.In this paper, we introduce an original approach for assigning semantics to keyword queries for XML documents. We exploit index graphs (a structural summary of data) to extract tree patterns that return meaningful answers. In contrast to previous approaches that operate locally on the data to compute meaningful answers (usually by computing lowest common ancestors), our approach operates globally on index graphs to detect and exploit meaningful tree patterns. We implemented and experimentally evaluated our approach on DBLP-based data sets with irregularities. Its comparison to previous ones shows that it succeeds in finding all the meaningful answers when the others fail (perfect recall). Further, it outperforms approaches with similar recall in excluding meaningless answers (better precision). Since our approach is based on tree-pattern query evaluation, it can be easily implemented on top of an XQuery engine.
机译:到目前为止,XML是事实上的标准,用于在Web上导出和交换数据。最近,对查询其结构不为用户所知的XML数据源的需求以及将多个数据源与不同树结构集成在一起的需求促使人们提出了基于关键字的技术来查询XML文档的建议。这些方法采用的语义旨在将答案限制为有意义的答案。但是,这些方法的精度较低,而最近的精度较高的方法则具有较低的查全率。本文介绍了一种为XML文档的关键字查询分配语义的原始方法。我们利用索引图(数据的结构摘要)来提取返回有意义答案的树型。与以前在数据上本地操作以计算有意义的答案(通常通过计算最低的共同祖先)的方法相反,我们的方法在索引图上全局操作以检测和利用有意义的树模式。我们在有异常情况的基于DBLP的数据集上实施和实验评估了我们的方法。与之前的比较表明,当其他失败时,它可以成功找到所有有意义的答案(完美召回)。此外,它在排除无意义的答案(更好的精度)方面优于具有类似回忆的方法。由于我们的方法基于树型查询评估,因此可以在XQuery引擎之上轻松实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号