...
首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Materialization and Decomposition of Dataspaces for Efficient Search
【24h】

Materialization and Decomposition of Dataspaces for Efficient Search

机译:实现有效搜索的数据空间的实现和分解

获取原文
获取原文并翻译 | 示例
           

摘要

Dataspaces consist of large-scale heterogeneous data. The query interface of accessing tuples should be provided as a fundamental facility by practical dataspace systems. Previously, an efficient index has been proposed for queries with keyword neighborhood over dataspaces. In this paper, we study the materialization and decomposition of dataspaces, in order to improve the query efficiency. First, we study the views of items, which are materialized in order to be reused by queries. When a set of views are materialized, it leads to select some of them as the optimal plan with the minimum query cost. Efficient algorithms are developed for query planning and view generation. Second, we study the partitions of tuples for answering top-k queries. Given a query, we can evaluate the score bounds of the tuples in partitions and prune those partitions with bounds lower than the scores of top-k answers. We also provide theoretical analysis of query cost and prove that the query efficiency cannot be improved by increasing the number of partitions. Finally, we conduct an extensive experimental evaluation to illustrate the superior performance of proposed techniques.
机译:数据空间由大规模的异构数据组成。实际数据空间系统应提供访问元组的查询接口作为基本工具。以前,已经提出了一种有效的索引,用于在数据空间上具有关键字邻域的查询。在本文中,我们研究了数据空间的实现和分解,以提高查询效率。首先,我们研究项目的视图,这些视图被具体化以便被查询重用。当实现一组视图时,它将导致选择其中一些视图作为具有最小查询成本的最佳计划。开发了用于查询计划和视图生成的高效算法。其次,我们研究元组的分区以回答前k个查询。给定一个查询,我们可以评估分区中元组的分数边界,并修剪其边界低于前k个答案的分数的那些分区。我们还提供了查询成本的理论分析,并证明无法通过增加分区数来提高查询效率。最后,我们进行了广泛的实验评估,以说明所提出技术的卓越性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号