首页> 外文学位 >Efficient query processing for modern data management.
【24h】

Efficient query processing for modern data management.

机译:用于现代数据管理的高效查询处理。

获取原文
获取原文并翻译 | 示例

摘要

Efficient query processing in any data management system typically relies on; (a) A profiling component that gathers statistics used to evaluate possible query execution plans, and (b) A planning component that picks the plan with the best predicted performance. For query processing in a range of new data management scenarios, e.g., query processing over data streams, and web services, traditional profiling and planning techniques developed for conventional relational database management systems are inadequate. This thesis develops several novel profiling and planning techniques to enable efficient query processing in these new scenarios.; When data is arriving rapidly in the form of streams, and many registered queries must be continuously executed over this data, system resources such as memory and processing power may be stretched to their limit. First, for a class of computation-intensive queries, we describe how system throughput can be increased by exploiting sharing of computation among the registered queries. Then, for a class of memory-intensive queries, we consider the case when system memory is insufficient for obtaining exact answers, and give techniques for maximizing result accuracy under the given memory constraints. We then consider a distributed setting such as that of a sensor network, and give techniques for deciding the placement of query operators at network nodes in order to minimize system-wide consumption of resources.; We then consider the scenario of web services, which have been emerging as a popular standard for sharing data and functionality among loosely-coupled systems. For queries involving multiple web services, we give algorithms for finding the optimal execution plan. Finally, we turn to the profiling component, and describe new techniques for gathering statistics by not looking at the data but only at the query results. Such a technique is required when data access for collecting statistics is infeasible, as for web services, but can also be useful in traditional databases.
机译:任何数据管理系统中有效的查询处理通常都依赖于此; (a)一个分析组件,该组件收集用于评估可能的查询执行计划的统计信息,以及(b)一个计划组件,该组件选择具有最佳预测性能的计划。对于一系列新数据管理方案中的查询处理,例如,对数据流和Web服务的查询处理,为常规关系数据库管理系统开发的传统概要分析和计划技术是不够的。本文开发了几种新颖的概要分析和计划技术,以在这些新场景中实现高效的查询处理。当数据以流的形式快速到达,并且必须对该数据连续执行许多注册查询时,系统资源(例如内存和处理能力)可能会达到极限。首先,对于一类计算密集型查询,我们描述了如何通过利用已注册查询之间的计算共享来提高系统吞吐量。然后,对于一类内存密集型查询,我们考虑了系统内存不足以获取准确答案的情况,并给出了在给定内存约束下使结果准确性最大化的技术。然后,我们考虑分布式设置(例如传感器网络的设置),并给出用于确定查询运算符在网络节点上的位置的技术,以最大程度地减少系统范围内的资源消耗。然后,我们考虑Web服务的场景,它已成为在松耦合系统之间共享数据和功能的流行标准。对于涉及多个Web服务的查询,我们提供了用于查找最佳执行计划的算法。最后,我们转到分析组件,并通过不查看数据而是仅查看查询结果来描述收集统计信息的新技术。当用于收集统计信息的数据访问不可行时(例如对于Web服务),就需要这种技术,但在传统数据库中它也很有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号