Efficient query processing for modern data management.

机译：用于现代数据管理的高效查询处理。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Efficient query processing in any data management system typically relies on; (a) A profiling component that gathers statistics used to evaluate possible query execution plans, and (b) A planning component that picks the plan with the best predicted performance. For query processing in a range of new data management scenarios, e.g., query processing over data streams, and web services, traditional profiling and planning techniques developed for conventional relational database management systems are inadequate. This thesis develops several novel profiling and planning techniques to enable efficient query processing in these new scenarios.; When data is arriving rapidly in the form of streams, and many registered queries must be continuously executed over this data, system resources such as memory and processing power may be stretched to their limit. First, for a class of computation-intensive queries, we describe how system throughput can be increased by exploiting sharing of computation among the registered queries. Then, for a class of memory-intensive queries, we consider the case when system memory is insufficient for obtaining exact answers, and give techniques for maximizing result accuracy under the given memory constraints. We then consider a distributed setting such as that of a sensor network, and give techniques for deciding the placement of query operators at network nodes in order to minimize system-wide consumption of resources.; We then consider the scenario of web services, which have been emerging as a popular standard for sharing data and functionality among loosely-coupled systems. For queries involving multiple web services, we give algorithms for finding the optimal execution plan. Finally, we turn to the profiling component, and describe new techniques for gathering statistics by not looking at the data but only at the query results. Such a technique is required when data access for collecting statistics is infeasible, as for web services, but can also be useful in traditional databases.

机译：任何数据管理系统中有效的查询处理通常都依赖于此；（a）一个分析组件，该组件收集用于评估可能的查询执行计划的统计信息，以及（b）一个计划组件，该组件选择具有最佳预测性能的计划。对于一系列新数据管理方案中的查询处理，例如，对数据流和Web服务的查询处理，为常规关系数据库管理系统开发的传统概要分析和计划技术是不够的。本文开发了几种新颖的概要分析和计划技术，以在这些新场景中实现高效的查询处理。当数据以流的形式快速到达，并且必须对该数据连续执行许多注册查询时，系统资源（例如内存和处理能力）可能会达到极限。首先，对于一类计算密集型查询，我们描述了如何通过利用已注册查询之间的计算共享来提高系统吞吐量。然后，对于一类内存密集型查询，我们考虑了系统内存不足以获取准确答案的情况，并给出了在给定内存约束下使结果准确性最大化的技术。然后，我们考虑分布式设置（例如传感器网络的设置），并给出用于确定查询运算符在网络节点上的位置的技术，以最大程度地减少系统范围内的资源消耗。然后，我们考虑Web服务的场景，它已成为在松耦合系统之间共享数据和功能的流行标准。对于涉及多个Web服务的查询，我们提供了用于查找最佳执行计划的算法。最后，我们转到分析组件，并通过不查看数据而是仅查看查询结果来描述收集统计信息的新技术。当用于收集统计信息的数据访问不可行时（例如对于Web服务），就需要这种技术，但在传统数据库中它也很有用。

著录项

作者
Srivastava, Utkarsh Hriday.;
展开▼
作者单位

Stanford University.;

展开▼
授予单位 Stanford University.;
学科 Computer Science.
学位 Ph.D.
年度 2006
页码 187 p.
总页数 187
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. POSTER: Two Concurrent Data Structures for Efficient Datalog Query Processing [J] . Herbert Jordan, Bernhard Scholz, Pavle Subotic ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2018,第1期

机译：海报：两个并发数据结构，用于高效数据录查询处理
2. Parallel Star Join+DataIndexes: efficient query processing in data warehouses and OLAP [J] . Datta A., VanderMeer D., Ramamritham K. IEEE Transactions on Knowledge and Data Engineering . 2002,第6期

机译：并行Star Join + DataIndexes：数据仓库和OLAP中的高效查询处理
3. Efficient co-processor utilization in database query processing [J] . Sebastian Bress, Felix Beier, Hannes Rauhe, Information Systems . 2013,第8期

机译：数据库查询处理中的有效协处理器利用率
4. Simulating of query processing on multiprocessor database systems with modern coprocessors [C] . Besedin Konstantin Y., Kostenetskiy Pavel S. International Convention on Information and Communication Technology, Electronics and Microelectronics . 2014

机译：使用现代协处理器模拟多处理器数据库系统上的查询处理
5. Efficient Processing of Skyline Queries on Static Data Sources, Data Streams and Incomplete Datasets. [D] . Nagendra, Mithila. 2014

机译：有效处理静态数据源，数据流和不完整数据集上的天际线查询。
6. Private and Efficient Query Processing on Outsourced Genomic Databases [O] . Reza Ghasemi, Momin Al Aziz, Noman Mohammed, -1

机译：外包基因组数据库的私有高效查询处理
7. Efficient query processing on spatial and textual data: beyond individual queries [O] . Choudhury F 2017

机译：对空间和文本数据进行高效的查询处理：超越单个查询
8. Allocation of Database Files Across Parallel Stores for Efficient Processing of Partial-Match Queries [R] . Bestul, T., Jajodia, S. 1987

机译：跨并行存储分配数据库文件以有效处理部分匹配查询

Efficient query processing for modern data management.

摘要

著录项

相似文献

相关主题

期刊订阅