首页> 外文学位 >Join processing in non-conventional databases.
【24h】

Join processing in non-conventional databases.

机译:非常规数据库中的联接处理。

获取原文
获取原文并翻译 | 示例

摘要

The join operation combines information from multiple data sources. Efficient processing of join queries is a pivotal issue in most database systems. My PhD research focuses on joins in two categories of novel applications. The first is continuous joins in data streams. Specifically, we exploit two key properties of the streaming join. First, the initial plan of a long query may gradually become inefficient due to changes in data characteristics. This necessitates dynamic plan migration, an online transition from the old plan to a more efficient one generated based on current statistics. The only known solutions MS and PT have some serious shortcomings. Hence, we propose HybMig, which combines their merits, and outperforms them on every aspect.;Another important property is that an output tuple from an upstream join (called the producer) may never generate any result in downstream operators (the consumers) during its entire lifespan. Motivated by this, we propose just-in-time (JIT) processing, a novel methodology that enables a producer to selectively generate outputs based on feedback returned from consumers that express their current demand. Extensive experiments show that JIT achieves significant savings in terms of both CPU time and memory consumption.;The second class of joins studied in this thesis are authenticated ones in outsourced databases. In particular, database outsourcing requires that the query server constructs a proof of result correctness, which can be verified by the client using the data owner's signature. Addressing such queries, we propose a comprehensive set of new solutions that cover the entire spectrum of index availability. Furthermore, we extend them to authenticate complex queries, involving multi-way joins and other relational operators. Our experiments demonstrate that, the proposed methods outperform two existing benchmark solutions, often by orders of magnitude.
机译:连接操作合并了来自多个数据源的信息。在大多数数据库系统中,有效处理联接查询是一个关键问题。我的博士研究专注于两类新颖应用程序的连接。首先是数据流中的连续联接。具体来说,我们利用流连接的两个关键属性。首先,由于数据特性的变化,长时间查询的初始计划可能会逐渐变得效率低下。这需要动态计划迁移,这是从旧计划到基于当前统计信息生成的更有效计划的在线过渡。唯一已知的解决方案MS和PT有一些严重的缺点。因此,我们提出了HybMig,它结合了它们的优点,并在各个方面都胜过它们。另一个重要特性是,上游联接(称为生产者)的输出元组在其运行期间可能永远不会对下游操作员(消费者)产生任何结果整个寿命。因此,我们提出了即时(JIT)处理方法,这是一种新颖的方法,它使生产者能够根据表示当前需求的消费者返回的反馈有选择地产生输出。大量的实验表明,JIT在CPU时间和内存消耗方面都实现了显着的节省。;本文研究的第二类联接是外包数据库中经过身份验证的联接。特别是,数据库外包要求查询服务器构造结果正确性的证明,客户端可以使用数据所有者的签名对其进行验证。针对此类查询,我们提出了一套全面的新解决方案,涵盖了整个索引可用性范围。此外,我们将其扩展为对复杂查询进行身份验证,其中涉及多路联接和其他关系运算符。我们的实验表明,所提出的方法通常比两个现有的基准解决方案好几个数量级。

著录项

  • 作者

    Yang, Yin.;

  • 作者单位

    Hong Kong University of Science and Technology (Hong Kong).;

  • 授予单位 Hong Kong University of Science and Technology (Hong Kong).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 108 p.
  • 总页数 108
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号