首页> 外文会议>IEEE International Conference on Data Engineering Workshops >PatchIndex - Exploiting Approximate Constraints in Self-managing Databases
【24h】

PatchIndex - Exploiting Approximate Constraints in Self-managing Databases

机译:PatchIndex-在自我管理数据库中利用近似约束

获取原文

摘要

In the cloud environment, data warehouse solutions need to be self-managing in order to be usable without prior database administration knowledge. Additionally, data is typically not clean in these environments, as it is imported from various sources. As a consequence, automatic schema optimization as an important task of self-management becomes difficult without human interaction and data cleaning steps. Within this paper, we focus on constraint discovery as a subtask of schema optimization. Real world datasets with unclean data may not contain perfect constraints, as a minor part of the values hampers the definition of them. Therefore, we introduce the PatchIndex structure, which handles these exceptions to column constraints and enables self-management tools to discover and define approximate constraints on unclean data. We present “nearly unique column” and nearly sorted column” constraints, both managed by the generic PatchIndex structure. Furthermore, we provide mechanisms to discover these constraints and show how query performance can benefit from them for different use cases by integrating them into query optimization. Our evaluation shows that the PatchIndex structure offers opportunities for a significant performance boost in different use cases while enabling self-management tools to define constraints on unclean data.
机译:在云环境中,数据仓库解决方案需要自我管理才能在没有事先数据库管理知识的情况下使用。此外,在这些环境中,数据通常是不干净的,因为它们是从各种来源导入的。结果,如果没有人工交互和数据清理步骤,作为自我管理的重要任务的自动模式优化将变得困难。在本文中,我们将约束发现作为模式优化的子任务。包含不干净数据的现实世界数据集可能不包含完美约束,因为这些值的一小部分妨碍了它们的定义。因此,我们引入了PatchIndex结构,该结构处理列约束的这些异常,并使自我管理工具能够发现并定义对不干净数据的近似约束。我们提出了“几乎唯一的列”和“几乎排序的列”约束,它们都由通用的PatchIndex结构管理。此外,我们提供了发现这些约束的机制,并展示了如何通过将其集成到查询优化中来针对不同的用例从中受益于查询性能。我们的评估表明,PatchIndex结构提供了在不同用例中显着提高性能的机会,同时使自我管理工具能够定义对不干净数据的约束。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号