首页> 外文会议>International conference on management of data >Jigsaw: Efficient Optimization Over Uncertain Enterprise Data
【24h】

Jigsaw: Efficient Optimization Over Uncertain Enterprise Data

机译:拼图:在不确定的企业数据上有效优化

获取原文

摘要

Probabilistic databases, in particular ones that allow users to externally define models or probability distributions so called VG-Fimctions are an ideal tool for constructing, simulating and analyzing hypothetical business scenarios. Enterprises often use such tools with parameterized models and need to explore a large parameter space in order to discover parameter values that optimize for a given goal. Parameter space is usually very large, making such exploration extremely expensive. We present Jigsaw, a probabilistic database-based simulation framework that addresses this performance problem. In Jigsaw, users define what-if style scenarios as parameterized probabilistic database queries and identify parameter values that achieve desired properties. Jigsaw uses a novel "fingerprinting" technique that efficiently identifies correlations between a query's output distribution for different parameter values. Using fingerprints, Jigsaw is able to reuse work performed for different parameter values, and obtain speedups of as much as 2 orders of magnitude for several real business scenarios.
机译:概率数据库,特别是允许用户从外部定义模型或概率分布的数据库是一个理想的工具,用于构建,模拟和分析假设的业务场景。企业通常使用这些工具使用参数化模型,需要探索大参数空间,以便发现优化给定目标的参数值。参数空间通常非常大,使得这种探索非常昂贵。我们呈现拼图,一个基于概率的基于数据库的仿真框架,解决了这个性能问题。在拼图中,用户定义了什么样式方案作为参数化的概率数据库查询,并识别实现所需属性的参数值。拼图使用新颖的“指纹”技术,可有效地识别查询的输出分布与不同参数值之间的相关性。使用指纹,拼图可以重复使用为不同参数值执行的工作,并获得多个实际业务场景的2个级别的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号