...
首页> 外文期刊>ACM Transactions on Programming Languages and Systems >Debugging Large-scale Datalog: A Scalable Provenance Evaluation Strategy
【24h】

Debugging Large-scale Datalog: A Scalable Provenance Evaluation Strategy

机译:调试大规模数据记录:可扩展的出处评估策略

获取原文
获取原文并翻译 | 示例
           

摘要

Logic programming languages such as Datalog have become popular as Domain Specific Languages (DSLs) for solving large-scale, real-world problems, in particular, static program analysis and network analysis. The logic specifications that model analysis problems process millions of tuples of data and contain hundreds of highly recursive rules. As a result, they are notoriously difficult to debug. While the database community has proposed several data provenance techniques that address the Declarative Debugging Challenge for Databases, in the cases of analysis problems, these state-of-the-art techniques do not scale.In this article, we introduce a novel bottom-up Datalog evaluation strategy for debugging: Our provenance evaluation strategy relies on a new provenance lattice that includes proof annotations and a new fixed-point semantics for semi-naive evaluation. A debugging query mechanism allows arbitrary provenance queries, constructing partial proof trees of tuples with minimal height. We integrate our technique into Souffle, a Datalog engine that synthesizes C++ code, and achieve high performance by using specialized parallel data structures. Experiments are conducted with DooP/DaCapo, producing proof annotations for tens of millions of output tuples. We show that our method has a runtime overhead of 1.31x on average while being more flexible than existing state-of-the-art techniques.
机译:Datalog等逻辑编程语言已成为域特定语言(DSL),以解决大规模,实际问题,特别是静态程序分析和网络分析。模型分析问题的逻辑规范处理数百万元数据并包含数百个高递归规则。结果,他们难以调试。虽然数据库社区提出了几种数据出处技术,用于解决数据库的陈述调试挑战,在分析问题的情况下,这些最先进的技术不扩展。在本文中,我们介绍了一部小说自下而上Datalog调试评估策略:我们的出处评估策略依赖于一个新的出处格子,包括证明注释和新的半天真评价的新的定点语义。调试查询机制允许任意种子查询,构建具有最小高度的组成部分的部分证明树。我们将技术集成到Souffle中,这是一种用于合成C ++代码的Datalog引擎,通过使用专门的并行数据结构来实现高性能。实验是用DOOP / DACAPO进行的,生产用于数百万个输出元组的证明注释。我们表明我们的方法平均具有1.31倍的运行时开销,而不是比现有的最先进技术更灵活。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号