首页> 外文会议>Datalog in academia and industry >Optimizing Large-Scale Semi-Naieve Datalog Evaluation in Hadoop
【24h】

Optimizing Large-Scale Semi-Naieve Datalog Evaluation in Hadoop

机译:在Hadoop中优化大型半天真数据日志评估

获取原文
获取原文并翻译 | 示例

摘要

We explore the design and implementation of a scalable Datalog system using Hadoop as the underlying runtime system. Observing that several successful projects provide a relational algebra-based programming interface to Hadoop, we argue that a natural extension is to add recursion to support scalable social network analysis, internet traffic analysis, and general graph query. We implement semi-naive evaluation in Hadoop, then apply a series of optimizations spanning fundamental changes to the Hadoop infrastructure to basic configuration guidelines that collectively offer a 10x improvement in our experiments. This work lays the foundation for a more comprehensive cost-based algebraic optimization framework for parallel recursive Datalog queries.
机译:我们探索了使用Hadoop作为底层运行时系统的可伸缩Datalog系统的设计和实现。观察到几个成功的项目为Hadoop提供了基于关系代数的编程接口,我们认为自然的扩展是添加递归以支持可扩展的社交网络分析,互联网流量分析和常规图查询。我们在Hadoop中实施半天真评估,然后将涵盖Hadoop基础结构基本变化的一系列优化应用到基本配置准则,这些准则在我们的实验中共同提高了10倍。这项工作为并行递归Datalog查询的更全面的基于成本的代数优化框架奠定了基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号