首页> 外文会议>The semantic web - ISWC 2009 >RAPID: Enabling Scalable Ad-Hoc Analytics on the Semantic Web
【24h】

RAPID: Enabling Scalable Ad-Hoc Analytics on the Semantic Web

机译:RAPID:在语义网上启用可扩展的即席分析

获取原文
获取原文并翻译 | 示例

摘要

As the amount of available RDF data continues to increase steadily, there is growing interest in developing efficient methods for analyzing such data. While recent efforts have focused on developing efficient methods for traditional data processing, analytical processing which typically involves more complex queries has received much less attention. The use of cost effective parallelization techniques such as Google's Map-Reduce offer significant promise for achieving Web scale analytics. However, currently available implementations are designed for simple data processing on structured data.rnIn this paper, we present a language, RAPID, for scalable ad-hoc analytical processing of RDF data on Map-Reduce frameworks. It builds on Yahoo's Pig Latin by introducing primitives based on a specialized join operator, the MD-join, for expressing analytical tasks in a manner that is more amenable to parallel processing, as well as primitives for coping with semi-structured nature of RDF data. Experimental evaluation results demonstrate significant performance improvements for analytical processing of RDF data over existing Map-Reduce based techniques.
机译:随着可用RDF数据量的持续稳定增长,对开发用于分析此类数据的有效方法的兴趣日益浓厚。尽管最近的工作集中在开发用于传统数据处理的有效方法上,但是通常涉及更复杂查询的分析处理却很少受到关注。具有成本效益的并行化技术(例如Google的Map-Reduce)的使用为实现Web规模分析提供了重大希望。但是,当前可用的实现被设计为对结构化数据进行简单的数据处理。在本文中,我们提出一种语言RAPID,用于在Map-Reduce框架上对RDF数据进行可扩展的即席分析处理。它建立在Yahoo的Pig Latin的基础上,通过引入基于专用联接运算符MD-join的基元来表达分析任务,该方式更适合并行处理,以及用于处理RDF数据的半结构化性质的基元。实验评估结果表明,与现有的基于Map-Reduce的技术相比,RDF数据的分析处理可显着提高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号