Scalable SAPRQL Querying Processing on Large RDF Data in Cloud Computing Environment

机译：云计算环境中大型RDF数据的可扩展SAPRQL查询处理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently the flexibility of RDF data model makes increasing number of organizations and communities keep their data available in the RDF format. There is a growing need for querying these data in scalable and efficient way. MapReduce is a parallel data processing solution for processing large data-intensive workloads, which is not supported directly for join-intensive workloads. In this paper, we present a schema based hybrid partitioning technique for RDF triples placement according to the relationships between them, and reduce the necessary number of MR cycles in each SAPRQL query job. Then we propose a lightweight sideways information passing techniques which pass the join information across MR jobs to decrease the intermediate results involved in join operations. The experimental results show that our approaches achieve a substantial performance improvement, and outperform the previous system by a factor of 2-20 using LUBM benchmark.

机译：最近，RDF数据模型的灵活性使得越来越多的组织和社区以RDF格式保持其数据可用。越来越需要以可扩展和高效的方式查询这些数据。 MapReduce是用于处理大型数据密集型工作负载的并行数据处理解决方案，而对于连接密集型工作负载则不直接支持。在本文中，我们根据RDF三元组之间的关系提出了一种基于模式的混合分区技术，用于RDF三元组放置，并减少了每个SAPRQL查询作业中所需的MR循环数。然后，我们提出了一种轻量级的横向信息传递技术，该技术可跨MR作业传递联接信息，以减少联接操作中涉及的中间结果。实验结果表明，使用LUBM基准测试，我们的方法可实现显着的性能改进，并且性能比以前的系统好2到20倍。

著录项

来源
《Joint international conference on pervasive computing and the networked world》|2013年|631-646|共16页
会议地点
作者
Buwen Wu; Hai Jin; Pingpeng Yuan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
RDF Data; Partitioning; MapReduce; Cloud Computing;

机译：RDF数据;分区; MapReduce;云计算;

相似文献

外文文献
中文文献
专利

1. xStore: Federated temporal query processing for large scale RDF triples on a cloud environment [J] . Ahn Jinhyun, Eom Jae-Hong, Nam Sejin, Neurocomputing . 2017,第sepa20期

机译：xStore：在云环境中针对大型RDF三元组的联合时间查询处理
2. Heuristics-Based Query Processing for Large RDF Graphs Using Cloud Computing [J] . Husain Mohammad, McGlothlin James, Masud Mohammad M., Knowledge and Data Engineering, IEEE Transactions on . 2011,第9期

机译：使用云计算的大型RDF图基于启发式的查询处理
3. Adaptive mechanism for distributed query processing and data loading using the RDF data in the cloud [J] . Dharmaraj Chandrasekaran Ranichandra, Tripathy BalaKrushna International journal of communication systems . 2018,第15期

机译：使用云中的RDF数据进行分布式查询处理和数据加载的自适应机制
4. Scalable SAPRQL Querying Processing on Large RDF Data in Cloud Computing Environment [C] . Buwen Wu, Hai Jin, Pingpeng Yuan ICPCA 2012 . 2013

机译：云计算环境中大型RDF数据的可扩展SAPRQL查询处理
5. Scalable parallel computing on clouds: Efficient and scalable architectures to perform pleasingly parallel, MapReduce and iterative data intensive computations on cloud environments. [D] . Gunarathne, Thilina. 2014

机译：云上的可伸缩并行计算：高效且可伸缩的架构，可在云环境上执行令人满意的并行，MapReduce和迭代式数据密集型计算。
6. Processing SPARQL queries with regular expressions in RDF databases [O] . Jinsoo Lee, Minh-Duc Pham, Jihwan Lee, 2011

机译：使用RDF数据库中的正则表达式处理SPARQL查询
7. Elastic Spatial Query Processing in OpenStack Cloud Computing Environment for Time-Constraint Data Analysis [O] . Wei Huang, Wen Zhang, Dongying Zhang, 2017

机译：用于时间约束数据分析的Openstack云计算环境中的弹性空间查询处理

Scalable SAPRQL Querying Processing on Large RDF Data in Cloud Computing Environment

摘要

著录项

相似文献

相关主题

期刊订阅