Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload

Peng Peng; Zou Lei; Chen Lei; Zhao Dongyan

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload

【24h】

Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload

机译：基于查询工作负载的自适应分布式RDF图碎片和分配

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

As massive volumes of Resource Description Framework (RDF) data are growing, designing a distributed RDF database system to manage them is necessary. In designing this system, it is very common to partition the RDF data into some parts, called fragments, which are then distributed. Thus, the distribution design comprises two steps: fragmentation and allocation. In this study, we explore the workload for fragmentation and allocation, which aims to reduce the communication cost during SPARQL query processing. Specifically, we adaptively maintain some frequent access patterns (FAPs) to reflect the characteristics of the workload while ensuring the data integrity and approximation ratio. Based on these frequent access patterns, we propose three fragmentation strategies, namely vertical, horizontal, and mixed fragmentation, to divide RDF graphs while meeting different types of query processing objectives. After fragmentation, we discuss how to allocate these fragments to various sites while balancing the fragments. Finally, we discuss how to process queries based on the results of fragmentation and allocation. Experiments over large RDF datasets confirm the superior performance of our proposed solutions.

机译：由于大量资源描述框架（RDF）数据正在增长，设计分布式RDF数据库系统来管理它们是必要的。在设计该系统时，很常见的是将RDF数据分为某些部件，称为片段，然后分发。因此，分配设计包括两个步骤：碎片和分配。在这项研究中，我们探讨了碎片和分配的工作量，旨在降低SparQL查询处理期间的通信成本。具体地，我们自适应地维持一些频繁的访问模式（FAPS）以反映工作负载的特性，同时确保数据完整性和近似比。基于这些频繁访问模式，我们提出了三个碎片策略，即垂直，水平和混合碎片，在满足不同类型的查询处理目标时划分RDF图。在碎片后，我们讨论如何在平衡片段时将这些碎片分配给各种站点。最后，我们讨论如何根据碎片和分配结果处理查询。大型RDF数据集的实验证实了我们提出的解决方案的卓越性能。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2019年第4期|670-685|共16页
作者
Peng Peng; Zou Lei; Chen Lei; Zhao Dongyan;
展开▼
作者单位

Hunan Univ Changsha 410006 Hunan Peoples R China;

Peking Univ Inst Comp Sci & Technol Beijing 100080 Peoples R China;

Hong Kong Univ Sci & Technol Dept Comp Sci & Engn Hong Kong Peoples R China;

Peking Univ Inst Comp Sci & Technol Beijing 100080 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Distributed RDF database; data fragmentation; data allocation; query workload;

机译：分布式RDF数据库;数据碎片;数据分配;查询工作负载;

相似文献

外文文献
中文文献
专利

1. Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload [J] . Peng Peng, Zou Lei, Chen Lei, IEEE Transactions on Knowledge and Data Engineering . 2019,第4期

机译：基于查询工作量的自适应分布式RDF图分段与分配
2. Distributed Pregel-based provenance-aware regular path query processing on RDF knowledge graphs [J] . Xin Wang, Simiao Wang, Yueqi Xin, World Wide Web . 2020,第3期

机译：基于PREGER的基于求购感知常规路径查询处理RDF知识图表
3. A Novel Query-Driven Clustering-Based Technique for Vertical Fragmentation and Allocation in Distributed Database Systems [J] . Adel A. Sewisy, Ali Abdullah Amer, Hassan I. Abdalla International journal on Semantic Web and information systems . 2017,第2期

机译：一种新的基于查询驱动的基于聚类技术，用于分布式数据库系统中的垂直碎片和分配
4. Adaptive Workload-Based Partitioning and Replication for RDF Graphs [C] . Ahmed Al-Ghezi, Lena Wiese International conference on database and expert systems applications;International workshop on big data mamagement in cloud systems;International workshop on biological knowledge discovery;International workshop on technologies for information retrieval . 2018

机译：RDF图的基于工作负载的自适应分区和复制
5. A graph based cache system for efficient querying in distributed triplestores. [D] . Devadithya, Tharaka. 2008

机译：基于图的缓存系统，可在分布式三元存储中进行有效查询。
6. SPANG: a SPARQL client supporting generation and reuse of queries for distributed RDF databases [O] . Hirokazu Chiba, Ikuo Uchiyama 2017

机译：SPANG：SPARQL客户端支持生成和重用分布式RDF数据库的查询
7. Towards Load Balancing and Parallelizing of RDF Query Processing in P2P Based Distributed RDF Data Stores [O] . Liaquat Ali, Thomas Janson, Christian Schindelhauer 2015

机译：基于p2p的分布式RDF数据存储中RDF查询处理的负载均衡与并行化

Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅