Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload

Peng Peng; Zou Lei; Chen Lei; Zhao Dongyan

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload

【24h】

Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload

机译：基于查询工作量的自适应分布式RDF图分段与分配

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

As massive volumes of Resource Description Framework (RDF) data are growing, designing a distributed RDF database system to manage them is necessary. In designing this system, it is very common to partition the RDF data into some parts, called fragments, which are then distributed. Thus, the distribution design comprises two steps: fragmentation and allocation. In this study, we explore the workload for fragmentation and allocation, which aims to reduce the communication cost during SPARQL query processing. Specifically, we adaptively maintain some frequent access patterns (FAPs) to reflect the characteristics of the workload while ensuring the data integrity and approximation ratio. Based on these frequent access patterns, we propose three fragmentation strategies, namely vertical, horizontal, and mixed fragmentation, to divide RDF graphs while meeting different types of query processing objectives. After fragmentation, we discuss how to allocate these fragments to various sites while balancing the fragments. Finally, we discuss how to process queries based on the results of fragmentation and allocation. Experiments over large RDF datasets confirm the superior performance of our proposed solutions.

机译：随着大量的资源描述框架（RDF）数据的增长，有必要设计一个分布式RDF数据库系统来管理它们。在设计该系统时，将RDF数据划分为一些部分（称为片段）然后分发，这是很常见的。因此，分发设计包括两个步骤：分段和分配。在这项研究中，我们探索了碎片和分配的工作量，目的是减少SPARQL查询处理期间的通信成本。具体来说，我们在确保数据完整性和近似率的同时，自适应地维护一些频繁访问模式（FAP）以反映工作负载的特征。基于这些频繁访问模式，我们提出了三种分段策略，即垂直，水平和混合分段，以在满足不同类型的查询处理目标的同时划分RDF图。碎片之后，我们讨论如何在平衡碎片的同时将这些碎片分配到各个站点。最后，我们讨论如何根据碎片和分配的结果处理查询。在大型RDF数据集上进行的实验证实了我们提出的解决方案的出色性能。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2019年第4期|670-685|共16页
作者
Peng Peng; Zou Lei; Chen Lei; Zhao Dongyan;
展开▼
作者单位

Hunan Univ, Changsha 410006, Hunan, Peoples R China;

Peking Univ, Inst Comp Sci & Technol, Beijing 100080, Peoples R China;

Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China;

Peking Univ, Inst Comp Sci & Technol, Beijing 100080, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Distributed RDF database; data fragmentation; data allocation; query workload;

机译：分布式RDF数据库;数据碎片;数据分配;查询工作负载;

相似文献

外文文献
中文文献
专利

1. Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload [J] . Peng Peng, Zou Lei, Chen Lei, IEEE Transactions on Knowledge and Data Engineering . 2019,第4期

机译：基于查询工作负载的自适应分布式RDF图碎片和分配
2. Distributed Pregel-based provenance-aware regular path query processing on RDF knowledge graphs [J] . Xin Wang, Simiao Wang, Yueqi Xin, World Wide Web . 2020,第3期

机译：基于PREGER的基于求购感知常规路径查询处理RDF知识图表
3. A Novel Query-Driven Clustering-Based Technique for Vertical Fragmentation and Allocation in Distributed Database Systems [J] . Adel A. Sewisy, Ali Abdullah Amer, Hassan I. Abdalla International journal on Semantic Web and information systems . 2017,第2期

机译：一种新的基于查询驱动的基于聚类技术，用于分布式数据库系统中的垂直碎片和分配
4. Adaptive Workload-Based Partitioning and Replication for RDF Graphs [C] . Ahmed Al-Ghezi, Lena Wiese International conference on database and expert systems applications;International workshop on big data mamagement in cloud systems;International workshop on biological knowledge discovery;International workshop on technologies for information retrieval . 2018

机译：RDF图的基于工作负载的自适应分区和复制
5. A graph based cache system for efficient querying in distributed triplestores. [D] . Devadithya, Tharaka. 2008

机译：基于图的缓存系统，可在分布式三元存储中进行有效查询。
6. SPANG: a SPARQL client supporting generation and reuse of queries for distributed RDF databases [O] . Hirokazu Chiba, Ikuo Uchiyama 2017

机译：SPANG：SPARQL客户端支持生成和重用分布式RDF数据库的查询
7. Towards Load Balancing and Parallelizing of RDF Query Processing in P2P Based Distributed RDF Data Stores [O] . Liaquat Ali, Thomas Janson, Christian Schindelhauer 2015

机译：基于p2p的分布式RDF数据存储中RDF查询处理的负载均衡与并行化

Adaptive Distributed RDF Graph Fragmentation and Allocation based on Query Workload

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅