Query processing on large graphs: Approaches to scalability and response time trade offs

首页> 外文期刊>Data & Knowledge Engineering >Query processing on large graphs: Approaches to scalability and response time trade offs

【24h】

Query processing on large graphs: Approaches to scalability and response time trade offs

机译：大型图上的查询处理：可伸缩性和响应时间折衷的方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Graphs, being an expressive data structure, have become increasingly important for modeling real-world applications, such as collaboration, different kinds of transactions, social networks, to name a few. With the advent of social networks and the web, the graph sizes have grown too large to fit in main memory precipitating the need for alternative approaches for an efficient, scalable evaluation of queries on graphs of any size.In this paper, we use the time-tested "divide and conquer" approach by partitioning a graph into desired number of partitions (and possibly with appropriate characteristics) and process queries over those partitions to obtain all or specified number of answers. This entails correctly computing answers that span multiple partitions or even need the same partition more than once. Given a set of partitions, there are a number of approaches using which a query can be evaluated: (i) One Partition At a Time (OPAT) approach, (ii) Traditional use of Multiple Processors (TraditionalMP), and (iii) using the Map/Reduce Multi-Processor approach (MapReduceMP) approach. The first approach, detailed in this paper, has established scalability through independent processing of partitions. The other two approaches address response time in addition to scalability. For the OPAT query evaluation approach, necessary minimal book keeping has been identified and its correctness established in this paper. Query answering on partitioned graphs also requires analyzing partitioning schemes for their impact on query processing and determining the number as well as the sequence in which partitions need to be loaded to reduce the response time for processing queries. We correlate query properties and partition characteristics to reduce query processing time in terms of the resources available.We also identify a set of quantitative metrics and use them for formulating heuristics to determine the order of loading partitions for efficient query processing. For OPAT approach, extensive experiments on large graphs (synthetic and real-world) using different partitioning schemes analyze the proposed heuristics on a variety of query types. The other two approaches are fleshed out, analyzed, and contrasted with the OPAT approach. An existing graph querying system has been extended to evaluate queries on partitioned graphs. Finally all three approaches are compared for their strengths and weaknesses.

机译：图形作为一种具有表现力的数据结构，对于建模现实应用程序（例如协作，各种交易，社交网络等）已变得越来越重要。随着社交网络和网络的出现，图的大小已经变得太大而无法容纳在主内存中，因此需要使用其他方法来对任何大小的图进行高效，可扩展的查询评估。在本文中，我们使用了时间经过测试的“分而治之”方法，将图形划分为所需数量的分区（并可能具有适当的特征），并处理这些分区上的查询以获得全部或指定数量的答案。这需要正确计算跨越多个分区甚至多次需要同一分区的答案。给定一组分区，可以使用多种方法来评估查询：（i）一次分区（OPAT）方法;（ii）传统使用多处理器（TraditionalMP）;以及（iii）使用Map / Reduce多处理器方法（MapReduceMP）方法。本文详细介绍的第一种方法已通过独立处理分区建立了可伸缩性。除可伸缩性外，其他两种方法还解决了响应时间。对于OPAT查询评估方法，已确定了必要的最小簿记并确定了其正确性。对分区图的查询答复还需要分析分区方案对查询处理的影响，并确定需要加载分区的数量和顺序，以减少处理查询的响应时间。我们将查询属性和分区特性关联起来，以减少可用资源方面的查询处理时间。我们还确定了一组定量指标，并使用它们来制定启发式方法以确定加载分区的顺序以进行有效的查询处理。对于OPAT方法，使用不同的分区方案在大型图（合成图和实际图）上进行的大量实验分析了各种查询类型上的启发式算法。对其他两种方法进行了充实，分析，并与OPAT方法进行了对比。现有的图查询系统已扩展为评估分区图上的查询。最后，比较了这三种方法的优缺点。

著录项

来源
《Data & Knowledge Engineering》 |2020年第3期|101736.1-101736.16|共16页
作者

展开▼
作者单位

UT Arlington IT Lab Arlington TX 76019 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Graph query processing; Plan generation; Query evaluation on partitioned graphs; Scalability; Map/Reduce;

机译：图查询处理;计划生成;查询分区图的评估;可扩展性;映射/缩小;

相似文献

外文文献
中文文献
专利

1. Constant versus variable response signal delays in speed-accuracy trade-offs: Effects of advance preparation for processing time [J] . Miller Jeff, Sproesser Gudrun, Ulrich Rolf Perception & psychophysics . 2008,第5期

机译：速度与精度之间的恒定与可变响应信号延迟：提前准备处理时间的影响
2. A bicriterion approach to time/cost trade-offs in scheduling with convex resource-dependent job processing times and release dates [J] . Moshe Kaspi, Dvir Shabtay Computers & operations research . 2006,第10期

机译：调度中的时间/成本权衡的双标准方法，具有凸出的依赖资源的作业处理时间和发布日期
3. Space-time trade-offs for some ranking and searching queries [J] . Adrian Dumitrescu, William Steiger Information Processing Letters . 2001,第5期

机译：某些排名和搜索查询的时空权衡
4. Time-completeness trade-offs in record linkage using adaptive query processing [C] . Roald Lengu, Paolo Missier, Alvaro A. A. Fernandes, International Conference on Extending Database Technology . 2009

机译：使用自适应查询处理的记录链接中的时间完整性权衡
5. MR_QP: A Scalable Approach to Query Processing on Arbitrary-Size Graphs Using the Map/Reduce Framework [D] . ?Modi, Harshit 2020

机译：MR_QP ：可扩展的方法来查询处理任意尺寸图使用的Map / Reduce 框架
6. Evolutionary trade-offs at two time-scales: competition versus persistence. [O] . M Keeling 2000

机译：在两个时间尺度上的进化权衡：竞争与持久性。
7. TIME-COMPLETENESS TRADE-OFFS IN RECORD LINKAGE USING ADAPTIVE QUERY PROCESSING [O] . LENGU R, MISSIER P, FERNANDES A, 2009

机译：自适应查询处理在记录链接中完成时间的权衡

Query processing on large graphs: Approaches to scalability and response time trade offs

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅