首页> 外文会议>IEEE international conference on data engineering >Evaluating multi-way joins over discounted hitting time
【24h】

Evaluating multi-way joins over discounted hitting time

机译:评估多路接入折扣的时间

获取原文

摘要

The discounted hitting time (DHT), which is a random-walk similarity measure for graph node pairs, is useful in various applications, including link prediction, collaborative recommendation, and reputation ranking. We examine a novel query, called the multi-way join (or n-way join), on DHT scores. Given a graph and n sets of nodes, the n-way join retrieves a set of n-tuples with the k highest scores, according to some aggregation function of DHT values. This query enables analysis and prediction of complex relationship among n sets of nodes. Since an n-way join is expensive to compute, we develop the Partial Join algorithm (or PJ). This solution decomposes an n-way join into a number of top-m 2-way joins, and combines their results to construct the answer of the n-way join. Since PJ may necessitate the computation of top-(m+ 1) 2-way joins, we study an incremental solution, which allows the top-(m+ 1) 2-way join to be derived quickly from the top-m 2-way join results earlier computed. We further examine fast processing and pruning algorithms for 2-way joins. An extensive evaluation on three real datasets shows that PJ accurately evaluates n-way joins, and is four orders of magnitude faster than basic solutions.
机译:折扣的击球时间(DHT)是图形节点对的随机步道相似度量,可用于各种应用,包括链路预测,协作推荐和声誉排名。我们检查DHT分数的新型查询,称为多路连接(或N-Way连接)。根据DHT值的一些聚合函数,给定图形和N个节点,N-Way加入用K最高分的分数检索一组N元组。该查询能够分析和预测N个节点中的复杂关系。由于N-Way加入昂贵,因此开发了部分连接算法(或PJ)。该解决方案将N-Way连接分解为多个Top-M的2路连接,并结合其结果构造N-Way连接的答案。由于PJ可能需要计算顶部(M + 1)2路联接,我们研究增量解决方案,允许从顶部-M 2路上快速导出顶部(M + 1)2路加入结果早期计算。我们进一步检查了用于双向连接的快速处理和修剪算法。对三个实时数据集的广泛评估显示PJ准确地评估了N-Way连接,并且比基本解决方案快四个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号