...
首页> 外文期刊>Journal of Information Science >A flexible aggregation framework on large-scale heterogeneous information networks
【24h】

A flexible aggregation framework on large-scale heterogeneous information networks

机译:大规模异构信息网络上的灵活聚合框架

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

OLAP (On-line Analytical Processing) can provide users with aggregate results from different perspectives and granularities. With the advent of heterogeneous information networks that consist of multi-type, interconnected nodes, such as bibliographic networks and knowledge graphs, it is important to study flexible aggregation in such networks. The aggregation results by existing work are limited to one type of node, which cannot be applied to aggregation on multi-type nodes, and relations in large-scale heterogeneous information networks. In this paper, we investigate the flexible aggregation problem on large-scale heterogeneous information networks, which is defined on multi-type nodes and relations. Moreover, by considering both attributes and structures, we propose a novel function based on graph entropy to measure the similarities of nodes. Further, we prove that the aggregation problem based on the function is NP-hard. Therefore, we develop an efficient heuristic algorithm for aggregation in two phases: informational aggregation and structural aggregation. The algorithm has linear time and space complexity. Extensive experiments on real-world datasets demonstrate the effectiveness and efficiency of the proposed algorithm.
机译:OLAP(在线分析处理)可以从不同的角度和粒度为用户提供汇总结果。随着由多种类型,相互连接的节点组成的异构信息网络(如书目网络和知识图)的出现,研究此类网络中的灵活聚合非常重要。现有工作的聚合结果仅限于一种类型的节点,不能应用于多类型节点上的聚合,以及大规模异构信息网络中的关系。在本文中,我们研究了在大型异构信息网络上的灵活聚合问题,该问题定义在多类型节点和关系上。此外,通过考虑属性和结构,我们提出了一种基于图熵的新颖函数来度量节点的相似性。此外,我们证明了基于函数的聚集问题是NP难的。因此,我们在两个阶段开发了一种高效的启发式聚合算法:信息聚合和结构聚合。该算法具有线性时间和空间复杂度。在现实世界数据集上的大量实验证明了该算法的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号