【24h】

Continuous Similarity Computation over Streaming Graphs

机译:流图的连续相似度计算

获取原文

摘要

Large network analysis is a very important topic in data mining. A significant body of work in the area studies the problem of node similarity. One way to express node similarity is to associate with each node the set of 1-hop neighbors and compute the Jaccard similarity between these sets. This information can be used subsequently for more complex operations like link prediction, clustering or dense subgraph discovery. In this work, we study algorithms to monitor the result of a similarity join between nodes continuously, assuming a sliding window accommodating graph edges. Since the arrival of a new edge or the expiration of an existing one may change the similarity between several node pairs, the challenge is to maintain the similarity join result as efficiently as possible. Our theoretical study is validated by a thorough experimental evaluation, based on real-world as well as synthetically generated graphs, demonstrating the superiority of the proposed technique in comparison to baseline approaches.
机译:大型网络分析是数据挖掘中非常重要的主题。该领域的一项重要工作研究了节点相似性问题。表达节点相似性的一种方法是与每个节点关联1跳邻居集合,并计算这些集合之间的Jaccard相似性。此信息可随后用于更复杂的操作,例如链接预测,聚类或密集子图发现。在这项工作中,我们假设连续滑动窗口可容纳图形边缘,我们将研究算法来连续监视节点之间的相似性连接结果。由于新边缘的到来或现有边缘的到期可能会更改多个节点对之间的相似性,因此面临的挑战是尽可能高效地保持相似性连接结果。我们的理论研究通过基于真实世界以及合成生成的图形的全面实验评估得到了验证,证明了与基线方法相比,所提出的技术的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号