首页> 外文期刊>ACM transactions on knowledge discovery from data >Social Influence Based Clustering and Optimization over Heterogeneous Information Networks
【24h】

Social Influence Based Clustering and Optimization over Heterogeneous Information Networks

机译:基于社会影响力的异构信息网络聚类与优化

获取原文

摘要

Social influence analysis has shown great potential for strategic marketing decision. It is well known that people influence one another based on both their social connections and the social activities that they have engaged in the past. In this article, we develop an innovative and high-performance social influence based graph clustering framework with four unique features. First, we explicitly distinguish social connection based influence (self-influence) and social activity based influence (co-influence). We compute the self-influence similarity between two members based on their social connections within a single collaboration network, and compute the co-influence similarity by taking into account not only the set of activities that people participate but also the semantic association between these activities. Second, we define the concept of influence-based similarity by introducing a unified influence-based similarity matrix that employs an iterative weight update method to integrate self-influence and co-influence similarities. Third, we design a dynamic learning algorithm, called SI-Cluster, for social influence based graph clustering. It iteratively partitions a large social collaboration network into K clusters based on both the social network itself and the multiple associated activity information networks, each representing a category of activities that people have engaged. To make the SI-Cluster algorithm converge fast, we transform sophisticated nonlinear fractional programming problem with respect to multiple weights into a straightforward nonlinear parametric programming problem of single variable. Finally, we develop an optimization technique of diagonalizable-matrix approximation to speed up the computation of self-influence similarity and co-influence similarities. Our SI-Cluster-Opt significantly improves the efficiency of SI-Cluster on large graphs while maintaining high quality of clustering results. Extensive experimental evaluation on three real-world graphs shows that, compared to existing representative graph clustering algorithms, our SI-Cluster-Opt approach not only achieves a very good balance between self-influence and co-influence similarities but also scales extremely well for clustering large graphs in terms of time complexity while meeting the guarantee of high density, low entropy and low Davies-Bouldin Index.
机译:社会影响力分析显示了战略性营销决策的巨大潜力。众所周知,人们基于他们的社交关系和过去从事的社交活动而相互影响。在本文中,我们将开发一种具有四个独特功能的,基于创新和高性能,具有社会影响力的图聚类框架。首先,我们明确区分基于社交联系的影响(自我影响)和基于社交活动的影响(共同影响)。我们基于两个成员在单个协作网络中的社交联系来计算他们之间的自我影响相似度,并不仅考虑人们参与的活动集,还考虑这些活动之间的语义关联来计算共同影响相似度。其次,我们通过引入统一的基于影响的相似性矩阵来定义基于影响的相似性的概念,该矩阵采用迭代权重更新方法来整合自影响和共影响相似性。第三,我们设计了一种动态学习算法,称为SI-Cluster,用于基于社会影响力的图聚类。它基于社交网络本身和多个关联的活动信息网络,将一个大型的社交协作网络迭代地划分为K个群集,每个代表着人们参与的活动类别。为了使SI-Cluster算法快速收敛,我们将关于多个权重的复杂的非线性分数规划问题转化为单变量的直接非线性参数规划问题。最后,我们开发了对角化矩阵逼近的优化技术,以加快自影响相似度和共影响相似度的计算。我们的SI-Cluster-Opt可显着提高大型图上SI-Cluster的效率,同时保持高质量的聚类结果。在三个真实世界的图形上进行的广泛实验评估表明,与现有的代表性图形聚类算法相比,我们的SI-Cluster-Opt方法不仅可以在自影响和共影响相似性之间实现很好的平衡,而且可以很好地扩展聚类时间复杂度较大的图表,同时满足高密度,低熵和低Davies-Bouldin指数的保证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号