首页> 外文会议>International Conference on Advanced Computer Science and Information Systems >Combining topological and topical features for community detection
【24h】

Combining topological and topical features for community detection

机译:结合拓扑和主题功能进行社区检测

获取原文

摘要

Community detection is an important approach to identify community's structure in a network and can also be considered as graph clustering. This paper conducted a research about community detection using combined topological and topical features in Twitter. The combined features were compared to topological only and topical only. The topological features that were used are following-follower relationship and retweet-favorite ratio while topical features are hashtags, mentions, links and tweets. This research proposed a new node weight using retweet-favorite ratio to build topological matrix and it has been proved to have higher purity value by 30–40% and higher rand index value by 10–20%. The purity value of combining topological and topical features is also improved by 30% compared to using following-follower relationship as topological features. The highest rand index and purity values are achieved by matrix of combinied topological and topical features with multilevel community detection as clustering algorithm with 0.89 and 0.77.
机译:社区检测是识别网络中社区结构的重要方法,也可以视为图聚类。本文对Twitter中结合使用拓扑和主题功能的社区检测进行了研究。将组合的功能与仅拓扑和仅局部比较。所使用的拓扑功能是关注者跟随关系和转发-收藏夹比率,而主题功能是标签,提及,链接和推文。这项研究提出了一种新的节点权重,它使用转推-收藏夹比率构建拓扑矩阵,并被证明具有30–40%的更高纯度值和10–20%的更高兰德指数值。与使用跟随跟随者关系作为拓扑特征相比,组合拓扑特征和局部特征的纯度值也提高了30%。最高的兰德指数和纯度值是通过组合拓扑和主题特征的矩阵以及具有0.89和0.77的聚类算法的多级社区检测来实现的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号