首页> 外文期刊>Future generation computer systems >Probabilistic data structure-based community detection and storage scheme in online social networks
【24h】

Probabilistic data structure-based community detection and storage scheme in online social networks

机译:在线社交网络中基于概率数据结构的社区检测和存储方案

获取原文
获取原文并翻译 | 示例

摘要

With the wide popularity of various social network sites such as - Facebook, Twitter, and Instagram; processing, storage, and analysis of a large volume of data are becoming challenging issues. In general, social networks are assumed to be graphs having nodes for representation of a group of persons in order to explore the relationship between them for Social Network Analysis (SNA). Analysts claim that interconnections in these networks are the reflection of social structure of individual where personalities with common attributes often occupy similar positions. Such similarities are caused by the prospects, opportunities, sensitivities and perceptions created by these similar network positions. Thus, clustering of these individuals is necessary to analyse their common characteristics. However, most of the existing clustering algorithms considered for community detection in SNA have high memory requirements especially in online social networks. So, to mitigate these issues, this paper proposes a novel clustering scheme for community detection for fast access, storage and retrieval of data using Probabilistic Data Structures (PDS). In the proposed scheme, Bloom filter has been used for clustering and Quotient filter has been used for storage and access of cluster nodes. It has been experimentally proved that the proposed scheme provides significant improvement in computational time which is reduced by 64% and 79% respectively in comparison to the linked list and adjacency matrix. Moreover, Quotient filter based storage schema significantly enhances the effectiveness of the proposed scheme over conventional storage methods. (C) 2018 Elsevier B.V. All rights reserved.
机译:随着各种社交网站的广泛普及,例如-Facebook,Twitter和Instagram;处理,存储和分析大量数据正在成为具有挑战性的问题。通常,社交网络被假定为具有用于代表一组人的节点的图,以便为社交网络分析(SNA)探索他们之间的关系。分析人士认为,这些网络中的相互联系是个人社会结构的反映,具有共同属性的人通常占据相似的位置。这些相似性是由这些相似的网络位置所产生的前景,机会,敏感性和感知引起的。因此,将这些个体聚在一起是分析他们共同特征的必要条件。但是,在SNA中考虑用于社区检测的大多数现有聚类算法对内存的要求都很高,尤其是在在线社交网络中。因此,为缓解这些问题,本文提出了一种新的集群方案,用于使用概率数据结构(PDS)快速访问,存储和检索数据的社区检测。在提出的方案中,布隆过滤器已用于聚类,商过滤器已用于聚类节点的存储和访问。实验证明,该方案显着改善了计算时间,与链表和邻接矩阵相比,分别减少了64%和79%。此外,基于商过滤器的存储方案比常规存储方法显着增强了所提出方案的有效性。 (C)2018 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号