Finding User Clusters in Sina Microblog

机译：在新浪微博中查找用户群

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sina microblog has been a very popular social microblog service in recent years. However it's difficult to analyze the network structure of Sina microblog because of the huge amount of users. The emergence of cloud computing gives us a new approach to analyze large-scale social networks. Hadoop is a widely used cloud computing platform, several clustering algorithms such as K-means and Canopy have already been implemented on it. However, the initial cluster centers of K-means are hard to select. Canopy provides a way to choose initial centers, but it is not suitable for very large data sets, and both traditional K-means and Canopy K-means converge very slowly. This paper proposes an improved method to cluster microblog users based on their relationship. We name our method "Weight Partitioned Canopy K-means" (WPCK), implement it on Hadoop cluster, and test it along with existing methods. Experimental results show that WPCK can reduce the number of iterations by about 1/3 of traditional K-means and Canopy K-means, while their performance are almost the same.

机译：近年来，新浪微博已成为非常流行的社交微博服务。但是由于用户数量巨大，很难分析新浪微博的网络结构。云计算的出现为我们提供了一种分析大型社交网络的新方法。 Hadoop是一个广泛使用的云计算平台，已经在其上实现了多种聚类算法，例如K-means和Canopy。但是，很难选择K均值的初始聚类中心。 Canopy提供了一种选择初始中心的方法，但是它不适用于非常大的数据集，并且传统的K均值和Canopy K均值都非常缓慢地收敛。本文提出了一种基于微博用户关系的聚类方法。我们将方法命名为“ Weight Partitioned Canopy K-means”（WPCK），在Hadoop集群上实现该方法，并与现有方法一起对其进行测试。实验结果表明，WPCK可以将迭代次数减少传统K均值和Canopy K均值的1/3，而它们的性能几乎相同。

著录项

来源
《International Symposium on Computational Intelligence and Design》|2013年|406-409|共4页
会议地点
作者
Pei Kang; Niu Kai; He Zhiqiang; He Xuan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Clustering; Hadoop; Microblog; Social Network;

机译：集群; Hadoop;微博;社交网络;
入库时间 2022-08-26 15:12:57

相似文献

外文文献
中文文献
专利

1. Celebrity and ordinary users： A comparative study of microblog user behaviors on Sina Weibo [J] . Xingjun LIU123, Weijun WANG23, Jinke WU1 中国文献情报：英文版 . 2015,第002期

机译：名人与普通用户：新浪微博上微博用户行为的比较研究
2. Modeling the heterogeneity of human dynamics based on the measurements of influential users in Sina Microblog [J] . Wang Chenxu, Guan Xiaohong, Qin Tao, Physica, A. Statistical mechanics and its applications . 2015,第Null期

机译：基于新浪微博中有影响力用户的测量，对人类动力学的异质性进行建模
3. Recommending Mobile Microblog Users via a Tensor Factorization Based on User Cluster Approach [J] . Liao Xiangwen, Zhang Lingying, Wei Jingjing, Wireless communications & mobile computing . 2018,第1期

机译：基于用户群方法的张量分解推荐移动微博用户
4. Finding User Clusters in Sina Microblog [C] . Pei Kang, Niu Kai, He Zhiqiang, International Symposium on Computational Intelligence and Design . 2013

机译：在新浪微博找到用户群集
5. Microblog search and word clouds: The impact of word clouds on user satisfaction during microblog searches. [D] . Haber, Jonathan. 2010

机译：微博客搜索和词云：在微博客搜索期间，词云对用户满意度的影响。
6. Suicide Communication on Social Media and Its Psychological Mechanisms: An Examination of Chinese Microblog Users [O] . Qijin Cheng, Chi Leung Kwok, Tingshao Zhu, 2015

机译：社交媒体上的自杀交流及其心理机制：对中国微博用户的考察
7. A Method of Finding Hidden Key Users Based on Transfer Entropy in Microblog Network [O] . 2020

机译：一种基于微博网络传输熵的隐藏密钥用户的方法
8. PKUICST at TREC 2014 Microblog Track: Feature Extraction for Effective Microblog Search and Adaptive Clustering Algorithms for TTG. [R] . Lv, C., Fan, F., Qiang, R., 2014

机译：2014年TREC上的pKUICsT微博跟踪：TTG的有效微博搜索和自适应聚类算法的特征提取。

Finding User Clusters in Sina Microblog

摘要

著录项

相似文献

相关主题

期刊订阅