Microblog Hotspot Discovery Method Based on Improved K-Means Algorithm

机译：基于改进的K均值算法的MicroBlog热点发现方法

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The K-means algorithm is one of the most frequently used clustering algorithms in hot topic discovery. However, due to its shortcomings such as the number of clusters K value and easy to fall into local optimum, the clustering accuracy is not high, which directly affects the quality of hotspot discovery. This paper proposes an improved K-means algorithm to achieve fast clustering of microblog texts. Combining the high-frequency words and similarities of the microblog texts to perform single-pass clustering, the K number of clusters and the initial clustering center are obtained, which solves the problem that the K-means algorithm is too sensitive to the K value and the initial center. Through experimental comparison and analysis, it makes up for the shortcomings of K-means algorithm, and effectively improves the efficiency and accuracy of clustering. Applying it to the hot topic discovery model, the effectiveness of the hot spot discovery model based on the improved K-means algorithm is verified by experiments, and it has a high accuracy.

机译：在K-means算法是在热门话题发现最常用的聚类算法之一。然而，由于它的缺点，例如簇K值和容易陷入局部最优的数量，聚类精度不高，这直接影响热点发现的质量。本文提出了一种改进的K-means算法来实现微博文本的快速聚类。组合的微博文本的高频词和相似性进行单遍聚类，获得簇的K个和初始聚类中心，解决了问题，即K-means算法是将K值过于敏感和初始中心。通过实验的比较和分析，它弥补了K-means算法的缺点，并有效地提高了效率和聚类的准确度。它应用到的热点话题发现模型的基础上，改进的K-means算法的热点发现模型的有效性进行了实验验证，并具有较高的精度。

著录项

来源
《IEEE International Conference on High Performance Computing and Communications》|2019年|lxxi705 p. :|共6页
会议地点
作者
Qiang Gao; Jing Feng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Clustering algorithms; Internet; Data models; Semantics; High frequency; Mathematical model; Data collection;

机译：聚类算法;互联网;数据模型;语义;高频;数学模型;数据收集;

相似文献

外文文献
中文文献
专利

1. Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means [J] . Gensheng Wang Computational intelligence and neuroscience . 2013,第Null期

机译：基于改进的K均值的互联网舆论热点发现研究
2. A Hotspot Discovery Method Based on Improved FIHC Clustering Algorithm [J] . Lin Lina, Wei Dezhi Technical Gazette . 2021,第5期

机译：一种基于改进FIHC聚类算法的热点发现方法
3. Topic Analysis of Microblog About "Didi Taxi" Based on K-means Algorithm [J] . Yonghe Lu, Xin Xiong American Journal of Information Science and Technology . 2019,第3期

机译：基于K-means算法的“滴滴出租车”微博主题分析
4. Microblog Hotspot Discovery Method Based on Improved K-Means Algorithm [C] . Qiang Gao, Jing Feng IEEE International Conference on High Performance Computing and Communications;IEEE International Conference on Smart City;IEEE International Conference on Data Science and Systems . 2019

机译：基于改进的K均值算法的微博热点发现方法
5. Content-Based Earth Observation Data Discovery Methods Based on Intelligent Algorithms [D] . Cui, Kejin. 2020

机译：基于智能算法的基于内容的地球观测数据发现方法
6. Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means [O] . Gensheng Wang 2013

机译：基于改进的K均值的互联网舆论热点发现研究
7. A Rapid and High-Precision Mountain Vertex Extraction Method Based on Hotspot Analysis Clustering and Improved Eight-Connected Extraction Algorithms for Digital Elevation Models [O] . Zhenqi Zheng, Xiongwu Xiao, Zhi-Chao Zhong, 2020

机译：一种基于热点分析聚类的快速和高精度的山顶顶点提取方法，并改进了数字高度模型的八连接提取算法

Microblog Hotspot Discovery Method Based on Improved K-Means Algorithm

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅