...
首页> 外文期刊>Decision support systems >Using text mining and sentiment analysis for online forums hotspot detection and forecast
【24h】

Using text mining and sentiment analysis for online forums hotspot detection and forecast

机译:使用文本挖掘和情感分析进行在线论坛热点检测和预测

获取原文
获取原文并翻译 | 示例

摘要

Text sentiment analysis, also referred to as emotional polarity computation, has become a flourishing frontier in the text mining community. This paper studies online forums hotspot detection and forecast using sentiment analysis and text mining approaches. First, we create an algorithm to automatically analyze the emotional polarity of a text and to obtain a value for each piece of text. Second, this algorithm is combined with K-means clustering and support vector machine (SVM) to develop unsupervised text mining approach. We use the proposed text mining approach to group the forums into various clusters, with the center of each representing a hotspot forum within the current time span. The data sets used in our empirical studies are acquired and formatted from Sina sports forums, which spans a range of 31 different topic forums and 220,053 posts. Experimental results demonstrate that SVM forecasting achieves highly consistent results with K-means clustering. The top 10 hotspot forums listed by SVM forecasting resembles 80% of K-means clustering results. Both SVM and K-means achieve the same results for the top 4 hotspot forums of the year.
机译:文本情感分析(也称为情感极性计算)已成为文本挖掘社区中蓬勃发展的前沿。本文使用情感分析和文本挖掘方法研究在线论坛的热点检测和预测。首先,我们创建一种算法来自动分析文本的情感极性并获取每个文本的值。其次,将该算法与K-means聚类和支持向量机(SVM)相结合,以开发无监督的文本挖掘方法。我们使用建议的文本挖掘方法将论坛分为不同的集群,每个集群的中心代表当前时间范围内的热点论坛。我们的实证研究中使用的数据集是从新浪体育论坛获取并格式化的,该论坛涵盖了31个不同的主题论坛和220,053个帖子。实验结果表明,通过K-means聚类,SVM预测获得了高度一致的结果。 SVM预测列出的前10个热点论坛与K均值聚类结果的80%相似。 SVM和K-means在本年度的前4个热点论坛上均取得了相同的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号