首页> 外文期刊>Journal of computer sciences >Online Forums Hotspot Prediction Based on Sentiment Analysis | Science Publications
【24h】

Online Forums Hotspot Prediction Based on Sentiment Analysis | Science Publications

机译:基于情感分析的在线论坛热点预测科学出版物

获取原文
获取外文期刊封面目录资料

摘要

> Problem statement: Online forums hotspot prediction is one of the significant research areas in web mining, which can help people make proper decision in daily life. Online forums, news reports and blogs, are containing large volume of public opinion information. Rapid growth of network arouses much attention on public opinion, it is important to analyse the public opinion in time and understands the trends of their opinion correctly. Approach: The sentiment analysis and text mining are important key elements for forecasting the hotspots in online forums. Most of the traditional text mining work on static data sets, while the online hotspot forecasts works on the web information dynamically and timely. The earlier work on text information processing focuses in the factual domain rather than opinion domain. Due to the semi structured or unstructured characteristics of online public opinion, we introduce traditional Vector Space Model (VSM) to express them and then use K-means to perform hotspot detection, then we use J48 classifier to perform hotspot forecast. Results: The experimentation is conducted by Rapid Miner tool and performance of proposed method J48 is compared with other method, such as Naive Bayes. The consistency between K-means and J48 is validated using three metrics. They are accuracy, sensitivity and specificity. Conclusion: The experiment helps to identify that K-means and J48 together to predict forums hotspot. The results that have been obtained using J48 present a noticeable consistency with the results achieved by K-means clustering.
机译: > 问题陈述:在线论坛热点预测是网络挖掘中的重要研究领域之一,可以帮助人们在日常生活中做出正确的决策。在线论坛,新闻报道和博客都包含大量的舆论信息。网络的迅猛发展引起了舆论的广泛关注,及时分析舆情并正确理解其趋势是很重要的。 方法:情绪分析和文本挖掘是预测在线论坛热点的重要关键元素。大多数传统的文本挖掘都是在静态数据集上进行的,而在线热点预测则是动态,及时地对Web信息进行处理。文本信息处理的早期工作集中在事实领域而不是意见领域。由于在线舆论的半结构化或非结构化特征,我们引入传统的向量空间模型(VSM)来表达它们,然后使用K-means进行热点检测,然后使用J48分类器进行热点预测。 结果:通过Rapid Miner工具进行了实验,并将所提出的方法J48的性能与其他方法(如Naive Bayes)进行了比较。使用三个度量标准来验证K均值和J48之间的一致性。它们是准确性,敏感性和特异性。 结论:该实验有助于识别K-means和J48一起预测论坛热点。使用J48获得的结果与通过K-均值聚类获得的结果具有明显的一致性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号