首页> 外文会议>Pacific-Asia conference on knowledge discovery and data mining >Detecting and Analyzing Influenza Epidemics with Social Media in China
【24h】

Detecting and Analyzing Influenza Epidemics with Social Media in China

机译:利用社交媒体检测和分析流行性感冒

获取原文

摘要

In recent years, social media has become important and omnipresent for social network and information sharing. Researchers and scientists have begun to mine social media data to predict varieties of social, economic, health and entertainment related real-world phenomena. In this paper, we exhibit how social media data can be used to detect and analyze real-world phenomena with several data mining techniques. Specifically, we use posts from TencentWeibo to detect influenza and analyze influenza trends. We build a support vector machine (SVM) based classifier to classify influenza posts. In addition, we use association rule mining to extract strongly associated features as additional features of posts to overcome the limitation of 140 words for posts. We also use sentimental analysis to classify the reposts without feature and uncommented reposts. The experimental results show that by combining those techniques, we can improve the precision and recall by at least ten percent. Finally, we analyze the spatial and temporal patterns for positive influenza posts and tell when and where influenza epidemic is more likely to occur.
机译:近年来,社交媒体已成为社交网络和信息共享的重要和无所不在的地方。研究人员和科学家已开始挖掘社交媒体数据,以预测与社会,经济,健康和娱乐相关的现实世界现象的种类。在本文中,我们展示了如何使用几种数据挖掘技术将社交媒体数据用于检测和分析现实世界中的现象。具体来说,我们使用来自腾讯微博的帖子来检测流感并分析流感趋势。我们构建了一个基于支持向量机(SVM)的分类器,对流感帖子进行分类。此外,我们使用关联规则挖掘来提取强关联的特征作为帖子的其他特征,从而克服了帖子中140个单词的局限性。我们还使用情感分析对无功能和无注释的转发进行分类。实验结果表明,通过结合使用这些技术,我们可以将精度和召回率提高至少10%。最后,我们分析了流感阳性职位的时空格局,并指出了何时和何地更可能发生流感流行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号