首页> 外文期刊>Journal of medical Internet research >Modeling Spatiotemporal Factors Associated With Sentiment on Twitter: Synthesis and Suggestions for Improving the Identification of Localized Deviations
【24h】

Modeling Spatiotemporal Factors Associated With Sentiment on Twitter: Synthesis and Suggestions for Improving the Identification of Localized Deviations

机译:与Twitter情绪相关的模拟时空因素:改善局部偏差识别的合成和建议

获取原文
获取外文期刊封面目录资料

摘要

Background Studies examining how sentiment on social media varies depending on timing and location appear to produce inconsistent results, making it hard to design systems that use sentiment to detect localized events for public health applications. Objective The aim of this study was to measure how common timing and location confounders explain variation in sentiment on Twitter. Methods Using a dataset of 16.54 million English-language tweets from 100 cities posted between July 13 and November 30, 2017, we estimated the positive and negative sentiment for each of the cities using a dictionary-based sentiment analysis and constructed models to explain the differences in sentiment using time of day, day of week, weather, city, and interaction type (conversations or broadcasting) as factors and found that all factors were independently associated with sentiment. Results In the full multivariable model of positive (Pearson r in test data 0.236; 95% CI 0.231-0.241) and negative (Pearson r in test data 0.306; 95% CI 0.301-0.310) sentiment, the city and time of day explained more of the variance than weather and day of week. Models that account for these confounders produce a different distribution and ranking of important events compared with models that do not account for these confounders. Conclusions In public health applications that aim to detect localized events by aggregating sentiment across populations of Twitter users, it is worthwhile accounting for baseline differences before looking for unexpected changes.
机译:背景技术研究了社交媒体的情感如何根据定时和位置而异,似乎产生不一致的结果,使得难以设计使用情绪的系统来检测公共卫生应用程序的本地化事件。目的本研究的目的是衡量共同的时序和位置混淆如何解释推特情绪的变化。使用DataSet的方法来自2017年7月13日至11月30日之间的100个城市的1654万英语推文,我们估计了使用基于字典的情绪分析和构造模型来解释差异的各个城市的积极和消极情绪在情绪中使用一天,一周,天气,城市和互动类型(对话或广播)作为因素的时间,发现所有因素都与情绪独立相关。结果在0.236试验数据中的Pearson R的全部多变量模型0.236; 95%CI 0.231-0.241)和否定(Pearson r在测试数据中0.306; 95%CI 0.301-0.310)情绪,城市和一天中的时间比天气和一周的差异。与这些混乱的模型相比,这些混乱的模型与不考虑这些混乱者的模型产生了不同的分布和排名。结论在公共卫生应用中,旨在通过在Twitter用户的群体中聚合情绪来检测本地化事件的旨在检测本地化事件,在寻找意外变化之前,这是基线差异的值得核算。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号