首页> 美国卫生研究院文献>other >What are we ‘tweeting’ about obesity? Mapping tweets with Topic Modeling and Geographic Information System
【2h】

What are we ‘tweeting’ about obesity? Mapping tweets with Topic Modeling and Geographic Information System

机译:我们对肥胖发推特是什么?使用主题建模和地理信息系统映射推文

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Public health related tweets are difficult to identify in large conversational datasets like . Even more challenging is the visualization and analyses of the spatial patterns encoded in tweets. This study has the following objectives: How can topic modeling be used to identify relevant public health topics such as obesity on ? What are the common obesity related themes? What is the spatial pattern of the themes? What are the research challenges of using large conversational datasets from social networking sites? Obesity is chosen as a test theme to demonstrate the effectiveness of topic modeling using Latent Dirichlet Allocation (LDA) and spatial analysis using Geographic Information System (GIS). The dataset is constructed from tweets (originating from the United States) extracted from on obesity-related queries. Examples of such queries are ‘food deserts’, ‘fast food’, and ‘childhood obesity’. The tweets are also georeferenced and time stamped. Three cohesive and meaningful themes such as ‘childhood obesity and schools’, ‘obesity prevention’, and ‘obesity and food habits’ are extracted from the LDA model. The GIS analysis of the extracted themes show distinct spatial pattern between rural and urban areas, northern and southern states, and between coasts and inland states. Further, relating the themes with ancillary datasets such as US census and locations of fast food restaurants based upon the location of the tweets in a GIS environment opened new avenues for spatial analyses and mapping. Therefore the techniques used in this study provide a possible toolset for computational social scientists in general and health researchers in specific to better understand health problems from large conversational datasets.
机译:与公共卫生相关的推文很难在大型会话数据集中(例如)识别出来。更具挑战性的是可视化和分析推文中编码的空间模式。这项研究的目标如下:如何使用主题模型来识别肥胖等相关的公共健康主题?常见的肥胖相关主题是什么?主题的空间模式是什么?使用社交网站中的大型对话数据集的研究挑战是什么?选择肥胖作为测试主题,以证明使用潜在狄利克雷分配(LDA)和使用地理信息系统(GIS)进行空间分析的主题建模的有效性。该数据集由从肥胖相关查询中提取的推文(源自美国)构建而成。例如“食荒”,“快餐”和“儿童肥胖”等查询。这些推文还经过地理定位和时间戳记。从LDA模型中提取了“儿童肥胖与学校”,“预防肥胖”和“肥胖与饮食习惯”这三个具有凝聚力和意义的主题。对提取的主题进行的GIS分析显示,城乡之间,北部和南部州以及沿海和内陆州之间的空间格局截然不同。此外,基于GIS环境中推文的位置,将主题与辅助数据集(例如美国人口普查和快餐店的位置)相关联,为空间分析和制图开辟了新途径。因此,本研究中使用的技术为一般的计算社会科学家和健康研究人员提供了一种可能的工具集,以更好地从大型对话数据集中了解健康问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号