What are we 'tweeting' about obesity? Mapping tweets with topic modeling and Geographic Information System

Debarchana (Debs) Ghosh; Rajarshi Guha

首页> 外文期刊>Cartography and geographic information science >What are we 'tweeting' about obesity? Mapping tweets with topic modeling and Geographic Information System

【24h】

What are we 'tweeting' about obesity? Mapping tweets with topic modeling and Geographic Information System

机译：我们对肥胖“发推特”是什么？使用主题建模和地理信息系统映射推文

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Public health related tweets are difficult to identify in large conversational datasets like Twitter.com. Even more challenging is the visualization and analyses of the spatial patterns encoded in tweets. This study has the following objectives: how can topic modeling be used to identify relevant public health topics such as obesity on Twitter.com? What are the common obesity related themes? What is the spatial pattern of the themes? What are the research challenges of using large conversational datasets from social networking sites? Obesity is chosen as a test theme to demonstrate the effectiveness of topic modeling using Latent Dirichlet Allocation (LDA) and spatial analysis using Geographic Information System (GIS). The dataset is constructed from tweets (originating from the United States) extracted from Twitter.com on obesity-related queries. Examples of such queries are 'food deserts', 'fast food', and 'childhood obesity'. The tweets are also georeferenced and time stamped. Three cohesive and meaningful themes such as 'childhood obesity and schools', 'obesity prevention', and 'obesity and food habits' are extracted from the LDA model. The GIS analysis of the extracted themes show distinct spatial pattern between rural and urban areas, northern and southern states, and between coasts and inland states. Further, relating the themes with ancillary datasets such as US census and locations of fast food restaurants based upon the location of the tweets in a GIS environment opened new avenues for spatial analyses and mapping. Therefore the techniques used in this study provide a possible toolset for computational social scientists in general, and health researchers in specific, to better understand health problems from large conversational datasets.

机译：在诸如Twitter.com之类的大型会话数据集中，很难识别与公共卫生相关的推文。更具挑战性的是可视化和分析推文中编码的空间模式。这项研究的目标如下：如何使用主题模型来识别相关公共卫生主题，例如Twitter.com上的肥胖症？常见的肥胖相关主题是什么？主题的空间模式是什么？使用社交网站中的大型对话数据集的研究挑战是什么？选择肥胖作为测试主题，以证明使用潜在狄利克雷分配（LDA）和使用地理信息系统（GIS）进行空间分析的主题建模的有效性。该数据集是根据与肥胖相关的查询从Twitter.com提取的推文（源自美国）构建的。这样的查询的例子是“食荒”，“快餐”和“儿童肥胖”。这些推文还经过地理定位和时间戳记。从LDA模型中提取了三个连贯且有意义的主题，例如“儿童肥胖与学校”，“肥胖预防”和“肥胖与饮食习惯”。对提取的主题进行的GIS分析显示，城乡之间，北部和南部州以及沿海和内陆州之间的空间格局截然不同。此外，基于GIS环境中推文的位置，将主题与辅助数据集（例如美国人口普查和快餐店的位置）相关联，为空间分析和制图开辟了新途径。因此，本研究中使用的技术为一般的计算社会科学家（尤其是健康研究人员）提供了一种可能的工具集，以便从大型对话数据集中更好地了解健康问题。

著录项

来源
《Cartography and geographic information science》 |2013年第2期|90-102|共13页
作者
Debarchana (Debs) Ghosh; Rajarshi Guha;
展开▼
作者单位

Department of Geography, University of Connecticut, Storrs, CT 06040, USA;

NIH Center for Advancing Translational Science,Rockville, MD 20850, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
mapping; social media; topic models; GIS; text mining; obesity;

机译：映射社交媒体;主题模型;地理信息系统文本挖掘;肥胖;

相似文献

外文文献
中文文献
专利

1. What is your tweet worldview? Mapping the topic structure of tweets on the Wikipedia [J] . Yu Suzuki, Hiromitsu Ohara, Akiyo Nadamoto International journal of pervasive computing and communications . 2018,第1期

机译：您的推特世界观是什么？在Wikipedia上映射推文的主题结构
2. Sentiment topic sarcasm mixture model to distinguish sarcasm prevalent topics based on the sentiment bearing words in the tweets [J] . Nimala K., Jebakumar R., Saravanan M. Journal of ambient intelligence and humanized computing . 2021,第6期

机译：情绪讽刺讽刺混合模型，以基于推文中的情绪与情词区分讽刺普遍主题
3. Comparing News Articles and Tweets About COVID-19 in Brazil: Sentiment Analysis and Topic Modeling Approach [J] . Tiago de Melo, Carlos M S Figueiredo JMIR public health and surveillance. . 2021,第2期

机译：比较新闻文章和推文关于巴西Covid-19：情绪分析和主题建模方法
4. Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using Tweets and Matrix Factorization [C] . Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, Annual meeting of the Association for Computational Linguistics;Conference of the European Chapter of the Association for Computational Linguistics . 2017

机译：您可能也同意或不同意的其他主题：使用推文和矩阵分解对主题间偏好设置建模
5. Integrating Community Volcanic Hazard Mapping, Geographic Information Systems, and Modeling to Reduce Volcanic Hazard Vulnerability. [D] . Bajo Sanchez, Jorge V. 2016

机译：集成社区火山灾害地图，地理信息系统和建模，以减少火山灾害的脆弱性。
6. What are we ‘tweeting’ about obesity? Mapping tweets with Topic Modeling and Geographic Information System [O] . Debarchana (Debs) Ghosh, Rajarshi Guha -1

机译：我们对肥胖发推特是什么？使用主题建模和地理信息系统映射推文
7. Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using Tweets and Matrix Factorization [O] . Sasaki, Akira, Hanawa, Kazuaki, Okazaki, Naoaki, 2017

机译：您可能也同意或不同意的其他主题：建模主题间使用推文和矩阵分解的首选项

What are we 'tweeting' about obesity? Mapping tweets with topic modeling and Geographic Information System

摘要

著录项

相似文献

相关主题

期刊订阅