首页> 外文期刊>JMIR Medical Informatics >Data Analysis and Visualization of Newspaper Articles on Thirdhand Smoke: A Topic Modeling Approach
【24h】

Data Analysis and Visualization of Newspaper Articles on Thirdhand Smoke: A Topic Modeling Approach

机译:二手烟报纸文章的数据分析和可视化:主题建模方法

获取原文
       

摘要

Background Thirdhand smoke has been a growing topic for years in China. Thirdhand smoke (THS) consists of residual tobacco smoke pollutants that remain on surfaces and in dust. These pollutants are re-emitted as a gas or react with oxidants and other compounds in the environment to yield secondary pollutants. Objective Collecting media reports on THS from major media outlets and analyzing this subject using topic modeling can facilitate a better understanding of the role that the media plays in communicating this health issue to the public. Methods The data were retrieved from the Wiser and Factiva news databases. A preliminary investigation focused on articles dated between January 1, 2013, and December 31, 2017. Use of Latent Dirichlet Allocation yielded the top 10 topics about THS. The use of the modified LDAvis tool enabled an overall view of the topic model, which visualizes different topics as circles. Multidimensional scaling was used to represent the intertopic distances on a two-dimensional plane. Results We found 745 articles dated between January 1, 2013, and December 31, 2017. The United States ranked first in terms of publications (152 articles on THS from 2013-2017). We found 279 news reports about THS from the Chinese media over the same period and 363 news reports from the United States. Given our analysis of the percentage of news related to THS in China, Topic 1 (Cancer) was the most popular among the topics and was mentioned in 31.9% of all news stories. Topic 2 (Control of quitting smoking) was related to roughly 15% of news items on THS. Conclusions Data analysis and the visualization of news articles can generate useful information. Our study shows that topic modeling can offer insights into understanding news reports related to THS. This analysis of media trends indicated that related diseases, air and particulate matter (PMsub2.5/sub), and control and restrictions are the major concerns of the Chinese media reporting on THS. The Chinese press still needs to consider fuller reports on THS based on scientific evidence and with less focus on sensational headlines. We recommend that additional studies be conducted related to sentiment analysis of news data to verify and measure the influence of THS-related topics.
机译:背景技术多年来,二手烟一直是中国日益增长的话题。二手烟(THS)由残留在表面和灰尘中的残留烟草烟雾污染物组成。这些污染物会重新排放为气体,或者与环境中的氧化剂和其他化合物发生反应,从而产生二次污染物。客观地从主要媒体收集关于THS的媒体报道,并使用主题模型分析该主题,可以帮助人们更好地理解媒体在向公众传播此健康问题方面的作用。方法从Wiser和Factiva新闻数据库中检索数据。初步调查的重点是日期在2013年1月1日至2017年12月31日期间的文章。使用潜在狄利克雷分配方法得出了THS的前10个主题。修改后的LDAvis工具的使用启用了主题模型的整体视图,该视图将不同的主题可视化为圆圈。多维标度用于表示二维平面上的主题间距。结果我们发现有745篇文章的日期为2013年1月1日至2017年12月31日。美国在出版物方面排名第一(2013-2017年有152篇关于THS的文章)。我们发现同期有279篇来自中国媒体的关于THS的新闻报道和363篇来自美国的新闻报道。根据我们对与THS相关的新闻在中国的百分比的分析,主题1(癌症)在主题中最受欢迎,在所有新闻故事中占31.9%。主题2(控制吸烟)与THS中大约15%的新闻有关。结论数据分析和新闻报道的可视化可以产生有用的信息。我们的研究表明,主题建模可以提供洞察力,以了解与THS相关的新闻报道。对媒体趋势的分析表明,相关疾病,空气和颗粒物(PM 2.5 )以及控制和限制是中国媒体报道THS的主要问题。中国新闻界仍需要根据科学证据并考虑较少的耸人听闻的头条新闻来考虑更全面的THS报道。我们建议对新闻数据的情感分析进行其他研究,以验证和衡量与THS相关主题的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号