...
首页> 外文期刊>Procedia Computer Science >Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets
【24h】

Towards an Infodemiological Algorithm for Classification of Filipino Health Tweets

机译:迈向菲律宾健康推文分类的信息流行病学算法

获取原文
           

摘要

Finding innovative ICT solutions to enhance the Philippines’ health sector is part and parcel of the Philippine eHealth Strategic Framework and Plan 2020 program . This study sees the opportunity of using collected Twitter data to create a model that processes tweets to produce a dataset that may be relevant in the field of epidemiology and infodemiology . Through the collection of relevant tweets, future studies may make use of the output of this research for various purposes, such as the improvement of epidemiological systems of the Department of Health in support of the eHealth strategy. In this study, we used the Na?ve-Bayes classification model, an efficient text classifier, to create a model that determines whether a tweet is “infodemiological” or not. From the collected 18,044 tweets, we have narrowed it down to 1,090 tweets (6.04%) that can be used in epidemiology. Using this as a dataset for training and testing, the model was able to classify 79.91% of tweets correctly. This research shows that it is indeed feasible to collect and classify enough infodemiological tweets in the Filipino language, which in turn can be used for future infodemiological studies.
机译:寻找创新的ICT解决方案以增强菲律宾的卫生部门,这是菲律宾eHealth战略框架和2020计划计划的重要组成部分。这项研究发现了使用收集到的Twitter数据创建处理推文的模型的机会,以产生可能与流行病学和信息流行病学领域相关的数据集。通过收集相关推文,未来的研究可能会出于各种目的而利用这项研究的成果,例如为了支持eHealth战略而改进卫生部的流行病学系统。在这项研究中,我们使用了Naveve-Bayes分类模型(一种有效的文本分类器)来创建一个模型,该模型确定一条推文是否为“信息流行病学”。从收集的18,044条推文中,我们将其范围缩小到1,090条(可用于流行病学)推文(6.04%)。使用此数据集进行培训和测试,该模型能够正确分类79.91%的推文。这项研究表明,用菲律宾语收集和分类足够的信息流行病推文确实是可行的,而这些推文又可用于将来的信息流行病学研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号