首页> 外文期刊>International journal of medical informatics >A methodology to enhance spatial understanding of disease outbreak events reported in news articles
【24h】

A methodology to enhance spatial understanding of disease outbreak events reported in news articles

机译:一种新闻报道中增强对疾病暴发事件的空间理解的方法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Purpose: The emergence and re-emergence of disease outbreaks of international concern in the last several years has raised the importance of health surveillance systems that exploit the open media for their timely and precise detection of events. However, one of the key barriers faced by current event-based health surveillance systems is in identifying finegrained terms for an outbreak's geographical location. In this article, we present a method to tackle this problem by associating each reported event with the most specific spatial information available in a news report. This would be useful not only for health surveillance systems, but also for other event-centered processing systems.rnMethods: To develop an automated spatial attribute annotation system, we first created a gold standard corpus for training a machine learning model. Since the qualitative analysis on data suggested that the event class might have an impact on the spatial attribute annotation, we also developed an event classification system to incorporate event class information into the spatial attribute annotation model. To automatically recognize the spatial attribute of events, several approaches, ranging from a simple heuristic technique to a more sophisticated approach based on a state-of-the-art Conditional Random Fields (CRFs) model were explored. Different feature sets were incorporated into the model and compared. Results: The evaluations were conducted on 100 outbreak news articles. Spatial attribute recognition performance was evaluated based on three metrics; precision, recall and the harmonic mean of precision and recall (F-score). Among three strategies proposed in this article, the CRF model appeared to be the most promising for spatial attribute recognition with a best performance of 85.5% F-score (86.3% precision and 84.7% recall). Conclusion: We presented a methodology for associating each event in media outbreak reports with their spatial attribute at the finest level of granularity. Our goal has been to provide a means for enhancing the spatial understanding of outbreak-related events. Evaluation studies showed promising results for automatic spatial attribute annotation. In the future, we plan to explore more features, such as semantic correlation between words, that maybe useful for the spatial attribute annotation task.
机译:目的:过去几年中,国际关注的疾病暴发的出现和再出现提高了健康监测系统的重要性,这些系统利用开放媒体及时准确地检测事件。但是,当前基于事件的健康监视系统面临的主要障碍之一是确定爆发地理位置的细粒度术语。在本文中,我们提出了一种通过将每个报告的事件与新闻报道中可用的最具体的空间信息相关联来解决此问题的方法。这不仅对健康监视系统有用,而且对其他以事件为中心的处理系统也很有用。方法:为了开发自动化的空间属性注释系统,我们首先创建了一个用于训练机器学习模型的黄金标准语料库。由于对数据的定性分析表明事件类别可能会对空间属性注释产生影响,因此,我们还开发了一个事件分类系统,将事件类别信息纳入空间属性注释模型。为了自动识别事件的空间属性,探索了几种方法,从简单的启发式技术到基于最新条件随机场(CRF)模型的更复杂的方法。将不同的功能集合并到模型中并进行比较。结果:对100篇爆发性新闻文章进行了评估。空间属性识别性能基于以下三个指标进行了评估:精度,查全率和精度和查全率的谐波均值(F分数)。在本文提出的三种策略中,CRF模型似乎是最有前途的空间属性识别方法,其最佳性能为85.5%F分数(准确度为86.3%,召回率为84.7%)。结论:我们提出了一种方法,用于以最佳粒度将媒体爆发报告中的每个事件与其空间属性相关联。我们的目标是提供一种手段,以增强对爆发相关事件的空间理解。评估研究显示了自动空间属性注释的有希望的结果。将来,我们计划探索更多可能对空间属性注释任务有用的功能,例如单词之间的语义相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号