首页> 外文期刊>International journal of digital Earth >An ontology-based framework for extracting spatio-temporal influenza data using Twitter
【24h】

An ontology-based framework for extracting spatio-temporal influenza data using Twitter

机译:基于本体的基于本体框架,用于使用Twitter提取时空逐时流感数据

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Early detection of influenza outbreaks is one of the key priorities on a national level for preparedness and planning. This study presents the design and implementation of Fluwitter, which is a spatio-temporal web-based prototype framework for pseudo real-time detection of influenza outbreaks from Twitter. Specifically, the framework integrates PostgreSQL database server with PostGIS spatial extension, Twitter streaming client, pre-processor, tagger and similarity calculator for semantic information extraction (IE). The IE of tagged terms is supported by Natural Language Processing (NLP) techniques, DBpediaSpotlight and WordNet Similarity for Java (WS4J), while data analytics, visualization, and mapping are supported by GeoServer and other GIS Free Open Source Software (FOSS). The prototype was calibrated to maximize detection of influenza using rules developed from ontology-based semantic similarity scores. The Twitter-generated influenza cases were validated by weekly hospitalization records issued by Ohio Department of Health (ODH). The optimized rule produced a final F-measure value of 0.72 and accuracy (ACC) value of 94.4%. The validation suggested the existence of moderate correlations for the beginning of the time period Southeast region (r = 0.52), the Northwestern region (r = 0.38), and the Central region (r = 0.33) and weak correlations for the entire time period. The potential strengths and benefits of the prototype are shown through spatio-temporal assessment and visualization of influenza potential in Ohio.
机译:利用流感爆发的早期发现是国家一级的准备和规划的关键优先事项之一。本研究介绍了流液的设计和实施,这是一种用于伪实时检测流感爆发的几种时空Web的原型框架。具体而言,该框架将PostgresQL数据库服务器与Postgis Spatial扩展,Twitter流客户端,预处理器,标签和相似度计算器进行了集成,用于语义信息提取(即)。标记项的IE由自然语言处理(NLP)技术,DBPediSPotlight和Java(WS4J)的WordNet相似度支持,而GeoServer和其他GIS免费开源软件(FOSS)支持数据分析,可视化和映射。原型被校准,以最大限度地利用从基于本体的语义相似性分数开发的规则来最大限度地检测流感。通过俄亥俄州卫生部(ODH)发布的每周住院记录验证了Twitter生成的流感病例。优化规则产生了最终F测量值0.72,精度(ACC)值为94.4%。验证表明,在东南区(R = 0.52),西北部(r = 0.38)和中央区域(r = 0.33)和整个时间段的弱相关性时,验证存在适中的相关性。通过在俄亥俄州的流感潜力的时空评估和可视化显示原型的潜在优势和益处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号