【24h】

A Rule-Based System for Monitoring of Microblogging Disease Reports

机译:基于规则的微博疾病报告监控系统

获取原文

摘要

Real-time microblogging messages are an interesting data source for the realization of early warning systems that track the outbreaks of epidemic diseases. Microblogging monitoring systems might be able to detect disease outbreaks in communities faster than the traditional public health services. The realization of such systems requires a message classification approach that can distinguish the messages which concern diseases from other unrelated messages. The existing machine learning classification approaches have some difficulties due to the lack of a longer history-based learning curve and the short length of the messages. In this paper, we present a demonstration of our rule-based approach for classification of disease reports. Our system is built based on the extraction of disease-related named entities. The type identification of the recognized named entities using the existing knowledge bases helps our system to classify a message as a disease report. We combine our approach with further text processing approaches like term frequency calculation. Our experimental results show that the presented approach is capable of classifying the disease report messages with acceptable precision and recall.
机译:实时微博消息是用于实现跟踪流行病暴发的预警系统的有趣数据源。与传统的公共卫生服务相比,微博监视系统可能能够更快地发现社区中的疾病暴发。这种系统的实现需要一种消息分类方法,该方法可以将涉及疾病的消息与其他不相关的消息区分开。由于缺乏基于历史的较长学习曲线以及消息长度较短,因此现有的机器学习分类方法存在一些困难。在本文中,我们演示了基于规则的疾病报告分类方法。我们的系统基于与疾病相关的命名实体的提取而构建。使用现有知识库对识别出的命名实体进行类型识别有助于我们的系统将消息分类为疾病报告。我们将我们的方法与进一步的文本处理方法(例如词频计算)结合在一起。我们的实验结果表明,所提出的方法能够以可接受的精度和召回率对疾病报告消息进行分类。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号