首页> 外文会议>International Work-Conference on Artificial Neural Networks >Attention-Based Recurrent Neural Networks (RNNs) for Short Text Classification: An Application in Public Health Monitoring
【24h】

Attention-Based Recurrent Neural Networks (RNNs) for Short Text Classification: An Application in Public Health Monitoring

机译:基于注意力的递归神经网络(RNN)用于短文本分类:在公共卫生监测中的应用

获取原文

摘要

In this paper, we propose an attention-based approach to short text classification, which we have created for the practical application of Twitter mining for public health monitoring. Our goal is to automatically filter Tweets which are relevant to the syndrome of asthma/difficulty breathing. We describe a bi-directional Recurrent Neural Network architecture with an attention layer (termed ABRNN) which allows the network to weigh words in a Tweet differently based on their perceived importance. We further distinguish between two variants of the ABRNN based on the Long Short Term Memory and Gated Recurrent Unit architectures respectively, termed the ABLSTM and ABGRU. We apply the ABLSTM and ABGRU, along with popular deep learning text classification models, to a Tweet relevance classification problem and compare their performances. We find that the ABLSTM outperforms the other models, achieving an accuracy of 0.906 and an F1-score of 0.710. The attention vectors computed as a by-product of our models were also found to be meaningful representations of the input Tweets. As such, the described models have the added utility of computing document embeddings which could be used for other tasks besides classification. To further validate the approach, we demonstrate the ABLSTM's performance in the real world application of public health surveillance and compare the results with real-world syndromic surveillance data provided by Public Health England (PHE). A strong positive correlation was observed between the ABLSTM surveillance signal and the real-world asthma/difficulty breathing syndromic surveillance data. The ABLSTM is a useful tool for the task of public health surveillance.
机译:在本文中,我们提出了一种基于注意力的短文本分类方法,该方法是为Twitter挖掘在公共卫生监控中的实际应用而创建的。我们的目标是自动过滤与哮喘/呼吸困难综合征相关的推文。我们描述了一种具有注意力层(称为ABRNN)的双向递归神经网络体系结构,该体系可使网络根据其感知的重要性以不同的方式衡量推文中的单词。我们进一步根据长期短期记忆和门控循环单元架构分别区分了ABRNN的两个变体,分别称为ABLSTM和ABGRU。我们将ABLSTM和ABGRU以及流行的深度学习文本分类模型应用于Tweet相关性分类问题,并比较它们的性能。我们发现,ABLSTM的性能优于其他模型,其精度为0.906,F1得分为0.710。还发现,作为我们模型的副产品计算出的注意力向量是输入推文的有意义的表示。这样,所描述的模型具有计算文档嵌入的附加效用,该文档嵌入可用于分类以外的其他任务。为了进一步验证该方法,我们展示了ABLSTM在现实世界中公共卫生监视应用中的性能,并将结果与​​英国公共卫生(PHE)提供的真实世界的症状监视数据进行了比较。在ABLSTM监测信号与现实世界哮喘/呼吸困难综合症监测数据之间观察到强烈的正相关性。 ABLSTM是用于公共卫生监视任务的有用工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号