...
首页> 外文期刊>Sensors >Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation
【24h】

Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation

机译:使用Twitter数据监视自然灾害的社会动态:带有词嵌入和内核密度估计的递归神经网络方法

获取原文
   

获取外文期刊封面封底 >>

       

摘要

In recent years, Online Social Networks (OSNs) have received a great deal of attention for their potential use in the spatial and temporal modeling of events owing to the information that can be extracted from these platforms. Within this context, one of the most latent applications is the monitoring of natural disasters. Vital information posted by OSN users can contribute to relief efforts during and after a catastrophe. Although it is possible to retrieve data from OSNs using embedded geographic information provided by GPS systems, this feature is disabled by default in most cases. An alternative solution is to geoparse specific locations using language models based on Named Entity Recognition (NER) techniques. In this work, a sensor that uses Twitter is proposed to monitor natural disasters. The approach is intended to sense data by detecting toponyms (named places written within the text) in tweets with event-related information, e.g., a collapsed building on a specific avenue or the location at which a person was last seen. The proposed approach is carried out by transforming tokenized tweets into word embeddings: a rich linguistic and contextual vector representation of textual corpora. Pre-labeled word embeddings are employed to train a Recurrent Neural Network variant, known as a Bidirectional Long Short-Term Memory (biLSTM) network, that is capable of dealing with sequential data by analyzing information in both directions of a word (past and future entries). Moreover, a Conditional Random Field (CRF) output layer, which aims to maximize the transition from one NER tag to another, is used to increase the classification accuracy. The resulting labeled words are joined to coherently form a toponym, which is geocoded and scored by a Kernel Density Estimation function. At the end of the process, the scored data are presented graphically to depict areas in which the majority of tweets reporting topics related to a natural disaster are concentrated. A case study on Mexico’s 2017 Earthquake is presented, and the data extracted during and after the event are reported.
机译:近年来,由于可以从这些平台中提取信息,在线社交网络(OSN)在事件的空间和时间建模中的潜在用途受到了广泛关注。在这种情况下,最潜在的应用之一是自然灾害的监视。 OSN用户发布的重要信息可以在灾难期间和之后为救援工作做出贡献。尽管可以使用GPS系统提供的嵌入式地理信息从OSN检索数据,但是在大多数情况下默认情况下禁用此功能。一种替代解决方案是使用基于命名实体识别(NER)技术的语言模型对特定位置进行地理解析。在这项工作中,提出了使用Twitter的传感器来监视自然灾害。该方法旨在通过检测带有事件相关信息的推文中的地名(在文本中书写的命名地点)来感知数据,例如特定大街上的倒塌建筑物或最后一次见到某人的位置。所提出的方法是通过将标记化的推文转换为词嵌入来实现的:文本语料库的丰富语言和上下文向量表示形式。使用预先标记的单词嵌入来训练递归神经网络变体,称为双向长期短期记忆(biLSTM)网络,该变体能够通过分析单词两个方向(过去和将来)的信息来处理顺序数据条目)。此外,旨在最大化从一个NER标签到另一个NER标签的过渡的条件随机场(CRF)输出层用于提高分类精度。将生成的带标签单词合并在一起,以连贯地形成地名,并通过内核密度估计功能对其进行地理编码和评分。在此过程的最后,将对评分数据进行图形化显示,以描述与自然灾害相关的大部分推文报告主题集中的区域。报告介绍了墨西哥2017年地震的案例,并报告了事件发生期间和事件发生后提取的数据。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号