首页> 外文会议>Pacific-Asia Conference on Knowledge Discovery and Data Mining >Image Analysis Enhanced Event Detection from Geo-Tagged Tweet Streams
【24h】

Image Analysis Enhanced Event Detection from Geo-Tagged Tweet Streams

机译:图像分析增强了来自带有地理标签的推文流的事件检测

获取原文

摘要

Events detected from social media streams often include early signs of accidents, crimes or disasters. Therefore, they can be used by related parties for timely and efficient response. Although significant progress has been made on event detection from tweet streams, most existing methods have not considered the posted images in tweets, which provide richer information than the text, and potentially can be a reliable indicator of whether an event occurs or not. In this paper, we design an event detection algorithm that combines textual, statistical and image information, following an unsupervised machine learning approach. Specifically, the algorithm starts with semantic and statistical analyses to obtain a list of tweet clusters, each of which corresponds to an event candidate, and then performs image analysis to separate events from non-events-a convolutional autoencoder is trained for each cluster as an anomaly detector, where a part of the images are used as the training data and the remaining images are used as the test instances. Our experiments on multiple datasets verify that when an event occurs, the mean reconstruction errors of the training and test images are much closer, compared with the case where the candidate is a non-event cluster. Based on this finding, the algorithm rejects a candidate if the difference is larger than a threshold. Experimental results over millions of tweets demonstrate that this image analysis enhanced approach can significantly increase the precision with minimum impact on the recall.
机译:从社交媒体流中检测到的事件通常包括事故,犯罪或灾难的早期迹象。因此,相关方可以使用它们进行及时有效的响应。尽管在通过推文流进行事件检测方面已经取得了重大进展,但是大多数现有方法并未考虑推文中发布的图像,这些图像提供的信息比文本还丰富,并且可能可以可靠地指示事件是否发生。在本文中,我们设计了一种事件检测算法,该算法结合了无监督机器学习方法,将文本,统计信息和图像信息结合在一起。具体来说,该算法从语义和统计分析开始,以获得一系列tweet簇,每个簇对应于一个事件候选者,然后执行图像分析以将事件与非事件分开-卷积自动编码器针对每个簇训练为异常检测器,其中一部分图像用作训练数据,其余图像用作测试实例。我们在多个数据集上进行的实验证明,与候选者是非事件聚类的情况相比,当事件发生时,训练图像和测试图像的平均重建误差要小得多。基于此发现,如果差异大于阈值,该算法将拒绝候选对象。数百万条推文的实验结果表明,这种图像分析增强方法可以显着提高精度,同时对召回的影响最小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号