首页> 外文会议>IEEE International Conference on Big Data >Multi-spatial Scale Event Detection from Geo-tagged Tweet Streams via Power-law Verification
【24h】

Multi-spatial Scale Event Detection from Geo-tagged Tweet Streams via Power-law Verification

机译:通过幂律验证从地理标记的推文流中的多空间尺度事件检测

获取原文

摘要

Compared with traditional news media, social media nowadays provides a richer and more timely source of news. We are interested in multi-spatial level event detection from geo-tagged tweet streams. Specifically, in this paper we (1) examine the statistical characteristic for the time series of the number of geo-tagged tweets posted from specific regions during a short time interval, e.g., one minute; (2) verify from over thirty datasets that while almost all such time series exhibit self-similarity, those that correspond to events, especially short-term and unplanned outbursts, follow a power-law distribution; (3) demonstrate that these findings can be applied to facilitate event detection from tweet streams. We propose two algorithms—Power-law basic and Power-law advanced, where Power-law basic only checks the existence of power-law distributions in the time series from tweet streams at multi-spatial scales, without looking into the content of each tweet, and Power-law advanced integrates power-law verification with semantic analysis via word embedding. Our experiments on multiple datasets show that when combined with a Quad-tree, the seemingly naive algorithm of Power-law basic achieves comparable results with more advanced event detection methods, while the semantic analysis enhanced version, Power-law advanced, can significantly increase both the precision and the recall.
机译:与传统新闻媒体相比,现在社交媒体现在提供更丰富,更及时的新闻来源。我们对从地理标记的推文流中的多空间级事件检测感兴趣。具体地,在本文中,我们(1)检查在短时间间隔期间从特定区域发布的地理标记推文的数量的时间序列的统计特征,例如,一分钟; (2)从30多个数据集中验证,虽然几乎所有这样的时间序列都表现出自我相似性,那些对应于事件的人,特别是短期和计划外爆发,遵循权法分布; (3)证明可以应用这些发现以促进来自推文流的事件检测。我们提出了两种算法 - 幂律基本和幂律先进,其中幂律基本只检查从多空间尺度的Tweet流中的时间序列中的幂律分布的存在,而不调查每个推文的内容,幂律高级通过Word Embedding与语义分析集成了电力法验证。我们在多个数据集上的实验表明,当与四平树相结合时,看似天真的幂律基本算法达到了可比的结果,具有更先进的事件检测方法,而语义分析增强版本,电力法先进,可以显着增加两者精度和召回。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号