【24h】

Characterizing Infrastructure Damage After Earthquake: A Split-Query Based IR Approach

机译:表征地震后基础设施的损坏:基于拆分查询的IR方法

获取原文

摘要

Retrieving relevant information from social media based on specific requirements has become a focus area for researchers. In this paper, we propose a framework for online retrieval of tweets providing information about possible infrastructure damages, caused due to earthquakes and use the same to determine a damage score for the possibly affected locations. Identifying such tweets would not only provide a holistic view of the affected areas but would also help in taking necessary relief actions. Existing works on this topic fail to effectively capture the semantic variation in the tweets, possibly due to poor content quality, thereby providing scopes for further improvement in the mechanisms involved. Our proposed technique relies on a novel split-query based mechanism along with a pseudo-relevance feedback approach to identify the relevant tweets. The pseudo-relevance feedback approach expands on an initial set of seed tweets obtained using a semi-automatic query generation mechanism that couples topic based clustering with human annotation. Empirical validation of our proposed method on a manually annotated ground truth data reveals a considerable improvement in precision, recall and mean average precision over several baseline methods.
机译:根据特定要求从社交媒体检索相关信息已成为研究人员的重点领域。在本文中,我们提出了一种在线检索推文的框架,以提供有关地震可能造成的基础设施损坏的信息,并使用该框架来确定可能受影响的位置的损坏评分。识别此类推文不仅可以全面了解受灾地区,还有助于采取必要的救济行动。有关此主题的现有作品可能无法有效地捕获推文中的语义变化,这可能是由于内容质量差所致,从而为进一步改进所涉及的机制提供了空间。我们提出的技术基于新颖的基于拆分查询的机制以及伪相关反馈方法来识别相关推文。伪相关性反馈方法扩展了使用半自动查询生成机制获得的种子推文的初始集合,该机制将基于主题的聚类与人类注释结合在一起。通过人工注释的地面真实数据对我们提出的方法进行的经验验证表明,与几种基线方法相比,精度,召回率和平均平均精度有了显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号