首页> 外文会议>International conference on computational linguistics >Inverse Document Density: A Smooth Measure for Location-Dependent Term Irregularities
【24h】

Inverse Document Density: A Smooth Measure for Location-Dependent Term Irregularities

机译:反向文档密度:位置相关项不规则性的平滑度量

获取原文

摘要

The advent and recent popularity of location-enabled social media services like Twitter and Foursquare has brought a dataset of immense value to researchers in several domains ranging from theory validation in computational sociology, over market analysis, to situation awareness in disaster management. Many of these applications, however, require evaluating the a priori relevance of trends, topics and terms in given regions of interest. Inspired by the well-known notion of the tf-idf weight combined with kernel density methods we present a smooth measure that utilizes large corpora of social media data to facilitate scalable, real-time and highly interactive analysis of geolocated text. We describe the implementation specifics of our measure, which are grounded in aggregation and preprocessing strategies, and we demonstrate its practical usefulness with two case studies within a sophisticated visual analysis system.
机译:诸如Twitter和Foursquare之类的基于位置的社交媒体服务的出现和最近的流行,为从计算社会学的理论验证,市场分析到灾难管理的态势感知等多个领域的研究人员带来了巨大的价值。然而,这些应用中的许多都需要评估给定感兴趣区域中趋势,主题和术语的先验相关性。受著名的tf-idf权重概念与内核密度方法相结合的启发,我们提出了一种平滑的测量方法,该方法利用大量的社交媒体数据集来促进对地理位置文本的可伸缩性,实时性和高度交互性分析。我们描述了基于聚合和预处理策略的措施的具体实施方式,并通过在复杂的视觉分析系统中进行的两个案例研究证明了其措施的实用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号