首页> 外文会议>International conference on computational linguistics >Inverse Document Density: A Smooth Measure for Location-Dependent Term Irregularities
【24h】

Inverse Document Density: A Smooth Measure for Location-Dependent Term Irregularities

机译:逆文档密度:位置依赖项不规则的平滑度量

获取原文

摘要

The advent and recent popularity of location-enabled social media services like Twitter and Foursquare has brought a dataset of immense value to researchers in several domains ranging from theory validation in computational sociology, over market analysis, to situation awareness in disaster management. Many of these applications, however, require evaluating the a priori relevance of trends, topics and terms in given regions of interest. Inspired by the well-known notion of the tf-idf weight combined with kernel density methods we present a smooth measure that utilizes large corpora of social media data to facilitate scalable, real-time and highly interactive analysis of geolocated text. We describe the implementation specifics of our measure, which are grounded in aggregation and preprocessing strategies, and we demonstrate its practical usefulness with two case studies within a sophisticated visual analysis system.
机译:Twitter和Foursquare等位置的社交媒体服务的出现和近来的人气已经为几个域的研究人员带来了巨大价值的数据集,从理论验证在计算社会学中,在市场分析中,灾害管理的意识。然而,许多这些应用程序要求评估趋势,主题和术语在鉴于兴趣区域中的先验相关性。灵感来自于TF-IDF重量的众所周知的概念,结合内核密度方法,我们提出了一种平滑的措施,利用社交媒体数据的大型语料库来促进对Geolocated文本的可扩展,实时和高度交互分析。我们描述了我们衡量标准的实施细节,它基于聚集和预处理策略,我们展示了其在复杂的视觉分析系统中的两种案例研究的实际用途。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号