首页> 外文会议>IEEE international conference on data engineering >Effective location identification from microblogs
【24h】

Effective location identification from microblogs

机译:通过微博有效识别位置

获取原文

摘要

The rapid development of social networks has resulted in a proliferation of user-generated content (UGC). The UGC data, when properly analyzed, can be beneficial to many applications. For example, identifying a user's locations from microblogs is very important for effective location-based advertisement and recommendation. In this paper, we study the problem of identifying a user's locations from microblogs. This problem is rather challenging because the location information in a microblog is incomplete and we cannot get an accurate location from a local microblog. To address this challenge, we propose a global location identification method, called Glitter. Glitter combines multiple microblogs of a user and utilizes them to identify the user's locations. Glitter not only improves the quality of identifying a user's location but also supplements the location of a microblog so as to obtain an accurate location of a microblog. To facilitate location identification, GLITTER organizes points of interest (POIs) into a tree structure where leaf nodes are POIs and non-leaf nodes are segments of POIs, e.g., countries, states, cities, districts, and streets. Using the tree structure, Glitter first extracts candidate locations from each microblog of a user which correspond to some tree nodes. Then Glitter aggregates these candidate locations and identifies top-k locations of the user. Using the identified top-k user locations, Glitter refines the candidate locations and computes top-k locations of each microblog. To achieve high recall, we enable fuzzy matching between locations and microblogs. We propose an incremental algorithm to support dynamic updates of microblogs. Experimental results on real-world datasets show that our method achieves high quality and good performance, and scales very well.
机译:社交网络的迅速发展导致用户生成内容(UGC)的激增。如果对UGC数据进行了适当的分析,则对许多应用程序都可能有益。例如,从微博中识别用户的位置对于有效的基于位置的广告和推荐非常重要。在本文中,我们研究了从微博中识别用户位置的问题。这个问题颇具挑战性,因为微博中的位置信息不完整,我们无法从本地微博中获得准确的位置。为了应对这一挑战,我们提出了一种称为“闪光器”的全球位置识别方法。 Glitter组合了用户的多个微博,并利用它们来标识用户的位置。闪光不仅提高了识别用户位置的质量,而且补充了微博的位置,从而获得了微博的准确位置。为了便于位置识别,GLITTER将兴趣点(POI)组织到树结构中,其中叶节点是POI,非叶节点是POI的片段,例如国家,州,城市,地区和街道。使用树结构,Glitter首先从用户的每个微博中提取与某些树节点相对应的候选位置。然后,Glitter汇总这些候选位置并标识用户的前k个位置。使用识别出的前k个用户位置,Glitter会优化候选位置并计算每个微博的前k个位置。为了实现较高的召回率,我们启用位置和微博客之间的模糊匹配。我们提出一种增量算法来支持微博客的动态更新。在真实数据集上的实验结果表明,我们的方法实现了高质量和良好的性能,并且可以很好地扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号