首页> 外文会议>CIKM 10;ACM conference on information and knowledge management >You Are Where You Tweet: A Content-Based Approach to Geo-locating Twitter Users
【24h】

You Are Where You Tweet: A Content-Based Approach to Geo-locating Twitter Users

机译:您在哪里发推文:一种基于内容的Twitter用户地理定位方法

获取原文

摘要

We propose and evaluate a probabilistic framework for estimating a Twitter user's city-level location based purely on the content of the user's tweets, even in the absence of any other geospatial cues. By augmenting the massive human-powered sensing capabilities of Twitter and related microblogging services with content-derived location information, this framework can overcome the sparsity of geo-enabled features in these services and enable new location-based personalized information services, the targeting of regional advertisements, and so on. Three of the key features of the proposed approach are: (i) its reliance purely on tweet content, meaning no need for user IP information, private login information, or external knowledge bases; (ii) a classification component for automatically identifying words in tweets with a strong local geo-scope; and (iii) a lattice-based neighborhood smoothing model for refining a user's location estimate. The system estimates k possible locations for each user in descending order of confidence. On average we find that the location estimates converge quickly (needing just 100s of tweets), placing 51% of Twitter users within 100 miles of their actual location.
机译:我们提出并评估了一个概率框架,即使在没有任何其他地理空间提示的情况下,也仅基于用户推文的内容来估算Twitter用户在城市级别的位置。通过使用内容派生的位置信息增强Twitter和相关微博服务的大规模人力感知功能,此框架可以克服这些服务中启用地理功能的稀疏性,并启用新的基于位置的个性化信息服务(针对区域广告等等。提议的方法的三个关键特征是:(i)仅依靠推文内容,这意味着不需要用户IP信息,私人登录信息或外部知识库; (ii)分类组件,用于通过强大的本地地理范围自动识别推文中的单词; (iii)基于网格的邻域平滑模型,用于细化用户的位置估计。系统以置信度降序为每个用户估计k个可能的位置。平均而言,我们发现位置估算值可以快速收敛(仅需发送数百条推文),即可将51%的Twitter用户置于其实际位置的100英里范围内。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号