...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Location Inference for Non-Geotagged Tweets in User Timelines
【24h】

Location Inference for Non-Geotagged Tweets in User Timelines

机译:在用户时间表中的非地理标记推文的位置推断

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Social media like Twitter have become globally popular in the past decade. Thanks to the high penetration of smartphones, social media users are increasingly going mobile. This trend has contributed to foster various location based services deployed on social media, the success of which heavily depends on the availability and accuracy of users' location information. However, only a very small fraction of tweets in Twitter are geo-tagged. Therefore, it is necessary to infer locations for tweets in order to attain the purpose of those location based services. In this paper, we tackle this problem by scrutinizing Twitter user timelines in a novel fashion. First of all, we split each user's tweet timeline temporally into a number of clusters, each tending to imply a distinct location. Subsequently, we adapt two machine learning models to our setting and design classifiers that classify each tweet cluster into one of the pre-defined location classes at the city level. The Bayes based model focuses on the information gain of words with location implications in the user-generated contents. The convolutional LSTM model treats user-generated contents and their associated locations as sequences and employs bidirectional LSTM and convolution operation to make location inferences. The two models are evaluated on a large set of real Twitter data. The experimental results suggest that our models are effective at inferring locations for non-geotagged tweets and the models outperform the state-of-the-art and alternative approaches significantly in terms of inference accuracy.
机译:像Twitter这样的社交媒体在过去十年中已经全球流行。由于智能手机的高渗透,社交媒体用户越来越多地移动。这一趋势有助于培养部署社交媒体上的各种位置的服务,其成功大量取决于用户位置信息的可用性和准确性。但是,在Twitter中只有一小部分推文是地理标记的。因此,需要推断推文的位置,以便获得基于位置的服务的目的。在本文中,我们通过以小说方式仔细审查Twitter用户时间表来解决这个问题。首先,我们将每个用户的推文时间线拆分为多个群集,每个趋势都暗示了一个不同的位置。随后,我们将两台机器学习模型调整到我们的设置和设计分类器,将每个推文集群分类为城市级别的预定义位置类之一。基于贝叶斯的模型侧重于具有位置含义的单词的信息增益,在用户生成的内容中。卷积LSTM模型将用户生成的内容及其关联的位置视为序列,并采用双向LSTM和卷积操作以使位置推断。这两种模型在大量的真实推特数据上进行评估。实验结果表明,我们的模型在推断出非地理推送的推断的位置,并且模型在推理准确性方面显着优于最先进的方法和替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号