...
首页> 外文期刊>ACM transactions on intelligent systems >An Unsupervised Approach to Inferring the Localness of People Using Incomplete Geotemporal Online Check-In Data
【24h】

An Unsupervised Approach to Inferring the Localness of People Using Incomplete Geotemporal Online Check-In Data

机译:使用不完整的时空在线值机数据推断人的本地性的无监督方法

获取原文
获取原文并翻译 | 示例
           

摘要

Inferring the localness of people is to classify people who are local residents in a city from people who visit the city by analyzing online check-in points that are contributed by online users. This information is critical for the urban planning, user profiling, and localized recommendation systems. Supervised learning approaches have been developed to infer the location of people in a city by assuming the availability of high-quality training datasets with complete geotemporal information. In this article, we develop an unsupervised model to accurately identify local people in a city by using the incomplete online check-in data that are publicly available. In particular, we develop an incomplete geotemporal expectation maximization (IGT-EM) scheme, which incorporates a set of hidden variables to represent the localness of people and a set of estimation parameters to represent the likelihood of venues to attract local and nonlocal people, respectively. Our solution can accurately classify local people from nonlocal nones without requiring any training data. We also implement a parallel IGT-EM algorithm by leveraging the computing power of a graphic processing unit (GPU) that consists of 2,496 cores. In the evaluation, we compare our new approach with the existing solutions through four real-world case studies using data from the New York City, Chicago, Boston, and Washington, DC. The results show that our approach can identify the local people and significantly outperform the compared baselines in estimation accuracy and execution time.
机译:推断人的本地性是通过分析由在线用户贡献的在线登机点,将访问城市的人与城市中的居民区分开。此信息对于城市规划,用户配置文件和本地化推荐系统至关重要。通过假设可获得具有完整地时信息的高质量培训数据集,已开发出有监督的学习方法来推断城市中人们的位置。在本文中,我们开发了一种无监督模型,可以使用公开提供的不完整的在线值机数据来准确识别城市中的本地人。特别是,我们开发了一个不完整的地时期望最大化(IGT-EM)方案,该方案包含一组代表人的本地性的隐藏变量和一组代表场所分别吸引本地人和非本地人的可能性的估计参数。我们的解决方案可以准确地将本地人与非本地人分类,而无需任何培训数据。我们还通过利用由2496个核组成的图形处理单元(GPU)的计算能力来实现并行IGT-EM算法。在评估中,我们通过使用来自纽约市,芝加哥,波士顿和华盛顿特区的数据进行的四个实际案例研究,将我们的新方法与现有解决方案进行了比较。结果表明,我们的方法可以识别本地人,并且在估计准确性和执行时间方面明显优于比较基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号