首页> 外文会议>SIAM International Conference on Data Mining >Fast Unsupervised Location Category Inference from Highly Inaccurate Mobility Data
【24h】

Fast Unsupervised Location Category Inference from Highly Inaccurate Mobility Data

机译:快速无监督的位置类别推断来自高度不准确的移动数据

获取原文

摘要

Understanding a mobile user's behavior, e.g., to infer if she is exercising in a gym or dining in a restaurant, is the key to a variety of applications. However, in many real-world scenarios, precisely determining user visitation is extremely challenging due to the uncertainty present in mobile location updates, where errors can be hundreds of meters or even more. We consider the location uncertainty circle determined by the reported location coordinates as the center and the associated location error as the radius. Such a location uncertainty circle is likely to cover multiple location categories, especially in densely populated areas. Worse still, in many cases, mobile users are anonymous, and we have no access to their personal information or other labeled data, which compels us to develop an unsupervised learning approach to solve this problem. Using a user-time-location category tensor, we capture the user behavior and propose a novel tensor factorization framework to accurately infer the location categories visited by mobile users. This framework leverages several key observations including the negative-unlabeled nature of the data and the intrinsic correlations between users. Also, the proposed algorithm can predict where users are even in the absence of location information. To efficiently solve the proposed framework, we propose a parameter-free and scalable optimization algorithm by effectively exploring the sparse and low-rank structure of the tensor. Our empirical studies show that the proposed algorithm is both effective and scalable: it can solve problems with millions of users and billions of location updates, and also provide superior prediction accuracies on real-world location update and check-in datasets.
机译:了解移动用户的行为,例如,如果她在健身房锻炼或在餐厅用餐时,是各种应用的关键。然而,在许多真实世界的情景中,由于移动位置更新中存在的不确定性,精确确定用户探视非常具有挑战性,错误可以是数百米甚至更多的错误。我们考虑由报告的位置确定的位置不确定性圆圈作为中心和相关的位置误差作为半径的相关位置误差。这种位置不确定性圆圈可能会涵盖多个位置类别,特别是在密集的地区。更糟糕的是,在许多情况下,移动用户都是匿名的,我们无法访问他们的个人信息或其他标签数据,这迫使我们开发一个无监督的学习方法来解决这个问题。使用用户时位置类别Tensor,我们捕获用户行为并提出一种新颖的张量因子框架,以准确地推断移动用户访问的位置类别。该框架利用了几个关键观察,包括数据的负面未标记性质和用户之间的内在相关性。此外,所提出的算法可以预测用户甚至在没有位置信息的情况下。为了有效解决所提出的框架,我们通过有效地探索了张量的稀疏和低级结构来提出无参数和可扩展的优化算法。我们的实证研究表明,该算法既有效又可缩放:它可以解决数百万用户和数十亿个位置更新的问题,也可以在实际位置更新和登记到数据集中提供卓越的预测精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号