Fast Unsupervised Location Category Inference from Highly Inaccurate Mobility Data

机译：快速无监督的位置类别推断来自高度不准确的移动数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Understanding a mobile user's behavior, e.g., to infer if she is exercising in a gym or dining in a restaurant, is the key to a variety of applications. However, in many real-world scenarios, precisely determining user visitation is extremely challenging due to the uncertainty present in mobile location updates, where errors can be hundreds of meters or even more. We consider the location uncertainty circle determined by the reported location coordinates as the center and the associated location error as the radius. Such a location uncertainty circle is likely to cover multiple location categories, especially in densely populated areas. Worse still, in many cases, mobile users are anonymous, and we have no access to their personal information or other labeled data, which compels us to develop an unsupervised learning approach to solve this problem. Using a user-time-location category tensor, we capture the user behavior and propose a novel tensor factorization framework to accurately infer the location categories visited by mobile users. This framework leverages several key observations including the negative-unlabeled nature of the data and the intrinsic correlations between users. Also, the proposed algorithm can predict where users are even in the absence of location information. To efficiently solve the proposed framework, we propose a parameter-free and scalable optimization algorithm by effectively exploring the sparse and low-rank structure of the tensor. Our empirical studies show that the proposed algorithm is both effective and scalable: it can solve problems with millions of users and billions of location updates, and also provide superior prediction accuracies on real-world location update and check-in datasets.

机译：了解移动用户的行为，例如，如果她在健身房锻炼或在餐厅用餐时，是各种应用的关键。然而，在许多真实世界的情景中，由于移动位置更新中存在的不确定性，精确确定用户探视非常具有挑战性，错误可以是数百米甚至更多的错误。我们考虑由报告的位置确定的位置不确定性圆圈作为中心和相关的位置误差作为半径的相关位置误差。这种位置不确定性圆圈可能会涵盖多个位置类别，特别是在密集的地区。更糟糕的是，在许多情况下，移动用户都是匿名的，我们无法访问他们的个人信息或其他标签数据，这迫使我们开发一个无监督的学习方法来解决这个问题。使用用户时位置类别Tensor，我们捕获用户行为并提出一种新颖的张量因子框架，以准确地推断移动用户访问的位置类别。该框架利用了几个关键观察，包括数据的负面未标记性质和用户之间的内在相关性。此外，所提出的算法可以预测用户甚至在没有位置信息的情况下。为了有效解决所提出的框架，我们通过有效地探索了张量的稀疏和低级结构来提出无参数和可扩展的优化算法。我们的实证研究表明，该算法既有效又可缩放：它可以解决数百万用户和数十亿个位置更新的问题，也可以在实际位置更新和登记到数据集中提供卓越的预测精度。

著录项

来源
《SIAM International Conference on Data Mining》|2019年|809p|共9页
会议地点
作者
Jinfeng Yi; Qi Lei; Wesley M. Gifford; Ji Liu; Junchi Yan; Bowen Zhou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP274-53;
关键词

相似文献

外文文献
中文文献
专利

1. Mobility-supporting data management for location-based mobile systems: the special case of single category location repositories [J] . James E. Wyse International Journal of Web Engineering and Technology . 2012,第2期

机译：基于位置的移动系统的支持移动性的数据管理：单类别位置存储库的特殊情况
2. Extracting accurate location information from a highly inaccurate traffic accident dataset: A methodology based on a string matching technique [J] . Miler Mario, Todic Filip, Sevrovic Marko Transportation research . 2016,第jula期

机译：从高度不准确的交通事故数据集中提取准确的位置信息：一种基于字符串匹配技术的方法
3. Aggregated mobility-based topology inference for fast sensor data collection [J] . Constantinos Marios Angelopoulos, Sotiris Nikoletseas Computer Communications . 2011,第13期

机译：基于移动性的聚合拓扑推断，可快速收集传感器数据
4. Fast Unsupervised Location Category Inference from Highly Inaccurate Mobility Data [C] . Jinfeng Yi, Qi Lei, Wesley M. Gifford, SIAM International Conference on Data Mining . 2019

机译：快速无监督的位置类别推断来自高度不准确的移动数据
5. Android Wi-Fi Location Awareness and Data Inference Heuristic. [D] . Wu, Leon. 2013

机译：Android Wi-Fi位置感知和数据推断启发式。
6. Inference from Inadequate and Inaccurate Data II [O] . George Backus 1970

机译：数据不足和不准确的推断II
7. Fast Unsupervised Location Category Inference from Highly Inaccurate Mobility Data [O] . Jinfeng Yi, Qi Lei, Wesley M. Gifford, 2019

机译：快速无监督的位置类别推断来自高度不准确的移动数据
8. Validation of the AMC-71Mobility Model. Appendix A:Vehicle Data. Appendix B:Location and Description of Test Sites. Appendix C:Definitions of Terrain Terms and Procedures Used to Collect Terrain Data for Validation Tests. Appendix D:Basic Terrain Data. [R] . schreiner,barton g. willoughby,william e. 1976

机译：验证amC-71mobility模型。附录a：车辆数据。附录B：测试站点的位置和描述。附录C：用于收集地形数据以进行验证测试的地形术语和程序的定义。附录D：基本地形数据。

Fast Unsupervised Location Category Inference from Highly Inaccurate Mobility Data

摘要

著录项

相似文献

相关主题

期刊订阅