【24h】

Spatial extension of the Reality Mining Dataset

机译:现实采矿数据集的空间扩展

获取原文

摘要

Data captured from a live cellular network with the real users during their common daily routine help to understand how the users move within the network. Unlike the simulations with limited potential or expensive experimental studies, the research in user-mobility or spatio-temporal user behavior can be conducted on publicly available datasets such as the Reality Mining Dataset. These data have been for many years a source of valuable information about social interconnection between users and user-network associations. However, an important, spatial dimension is missing in this dataset. In this paper, we present a methodology for retrieving geographical locations matching the GSM cell identifiers in the Reality Mining Dataset, an approach base on querying the Google Location API. A statistical analysis of the measure of success of locations retrieval is provided. Further, we present the LAC-clustering method for detecting and removing outliers, a heuristic extension of general agglomerative hierarchical clustering. This methodology enables further, previously impossible analysis of the Reality Mining Dataset, such as studying user mobility patterns, describing spatial trajectories and mining the spatio-temporal data.
机译:在实际用户的日常日常活动中与实际用户一起从实时蜂窝网络捕获的数据有助于了解用户在网络中的移动方式。与具有有限潜力或昂贵实验研究的模拟不同,可以在诸如Reality Mining Dataset之类的公共数据集上进行用户移动性或时空用户行为的研究。多年来,这些数据一直是有关用户与用户网络关联之间的社会互连的有价值信息的来源。但是,此数据集中缺少重要的空间维度。在本文中,我们提出了一种在Reality Mining Dataset中检索与GSM小区标识符匹配的地理位置的方法,该方法基于查询Google Location API。提供了位置检索成功度量的统计分析。此外,我们提出了用于检测和消除异常值的LAC聚类方法,这是一般聚类分层聚类的一种启发式扩展。这种方法可以对现实挖掘数据集进行进一步的,以前不可能的分析,例如研究用户移动性模式,描述空间轨迹并挖掘时空数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号