首页> 外文期刊>EPJ Data Science >Travelers or locals? Identifying meaningful sub-populations from human movement data in the absence of ground truth
【24h】

Travelers or locals? Identifying meaningful sub-populations from human movement data in the absence of ground truth

机译:旅行者或当地人?在没有地面真理的情况下,从人类运动数据中识别有意义的子人群

获取原文
           

摘要

As users of mobile devices make phone calls, browse the web, or use an app, large volumes of data are routinely generated that are a potentially useful source for investigating human behavior in space. However, as such data are usually collected only as a by-product, they often lack stringent experimental design and ground truth, which makes interpretation and derivation of valid behavioral conclusions challenging. Here, we propose an unsupervised, data-driven approach to identify different user types based on high-resolution human movement data collected from a smartphone navigation app, in the absence of ground truth. We capture spatio-temporal footprints of users, characterized by meaningful summary statistics, which are then used in an unsupervised step to identify user types. Based on an extensive dataset of users of the mobile navigation app Sygic in Australia, we show how the proposed methodology allows to identify two distinct groups of users: ‘travelers’, visiting different areas with distinct, salient characteristics, and ‘locals’, covering shorter distances and revisiting many of their locations. We verify our approach by relating user types to space use: we find that travelers and locals prefer to visit distinct, different locations in the Australian cities Sydney and Melbourne, as suggested independently by other studies. Although we use high-resolution GPS data, the proposed methodology is potentially transferable to low-resolution movement data (e.g. Call Detail Records), since we rely only on summary statistics.
机译:随着移动设备的用户,浏览网络或使用应用程序,通常产生大量的数据,这是调查空间中的人类行为的潜在有用的来源。然而,由于这些数据通常仅作为副产品收集,因此它们通常缺乏严格的实验设计和地面真理,这使得有效行为结论的解释和推导出现。在这里,我们提出了一种无监督的数据驱动方法来基于从智能手机导航应用程序收集的高分辨率人体运动数据来识别不同的用户类型,在没有地面真理的情况下。我们捕获用户的时空足迹,其特征在于有意义的汇总统计,然后在无监督的步骤中用于识别用户类型。基于澳大利亚移动导航应用程序Sygic的广泛数据集,我们展示了所提出的方法如何识别两个不同的用户组:“旅行者”,访问具有不同,突出的特征和“当地人”的不同领域,覆盖距离较短,重新审视其许多地点。我们通过将用户类型与空间使用相关来验证我们的方法:我们发现旅行者和当地人更喜欢在澳大利亚城市悉尼和墨尔本的独特不同地点,如其他研究所建议的。虽然我们使用高分辨率GPS数据,但是所提出的方法可能可转换为低分辨率移动数据(例如呼叫详细记录),因为我们依赖于汇总统计信息。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号