首页> 外文会议>IEEE international conference on data engineering >Region sampling and estimation of geosocial data with dynamic range calibration
【24h】

Region sampling and estimation of geosocial data with dynamic range calibration

机译:动态范围校准的地域社会数据区域采样和估计

获取原文

摘要

Location based social networks (LBSNs) are becoming increasingly popular with the fast deployment of broadband mobile networks and the growing prevalence of versatile mobile devices. This success has attracted great interest in studying and measuring the characteristics of LBSNs, such as Facebook Places, Yelp, and Google+ Local. However, it is often prohibitive, and sometimes too costly, to obtain a detailed and complete snapshot of a LBSN due to its usually massive scale. In this work, taking Foursquare as an example, we focus on sampling and estimating restricted geographic regions in LBSNs, such as a city or a country. By exploiting the application programming interfaces (APIs) provided by Foursquare for geographic search, we first introduce how to obtain the “ground truth”, namely, a complete set of all venues (i.e., places) in a specified region. Then, we propose random region sampling algorithms that allow us to draw representative samples of venues, and design unbiased estimators of regional characteristics of venues. We validate the efficiency of our sampling algorithms on Foursquare using complete datasets obtained from 12 regions, such as Switzerland, New York City and Los Angeles. Our results are applicable to perform sampling and estimation in all GeoDatabases, such as Facebook Places, Yelp, and Google+ Local, which have similar venue search APIs as Foursquare. These location service providers can also benefit from our results to enable efficient online statistic estimation.
机译:随着宽带移动网络的快速部署以及多功能移动设备的普及,基于位置的社交网络(LBSN)变得越来越流行。这项成功吸引了人们对研究和衡量LBSN(例如Facebook Places,Yelp和Google+ Local)的特征的极大兴趣。但是,由于LBSN通常规模庞大,因此获得详细而完整的LBSN快照通常是令人望而却步的,有时甚至代价很高。在本文中,以Foursquare为例,我们专注于采样和估计LBSN中受限的地理区域,例如城市或国家。通过利用Foursquare提供的应用程序编程接口(API)进行地理搜索,我们首先介绍如何获取“地面真相”,即指定区域中所有场所(即场所)的完整集合。然后,我们提出了随机区域采样算法,该算法允许我们绘制场所的代表性样本,并设计场所区域特征的无偏估计量。我们使用从12个地区(例如瑞士,纽约市和洛杉矶)获得的完整数据集验证了Foursquare上采样算法的效率。我们的结果适用于在所有GeoDatabase中执行采样和估计,例如Facebook Places,Yelp和Google+ Local,它们具有与Foursquare类似的场所搜索API。这些定位服务提供商也可以从我们的结果中受益,以实现有效的在线统计估算。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号