首页> 外文会议>International Multi-Conference on Computing in the Global Information Technology >Smartphone-based Data Collection with Stunner Using Crowdsourcing: Lessons Learnt while Cleaning the Data
【24h】

Smartphone-based Data Collection with Stunner Using Crowdsourcing: Lessons Learnt while Cleaning the Data

机译:基于智能手机的数据收集,使用众包使用STUNNING:在清洁数据时学习的经验教训

获取原文

摘要

The increasing popularity of smartphones makes them popular tools for various big data collecting crowdsourcing campaigns, but there are still many open questions about the proper methodology of these campaigns. Beyond this, despite the growing popularity of this type of research, there are familiar difficulties and challenges in handling a wide range of uploads, maintaining the quality of the datasets, cleaning the data sets containing noisy, incorrect data, motivating the participants, and providing support for data collecting regardless of the remoteness of the device. In order to collect information about the Network Address Translation (NAT) related environment of mobile phones, we utilized a crowdsourcing approach. We collected more than 70 million data records from over 100 countries measuring the NAT characteristics of more than 1300 carriers and over 35000 WiFi environments during the three year project. Here, we introduce our data collecting architecture, some of the most prominent problems we have encountered since its launch, some of the solutions and proposed solutions to handle difficulties.
机译:智能手机的越来越越来越大,为收集众群运动的各种大数据提供流行的工具,但仍有许多关于这些运动的适当方法的开放性问题。除此之外,尽管这种类型的研究越来越受欢迎,但在处理广泛的上传方面存在熟悉的困难和挑战,维护数据集的质量,清洁包含嘈杂,不正确的数据,激励参与者的数据集,并提供无论设备的远程性如何,都支持数据收集。为了收集有关移动电话的网络地址转换(NAT)相关环境的信息,我们利用了一种众群方法。我们从超过100个国家收集了超过7000万个数据记录,这些国家测量了超过1300多个载体的NAT特征,三年的计划中超过35000个WiFi环境。在这里,我们介绍了我们的数据收集架构,自推出以来我们遇到的一些最突出的问题,一些解决方案和提出的解决方案来处理困难。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号