首页> 外文期刊>Communications, China >Parallelized Jaccard-based learning method and MapReduce implementation for mobile devices recognition from massive network data
【24h】

Parallelized Jaccard-based learning method and MapReduce implementation for mobile devices recognition from massive network data

机译:基于并行Jaccard的学习方法和MapReduce实现,可从海量网络数据中识别移动设备

获取原文
获取原文并翻译 | 示例
       

摘要

The ability of accurate and scalable mobile device recognition is critically important for mobile network operators and ISPs to understand their customers' behaviours and enhance their user experience. In this paper, we propose a novel method for mobile device model recognition by using statistical information derived from large amounts of mobile network traffic data. Specifically, we create a Jaccard-based coefficient measure method to identify a proper keyword representing each mobile device model from massive unstructured textual HTTP access logs. To handle the large amount of traffic data generated from large mobile networks, this method is designed as a set of parallel algorithms, and is implemented through the MapReduce framework which is a distributed parallel programming model with proven low-cost and high-efficiency features. Evaluations using real data sets show that our method can accurately recognise mobile client models while meeting the scalability and producer-independency requirements of large mobile network operators. Results show that a 91.5% accuracy rate is achieved for recognising mobile client models from 2 billion records, which is dramatically higher than existing solutions.
机译:准确和可扩展的移动设备识别能力对于移动网络运营商和ISP了解客户的行为并增强其用户体验至关重要。在本文中,我们提出了一种使用从大量移动网络流量数据中获得的统计信息进行移动设备模型识别的新方法。具体来说,我们创建了一个基于Jaccard的系数测量方法,以从大量的非结构化文本HTTP访问日志中识别代表每个移动设备模型的适当关键字。为了处理从大型移动网络生成的大量流量数据,此方法被设计为一组并行算法,并通过MapReduce框架实现,该框架是一种分布式并行编程模型,具有经过验证的低成本和高效率功能。使用实际数据集进行的评估表明,我们的方法可以准确识别移动客户端模型,同时满足大型移动网络运营商的可伸缩性和生产者独立性的要求。结果表明,从20亿条记录中识别移动客户端模型的准确率达到了91.5%,大大高于现有解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号