首页> 外文会议>IEEE International Conference on Computer and Communications >A fast multi-patterns parallel matching algorithm for massive HTTP data processing
【24h】

A fast multi-patterns parallel matching algorithm for massive HTTP data processing

机译:用于海量HTTP数据处理的快速多模式并行匹配算法

获取原文

摘要

The development of data services in wireless mobile networks leads to the tremendous growth of net users, making user behavior grow rapidly. And it brings a great opportunity for researchers to analyze user behavior through large-scale network traffic, which is not only significant for Internet Service Providers (ISP) to optimize resource allocation, but also can provide users with more customized service. The analysis of user behavior is based on the extraction of user characteristics, and multi-patterns URL matching is the foundation. However, the efficiency of extracting user behavior from massive network traffic data is still a huge challenge problem. This paper focuses on the efficiency of extracting user characteristics and proposes a novel algorithm, Multi-Patterns Parallel Matching on HTTP Traffic (MPPM) that takes advantage of the hash map in data searching, and it can extract user behavior from massive HTTP traffic more effective and faster than conventional methods with the same accuracy. Experiments are conducted by using real-world HTTP traffic data collected from the ISP networks. It is demonstrated that the proposed algorithm is superior to the known methods, as well as the capacity of dealing with massive HTTP traffic data. The implementation of MPPM algorithm will be a solid base to build a high-performance analysis engine of user behavior for massive HTTP data processing.
机译:无线移动网络中数据服务的发展导致网络用户的巨大增长,使用户行为迅速增长。这为研究人员通过大规模网络流量分析用户行为提供了巨大的机会,这不仅对Internet服务提供商(ISP)优化资源分配具有重要意义,而且还可以为用户提供更多定制服务。对用户行为的分析是基于对用户特征的提取,而多模式URL匹配是基础。但是,从海量网络流量数据中提取用户行为的效率仍然是一个巨大的挑战问题。本文着重于提取用户特征的效率,并提出了一种新颖的算法,即HTTP流量多模式并行匹配(MPPM),该算法在数据搜索中利用了哈希映射,可以更有效地从大量HTTP流量中提取用户行为。并且比具有相同精度的传统方法更快。通过使用从ISP网络收集的真实HTTP流量数据进行实验。结果表明,所提出的算法优于已知方法,并且具有处理大量HTTP流量数据的能力。 MPPM算法的实施将为构建用于大规模HTTP数据处理的用户行为的高性能分析引擎奠定坚实的基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号