首页> 外文会议>IEEE International Conference on Network Infrastructure and Digital Content >User behavior analysis of automobile websites based on distributed computing and sequential pattern mining
【24h】

User behavior analysis of automobile websites based on distributed computing and sequential pattern mining

机译:基于分布式计算和顺序模式挖掘的汽车网站的用户行为分析

获取原文

摘要

Nowadays Internet user behavior becomes more and more complicated due to application diversity. It is important to analyze user behavior on specific websites such as e-commerce, education, and healthcare in order for personalized recommendation or targeted advertisement. In this paper, based on the large-scale traffic flow data of real network and crawling data from websites, we focus on the analysis of user browsing behavior on automobile websites. First of all, data pre-processing and statistical analysis based on MapReduce framework are designed and implemented, which is mainly to transform the flow data type to sequential dataset. By improving regular expressions matching method in distributed computing, the running time is reduced from O(N) to O(1). Secondly, we apply the sequential pattern mining algorithm AprioriAll to analyze the sequential dataset. The analysis result reflects the preference of the users when browsing automobile websites to acquire their wanted information.
机译:如今,由于应用程序分集,现在互联网用户行为变得越来越复杂。重要的是要分析特定网站上的用户行为,例如电子商务,教育和医疗保健,以便为个性化推荐或有针对性的广告。本文基于真实网络的大规模交通流量数据和来自网站的爬行数据,我们专注于汽车网站上用户浏览行为的分析。首先,设计并实现了基于MapReduce框架的数据预处理和统计分析,主要是将流数据类型转换为顺序数据集。通过改进分布式计算中的正则表达式匹配方法,从O(n)到O(1)减少运行时间。其次,我们应用顺序模式挖掘算法ApriorialL来分析顺序数据集。分析结果反映了用户在浏览汽车网站获取其所需信息时的偏好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号