首页> 外文会议>International Symposium on Distributed Computing and Applications for Business Engineering and Science >Analysis and Implementation of Abnormal User Data for Large Scale Communication Based on Spark
【24h】

Analysis and Implementation of Abnormal User Data for Large Scale Communication Based on Spark

机译:基于Spark的大规模通信用户数据异常的分析与实现

获取原文
获取外文期刊封面目录资料

摘要

Abnormal communication refers to the user in the call traffic and business management and other daily consumption of abnormal behavior in the user. Communication operators have large-scale user data sets, and using data sets reasonably to make good guidance and recommendation to businesses can bring better economic benefits. However, for large-scale user feature datasets, serial machine learning and analysis methods spend a lot of time in feature processing, and data sets training is facing a huge time cost. In order to process and train abnormal user data better and more efficiently, this paper uses Spark to implement feature engineering and analyze large-scale anomaly user datasets to highlight the efficiency of Spark in analyzing feature data and implement distributed training to accelerate the algorithm model. The training algorithm takes the SVM distributed training dataset as an example and compares it with the stand-alone serial SVM and scikit-learn SVM. The experimental results show the advantages of distributed computing as well as good training results. Finally, common logistic regression and Bayesian algorithms and other distributed computing models to compare the training effect.
机译:异常通信是指用户在通话量和业务管理等日常消费中的异常行为。通信运营商拥有大规模的用户数据集,并且合理地使用数据集为企业提供良好的指导和建议可以带来更好的经济效益。但是,对于大规模用户特征数据集,串行机器学习和分析方法在特征处理上花费大量时间,并且数据集训练面临巨大的时间成本。为了更好,更有效地处理和训练异常用户数据,本文使用Spark进行特征工程和分析大规模异常用户数据集,以突出Spark在特征数据分析中的效率,并实施分布式训练以加速算法模型。训练算法以SVM分布式训练数据集为例,并将其与独立的串行SVM和scikit-learn SVM进行比较。实验结果表明了分布式计算的优点以及良好的训练效果。最后,通过常用的逻辑回归和贝叶斯算法以及其他分布式计算模型来比较训练效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号