首页> 外文会议>2016 International Computer Symposium >Improving User's Quality of Experience in Imbalanced Dataset
【24h】

Improving User's Quality of Experience in Imbalanced Dataset

机译:在不平衡的数据集中提高用户的体验质量

获取原文
获取原文并翻译 | 示例

摘要

Currently, traditional algorithm performs not well in terms of predicting the user's complaint in imbalanced IPTV dataset. To solve this problem, we combine status data from the set-top box with data of user's complaints and select the appropriate model to predict user's quality of experience (QoE). Concretely, we firstly perform data cleaning and select suitable attributes from the original dataset. Then, we apply random under-sampling and synthetic over-sampling to the preprocessed dataset. In order to get better performance, we improves the Synthetic Minority Over-sampling Technique (SMOTE) algorithm and combine it with K-means algorithm to generate a new dataset. After these procedures, we use the Naïve Bayes (NB) model in user's complaint dataset. Through the rigorous modeling and prediction, extensive experimental results show that this integrated algorithm performs better than the Borderline-SMOTE algorithm in predicting user's complaints.
机译:当前,传统算法在不平衡IPTV数据集中预测用户投诉方面表现不佳。为解决此问题,我们将机顶盒中的状态数据与用户投诉数据结合在一起,并选择适当的模型来预测用户的体验质量(QoE)。具体而言,我们首先执行数据清理并从原始数据集中选择合适的属性。然后,我们将随机欠采样和合成过采样应用于预处理后的数据集。为了获得更好的性能,我们改进了综合少数族裔过采样技术(SMOTE)算法,并将其与K-means算法结合以生成新的数据集。完成这些步骤后,我们在用户投诉数据集中使用朴素贝叶斯(NB)模型。通过严格的建模和预测,广泛的实验结果表明,该集成算法在预测用户投诉方面要比Borderline-SMOTE算法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号