首页> 中文期刊> 《计算机工程与应用 》 >组合与概率的连续特征权衡量化方法

组合与概率的连续特征权衡量化方法

             

摘要

Quantization methods of continuous features are a necessary preprocess of data mining methods. This paper presents a trade-off discrimination method for continuous features based on minimum description length, combination and probability theories. It proposes a quantizative trade-off criterion for continuous features which reasonably balances classification errors and interval information generated by quantization. It proposes an effective dynamic programming quantization algorithm with the aim to find the best quantization result based on the trade-off criteria. The quantized data will be sent to naive bayes classifier to establish classification and prediction model. Contrastive experimental results show that the new method achieves higher mean learning accuracy than other quantization methods.%连续特征量化方法是数据挖掘方法中必要的预处理过程.呈现一种组合与概率的连续特征权衡量化方法.基于最小描述长度以及组合与概率理论,提出连续特征量化的权衡标准,能够在量化所导致的分类错误与量化区间信息之间得到合理的权衡;基于该权衡标准提出一种有效的动态规划量化算法,以找到最好的量化结果;量化后的数据采用naive贝叶斯分类器进行分类预测,与其他连续特征量化方法的对比实验结果表明,新方法得到了较高的平均学习精度.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号