首页> 外文会议>International Conference on Digital Information and Communication Technology and it's Applications >A novel discretization technique using Class Attribute Interval Average
【24h】

A novel discretization technique using Class Attribute Interval Average

机译:使用类属性间隔平均的新离散化技术

获取原文

摘要

Discretization algorithm is important for data mining preprocessing because it will help the user to easily understand the data, reduce the complexity of data, reduce processing time, and increase efficiency and accuracy of the data. This paper proposes the new discretization algorithm called Class Attribute Interval Average (CAIA). The algorithm uses 2D-quanta matrix table to calculate each of class individual interval's average and merge the best adjacent intervals to form the new interval. The experimental design uses four-UCI data sets (Iris, Breast Cancer, Heart Diseases, Glass) and four-classification algorithms (J48, RBF, MLP, NB). The comparisons of experimental result with the other six discretization algorithms (EW, EF, ChiMerge, IEM, CAIM, CACC) show that the proposed CAIA has the best mean rank for both of the accuracy and the number of intervals.
机译:离散化算法对于数据挖掘预处理非常重要,因为它将帮助用户轻松理解数据,减少数据的复杂性,减少处理时间并提高数据的效率和准确性。本文提出了一种新的离散化算法,称为类属性间隔平均值(CAIA)。该算法使用2D量化矩阵表来计算每个类的单个区间的平均值,并合并最佳相邻区间以形成新区间。实验设计使用四个UCI数据集(虹膜,乳腺癌,心脏病,玻璃)和四个分类算法(J48,RBF,MLP,NB)。与其他六种离散化算法(EW,EF,ChiMerge,IEM,CAIM,CACC)的实验结果比较表明,所提出的CAIA在准确性和间隔数方面均具有最佳的平均等级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号