首页> 中文期刊> 《计算机工程与应用》 >一种区间型数据的离散化方法

一种区间型数据的离散化方法

     

摘要

The area of knowledge discovery and data mining are growing rapidly. A large number of methods are employed to discrete data, however, most of the existing discretion methods are applied in the case of attributes with real-value. In the practical application, the attribute value is interval number in many cases. Aiming at this problem, a new discretization algorithm applied to interval numbers is proposed. Similarity degree of interval number is used to describe the similar relation of two interval numbers. Threshold degree is defined to ensure discrete relationship between the data to implement algorithm. A new variable-associated degree is proposed through analysing action of similarity degree in the algorithm, and associated degree is used to improve algorithm. A group of data set is applied to testing the performance of the algorithm and the experiment result is compared with other discretization algorithms. The experiment result shows that the algorithm is effective.%随着数据挖掘和知识发现等技术的迅速发展,出现了很多数据离散的算法,但是,已有的离散化方法大多是针对固定点上的连续属性值的情况,实际应用中大量存在着连续区间属性值的情况.针对这一问题,提出了一种连续区间属性值离散化的新方法.通过区间数的相似度来描述对象问的相似关系,定义相似度阈度确定离散关系,来实现对区间数据的离散化,经过分析相似度在算法中的作用,提出了一种新的变量——关联度,改进了算法.采用多组数据对此算法的性能进行了检验,与其他算法做了对比试验,试验结果表明此算法是有效的.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号