首页> 外文期刊>系统工程与电子技术(英文版) >Half-global discretization algorithm based on rough set theory
【24h】

Half-global discretization algorithm based on rough set theory

机译:基于粗糙集理论的半全局离散化算法

获取原文
获取原文并翻译 | 示例
       

摘要

It is being widely studied how to extract knowledge from a decision table based on rough set theory. The novel problem is how to discretize a decision table having continuous attribute. In order to obtain more reasonable discretization results, a discretization algorithm is proposed, which arranges half-global discretization based on the correlational coefficient of each continuous attribute while considering the uniqueness of rough set theory. When choosing heuristic information, stability is combined with rough entropy. In terms of stability, the possibility of classifying objects belonging to certain sub-interval of a given attribute into neighbor sub-intervals is minimized. By doing this, rational discrete intervals can be determined. Rough entropy is employed to decide the optimal cut-points while guaranteeing the consistency of the decision table after discretization. Thought of this algorithm is elaborated through Iris data and then some experiments by comparing outcomes of four discritized datasets are also given, which are calculated by the proposed algorithm and four other typical algorithms for discritization respectively. After that, classification rules are deduced and summarized through rough set based classifiers. Results show that the proposed discretization algorithm is able to generate optimal classification accuracy while minimizing the number of discrete intervals. It displays superiority especially when dealing with a decision table having a large attribute number.
机译:人们正在广泛研究如何基于粗糙集理论从决策表中提取知识。新颖的问题是如何使具有连续属性的决策表离散化。为了获得更合理的离散化结果,提出了一种离散化算法,该算法在考虑粗糙集理论的唯一性的基础上,基于每个连续属性的相关系数,进行半全局离散化。选择启发式信息时,稳定性与粗糙熵结合在一起。就稳定性而言,将属于给定属性的特定子间隔的对象分类为相邻子间隔的可能性被最小化。通过这样做,可以确定合理的离散间隔。在保证离散化后决策表的一致性的同时,使用粗糙熵来确定最佳切入点。通过虹膜数据阐述了该算法的思想,然后通过比较四个离散化数据集的结果给出了一些实验,分别由提出的算法和其他四个典型的离散化算法计算得到。之后,通过基于粗糙集的分类器推导并归纳分类规则。结果表明,提出的离散化算法能够在最小化离散区间数量的同时,产生最佳的分类精度。特别是在处理具有较大属性数的决策表时,它显示出优越性。

著录项

  • 来源
    《系统工程与电子技术(英文版)》 |2009年第2期|339-347|共9页
  • 作者

    Tan Xu; Chen Yingwu;

  • 作者单位

    School of Information Systems & Management, National Univ. of Defense Technology,Changsha 410073, P. R. China;

    School of Information Systems & Management, National Univ. of Defense Technology,Changsha 410073, P. R. China;

  • 收录信息 中国科学引文数据库(CSCD);
  • 原文格式 PDF
  • 正文语种 chi
  • 中图分类 计算技术、计算机技术;
  • 关键词

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号