首页> 外文期刊>International Journal of Business Intelligence and Data Mining >Data dimensionality reduction with application to improving classification performance and explaining concepts of data sets
【24h】

Data dimensionality reduction with application to improving classification performance and explaining concepts of data sets

机译:减少数据维数,以提高分类性能并解释数据集的概念

获取原文
获取原文并翻译 | 示例
       

摘要

Data dimensionality reduction is usually carried out before patterns are input to classifiers. In order to obtain good results in data mining, selecting relevant data is desirable. In many cases, irrelevant or redundant attributes are included in data sets, which interfere with knowledge discovery from data sets. In this paper, we propose a rule-extraction method based on a novel separability-correlation measure (SCM) ranking the importance of attributes. According to the attribute ranking results, the attribute subsets that lead to the best classification results are selected and used as inputs to a classifier, such as an RBF neural network in our paper. The complexity of the classifier can thus be reduced and its classification performance improved. Our method uses the classification results with reduced attribute sets to extract rules. Computer simulations show that our method leads to smaller rule sets with higher accuracies compared with other methods.
机译:数据降维通常在将模式输入到分类器之前执行。为了在数据挖掘中获得良好的结果,需要选择相关数据。在许多情况下,数据集中包含无关或冗余的属性,这会干扰从数据集中发现知识。在本文中,我们提出了一种基于新的可分离性-相关性度量(SCM)的规则提取方法,该度量对属性的重要性进行了排序。根据属性排序结果,选择导致最佳分类结果的属性子集,并将其用作分类器的输入,例如本文的RBF神经网络。从而可以降低分类器的复杂度并提高其分类性能。我们的方法使用具有简化属性集的分类结果来提取规则。计算机仿真表明,与其他方法相比,我们的方法可生成具有较高准确性的较小规则集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号