首页> 外文期刊>International journal of intelligent engineering informatics >A clustering-based hybrid approach for dual data reduction
【24h】

A clustering-based hybrid approach for dual data reduction

机译:基于集群的混合方法用于双重数据缩减

获取原文
获取原文并翻译 | 示例
       

摘要

The research on data reduction techniques has become important to enhance the efficacy and efficiency of data mining algorithms which may otherwise be compromised in the presence of a large number of irrelevant attributes and redundant instances. Data can be reduced by selecting either a subset of attributes or instances. Dual selection treats the problem of feature and instance selection together as a single optimisation problem. The problem of dual selection is relatively difficult as it involves an enormously large search space. In this paper, we propose a hybrid instance feature selection; HIFS-CHC method using heterogeneous recombination and cataclysmic mutation; CHC adaptive search genetic algorithm to solve the problem of dual selection. The proposed approach works in two stages. In the first stage, K-means clustering algorithm is used to reduce the search space. The second stage incorporates stratified prototype selection and CHC algorithm for data reduction. The clustering based hybrid scheme is experimentally tested on sixteen benchmark datasets and compared with the other similar data reduction algorithms with respect to the predictive accuracy, reduction rate and execution time. Experimental results show that the proposed method outperforms the other methods in terms of reduction rate and execution time while preserving the predictive accuracy almost at the same level.
机译:数据缩减技术的研究对于提高数据挖掘算法的效率和效率已经变得很重要,否则在存在大量无关属性和冗余实例的情况下,数据挖掘算法可能会受到损害。可以通过选择属性或实例的子集来减少数据。双重选择将特征和实例选择问题视为一个优化问题。双重选择问题相对困难,因为它涉及极大的搜索空间。在本文中,我们提出了一个混合实例特征选择。 HIFS-CHC方法使用异质重组和催化突变; CHC自适应搜索遗传算法解决了双重选择问题。拟议的方法分两个阶段进行。在第一阶段,使用K均值聚类算法来减少搜索空间。第二阶段结合分层原型选择和CHC算法进行数据缩减。在16个基准数据集上对基于聚类的混合方案进行了实验测试,并就预测准确性,减少率和执行时间与其他类似的数据减少算法进行了比较。实验结果表明,该方法在减少率和执行时间上均优于其他方法,同时保持了几乎相同的预测精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号