首页> 外文期刊>Expert Systems with Application >Adaptive pairing of classifier and imputation methods based on the characteristics of missing values in data sets
【24h】

Adaptive pairing of classifier and imputation methods based on the characteristics of missing values in data sets

机译:基于数据集中缺失值特征的分类器和归类方法的自适应配对

获取原文
获取原文并翻译 | 示例
           

摘要

Classifiers and imputation methods have played crucial parts in the field of big data analytics. Especially, when using data sets characterized by horizontal scattering, vertical scattering, level of spread, compound metric, imbalance ratio and missing ratio, how to combine those classifiers and imputation methods will lead to significantly different performance. Therefore, it is essential that the characteristics of data sets must be identified in advance to facilitate selection of the optimal combination of imputation methods and classifiers. However, this is a very costly process. The purpose of this paper is to propose a novel method of automatic, adaptive selection of the optimal combination of classifier and imputation method on the basis of features of a given data set. The proposed method turned out to successfully demonstrate the superiority in performance evaluations with multiple data sets. The decision makers in big data analytics could greatly benefit from the proposed method when it comes to dealing with data set in which the distribution of missing data varies in real time. (C) 2015 Elsevier Ltd. All rights reserved.
机译:分类器和插补方法在大数据分析领域起着至关重要的作用。特别是,当使用以水平散射,垂直散射,扩展程度,复合度量,不平衡率和缺失率为特征的数据集时,如何结合使用这些分类器和插补方法将导致明显不同的性能。因此,至关重要的是必须事先确定数据集的特征,以利于选择插补方法和分类器的最佳组合。但是,这是一个非常昂贵的过程。本文的目的是根据给定数据集的特征,提出一种自动,自适应地选择分类器和归类方法的最佳组合的新方法。事实证明,所提出的方法成功地证明了在具有多个数据集的性能评估中的优越性。大数据分析中的决策者可以从所提出的方法中受益,该方法涉及处理丢失数据的分布实时变化的数据集。 (C)2015 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号