...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Automatic recommendation of classification algorithms based on data set characteristics
【24h】

Automatic recommendation of classification algorithms based on data set characteristics

机译:根据数据集特征自动推荐分类算法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Choosing appropriate classification algorithms for a given data set is very important and useful in practice but also is full of challenges. In this paper, a method of recommending classification algorithms is proposed. Firstly the feature vectors of data sets are extracted using a novel method and the performance of classification algorithms on the data sets is evaluated. Then the feature vector of a new data set is extracted, and its k nearest data sets are identified. Afterwards, the classification algorithms of the nearest data sets are recommended to the new data set. The proposed data set feature extraction method uses structural and statistical information to characterize data sets, which is quite different from the existing methods. To evaluate the performance of the proposed classification algorithm recommendation method and the data set feature extraction method, extensive experiments with the 17 different types of classification algorithms, the three different types of data set characterization methods and all possible numbers of the nearest data sets are conducted upon the 84 publicly available UCI data sets. The results indicate that the proposed method is effective and can be used in practice.
机译:为给定的数据集选择适当的分类算法在实践中非常重要和有用,但也充满了挑战。本文提出了一种推荐分类算法的方法。首先使用一种新颖的方法提取数据集的特征向量,并评估分类算法在数据集上的性能。然后提取新数据集的特征向量,并确定其k个最近的数据集。然后,将最接近的数据集的分类算法推荐给新数据集。所提出的数据集特征提取方法使用结构和统计信息来表征数据集,这与现有方法大不相同。为了评估所提出的分类算法推荐方法和数据集特征提取方法的性能,使用17种不同类型的分类算法,三种不同类型的数据集表征方法以及所有可能的最近数据集进行了广泛的实验。根据84个公开可用的UCI数据集。结果表明,该方法是有效的,可以在实践中使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号