首页> 外文会议>International Neural Network Society Conference on Big Data >Novel Automatic Filter-Class Feature Selection for Machine Learning Regression
【24h】

Novel Automatic Filter-Class Feature Selection for Machine Learning Regression

机译:机器学习回归的新型自动滤波器类功能选择

获取原文

摘要

With the increased focus on application of Big Data in all sectors of society, the performance of machine learning becomes essential. Efficient machine learning depends on efficient feature selection algorithms. Filter feature selection algorithms are model-free and therefore very fast, but require a threshold to function. We have created a novel meta-filter automatic feature selection, Ranked Distinct Elitism Selection Filter (RDESF) which is fully automatic and is composed of five common filters and a distinct selection process. To test the performance and speed of RDESF it will be benchmarked against 4 other common automatic feature selection algorithms: Backward selection, forward selection, NLPCA and PCA as well as using no algorithms at all. The benchmarking will be performed through two experiments with two different data sets that are both time-series regression-based problems. The prediction will be performed by a Multilayer Perceptron (MLP). Our results show that RDESF is a strong competitor and allows for a fully automatic feature selection system using filters. RDESF was only outperformed by forward selection, which was expected as it is a wrapper which includes the prediction model in the feature selection process. PCA is often used in machine learning litterature and can be considered the default feature selection method. RDESF outperformed PCA in both experiments in both prediction error and computational speed. RDESF is a new step into filter-based automatic feature selection algorithms that can be used for many different applications.
机译:随着焦点对社会各界大数据的应用,机器学习的性能变得至关重要。高效的机器学习取决于有效的特征选择算法。过滤器功能选择算法是无模型的,因此非常快,但需要阈值来运行。我们创建了一种新颖的元过滤器自动特征选择,排名不同的精油选择滤波器(RDESF),该滤波器(RDESF)完全自动,由五个常见的滤波器和不同的选择过程组成。为了测试RDESF的性能和速度,它将与4个其他常见的自动特征选择算法进行基准测试:向后选择,转发选择,NLPCA和PCA以及根本不使用算法。基准将通过两个实验进行,其中两个不同的数据集是时间序列回归的问题。预测将由多层的感知(MLP)执行。我们的结果表明,RDESF是一个强大的竞争对手,允许使用过滤器的全自动特征选择系统。 RDESF仅通过前向选择优先表现,这是预期的,因为它是一种包装器,它包括特征选择过程中的预测模型。 PCA常用于机器学习辅用工具,可被视为默认的特征选择方法。在预测误差和计算速度的两种实验中,RDESF优于PCA。 RDESF是进入基于过滤器的自动特征选择算法的新步骤,可用于许多不同的应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号