首页> 外文期刊>Pattern recognition letters >Efficient feature selection filters for high-dimensional data
【24h】

Efficient feature selection filters for high-dimensional data

机译:高维数据的高效特征选择过滤器

获取原文
获取原文并翻译 | 示例
           

摘要

Feature selection is a central problem in machine learning and pattern recognition. On large datasets (in terms of dimension and/or number of instances), using search-based or wrapper techniques can be computationally prohibitive. Moreover, many filter methods based on relevance/redundancy assessment also take a prohibitively long time on high-dimensional datasets. In this paper, we propose efficient unsupervised and supervised feature selection/ranking filters for high-dimensional datasets. These methods use low-complexity relevance and redundancy criteria, applicable to supervised, semi-supervised, and unsupervised learning, being able to act as pre-processors for computationally intensive methods to focus their attention on smaller subsets of promising features. The experimental results, with up to 105 features, show the time efficiency of our methods, with lower generalization error than state-of-the-art techniques, while being dramatically simpler and faster.
机译:特征选择是机器学习和模式识别中的核心问题。在大型数据集(就维度和/或实例数而言)上,使用基于搜索的技术或包装技术可能会在计算上令人望而却步。此外,许多基于相关性/冗余度评估的过滤方法在高维数据集上的花费也非常长。在本文中,我们为高维数据集提出了有效的非监督和监督特征选择/排序过滤器。这些方法使用低复杂度相关性和冗余标准,适用于有监督,半监督和无监督学习,能够充当计算密集型方法的预处理器,从而将注意力集中在有希望的特征的较小子集上。具有多达105个功能的实验结果显示了我们方法的时间效率,与最先进的技术相比,具有更低的泛化误差,同时显着更简单,更快速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号