Efficient feature selection filters for high-dimensional data

Artur J. Ferreira; Mario A.T. Figueiredo

首页> 外文期刊>Pattern recognition letters >Efficient feature selection filters for high-dimensional data

【24h】

Efficient feature selection filters for high-dimensional data

机译：高维数据的高效特征选择过滤器

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Feature selection is a central problem in machine learning and pattern recognition. On large datasets (in terms of dimension and/or number of instances), using search-based or wrapper techniques can be computationally prohibitive. Moreover, many filter methods based on relevance/redundancy assessment also take a prohibitively long time on high-dimensional datasets. In this paper, we propose efficient unsupervised and supervised feature selection/ranking filters for high-dimensional datasets. These methods use low-complexity relevance and redundancy criteria, applicable to supervised, semi-supervised, and unsupervised learning, being able to act as pre-processors for computationally intensive methods to focus their attention on smaller subsets of promising features. The experimental results, with up to 105 features, show the time efficiency of our methods, with lower generalization error than state-of-the-art techniques, while being dramatically simpler and faster.

机译：特征选择是机器学习和模式识别中的核心问题。在大型数据集（就维度和/或实例数而言）上，使用基于搜索的技术或包装技术可能会在计算上令人望而却步。此外，许多基于相关性/冗余度评估的过滤方法在高维数据集上的花费也非常长。在本文中，我们为高维数据集提出了有效的非监督和监督特征选择/排序过滤器。这些方法使用低复杂度相关性和冗余标准，适用于有监督，半监督和无监督学习，能够充当计算密集型方法的预处理器，从而将注意力集中在有希望的特征的较小子集上。具有多达105个功能的实验结果显示了我们方法的时间效率，与最先进的技术相比，具有更低的泛化误差，同时显着更简单，更快速。

著录项

来源
《Pattern recognition letters》 |2012年第13期|p.1794-1804|共11页
作者
Artur J. Ferreira; Mario A.T. Figueiredo;
展开▼
作者单位

Instituto Superior de Engenharia de Lisboa, Lisboa, Portugal,Instituto de Telecomunicacoes, Lisboa, Portugal;

Instituto Superior Tecnico, Lisboa, Portugal,Instituto de Telecomunicacoes, Lisboa, Portugal;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
feature selection; filters; dispersion measures; similarity measures; high-dimensional data;

机译：特征选择;过滤器分散措施;相似性度量;高维数据;

相似文献

外文文献
中文文献
专利

1. Ensemble learning-based filter-centric hybrid feature selection framework for high-dimensional imbalanced data [J] . Kim Jongmo, Kang Jaewoong, Sohn Mye Knowledge-Based Systems . 2021,第MAYa23期

机译：基于学习的默认的过滤的默认混合功能选择框架，用于高维不平衡数据
2. A new improved filter-based feature selection model for high-dimensional data [J] . Munirathinam Deepak Raj, Ranganadhan Mohanasundaram Journal of supercomputing . 2020,第8期

机译：用于高维数据的新改进的基于滤波器的特征选择模型
3. Benchmark for filter methods for feature selection in high-dimensional classification data [J] . Bommert Andrea, Sun Xudong, Bischl Bernd, Computational statistics & data analysis . 2020,第1期

机译：用于在高维分类数据中的特征选择的过滤方法的基准测试
4. Efficient feature selection for high-dimensional data using two-level filter [C] . Yun Li, Zhong-Fu Wu, Jia-Min Liu, . 2004

机译：使用两级滤波器对高维数据进行有效的特征选择
5. Robust and efficient feature selection for high-dimensional datasets. [D] . Mo, Dengyao. 2011

机译：高维数据集的稳健而高效的特征选择。
6. An efficient approach for feature construction of high-dimensional microarray data by random projections [O] . Hassan Tariq, Elf Eldridge, Ian Welch -1

机译：通过随机投影构建高维微阵列数据特征的有效方法
7. Efficient feature selection filters for high-dimensional data [O] . Ferreira Artur J., Figueiredo Mário A. T. 2012

机译：高维数据的高效特征选择过滤器

Efficient feature selection filters for high-dimensional data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅