Theoretical foundations of forward feature selection methods based on mutual information

Macedo Francisco; Rosario Oliveira M.; Pacheco Antonio; Valadas Rui

首页> 外文期刊>Neurocomputing >Theoretical foundations of forward feature selection methods based on mutual information

【24h】

Theoretical foundations of forward feature selection methods based on mutual information

机译：基于互信息的前向特征选择方法的理论基础

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature selection problems arise in a variety of applications, such as microarray analysis, clinical prediction, text categorization, image classification and face recognition, multi-label learning, and classification of internet traffic. Among the various classes of methods, forward feature selection methods based on mutual information have become very popular and are widely used in practice. However, comparative evaluations of these methods have been limited by being based on specific datasets and classifiers. In this paper, we develop a theoretical framework that allows evaluating the methods based on their theoretical properties. Our framework is grounded on the properties of the target objective function that the methods try to approximate, and on a novel categorization of features, according to their contribution to the explanation of the class; we derive upper and lower bounds for the target objective function and relate these bounds with the feature types. Then, we characterize the types of approximations taken by the methods, and analyze how these approximations cope with the good properties of the target objective function. Additionally, we develop a distributional setting designed to illustrate the various deficiencies of the methods, and provide several examples of wrong feature selections. Based on our work, we identify clearly the methods that should be avoided, and the methods that currently have the best performance. (C) 2018 Elsevier B.V. All rights reserved.

机译：特征选择问题出现在各种应用中，例如微阵列分析，临床预测，文本分类，图像分类和面部识别，多标签学习以及互联网流量分类。在各种方法中，基于互信息的前向特征选择方法已经非常流行并在实践中被广泛使用。但是，这些方法的比较性评估由于基于特定的数据集和分类器而受到限制。在本文中，我们建立了一个理论框架，可以根据其理论特性评估这些方法。我们的框架基于该方法试图逼近的目标目标函数的属性，以及根据特征对类的解释做出的新颖的特征分类；我们导出目标目标函数的上限和下限，并将这些界限与特征类型相关联。然后，我们表征了这些方法所采用的近似类型，并分析了这些近似如何应对目标目标函数的良好特性。此外，我们开发了一种分布设置，旨在说明这些方法的各种缺陷，并提供一些错误的特征选择示例。根据我们的工作，我们明确确定应避免的方法以及目前性能最佳的方法。（C）2018 Elsevier B.V.保留所有权利。

著录项

来源
《Neurocomputing》 |2019年第24期|67-89|共23页
作者
Macedo Francisco; Rosario Oliveira M.; Pacheco Antonio; Valadas Rui;
展开▼
作者单位

Univ Lisbon, CEMAT, Inst Super Tecn, Av Rovisco Pais, P-1049001 Lisbon, Portugal;

Univ Lisbon, CEMAT, Inst Super Tecn, Av Rovisco Pais, P-1049001 Lisbon, Portugal;

Univ Lisbon, CEMAT, Inst Super Tecn, Av Rovisco Pais, P-1049001 Lisbon, Portugal;

Univ Lisbon, Inst Super Tecn, IT, Av Rovisco Pais, P-1049001 Lisbon, Portugal;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Mutual information; Feature selection methods; Forward greedy search; Performance measure; Minimum Bayes risk;

机译：相互信息;特征选择方法;前向贪婪搜索;性能度量;贝叶斯风险最小;

相似文献

外文文献
中文文献
专利

1. Theoretical evaluation of feature selection methods based on mutual information [J] . Pascoal Claudia, Oliveira M. Rosario, Pacheco Antonio, Neurocomputing . 2017,第FEBa22期

机译：基于互信息的特征选择方法的理论评价
2. A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information [J] . Lin X., Yang F., Zhou L., Journal of chromatography, B. Analytical technologies in the biomedical and life sciences . 2012,第Null期

机译：基于人工对比变量和互信息的支持向量机递归特征消除特征选择方法
3. A review of feature selection methods based on mutual information [J] . Jorge R. Vergara, Pablo A. Estevez Neural computing & applications . 2014,第1期

机译：基于互信息的特征选择方法综述
4. Mutual Information Based Initialization of Forward-Backward Search for Feature Selection in Regression Problems [C] . Alberto Guillen, Antti Sorjamaa, Gines Rubio, ICANN 2009;International conference on artificial neural networks . 2009

机译：基于互信息的回归问题中用于特征选择的前向后搜索初始化
5. Statistical model-based methods for observation selection in wireless sensor networks and for feature selection in classification. [D] . Qi, Qi. 2012

机译：基于统计模型的方法用于无线传感器网络中的观察选择和分类中的特征选择。
6. Parameter Selection in Mutual Information-Based Feature Selection in Automated Diagnosis of Multiple Epilepsies Using Scalp EEG [O] . Wesley T. Kerr, Ariana Anderson, Hongjing Xia, -1

机译：使用ScalP EEG自动诊断的基于相互信息的特征选择参数选择
7. Theoretical Evaluation of Feature Selection Methods based on Mutual Information [O] . Pascoal, Cláudia, Oliveira, M. Rosário, Pacheco, António, 2016

机译：基于互信息的特征选择方法的理论评价信息

Theoretical foundations of forward feature selection methods based on mutual information

摘要

著录项

相似文献

相关主题

期刊订阅