首页> 外文期刊>Neurocomputing >Theoretical and empirical study on the potential inadequacy of mutual information for feature selection in classification
【24h】

Theoretical and empirical study on the potential inadequacy of mutual information for feature selection in classification

机译:互信息不足对分类特征选择的理论和实证研究

获取原文
获取原文并翻译 | 示例

摘要

Mutual information is a widely used performance criterion for filter feature selection. However, despite its popularity and its appealing properties, mutual information is not always the most appropriate criterion. Indeed, contrary to what is sometimes hypothesized in the literature, looking for a feature subset maximizing the mutual information does not always guarantee to decrease the misclassification probability, which is often the objective one is interested in. The first objective of this paper is thus to clearly illustrate this potential inadequacy and to emphasize the fact that the mutual information remains a heuristic, coming with no guarantee in terms of classification accuracy. Through extensive experiments, a deeper analysis of the cases for which the mutual information is not a suitable criterion is then conducted. This analysis allows us to confirm the general interest of the mutual information for feature selection. It also helps us better apprehending the behaviour of mutual information throughout a feature selection process and consequently making a better use of it as a feature selection criterion.
机译:互信息是过滤器特征选择中广泛使用的性能标准。但是,尽管相互信息广受欢迎并具有吸引人的特性,但它们并不总是最合适的标准。确实,与文献中有时所假设的相反,寻找使互信息最大化的特征子集并不能总保证降低误分类的概率,这常常是人们感兴趣的目标。因此,本文的首要目标是清楚地说明这种潜在的不足,并强调一个事实,即相互信息仍然是一种启发式方法,无法保证分类的准确性。通过广泛的实验,然后对互信息不是合适标准的情况进行了更深入的分析。通过这种分析,我们可以确认共同的信息对特征选择的普遍兴趣。它还可以帮助我们更好地理解整个信息选择过程中相互信息的行为,从而更好地利用它作为特征选择标准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号