...
首页> 外文期刊>Expert systems with applications >Minimizing neighborhood evidential decision error for feature evaluation and selection based on evidence theory
【24h】

Minimizing neighborhood evidential decision error for feature evaluation and selection based on evidence theory

机译:基于证据理论的邻域证据决策误差最小化特征评估与选择

获取原文
获取原文并翻译 | 示例
           

摘要

Feature selection is an important preprocessing step in pattern recognition and machine learning, and feature evaluation arises as key issues in the construction of feature selection algorithms. In this study, we introduce a new concept of neighborhood evidential decision error to evaluate the quality of candidate features and construct a greedy forward algorithm for feature selection. This technique considers both the Bayes error rate of classification and spatial information of samples in the decision boundary regions.Within the decision boundary regions, each sample x, in the neighborhood of x provides a piece of evidence reflecting the decision of x so as to separate the decision boundary regions into two subsets: recognizable and misclassified regions. The percentage of misclassified samples is viewed as the Bayes error rate of classification in the corresponding feature subspaces. By minimizing the neighborhood evidential decision error (i.e., Bayes error rate), the optimal feature subsets of raw data set can be selected. Some numerical experiments were conducted to validate the proposed technique by using nine UCI classification datasets. The experimental results showed that this technique is effective in most of the cases, and is insensitive to the size of neighborhood comparing with other feature evaluation functions such as the neighborhood dependency.
机译:特征选择是模式识别和机器学习中的重要预处理步骤,特征评估是构建特征选择算法中的关键问题。在这项研究中,我们引入了邻域证据决策误差的新概念,以评估候选特征的质量,并构建用于特征选择的贪婪前进算法。该技术同时考虑了分类边界的贝叶斯错误率和决策边界区域内样本的空间信息,在决策边界区域内,x附近的每个样本x提供了反映x决策的证据,从而可以将x分离决策边界区域分为两个子集:可识别区域和错误分类区域。错误分类的样本的百分比被视为相应特征子空间中的贝叶斯分类错误率。通过最小化邻域证据决策误差(即贝叶斯误差率),可以选择原始数据集的最佳特征子集。通过使用9个UCI分类数据集,进行了一些数值实验,以验证所提出的技术。实验结果表明,该技术在大多数情况下是有效的,并且与其他特征评估功能(例如邻域依赖性)相比,对邻域的大小不敏感。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号