...
【24h】

Wrapper feature selection for small sample size data driven by complete error estimates

机译:针对由完整误差估计驱动的小样本数据的包装器功能选择

获取原文
获取原文并翻译 | 示例

摘要

This paper focuses on wrapper-based feature selection for a 1-nearest neighbor classifier. We consider in particular the case of a small sample size with a few hundred instances, which is common in biomedical applications. We propose a technique for calculating the complete bootstrap for a 1-nearest-neighbor classifier (i.e., averaging over all desired test/train partitions of the data). The complete bootstrap and the complete cross-validation error estimate with lower variance are applied as novel selection criteria and are compared with the standard bootstrap and cross-validation in combination with three optimization techniques - sequential forward selection (SFS), binary particle swarm optimization (BPSO) and simplified social impact theory based optimization (SSITO). The experimental comparison based on ten datasets draws the following conclusions: for all three search methods examined here, the complete criteria are a significantly better choice than standard 2-fold cross-validation, 10-fold cross-validation and bootstrap with 50 trials irrespective of the selected output number of iterations. All the complete criterion-based 1NN wrappers with SFS search performed better than the widely-used FILTER and SIMBA methods. We also demonstrate the benefits and properties of our approaches on an important and novel real-world application of automatic detection of the subthalamic nucleus.
机译:本文着重于针对1个最近邻居分类器的基于包装器的特征选择。我们特别考虑小样本量和数百个实例的情况,这在生物医学应用中很常见。我们提出了一种用于计算1-最近邻分类器的完整引导程序的技术(即,对数据的所有所需测试/训练分区进行平均)。具有较低方差的完整引导程序和完整的交叉验证误差估计被用作新的选择标准,并与标准的引导程序和交叉验证相结合并结合了三种优化技术-顺序前向选择(SFS),二进制粒子群优化( BPSO)和简化的基于社会影响理论的优化(SSITO)。基于十个数据集的实验比较得出以下结论:对于此处检查的所有三种搜索方法,与50项试验无关的2倍交叉验证,10倍交叉验证和自举的标准相比,完整标准是更好的选择。选定的输出迭代次数。具有SFS搜索功能的所有基于标准的完整1NN包装器的性能均优于广泛使用的FILTER和SIMBA方法。我们还展示了我们的方法在丘脑下丘脑核自动检测的重要且新颖的实际应用中的优势和特性。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号