首页> 外文会议>IEEE conference on Sensors >Feature Selection for High Dimensionality Data in Chemical Sensing
【24h】

Feature Selection for High Dimensionality Data in Chemical Sensing

机译:化学传感中高维数据的特征选择

获取原文

摘要

We present the results obtained by applying different Feature Selection (FS) methods to the analysis of chemical sensing data. Four datasets have been considered: a mass spectrometer (MS) dataset with 505 peaks (features) and three datasets from metal oxide sensor based E-Noses with 30 features. For the MS data we first filter features individually and then apply optimal and suboptimal FS search strategies. The three E-Nose datasets present different discrimination hardness and different number of data. For every classification problem three classifiers are tested (3NN, LDA, QDA). FS performance is calculated by two nested cross-validation cycles in order to prevent selection bias. FS always increases the correct test set classification ratio (sometimes the increase is substantial) and the discriminative features are recognized. To our knowledge, this is the first benchmarking study evaluating several FS options (classifiers and search strategies, including suboptimal ones) for the classification of different chemical sensing datasets.
机译:我们介绍了通过将不同的特征选择(FS)方法应用于化学传感数据的分析来获得的结果。已经考虑了四个数据集:具有505个峰值(特征)的质谱仪(MS)数据集和来自金属氧化物传感器的基于E-NOSE的三个数据集,具有30个特征。对于MS数据,我们首先单独过滤功能,然后应用最佳和次优的FS搜索策略。三个电子鼻数据集呈现不同的辨别硬度和不同数量的数据。对于每个分类问题,测试了三个分类器(3nn,lda,qda)。 FS性能由两个嵌套交叉验证周期计算,以防止选择偏差。 FS总是增加正确的测试设定分类率(有时增加是实质性的),并且识别鉴别特征。为我们的知识,这是第一个评估几个FS选项(分类器和搜索策略,包括次优)的基准测试,用于分类不同化学传感数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号