首页> 外文会议> >Efficient feature selection for high-dimensional data using two-level filter
【24h】

Efficient feature selection for high-dimensional data using two-level filter

机译:使用两级滤波器对高维数据进行有效的特征选择

获取原文

摘要

Feature selection is a key problem to pattern recognition and machine learning, and it is difficult to get the optimal feature subset for its NP-hard. Currently, the dimensionality of feature set or instance set is very high in many applications, such as information retrieval, so the feature selection from high-dimensional data is also an urgent task for researchers. This paper presents a new approach, which is a two-level filter model system integrating the relief and a newly developed algorithm of feature cluster, to reduce the dimensionality of large-scale feature set via the feature correlation (relevance) including the feature-feature correlation and feature-class correlation. Our major contributions are: (1) to present a system to perform feature selection from high-dimensional data; (2) to analyze the change of system architecture according to the time cost of the parts in the system; (3) to summarize and comment on the calculations of feature correlation; (4) to perform experiments to show the effective of the proposed approach, which has shown that the system can efficiently get a better compromise between dimensionality reduction and accuracy rate of classification than just part of the system. In many cases, it can improve the accuracy rate and dimensionality reduction.
机译:特征选择是模式识别和机器学习的关键问题,很难为其NP-hard获得最佳特征子集。当前,特征集或实例集的维数在许多应用中都很高,例如信息检索,因此从高维数据中选择特征也是研究人员的紧迫任务。本文提出了一种新的方法,该方法是一种结合了浮雕的两级过滤器模型系统和一种新开发的特征簇算法,可通过包括特征-特征的特征相关性(相关性)来降低大规模特征集的维数。相关性和要素类相关性。我们的主要贡献是:(1)提出一种从高维数据执行特征选择的系统; (2)根据系统中各部分的时间成本来分析系统架构的变化; (3)对特征相关性的计算进行总结和评论; (4)进行实验以证明所提方法的有效性,这表明该系统可以有效地在降维和分类准确率之间取得更好的折衷,而不仅仅是该系统的一部分。在许多情况下,它可以提高准确率和降维。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号