...
首页> 外文期刊>Pattern Analysis and Applications >An approach of feature selection using graph-theoretic heuristic and hill climbing
【24h】

An approach of feature selection using graph-theoretic heuristic and hill climbing

机译:一种基于图论启发式和爬山的特征选择方法

获取原文
获取原文并翻译 | 示例
           

摘要

Search-based methods that use matrix- or vector-based representations of the dataset are commonly employed to solve the problem of feature selection. These methods are more generalized and easy to apply. Recently, a set of algorithms have started using graph-based representation of the dataset instead of the traditional representations. These methods require additional modelling as the dataset needs to be represented as a graph. However, graph-based methods help in visualizing inter-feature relationship based on which graph-theoretic principles can be applied to identify good-quality feature subsets. A combination of the graph-based representation with traditional search techniques has the potential to increase model performance as well as interpretability. As per literature study, there is hardly any method which combines these approaches. In this paper, we have proposed a feature selection algorithm, which represents the dataset as a graph and then uses maximal independent sets and minimal vertex covers to improve traditional hill climbing search. The proposed method produces statistically significant improvement over (i) hill climbing, (ii) standard search-based methods and (iii) pure graph-based methods.
机译:通常使用使用数据集的基于矩阵或矢量的表示形式的基于搜索的方法来解决特征选择的问题。这些方法更加通用并且易于应用。最近,一组算法已开始使用数据集的基于图形的表示形式代替传统表示形式。这些方法需要额外的建模,因为数据集需要以图形表示。但是,基于图的方法有助于可视化要素间关系,基于该关系,可以将图论原理应用于识别高质量特征子集。基于图的表示形式与传统搜索技术的结合具有提高模型性能和可解释性的潜力。根据文献研究,几乎没有任何方法可以将这些方法结合起来。在本文中,我们提出了一种特征选择算法,该算法将数据集表示为图形,然后使用最大独立集和最小顶点覆盖率来改进传统的爬山搜索。相对于(i)爬坡,(ii)基于标准搜索的方法和(iii)基于纯图形的方法,该方法产生了统计学上的显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号