首页> 外文会议>IAPR-TC-15 international workshop on graph-based representations in pattern recognition >Adaptive Feature Selection Based on the Most Informative Graph-Based Features
【24h】

Adaptive Feature Selection Based on the Most Informative Graph-Based Features

机译:基于最翔实的基于图的特征的自适应特征选择

获取原文

摘要

In this paper, we propose a novel method to adaptively select the most informative and least redundant feature subset, which has strong discriminating power with respect to the target label. Unlike most traditional methods using vectorial features, our proposed approach is based on graph-based features and thus incorporates the relationships between feature samples into the feature selection process. To efficiently encapsulate the main characteristics of the graph-based features, we probe each graph structure using the steady state random walk and compute a probability distribution of the walk visiting the vertices. Furthermore, we propose a new information theoretic criterion to measure the joint relevance of different pairwise feature combinations with respect to the target feature, through the Jensen-Shannon divergence measure between the probability distributions from the random walk on different graphs. By solving a quadratic programming problem, we use the new measure to automatically locate the subset of the most informative features, that have both low redundancy and strong discriminating power. Unlike most existing state-of-the-art feature selection methods, the proposed information theoretic feature selection method can accommodate both continuous and discrete target features. Experiments on the problem of P2P lending platforms in China demonstrate the effectiveness of the proposed method.
机译:在本文中,我们提出了一种新的方法来自适应地选择信息量最大和冗余最少的特征子集,该方法对于目标标签具有很强的区分能力。与大多数使用矢量特征的传统方法不同,我们提出的方法基于基于图的特征,因此将特征样本之间的关系纳入了特征选择过程。为了有效地封装基于图的特征的主要特征,我们使用稳态随机游动来探查每个图结构,并计算游动到顶点的概率分布。此外,我们提出了一种新的信息理论标准,通过对不同图上随机游走的概率分布之间的詹森-香农散度进行度量,以测量不同成对特征组合相对于目标特征的联合相关性。通过解决二次编程问题,我们使用新方法来自动定位信息量最大的子集,这些子集具有低冗余性和强大的区分能力。与大多数现有的最新特征选择方法不同,所提出的信息理论特征选择方法可以同时容纳连续和离散目标特征。在中国的P2P借贷平台问题上的实验证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号