首页> 外文会议>AAAI Conference on Artificial Intelligence >IVFS: Simple and Efficient Feature Selection for High Dimensional Topology Preservation
【24h】

IVFS: Simple and Efficient Feature Selection for High Dimensional Topology Preservation

机译:IVFS:高维拓扑保存简单有效的特点选择

获取原文

摘要

Feature selection is an important tool to deal with high dimensional data. In unsupervised case, many popular algorithms aim at maintaining the structure of the original data. In this paper, we propose a simple and effective feature selection algorithm to enhance sample similarity preservation through a new perspective, topology preservation, which is represented by persistent diagrams from the context of computational topology. This method is designed upon a unified feature selection framework called IVFS, which is inspired by random subset method. The scheme is flexible and can handle cases where the problem is analytically intractable. The proposed algorithm is able to well preserve the pairwise distances, as well as topological patterns, of the full data. We demonstrate that our algorithm can provide satisfactory performance under a sharp sub-sampling rate, which supports efficient implementation of our proposed method to large scale datasets. Extensive experiments validate the effectiveness of the proposed feature selection scheme.
机译:特征选择是处理高维数据的重要工具。在无监督的情况下,许多流行的算法旨在维护原始数据的结构。在本文中,我们提出了一种简单有效的特征选择算法,通过新的透视,拓扑保存来增强样本相似度保存,其由从计算拓扑的上下文中的持久图表示。该方法是在称为IVFS的统一特征选择框架上设计的,该框架由随机子集方法启发。该方案是灵活的,可以处理问题在分析上棘手的情况下。所提出的算法能够很好地保护完整数据的成对距离以及拓扑模式。我们展示了我们的算法可以在尖锐的子采样率下提供令人满意的性能,这支持有效地实现我们提出的大规模数据集。广泛的实验验证了所提出的特征选择方案的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号