首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Scalable Greedy Feature Selection via Weak Submodularity
【24h】

Scalable Greedy Feature Selection via Weak Submodularity

机译:通过弱次模块性进行可扩展的贪婪特征选择

获取原文
           

摘要

Greedy algorithms are widely used for problems in machine learning such as feature selection and set function optimization. Unfortunately, for large datasets, the running time of even greedy algorithms can be quite high. This is because for each greedy step we need to refit a model or calculate a function using the previously selected choices and the new candidate. Two algorithms that are faster approximations to the greedy forward selection were introduced recently?[Mirzasoleiman et al., 2013, 2015]. They achieve better performance by exploiting stochastic evaluation and distributed computation respectively. Both algorithms have provable performance guarantees for submodular functions. In this paper we show that divergent from previously held opinion, submodularity is not required to obtain approximation guarantees for these two algorithms. Specifically, we show that a generalized concept of weak submodularity suffices to give multiplicative approximation guarantees. Our result extends the applicability of these algorithms to a larger class of functions. Furthermore, we show that a bounded submodularity ratio can be used to provide data dependent bounds that can sometimes be tighter also for submodular functions. We empirically validate our work by showing superior performance of fast greedy approximations versus several established baselines on artificial and real datasets.
机译:贪婪算法广泛用于机器学习中的问题,例如特征选择和集合函数优化。不幸的是,对于大型数据集,即使是贪婪算法的运行时间也可能很高。这是因为对于每个贪婪的步骤,我们都需要使用先前选择的选项和新候选者来重新拟合模型或计算函数。最近引入了两种更快速近似贪婪前向选择的算法?[Mirzasoleiman等,2013,2015]。它们分别利用随机评估和分布式计算来获得更好的性能。两种算法对于子模块功能都有可证明的性能保证。在本文中,我们表明与先前持有的观点有所不同,对于这两种算法,不需要子模数来获得近似保证。具体来说,我们证明了弱次模量的广义概念足以给出乘法近似保证。我们的结果将这些算法的适用性扩展到更大的功能类别。此外,我们表明,有界子模数比可用于提供数据相关的界线,对于子模态函数,界线有时也会更严格。我们通过在人工和真实数据集上显示快速贪婪近似优于几个已建立的基线的性能,来从经验上验证我们的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号