Scalable Greedy Feature Selection via Weak Submodularity

Rajiv Khanna; Ethan Elenberg; Alex Dimakis; Sahand Negahban; Joydeep Ghosh

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Scalable Greedy Feature Selection via Weak Submodularity

【24h】

Scalable Greedy Feature Selection via Weak Submodularity

机译：通过弱次模块性进行可扩展的贪婪特征选择

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Greedy algorithms are widely used for problems in machine learning such as feature selection and set function optimization. Unfortunately, for large datasets, the running time of even greedy algorithms can be quite high. This is because for each greedy step we need to refit a model or calculate a function using the previously selected choices and the new candidate. Two algorithms that are faster approximations to the greedy forward selection were introduced recently?[Mirzasoleiman et al., 2013, 2015]. They achieve better performance by exploiting stochastic evaluation and distributed computation respectively. Both algorithms have provable performance guarantees for submodular functions. In this paper we show that divergent from previously held opinion, submodularity is not required to obtain approximation guarantees for these two algorithms. Specifically, we show that a generalized concept of weak submodularity suffices to give multiplicative approximation guarantees. Our result extends the applicability of these algorithms to a larger class of functions. Furthermore, we show that a bounded submodularity ratio can be used to provide data dependent bounds that can sometimes be tighter also for submodular functions. We empirically validate our work by showing superior performance of fast greedy approximations versus several established baselines on artificial and real datasets.

机译：贪婪算法广泛用于机器学习中的问题，例如特征选择和集合函数优化。不幸的是，对于大型数据集，即使是贪婪算法的运行时间也可能很高。这是因为对于每个贪婪的步骤，我们都需要使用先前选择的选项和新候选者来重新拟合模型或计算函数。最近引入了两种更快速近似贪婪前向选择的算法？[Mirzasoleiman等，2013，2015]。它们分别利用随机评估和分布式计算来获得更好的性能。两种算法对于子模块功能都有可证明的性能保证。在本文中，我们表明与先前持有的观点有所不同，对于这两种算法，不需要子模数来获得近似保证。具体来说，我们证明了弱次模量的广义概念足以给出乘法近似保证。我们的结果将这些算法的适用性扩展到更大的功能类别。此外，我们表明，有界子模数比可用于提供数据相关的界线，对于子模态函数，界线有时也会更严格。我们通过在人工和真实数据集上显示快速贪婪近似优于几个已建立的基线的性能，来从经验上验证我们的工作。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第3期|共9页
作者
Rajiv Khanna; Ethan Elenberg; Alex Dimakis; Sahand Negahban; Joydeep Ghosh;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Randomized Greedy Sensor Selection: Leveraging Weak Submodularity [J] . Abolfazl Hashemi, Mahsa Ghasemi, Haris Vikalo, Automatic Control, IEEE Transactions on . 2021,第1期

机译：随机贪婪传感器选择：利用弱潜水线
2. Weakly Submodular Maximization Beyond Cardinality Constraints: Does Randomization Help Greedy? [J] . Lin Chen, Moran Feldman, Amin Karbasi JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：超越基数约束的弱次模块化最大化：随机化是否有助于贪婪？
3. Cost-Constrained feature selection in binary classification: adaptations for greedy forward selection and genetic algorithms [J] . Rudolf Jagdhuber, Michel Lang, Arnulf Stenzl, BMC Bioinformatics . 2020,第1期

机译：二进制分类中的成本约束特征选择：对贪婪前进选择和遗传算法的适应
4. Large-scale Submodular Greedy Exemplar Selection with Structured Similarity Matrices [C] . Dmitry Malioutov, Abhishek Kumar, Ian E. H. Yen Conference on Uncertainty in Artificial Intelligence . 2016

机译：具有结构相似性矩阵的大型子模块贪婪示例选择
5. Stability and Scalability of Feature Subset Selection Using Particle Swarm Optimization in Bioinformatics [D] . ?Dhrif, Hassen 2019

机译：生物信息学中粒子群优化特征子集选择的稳定性和可扩展性
6. Cost-Constrained feature selection in binary classification: adaptations for greedy forward selection and genetic algorithms [O] . Rudolf Jagdhuber, Michel Lang, Arnulf Stenzl, 2020

机译：二元分类中受成本约束的特征选择：贪婪前向选择和遗传算法的改编
7. Randomized Greedy Sensor Selection: Leveraging Weak Submodularity [O] . Abolfazl Hashemi, Mahsa Ghasemi, Haris Vikalo, 2021

机译：随机贪婪传感器选择：利用弱潜水线

Scalable Greedy Feature Selection via Weak Submodularity

摘要

著录项

相似文献

相关主题

期刊订阅