Predicting Rare Classes: Can Boosting Make Any Weak Learner Strong?

机译：预测稀有课程：提高能力可以使任何弱小的学习者变得强大吗？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Boosting is a strong ensemble-based learning algorithm with the promise of iteratively improving the classification accuracy using any base learner, as long as it satisfies the condition of yielding weighted accuracy > 0.5. In this paper, we analyze boosting with respect to this basic condition on the base learner, to see if boosting ensures prediction of rarely occurring events with high recall and precision. First we show that a base learner can satisfy the required condition even for poor recall or precision levels, especially for very rare classes. Furthermore, we show that the intelligent weight updating mechanism in boosting, even in its strong cost-sensitive form, does not prevent cases where the base learner always achieves high precision but poor recall or high recall but poor precision, when mapped to the original distribution. In either of these cases, we show that the voting mechanism of boosting fails to achieve good overall recall and precision for the ensemble. In effect, our analysis indicates that one cannot be blind to the base learner performance, and just rely on the boosting mechanism to take care of its weakness. We validate our arguments empirically on variety of real and synthetic rare class problems. In particular, using AdaCost as the boosting algorithm, and variations of PNrule and RIPPER as the base learners, we show that if algorithm A achieves better recall-precision balance than algorithm B, then using A as the base learner in AdaCost yields significantly better performance than using B as the base learner.

机译：Boosting是一个强大的基于集合的学习算法，它有望使用任何基础学习器迭代地提高分类精度，只要它满足产生加权精度> 0.5的条件即可。在本文中，我们针对基础学习者的这一基本条件分析了增强，以了解增强是否能够以较高的查全率和准确性来确保很少发生的事件的预测。首先，我们证明基础学习者即使在召回率或精确度不佳的情况下也能满足要求的条件，尤其是对于非常少见的班级。此外，我们表明，即使以强大的成本敏感形式进行增强的智能权重更新机制，也无法防止基础学习者映射到原始分布时总是达到高精度但召回率不高或召回率很高但精度不高的情况。。在这两种情况下，我们都表明，增强投票的机制无法使整体获得良好的整体召回率和准确性。实际上，我们的分析表明，不能对基础学习者的表现视而不见，而只能依靠增强机制来弥补其劣势。我们通过实证和综合稀有类问题的各种经验性地验证了我们的论点。尤其是，使用AdaCost作为增强算法，并使用PNrule和RIPPER的变体作为基本学习器，我们证明，如果算法A获得的重调用精度平衡比算法B更好，那么在AdaCost中使用A作为基本学习器将产生显着更好的性能。而不是使用B作为基础学习者。

著录项

来源
《Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Jul 23-26, 2002, Edmonton 》|2002年|p.297-306|共10页
会议地点
作者
Mahesh V. Joshi; Ramesh C. Agarwal; Vipin Kumar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. BoostingTree: parallel selection of weak learners in boosting, with application to ranking [J] . Levente Kocsis, András György, Andrea N. Bán Machine Learning . 2013 ,第2a3期

机译：BoostingTree：并行选择弱势学习者，并将其应用于排名
2. BoostingTree: parallel selection of weak learners in boosting, with application to ranking [J] . Levente Kocsis, Andras Gyoergy, Andrea N. Ban Machine Learning . 2013 ,第2a3期

机译：BoostingTree：并行选择弱势学习者，并将其应用于排名
3. The outcome-based iCAN! / theyCAN! feedback paradigm differentiates strong and weak learning outcomes, learner diversity, and the learning outcomes of each learner: A shift to metacognitive assessment [J] . Ioannis D K Dimoliatis, Ioannis Zerdes, Athanasia Zampeta, Forum of Clinical Oncology . 2019 ,第1期

机译：基于结果的iCAN！ / 他们能！反馈范式区分强者和弱者的学习成果，学习者的多样性以及每个学习者的学习成果：向元认知评估的转变
4. Predicting Rare Classes: Can Boosting Make Any Weak Learner Strong? [C] . Mahesh V. Joshi, Ramesh C. Agarwal, Vipin Kumar ACM SIGKDD international conference on knowledge discovery and data mining . 2002

机译：预测稀有课程：可以提高任何弱学习者吗？
5. FPGA Interconnection Networks with Capacitive Boosting in Strong and Weak Inversion [D] . Eslami, Fatemeh. 2012

机译：FPGA互连网络，具有强大且弱反转的电容升压
6. Growth Mindset as a Personal Preference Predicts Teachers’ Favorable Evaluation of Positive Education as an Imported Practice When Institutional and Normative Support for It Are Both Strong or Both Weak [O] . Vincci Chan, Chi-yue Chiu, Sau-lai Lee, 2020

机译：由于个人偏好增长心态预测了教师对积极教育的有利评估作为进口实践因为机构和规范性支持都很强大或两者都很弱
7. Predicting Rare Classes: Comparing Two-Phase Rule Induction to Cost-Sensitive Boosting [O] . Mahesh V. Joshi, Ramesh C. Agarwal, Vipin Kumar 2002

机译：预测稀有类别：比较两阶段规则诱导对成本敏感的提升
8. Predicting the Turbulent Air-Sea Surface Fluxes, Including Spray Effects, from Weak to Strong Winds. [R] . E. L. Andreas L. Mahrt 2013

机译：预测湍流的海气表面通量，包括从弱风到强风的喷雾效应。

Predicting Rare Classes: Can Boosting Make Any Weak Learner Strong?

摘要

著录项

相似文献

相关主题

期刊订阅