FilterBoost: Regression and Classification on LargeDatasets

机译：FilterBoost：大数据集的回归和分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study boosting in the filtering setting, where the booster draws examples from an oracle instead of using a fixed training set and so may train efficiently on very large datasets. Our algorithm, which is based on a logistic regression technique proposed by Collins, Schapire, & Singer, requires fewer assumptions to achieve bounds equivalent to or better than previous work. Moreover, we give the first proof that the algorithm of Collins et al. is a strong PAC learner, albeit within the filtering setting. Our proofs demonstrate the algorithm's strong theoretical properties for both classification and conditional probability estimation, and we validate these results through extensive experiments. Empirically, our algorithm proves more robust to noise and overfitting than batch boosters in conditional probability estimation and proves competitive in classification.

机译：我们研究了过滤设置中的增强，其中增强器从oracle中提取示例，而不是使用固定的训练集，因此可以在非常大的数据集上进行有效的训练。我们的算法基于Collins，Schapire和Singer提出的逻辑回归技术，需要较少的假设才能达到与先前的工作相当或更好的界限。此外，我们给出了Collins等人的算法的第一个证明。尽管在过滤设置内，但他还是PAC学习者中的佼佼者。我们的证明证明了该算法在分类和条件概率估计方面的强大理论特性，并通过大量实验验证了这些结果。从经验上讲，我们的算法在条件概率估计方面比批处理增强器对噪声和过度拟合的鲁棒性更强，并且在分类方面具有竞争力。

著录项

来源
《Annual Conference on Neural Information Processing Systems》|2007年|57-64|共8页
会议地点
作者
Joseph K. Bradlev; Robert E. Schapire;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Identification of insulin resistance in Asian Indian adolescents: classification and regression tree (CART) and logistic regression based classification rules. [J] . Goel R, Misra A, Kondal D, Clinical Endocrinology . 2009,第5期

机译：鉴定亚洲印度裔青少年的胰岛素抵抗：分类和回归树（CART）和基于逻辑回归的分类规则。
2. Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: A comparative study [J] . Felicísimo á.M., Cuartero A., Remondo J., Landslides . 2013,第2期

机译：用逻辑回归，多种自适应回归样条，分类和回归树以及最大熵方法绘制滑坡敏感性图：一项比较研究
3. A Novel Backward Stepwise Logistic Regression and Classification and Regression Tree Model to Predict 180-day Clinical Outcomes in Hepatitis B Virus-acute-on-chronic Liver Failure Patients [J] . Shima Ghavimi 临床与转化肝病杂志（英文版） . 2021,第004期

机译：一种小说逐步逐步逻辑回归和分类和回归树模型，以预测乙型肝炎病毒 - 急性对慢性肝功能衰竭患者的180天临床结果
4. FilterBoost: Regression and Classification on LargeDatasets [C] . Annual Conference on Neural Information Processing Systems . 2007

机译：过滤艇：在Largedatasets上回归和分类
5. Rapid Classification of NifH Protein Sequences using Classification and Regression Trees [D] . Frank, Ildiko 2014

机译：使用分类和回归树对NifH蛋白序列进行快速分类
6. Factors Influencing Drug Injection History among Prisoners: A Comparison between Classification and Regression Trees and Logistic Regression Analysis [O] . Azam Rastegari, Ali Akbar Haghdoost, Mohammad Reza Baneshi 2013

机译：影响囚犯注射毒品史的因素：分类树和回归树的比较以及Logistic回归分析
7. A Comparative Assessment of the Influences of Human Impacts on Soil Cd Concentrations Based on Stepwise Linear Regression, Classification and Regression Tree, and Random Forest Models. [O] . Lefeng Qiu, Kai Wang, Wenli Long, 2016

机译：基于逐步线性回归，分类和回归树及随机森林模型的人类影响对土壤镉含量影响的比较评价。

FilterBoost: Regression and Classification on LargeDatasets

摘要

著录项

相似文献

相关主题

期刊订阅