Learning noisy linear classifiers via adaptive and selective sampling

Giovanni Cavallanti; Nicolo Cesa-Bianchi; Claudio Gentile

首页> 外文期刊>Machine Learning >Learning noisy linear classifiers via adaptive and selective sampling

【24h】

Learning noisy linear classifiers via adaptive and selective sampling

机译：通过自适应和选择性采样学习噪声线性分类器

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce efficient margin-based algorithms for selective sampling and filtering in binary classification tasks. Experiments on real-world textual data reveal that our algorithms perform significantly better than popular and similarly efficient competitors. Using the so-called Mammen-Tsybakov low noise condition to parametrize the instance distribution, and assuming linear label noise, we show bounds on the convergence rate to the Bayes risk of a weaker adaptive variant of our selective sampler. Our analysis reveals that, excluding logarithmic factors, the average risk of this adaptive sampler converges to the Bayes risk at rate N-(1+α)(2+α)/2(3+α) where N denotes the number of queried labels, and α > 0 is the exponent in the low noise condition. For all α > √3 - 1 ≈ 0.73 this convergence rate is asymptotically faster than the rate N-(1+α)/(2+α) achieved by the fully supervised version of the base selective sampler, which queries all labels. Moreover, for α →∞ (hard margin condition) the gap between the semi- and fully-supervised rates becomes exponential.

机译：我们介绍了有效的基于余量的算法，用于二进制分类任务中的选择性采样和过滤。对真实世界文本数据的实验表明，我们的算法的性能明显优于受欢迎且效率类似的竞争对手。使用所谓的Mammen-Tsybakov低噪声条件对实例分布进行参数化，并假设线性标签噪声，我们证明了选择性采样器适应性较弱的贝叶斯风险的收敛速度范围。我们的分析表明，除对数因素外，此自适应采样器的平均风险以N-（1 +α）（2 +α）/ 2（3 +α）的速率收敛到贝叶斯风险，其中N表示查询的标签数，α> 0是低噪声条件下的指数。对于所有α>√3-1-≈0.73，该收敛速度渐近地快于基本选择性采样器的全监督版本（查询所有标签）获得的速率N-（1 +α）/（2 +α）。此外，对于α→∞（硬边界条件），半监督率和完全监督率之间的差距变为指数。

著录项

来源
《Machine Learning》 |2011年第1期|p.71-102|共32页
作者
Giovanni Cavallanti; Nicolo Cesa-Bianchi; Claudio Gentile;
展开▼
作者单位

DSI, Universita degli Studi di Milano, Milano, Italy;

DSI, Universita degli Studi di Milano, Milano, Italy;

DICOM, Universita dell'Insubria, Varese, Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
active learning • selective sampling • adaptive sampling • linear classification • low noise;

机译：主动学习•选择性采样•自适应采样•线性分类•低噪声;

相似文献

外文文献
中文文献
专利

1. Adaptive selective compressive sampling for sparse signal acquisition in noisy background [J] . Wen Fangqing, Zhang Yu, Zhang Gong Electronics Letters . 2015,第21期

机译：噪声背景下稀疏信号采集的自适应选择性压缩采样
2. Realization of learning induced self-adaptive sampling in noisy optimization [J] . Rakshit Pratyusha, Konar Amit Applied Soft Computing . 2018,第期

机译：嘈杂优化学习诱导自适应抽样的实现
3. Gaussian prior based adaptive synthetic sampling with non-linear sample space for imbalanced learning [J] . Knowledge-Based Systems . 2020,第Mara5期

机译：基于高斯先验的具有非线性样本空间的自适应合成样本，用于不平衡学习
4. Learning Probabilistic Linear-Threshold Classifiers via Selective Sampling [C] . Nicolo Cesa-Bianchi, Alex Conconi, Claudio Gentile, Annual Conference on Learning Theory . 2003

机译：通过选择性采样学习概率线性阈值分类器
5. Optimization using noisy simulations: Trust region, surrogate surface, and adaptive sampling. [D] . Wan, Zailong. 2005

机译：使用嘈杂的模拟进行优化：信任区域，代理曲面和自适应采样。
6. A Highly Linear CMOS Image Sensor Design Based on an Adaptive Nonlinear Ramp Generator and Fully Differential Pipeline Sampling Quantization with a Double Auto-Zeroing Technique [O] . Chuangze Li, Benguang Han, Jie He, 2020

机译：基于自适应非线性斜坡发生器和双差分自动归零技术的全差分流水线采样量化的高度线性CMOS图像传感器设计
7. Learning Noisy Linear Classifiers via Adaptive and Selective Sampling [O] . Giovanni Cavallanti, Nicolò Cesa-bianchi, Claudio Gentile 2013

机译：通过自适应和选择性采样学习噪声线性分类器
8. Adaptive Sampling for Noisy Problems [R] . Cantu-Paz, E. 2004

机译：噪声问题的自适应采样

Learning noisy linear classifiers via adaptive and selective sampling

摘要

著录项

相似文献

相关主题

期刊订阅