首页> 外文期刊>Statistical Methods and Applications >On boosting kernel density methods for multivariate data: density estimation and classification
【24h】

On boosting kernel density methods for multivariate data: density estimation and classification

机译:关于提高多元数据的核密度方法:密度估计和分类

获取原文
获取原文并翻译 | 示例
       

摘要

Statistical learning is emerging as a promising field where a number of algorithms from machine learning are interpreted as statistical methods and vice-versa. Due to good practical performance, boosting is one of the most studied machine learning techniques. We propose algorithms for multivariate density estimation and classification. They are generated by using the traditional kernel techniques as weak learners in boosting algorithms. Our algorithms take the form of multistep estimators, whose first step is a standard kernel method. Some strategies for bandwidth selection are also discussed with regard both to the standard kernel density classification problem, and to our 'boosted' kernel methods. Extensive experiments, using real and simulated data, show an encouraging practical relevance of the findings. Standard kernel methods are often outperformed by the first boosting iterations and in correspondence of several bandwidth values. In addition, the practical effectiveness of our classification algorithm is confirmed by a comparative study on two real datasets, the competitors being trees including AdaBoosting with trees.
机译:统计学习正在成为一个有前途的领域,其中机器学习的许多算法被解释为统计方法,反之亦然。由于良好的实用性能,提升是研究最多的机器学习技术之一。我们提出用于多元密度估计和分类的算法。它们是通过使用传统的内核技术作为增强算法中的弱学习者而生成的。我们的算法采用多步估算器的形式,其第一步是标准核方法。还讨论了一些带宽选择策略,涉及标准内核密度分类问题和我们的“增强型”内核方法。使用真实和模拟数据进行的大量实验表明,这些发现具有令人鼓舞的实际意义。标准的内核方法通常在第一次增强迭代时表现不佳,并且与多个带宽值相对应。此外,通过对两个真实数据集的比较研究,我们的分类算法的实际有效性得到了证实,竞争者是树木,包括AdaBoosting与树木。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号