Nonparametric Density Estimation: Toward Computational Tractability

机译：非参数密度估计：朝向计算途径

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Density estimation is a core operation of virtually all probabilistic learning methods (as opposed to discriminative methods). Approaches to density estimation can be divided into two principal classes, parametric methods, such as Bayesian networks, and nonparametric methods such as kernel density estimation and smoothing splines. While neither choice should be universally preferred for all situations, a well-known benefit of nonparametric methods is their ability to achieve estimation optimality for ANY input distribution as more data are observed, a property that no model with a parametric assumption can have, and one of great importance in exploratory data analysis and mining where the underlying distribution is decidedly unknown. To date, however, despite a wealth of advanced underlying statistical theory, the use of nonparametric methods has been limited by their computational intractibility for all but the smallest datasets. In this paper, we present an algorithm for kernel density estimation, the chief nonparametric approach, which is dramatically faster than previous algorithmic approaches in terms of both dataset size and dimensionality. Furthermore, the algorithm provides arbitrarily tight accuracy guarantees, provides anytime convergence, works for all common kernel choices, and requires no parameter tuning. The algorithm is an instance of a new principle of algorithm design: multi-recursion, or higher-order divide-and-conquer.

机译：密度估计是几乎所有的概率学习方法核心操作（相对于辨别方法）。方法密度估计可以分为两个主要的类别，参数的方法，例如贝叶斯网络，和非参数的方法，例如核密度估计和平滑样条。虽然没有选择应普遍首选的所有情况下，非参数方法的公知的好处是它们作为多个数据中观察到，以实现估计最优对于任何输入分配能力，属性，与一个参数假设没有模型可以具有，和一个在其中底层分布是决定性的未知探索性数据分析和挖掘具有重要意义。到目前为止，然而，尽管有丰富先进的底层统计理论，运用非参数方法已被其计算intractibility为所有，但最小数据集的限制。在本文中，我们提出了核密度估计，主要非参数方法，这种方法大大加快比以前的算法方法在这两个数据集的大小和维方面的算法。此外，该算法提供任意紧精度的质量担保，提供随时随地收敛，适用于所有的通用内核的选择，无需参数整定。该算法的算法设计的新原理的实例：多递归或高阶的分而治之。

著录项

来源
《SIAM International Conference on Data Mining》|2003年|xiv 347 p.|共9页
会议地点
作者
Alexander G. Gray; Andrew W. Moore;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Kernel density estimation; Nonparametric statistics; Algorithms; Divide-and-conquer; Space-partitioning trees;

机译：内核密度估计;非参数统计;算法;分开和征服;空间分区树木;

相似文献

外文文献
中文文献
专利

1. Nonparametric density estimation for randomly perturbed elliptic problems III: Convergence, computational cost, and generalizations [J] . Estep D., Holst M.J., M?lqvist A. Journal of Applied Mathematics and Computing . 2012,第1a2期

机译：随机扰动椭圆问题的非参数密度估计III：收敛性，计算成本和一般化
2. Nonparametric density estimation for randomly perturbed elliptic problems III: convergence, computational cost, and generalizations [J] . Donald Estep, Michael J. Holst, Axel Målqvist Journal of Applied Mathematics and Computing . 2012,第1a2期

机译：随机扰动椭圆问题的非参数密度估计III：收敛性，计算成本和一般化
3. Simplified Computation for Nonparametric Windows Method of Probability Density Function Estimation [J] . Joshi Niranjan, Kadir Timor, Brady Sir Michael Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2011,第8期

机译：概率密度函数估计的非参数Windows方法的简化计算
4. Nonparametric Density Estimation: Toward Computational Tractability [C] . Alexander G. Gray, Andrew W. Moore SIAM International Conference on Data Mining . 2003

机译：非参数密度估计：朝向计算途径
5. Nonparametric Regression and Density Estimation on a Network [D] . Liu, Yang. 2020

机译：网络上的非参数回归和密度估计
6. High throughput nonparametric probability density estimation [O] . Jenny Farmer, Donald Jacobs 2012

机译：高通量非参数概率密度估计
7. Nonparametric Density Estimation: Toward Computational Tractability [O] . Alexander G. Gray, Andrew W. Moore 2003

机译：非参数密度估计：计算可行性

Nonparametric Density Estimation: Toward Computational Tractability

摘要

著录项

相似文献

相关主题

期刊订阅