首页> 外文会议>International Conference on Machine Learning >Uniform Convergence Rate of the Kernel Density Estimator Adaptive to Intrinsic Volume Dimension
【24h】

Uniform Convergence Rate of the Kernel Density Estimator Adaptive to Intrinsic Volume Dimension

机译:核密度估计器的均匀收敛速率适应固有体积尺寸

获取原文

摘要

We derive concentration inequalities for the supremum norm of the difference between a kernel density estimator (KDE) and its point-wise expectation that hold uniformly over the selection of the bandwidth and under weaker conditions on the kernel and the data generating distribution than previously used in the literature. We first propose a novel concept, called the volume dimension, to measure the intrinsic dimension of the support of a probability distribution based on the rates of decay of the probability of vanishing Euclidean balls. Our bounds depend on the volume dimension and generalize the existing bounds derived in the literature. In particular, when the data-generating distribution has a bounded Lebesgue density or is supported on a sufficiently well-behaved lower-dimensional manifold, our bound recovers the same convergence rate depending on the intrinsic dimension of the support as ones known in the literature. At the same time, our results apply to more general cases, such as the ones of distribution with unbounded densities or supported on a mixture of manifolds with different dimensions. Analogous bounds are derived for the derivative of the KDE, of any order. Our results are generally applicable but are especially useful for problems in geometric inference and topological data analysis, including level set estimation, density-based clustering, modal clustering and mode hunting, ridge estimation and persistent homology.
机译:我们导出核心密度估计(KDE)之间的差异的最高规范的浓度不平等,并且其统一在核心选择的带宽和较弱条件下均匀地保持和数据产生分布而不是先前使用的文献。我们首先提出一种称为体积尺寸的新颖概念,测量基于消失欧几里德球的概率衰减率的概率分布支持的内在尺寸。我们的界限取决于卷维度,并概括了文献中派生的现有范围。特别地,当数据产生分布具有有界的lebesgue密度或者被支撑在足够良好良好的低维歧管上时,我们的绑定取决于支持的固有尺寸作为文献中已知的那些相同的收敛速度。与此同时,我们的结果适用于更多的通用病例,例如具有无限性密度的分布,或者支持具有不同尺寸的歧管的混合物。对于KDE的衍生物,任何顺序导出类似的界限。我们的结果通常适用,但对于几何推断和拓扑数据分析中的问题特别有用,包​​括级别设定估计,基于级别的聚类,模态聚类和模式狩猎,脊估计和持续同源性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号