首页> 外文会议>Proceedings of the Seventeenth annual ACM-SIAM symposium on Discrete algorithm >Analysis of incomplete data and an intrinsic-dimension Helly theorem
【24h】

Analysis of incomplete data and an intrinsic-dimension Helly theorem

机译:不完整数据分析和内维Helly定理

获取原文
获取原文并翻译 | 示例

摘要

The analysis of incomplete data is a long-standing challenge in practical statistics. When, as is typical, data objects are represented by points in Rd, incomplete data objects correspond to affine subspaces (lines or Δ-flats). With this motivation we study the problem of finding the minimum intersection radius r(L) of a set of lines or Δ-flats L: the least r such that there is a ball of radius r intersecting every flat in L. Known algorithms for finding the minimum enclosing ball for a point set (or clustering by several balls) do not easily extend to higher-dimensional flats, primarily because "distances" between flats do not satisfy the triangle inequality. In this paper we show how to restore geometry (i.e., a substitute for the triangle inequality) to the problem, through a new analog of Helly's theorem. This "intrinsic-dimension" Helly theorem states: for any family L of Δ-dimensional convex sets in a Hilbert space, there exist Δ + 2 sets L'L such that r(L) ≤ 2r(L'). Based upon this we present an algorithm that computes a (1 + ε)-core set L'L,|L'| = O42), such that the ball centered at a point c with radius (1 + ε)r(L') intersects every element of L. The running time of the algorithm is O(nΔ+1dpoly(1/ε)). For the case of lines or line segments (Δ = 1), the (expected) running time of the algorithm can be improved to O(nd poly(1/ε)). We note that the size of the core set depends only on the dimension of the input objects and is independent of the input size n and the dimension d of the ambient space.
机译:不完整数据的分析是实际统计中的长期挑战。通常,当数据对象由R d 中的点表示时,不完整的数据对象对应于仿射子空间(线或Δ展平)。以此动机,我们研究了找到一组线或Δ平面 L 最小相交半径r L )的问题:至少 r ,以便有一个半径为 r 的球与 L 中的每个平面相交。用于找到点集的最小包围球(或由多个球聚类)的已知算法不容易扩展到高维平面,主要是因为平面之间的“距离”不满足三角形不等式。在本文中,我们展示了如何通过Helly定理的新模拟来恢复几何形状(即三角形不等式的替代形式)。这个“固有维数” Helly定理指出:对于希尔伯特空间中Δ维凸集的任何族 L ,都存在Δ+ 2集 L' L ,使得 r L )≤2 r L')。基于此,我们提出了一种算法,该算法计算(1 +ε)核集 L' L ,| L' | = O (Δ 4 2 ),这样球就以半径为< 1 +ε) r L')与 L 的每个元素相交。该算法的运行时间为 O n Δ+ 1 d poly(1 /ε))。对于线或线段(Δ= 1),可以将算法的(预期)运行时间改进为 O nd poly(1 /ε) )。我们注意到,核心集的大小仅取决于输入对象的大小,并且与输入大小 n 和环境空间的大小 d 无关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号