...
首页> 外文期刊>Information Processing Letters >Instability results for Euclidean distance, nearest neighbor search on high dimensional Gaussian data
【24h】

Instability results for Euclidean distance, nearest neighbor search on high dimensional Gaussian data

机译:欧几里德距离的不稳定结果,高维高斯数据的最近邻搜索

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In 1998, Beyer et al. described a nearest neighbor query as unstable if the query point has nearly identical distance from all points in the dataset. Subsequently, researchers have proven that, as data dimensionality goes to infinity, the probability of query instability approaches one for various kinds of data distributions, dataset size functions, and distance metrics. This paper addresses the problem of characterizing query instability behavior over centered Gaussian data generation distributions and Euclidean distance. Sufficient conditions are established on the covariance matrices and dataset size function under which the probability of query instability approaches one. Furthermore, conditions are also established under which the query instability probability is strictly bounded away from one for a non-vanishing set of query points. (c) 2021 Elsevier B.V. All rights reserved.
机译:1998年,Beyer等。 如果查询点与数据集中的所有点几乎相同的距离,则将最近的邻居查询视为不稳定。 随后,研究人员已经证明,随着数据维度到无限的,查询不稳定性的概率接近各种数据分布,数据集大小函数和距离度量。 本文讨论了以中心高斯数据生成分布和欧几里德距离为中心的查询不稳定行为的问题。 在协方差矩阵和数据集大小函数上建立了足够的条件,其中查询不稳定性的概率接近一个。 此外,还建立了条件,在该条件下,查询不稳定性概率被严格偏向于一个用于非消失的查询点。 (c)2021 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号