【24h】

Choosing the Metric in High-Dimensional Spaces Based on Hub Analysis

机译:基于Hub分析的高维空间度量

获取原文

摘要

To avoid the undesired effects of distance concentration in high-dimensional spaces, previous work has already advocated the use of fractional ℓ~p norms instead of the ubiquitous Euclidean norm. Closely related to concentration is the emergence of hub and anti-hub objects. Hub objects have a small distance to an exceptionally large number of data points while anti-hubs lie far from all other data points. The contribution of this work is an empirical examination of concentration and hubness, resulting in an unsupervised approach for choosing an ℓ~p norm by minimizing hubs while simultaneously maximizing nearest neighbor classification.
机译:为了避免在高维空间中距离集中的不良影响,先前的工作已经提倡使用分数ℓ〜p范数来代替普遍存在的欧几里得范数。与集中度密切相关的是轮毂和反轮毂物体的出现。集线器对象到大量数据点的距离很小,而反集线器则远离所有其他数据点。这项工作的贡献在于对集中度和中心度进行了实证检验,从而得出了一种无监督的方法,即通过最小化中心线同时最大化最近邻分类来选择ℓ〜p范数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号