【24h】

Determining the Cause of Negative Dissimilar Eigenvalues

机译:确定负相似特征值的原因

获取原文

摘要

Pairwise dissimilarity representations are frequently used as an alternative to feature vectors in pattern recognition. One of the problems encountered in the analysis of such data, is that the dissimilarities are rarely Euclidean, and are sometimes non-metric too. As a result the objects associated with the dissimilarities can not be embedded into a Euclidean space without distortion. One way of gauging the extent of this problem is to compute the total mass associated with the negative eigenvalues of the dissimilarity matrix. However,this test does not reveal the origins of non-Euclidean or non-metric artefacts in the data. The aim in this paper is to provide simple empirical tests that can be used to determine the origins of the negative dissimilarity eigenvalues. We consider three sources of the negative dissimilarity eigenvalues, namely a) that the data resides on a manifold (here for simplicity we consider a sphere), b) that the objects may be extended and c) that there is Gaussian error. We develop three measures based on the non-metricity and the negative spectrum to characterize the possible causes of non-Euclidean data. We then experimentally test our measures on various real-world dissimilarity datasets.
机译:在模式识别中,成对的不相似表示经常被用作特征向量的替代。分析此类数据时遇到的问题之一是,相异性很少是欧几里得的,有时也是非度量的。结果,与不相似相关的对象不能被嵌入到欧几里得空间中而不会失真。衡量此问题范围的一种方法是计算与相异矩阵的负特征值关联的总质量。但是,该测试并未揭示数据中非欧几里德或非度量伪像的起源。本文的目的是提供可用于确定负相异特征值起源的简单经验检验。我们考虑了负不相似特征值的三个来源,即a)数据驻留在流形上(在这里为简单起见,我们考虑一个球体),b)可以扩展对象和c)存在高斯误差。我们基于非度量和负谱图开发了三种度量,以表征非欧几里得数据的可能原因。然后,我们在各种现实世界的不相似性数据集上实验性地测试我们的度量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号