首页> 外文期刊>Artificial Intelligence Review: An International Science and Engineering Journal >On metricity of two heterogeneous measures in the presence of missing values
【24h】

On metricity of two heterogeneous measures in the presence of missing values

机译:存在缺失值时两种异类测度的度量

获取原文
获取原文并翻译 | 示例
           

摘要

Heterogeneous Euclidean-overlap metric and heterogeneous value difference metric given in machine learning literature are useful for the consideration of mixed-type data for machine learning, pattern recognition and data mining tasks. Mixed-type variables are quite common in practical problems, but this property has been taken into account only seldom in pattern recognition, data mining and decision making algorithms. We observed that these two distance measures are not actually metrics after having found a special situation when they are not metric, but pseudometric, a feature to be noted while using them. Nevertheless, by changing their definitions somewhat, it is possible to meet the metricity. Especially in medical applications, the redefinition of the two measures might be important, since otherwise it is possible in theory that, for example, two identical cases would be classified differently. Nearest neighbor searching tests with medical data were run to illustrate the behavior of these measures. Notwithstanding the violation of the metricity their original forms yielded slightly better classification results. The reason was that in real data sets tested there were very few almost similar cases according to these distance measures, and the original forms based on more separating distances than the redefinitions were slightly better in the classification.
机译:机器学习文献中给出的异构欧氏重叠度量和异构值差异度量对于考虑用于机器学习,模式识别和数据挖掘任务的混合类型数据很有用。混合类型变量在实际问题中很常见,但是在模式识别,数据挖掘和决策算法中很少考虑到此属性。我们观察到这两种距离量度在不是度量值时已发现特殊情况,实际上不是度量值,而是伪度量,这是使用它们时要注意的一个特征。但是,通过稍微更改其定义,可以满足度量标准。特别是在医疗应用中,这两种措施的重新定义可能很重要,因为否则在理论上可能会例如对两个相同的病例进行不同的分类。进行了具有医学数据的最近邻居搜索测试,以说明这些措施的行为。尽管违反了度量标准,但它们的原始形式还是产生了更好的分类结果。原因是,根据这些距离测度,在测试的真实数据集中几乎没有相似的案例,并且基于比重新定义更长的分隔距离的原始形式在分类中要好一些。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号