首页> 外文期刊>International Journal of Data Warehousing and Mining >An Empirical Evaluation of Similarity Coefficients for Binary Valued Data
【24h】

An Empirical Evaluation of Similarity Coefficients for Binary Valued Data

机译:二元值数据相似系数的实证评估

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, the authors present an empirical evaluation of similarity coefficients for binary valued data. Similarity coefficients provide a means to measure the similarity or distance between two binary valued objects in a dataset such that the attributes qualifying each object have a 0-1 value. This is useful in several domains, such as similarity of feature vectors in sensor networks, document search, router network mining, and web mining. The authors survey 35 similarity coefficients used in various domains and present conclusions about the efficacy of the similarity computed in (1) labeled data to quantify the accuracy of the similarity coefficients, (2) varying density of the data to evaluate the effect ofsparsity of the values, and (3) varying number of attributes to see the effect of high dimensionality in the data on the similarity computed.
机译:在本文中,作者对二值数据的相似系数进行了实证评估。相似系数提供了一种手段来测量数据集中两个二进制值对象之间的相似性或距离,以使限定每个对象的属性具有0-1值。这在多个领域很有用,例如传感器网络中特征向量的相似性,文档搜索,路由器网络挖掘和Web挖掘。作者调查了在各个领域中使用的35个相似系数,并提出了有关以下方面的结论:在(1)标记数据中计算出的相似性以量化相似系数的准确性,(2)改变数据的密度以评估稀疏性的影响值;以及(3)改变属性数量,以查看数据中高维数对计算出的相似度的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号