...
首页> 外文期刊>Data mining and knowledge discovery >Generalized Gini Correlation and its Application in Data-Mining
【24h】

Generalized Gini Correlation and its Application in Data-Mining

机译:广义基尼相关度及其在数据挖掘中的应用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

An asymmetric correlation measure commonly used in social economics, called the Gini correlation, is defined between a numerical response and a rank. We generalize the definition of this correlation so that it can be applied to data mining. The new definition, called the generalized Gini correlation, is found to include special cases that are equivalent to common evaluation measures used in data mining, for example, the LIFT measures for a binary response and the expected profit measure for a monetary response. We consider estimation and inference regarding this generalized Gini correlation. The asymptotic distribution of the estimated correlation is derived with the help of some empirical process theory. We consider several ways of constructing confidence intervals and demonstrate their performance numerically. Our paper is interdisciplinary and makes contributions to both the Gini literature and the literature of statistical inference of performance measures in data mining.
机译:在数值响应和等级之间定义了一种不对称的相关度量,该度量通常在社会经济学中使用,称为基尼相关。我们概括了这种相关性的定义,以便可以将其应用于数据挖掘。发现新的定义称为广义基尼相关性,它包含与数据挖掘中使用的常见评估度量等效的特殊情况,例如,用于二进制响应的LIFT度量和用于货币响应的预期利润度量。我们考虑关于这种广义基尼相关性的估计和推断。估计相关性的渐近分布是借助一些经验过程理论得出的。我们考虑了构造置信区间的几种方法,并通过数值证明了它们的性能。我们的论文是跨学科的,为基尼文献和数据挖掘中性​​能度量的统计推断文献做出了贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号