首页> 外文会议>International conference on combinatorial optimization and applications >TNT: An Effective Method for Finding Correlations Between Two Continuous Variables
【24h】

TNT: An Effective Method for Finding Correlations Between Two Continuous Variables

机译:TNT:查找两个连续变量之间的相关性的有效方法

获取原文

摘要

Determining whether two continuous variables are relevant, either linearly or non-linearly correlated, is a fundamental problem in data science. To test whether two continuous variables have a linear correlation is simple and has a much complete solution, but to judge whether they are in nonlinear correlation is far more difficult. Here, we propose a novel method. Tight Nearest-neighbor prediction correlation Test (TNT), to determine whether two continuous variables are nonlinearly correlated. TNT first use the values of one variable to construct a tight neighborhood structure to predict the value of the other variable and then use the sum of squared errors to measure how well the prediction is. A permutation test based on the sum of squared errors is employed to determine whether two continuous variables are relevant. To evaluate the performance of TNT, we performed extensive simulations comparing with seven existing methods. The results on both simulation and real data demonstrate that TNT is an efficient method to test nonlinear correlations, particularly for some nonlinear correlation which existing methods cannot solve, such as "ring".
机译:确定两个连续变量是否线性或非线性相关是数据科学中的一个基本问题。测试两个连续变量是否具有线性相关性很简单,并且具有非常完整的解决方案,但是判断它们是否处于非线性相关性要困难得多。在这里,我们提出了一种新颖的方法。紧密最近邻居预测相关性测试(TNT),以确定两个连续变量是否非线性相关。 TNT首先使用一个变量的值来构建紧密的邻域结构,以预测另一个变量的值,然后使用平方误差的总和来衡量预测的效果。基于平方和之和的排列检验用于确定两个连续变量是否相关。为了评估TNT的性能,我们与7种现有方法进行了广泛的仿真。仿真和实际数据的结果表明,TNT是测试非线性相关性的有效方法,特别是对于某些现有方法无法解决的非线性相关性,例如“环”。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号