首页> 外文期刊>Knowledge-Based Systems >ApproxCCA: An approximate correlation analysis algorithm for multidimensional data streams
【24h】

ApproxCCA: An approximate correlation analysis algorithm for multidimensional data streams

机译:ApproxCCA:多维数据流的近似相关分析算法

获取原文
获取原文并翻译 | 示例
           

摘要

Correlation analysis is regarded as a significant challenge in the mining of multidimensional data streams. Great emphasis is generally placed on one-dimensional data streams with the existing correlation analysis methods for the mining of data streams. Therefore, the identification of underlying correlation among multivariate arrays (e.g. Sensor data) has long been ignored. The technique of canonical correlation analysis (CCA) has rarely been applied in multidimensional data streams. In this study, a novel correlation analysis algorithm based on CCA, called ApproxCCA, is proposed to explore the correlations between two multidimensional data streams in the environment with limited resources. By introducing techniques of unequal probability sampling and low-rank approximation to reduce the dimensionality of the product matrix composed by the sample covariance matrix and sample variance matrix, ApproxCCA successfully improves computational efficiency while ensuring the analytical precision. Experimental results of synthetic and real data sets have indicated that the computational bottleneck of traditional CCA can be overcome with ApproxCCA, and the correlations between two multidimensional data streams can also be detected accurately.
机译:在多维数据流的挖掘中,相关性分析被认为是一项重大挑战。通常使用现有的用于数据流挖掘的相关性分析方法将重点放在一维数据流上。因此,长期以来就忽略了对多元数组(例如传感器数据)之间的基础相关性的识别。规范相关分析(CCA)技术很少用于多维数据流中。在这项研究中,提出了一种新的基于CCA的相关分析算法,称为ApproxCCA,以探索资源有限的环境中两个多维数据流之间的相关性。通过引入不等概率抽样和低秩逼近技术来减少由样本协方差矩阵和样本方差矩阵组成的乘积矩阵的维数,ApproxCCA成功地提高了计算效率,同时确保了分析精度。综合和真实数据集的实验结果表明,ApproxCCA可以克服传统CCA的计算瓶颈,并且还可以准确检测两个多维数据流之间的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号