首页> 外文会议>The 2010 International Joint Conference on Neural Networks >Incomplete Cholesky decomposition based kernel principal component analysis for large-scale data set
【24h】

Incomplete Cholesky decomposition based kernel principal component analysis for large-scale data set

机译:基于不完全Cholesky分解的大规模数据集核主成分分析

获取原文
获取外文期刊封面目录资料

摘要

Kernel principal component analysis (KPCA) is a popular nonlinear feature extraction method. It generally uses eigen-decomposition technique to extract the principal components. But the method is infeasible for large-scale data set because of the storage and computational problem. To overcome these disadvantages, an efficient iterative method of computing kernel principal components is proposed. First, the Gram matrix is transformed into the two triangular matrices using incomplete Cholesky decomposition. Then each column of the triangular matrix is treated as the input sample for the covariance-free algorithm. Thus, the kernel principal components can be iteratively computed without the eigen-decomposition. The proposed method uses less than half of original storage capacity and also greatly reduces the time complexity. More important, it still can be used even if traditional eigen-decomposition technique cannot be applied when faced with the extremely large-scale data set. The effectiveness of proposed method is validated from experimental results.
机译:核主成分分析(KPCA)是一种流行的非线性特征提取方法。它通常使用特征分解技术来提取主成分。但是由于存储和计算问题,该方法不适用于大规模数据集。为了克服这些缺点,提出了一种计算内核主成分的有效迭代方法。首先,使用不完整的Cholesky分解将Gram矩阵转换为两个三角形矩阵。然后将三角矩阵的每一列都视为无协方差算法的输入样本。因此,可以在没有特征分解的情况下迭代地计算内核主成分。所提出的方法使用不到原始存储容量的一半,并且还大大降低了时间复杂度。更重要的是,即使在面对超大规模数据集时无法应用传统的特征分解技术,它仍然可以使用。实验结果验证了所提方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号