Incomplete Cholesky decomposition based kernel principal component analysis for large-scale data set

机译：基于不完全Cholesky分解的大规模数据集核主成分分析

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Kernel principal component analysis (KPCA) is a popular nonlinear feature extraction method. It generally uses eigen-decomposition technique to extract the principal components. But the method is infeasible for large-scale data set because of the storage and computational problem. To overcome these disadvantages, an efficient iterative method of computing kernel principal components is proposed. First, the Gram matrix is transformed into the two triangular matrices using incomplete Cholesky decomposition. Then each column of the triangular matrix is treated as the input sample for the covariance-free algorithm. Thus, the kernel principal components can be iteratively computed without the eigen-decomposition. The proposed method uses less than half of original storage capacity and also greatly reduces the time complexity. More important, it still can be used even if traditional eigen-decomposition technique cannot be applied when faced with the extremely large-scale data set. The effectiveness of proposed method is validated from experimental results.

机译：核主成分分析（KPCA）是一种流行的非线性特征提取方法。它通常使用特征分解技术来提取主成分。但是由于存储和计算问题，该方法不适用于大规模数据集。为了克服这些缺点，提出了一种计算内核主成分的有效迭代方法。首先，使用不完整的Cholesky分解将Gram矩阵转换为两个三角形矩阵。然后将三角矩阵的每一列都视为无协方差算法的输入样本。因此，可以在没有特征分解的情况下迭代地计算内核主成分。所提出的方法使用不到原始存储容量的一半，并且还大大降低了时间复杂度。更重要的是，即使在面对超大规模数据集时无法应用传统的特征分解技术，它仍然可以使用。实验结果验证了所提方法的有效性。

著录项

来源
《The 2010 International Joint Conference on Neural Networks》|2010年|p.1-6|共6页
会议地点
作者
Shi Weiya; Guo Yue-Fei;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工神经网络与计算;
关键词

相似文献

外文文献
中文文献
专利

1. Compound dimensionality reduction based multi-dynamic kernel principal component analysis monitoring method for batch process with large-scale data sets [J] . Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第1aPta1期

机译：基于复合维度缩小的多动态内核主成分分析监测监测方法，具有大规模数据集的批处理过程
2. Compound dimensionality reduction based multi-dynamic kernel principal component analysis monitoring method for batch process with large-scale data sets [J] . Wang Yajun, Sun Fuming, Li Xiaohui Ecological restoration . 2020,第1期

机译：基于复合维度缩小的多动态内核主成分分析监测监测方法，具有大规模数据集的批处理过程
3. Iterative Kernel Principal Component for Large-Scale Data Set [J] . Shi Weiya Journal of testing and evaluation . 2018,第5期

机译：大规模数据集的迭代内核主组件
4. Incomplete Cholesky decomposition based kernel principal component analysis for large-scale data set [C] . Shi Weiya, Guo Yue-Fei The 2010 International Joint Conference on Neural Networks . 2010

机译：基于不完全Cholesky分解的大规模数据集核主成分分析
5. Using principal component analysis (PCA) to obtain auxiliary variables for missing data in large data sets. [D] . Howard, Waylon J. 2012

机译：使用主成分分析（PCA）获得大数据集中缺失数据的辅助变量。
6. Association Test Based on SNP Set: Logistic Kernel Machine Based Test vs. Principal Component Analysis [O] . Yang Zhao, Feng Chen, Rihong Zhai, -1

机译：协会测试依据sNp集：基于物流核心机试验与主成分分析
7. An efficient Kernel Principal Component Analysis Algorithm for large-scale data set * [O] . Shi Weiya, Guo Yue-fei, Xue Xiangyang 2014

机译：一种用于大规模数据集的高效核主成分分析算法*

Incomplete Cholesky decomposition based kernel principal component analysis for large-scale data set

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅