Stability Based Sparse LSI/PCA: Incorporating Feature Selection in LSI and PCA

机译：基于稳定性的稀疏LSI / PCA：在LSI和PCA中纳入功能选择

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The stability of sample based algorithms is a concept commonly used for parameter tuning and validity assessment. In this paper we focus on two well studied algorithms, LSI and PCA, and propose a feature selection process that provably guarantees the stability of their outputs. The feature selection process is performed such that the level of (statistical) accuracy of the LSI/PCA input matrices is adequate for computing meaningful (stable) eigenvectors. The feature selection process "sparsifies" LSI/PCA, resulting in the projection of the instances on the eigenvectors of a principal submatrix of the original input matrix, thus producing sparse factor loadings that are linear combinations solely of the selected features. We utilize bootstrapping confidence intervals for assessing the statistical accuracy of the input sample matrices, and matrix perturbation theory in order to relate the statistical accuracy to the stability of eigenvectors. Experiments on several UCI-datasets verify empirically our approach.

机译：基于样本的算法的稳定性是通常用于参数调整和有效性评估的概念。在本文中，我们着重研究两种经过深入研究的算法LSI和PCA，并提出一种特征选择过程，该过程可证明保证其输出的稳定性。执行特征选择过程，以使LSI / PCA输入矩阵的（统计）准确度足以计算有意义的（稳定）特征向量。特征选择过程“稀疏”了LSI / PCA，从而将实例投影到原始输入矩阵的主子矩阵的特征向量上，从而产生稀疏因子负载，这些稀疏因子负载是所选特征的线性组合。我们利用自举置信区间来评估输入样本矩阵的统计准确性，并使用矩阵扰动理论来将统计准确性与特征向量的稳定性相关联。在几个UCI数据集上进行的实验从经验上证明了我们的方法。

著录项

来源
《European Conference on Machine Learning(ECML 2007); 20070917-21; Warsaw(PL)》|2007年|P.226-237|共12页
会议地点 Warsaw(PL)
作者
Dimitrios Mavroeidis; Michalis Vazirgiannis;
展开▼
作者单位

Department of Informatics, Athens University of Economics and Business, Greece;

GEMO Team, INRIA/FUTURS, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
入库时间 2022-08-26 14:09:17

相似文献

外文文献
中文文献
专利

1. An Informative Feature Selection Method Based on Sparse PCA for VHR Scene Classification [J] . Chaib Souleyman, Gu Yanfeng, Yao Hongxun Geoscience and Remote Sensing Letters, IEEE . 2016,第2期

机译：基于稀疏PCA的VHR场景分类信息特征选择方法
2. Information-theoretic feature selection with segmentation-based folded principal component analysis (PCA) for hyperspectral image classification [J] . Uddin Md. Palash, Mamun Md. Al, Afjal Masud Ibn, International journal of remote sensing . 2021,第1a2期

机译：具有基于分段的折叠主成分分析（PCA）的信息 - 理论特征选择，用于高光谱图像分类
3. Enhanced-PCA based Dimensionality Reduction and Feature Selection for Real-Time Network Threat Detection [J] . P. More, P. Mishra Engineering Technology and Applied Science Research . 2020,第5期

机译：基于增强的PCA的维度减少和特征选择，用于实时网络威胁检测
4. Stability Based Sparse LSI/PCA: Incorporating Feature Selection in LSI and PCA [C] . Dimitrios Mavroeidis, Michalis Vazirgiannis European Conference on Machine Learning . 2007

机译：基于稳定的稀疏LSI / PCA：在LSI和PCA中结合功能选择
5. Incremental Sparse-PCA Feature Extraction For Data Streams. [D] . Nziga, Jean-Pierre. 2015

机译：数据流的增量式稀疏PCA特征提取。
6. RF-PCA: A New Solution for Rapid Identification of Breast Cancer Categorical Data Based on Attribute Selection and Feature Extraction [O] . Kai Bian, Mengran Zhou, Feng Hu, 2020

机译：RF-PCA：基于属性选择和特征提取的快速鉴定乳腺癌分类数据的新解决方案
7. Informative Feature Selection for Object Recognition via Sparse PCA ∗ [O] . Nikhil Naikal, Allen Y. Yang, S. Shankar Sastry 2012

机译：通过稀疏PCA进行信息识别的特征选择
8. Informative Feature Selection for Object Recognition via Sparse PCA [R] . Naikal, N., Yang, A., Sastry, S. S. 2011

机译：稀疏pCa对象识别的信息特征选择

Stability Based Sparse LSI/PCA: Incorporating Feature Selection in LSI and PCA

摘要

著录项

相似文献

相关主题

期刊订阅