Correlation-assisted nearest shrunken centroid classifier with applications for high dimensional spectral data

Xu Jian; Xu Qingsong; Yi Lunzhao; Chan Chi-On; Mok Daniel Kam-Wah

首页> 外文期刊>Journal of Chemometrics >Correlation-assisted nearest shrunken centroid classifier with applications for high dimensional spectral data

【24h】

Correlation-assisted nearest shrunken centroid classifier with applications for high dimensional spectral data

机译：相关辅助的最近收缩质心分类器及其在高维光谱数据中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

High throughput data are frequently observed in contemporary chemical studies. Classification through spectral information is an important issue in chemometrics. Linear discriminant analysis (LDA) fails in the large-p-small-n situation for two main reasons: (1) the sample covariance matrix is singular when p > n and (2) there is an accumulation of noise in the estimation of the class centroid in high dimensional feature space. The Independence Rule is a class of methods used to overcome these drawbacks by ignoring the correlation information between spectral variables. However, a strong correlation is an essential characteristic of spectral data. We proposed a new correlation-assisted nearest shrunken centroid classifier (CA-NSC) to incorporate correlation information into the classification. CA-NSC combines two sources of information [class centroid (mean) and correlation structure (variance)] to generate the classification. We used two real data analyses and a simulation study to verify our CA-NSC method. In addition to NSC, we also performed a comparison with the soft independent modeling of class analogy (SIMCA) approach, which uses only correlation structure information for classification. The results show that CA-NSC consistently improves on NSC and SIMCA. The misclassification rate of CA-NSC is reduced by almost half compared with NSC in one of the real data analyses. Generally, correlation among variables will worsen the performance of NSC, even though the discriminatory information contained in the class centroid remains unchanged. If only correlation structure information is used (as in the case of SIMCA), the result will be satisfactory only when the correlation structure alone can provide sufficient information for classification. Copyright (C) 2015 John Wiley & Sons, Ltd.

机译：在当代化学研究中经常观察到高通量数据。通过光谱信息进行分类是化学计量学中的重要问题。线性判别分析（LDA）在大-小-小-n情况下失败的主要原因有两个：（1）当p> n时样本协方差矩阵是奇异的;（2）在估计的p时存在噪声累积高维特征空间中的类质心。独立规则是一类用于通过忽略频谱变量之间的相关信息来克服这些缺点的方法。但是，强相关性是光谱数据的基本特征。我们提出了一种新的相关辅助最近收缩质心分类器（CA-NSC），以将相关信息纳入分类。 CA-NSC结合了两种信息来源[类质心（均值）和相关结构（方差）]以生成分类。我们使用两次真实数据分析和一次模拟研究来验证我们的CA-NSC方法。除了NSC，我们还与类比的软独立建模（SIMCA）方法进行了比较，该方法仅使用相关结构信息进行分类。结果表明，CA-NSC在NSC和SIMCA上持续改进。在一项实际数据分析中，与NSC相比，CA-NSC的错误分类率降低了近一半。通常，即使类质心中包含的歧视性信息保持不变，变量之间的相关性也会恶化NSC的性能。如果仅使用相关结构信息（如SIMCA的情况），则仅当相关结构本身可以提供足够的分类信息时，结果才会令人满意。版权所有（C）2015 John Wiley＆Sons，Ltd.

著录项

来源
《Journal of Chemometrics》 |2016年第1期|共9页
作者
Xu Jian; Xu Qingsong; Yi Lunzhao; Chan Chi-On; Mok Daniel Kam-Wah;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类化学;
关键词
classification; soft independent modeling of class analogy; principal component analysis;

机译：分类;类比的软独立建模;主成分分析;

相似文献

外文文献
中文文献
专利

1. Correlation-assisted nearest shrunken centroid classifier with applications for high dimensional spectral data [J] . Xu Jian, Xu Qingsong, Yi Lunzhao, Journal of Chemometrics . 2016,第1期

机译：相关辅助的最近收缩质心分类器及其在高维光谱数据中的应用
2. Improved centroids estimation for the nearest shrunken centroid classifier [J] . Sijian Wang, Ji Zhu Bioinformatics . 2007,第8期

机译：最近收缩的质心分类器的改进质心估计
3. Improved centroids estimation for the nearest shrunken centroid classifier [J] . Sijian Wang, and Ji Zhu Bioinformatics . 2007,第8期

机译：最近收缩的质心分类器的改进质心估计
4. Nearest Shrunken Centroid as Feature Selection of Microarray Data [C] . Myungsook Klassen, Nyunsu Kim 24th international conference on computers and their applications 2009 . 2009

机译：最近收缩质心作为微阵列数据的特征选择
5. An application of artificial intelligence techniques in classifying tree species with LiDAR and multi-spectral scanner data. [D] . Posadas, Benedict Kit A., Jr. 2008

机译：人工智能技术在LiDAR和多光谱扫描仪数据分类树种中的应用。
6. Improved shrunken centroid classifiers for high-dimensional class-imbalanced data [O] . Rok Blagus, Lara Lusa 2013

机译：改进的收缩质心分类器用于处理高维类不平衡数据
7. Correlation-assisted nearest shrunken centroid classifier with applications for high dimensional spectral data [O] . Xu J, Xu Q, Yi L, 2016

机译：相关辅助的最近收缩质心分类器及其在高维光谱数据中的应用
8. Nearest Neighbor - a New Non-Parametric Test Used for Classifying Spectral Data [R] . Chapman, W. E., Nadeau, J. J., Switzer, P. 1968

机译：最近邻 - 一种用于光谱数据分类的非参数检验

Correlation-assisted nearest shrunken centroid classifier with applications for high dimensional spectral data

摘要

著录项

相似文献

相关主题

期刊订阅