Improving Performance of the k-Nearest Neighbor Classifier by Combining Feature Selection with Feature Weighting

Yongguang Bao; Xiaoyong Du; Naohiro Ishii

首页> 外文期刊>人工知能学会論文誌 >Improving Performance of the k-Nearest Neighbor Classifier by Combining Feature Selection with Feature Weighting

【24h】

Improving Performance of the k-Nearest Neighbor Classifier by Combining Feature Selection with Feature Weighting

机译：通过将特征选择与特征权重相结合来提高k最近邻分类器的性能

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The k-nearest neighbor (k-NN) classification is a simple and effective classification approach. However, it suffers from over-sensitivity problem due to irrelevant and noisy features. There are two ways to relax such sensitivity. One is to assign each feature a weight, and the other way is to select a subset of relevant features. Existing researches showed that both approaches can improve generalization accuracy, but it is impossible to predict which one is better for a specific dataset. In this paper, we propose an algorithm to improve the effectiveness of k-NN by combining these two approaches. Specifically, we select all relevant features firstly, and then assign a weight to each relevant feature. Experiments have been conducted on 14 datasets from the UCI Machine Learning Repository, and the results show that our algorithm achieves the highest accuracy or near to the highest accuracy on all test datasets. It increases generalization accuracy 8.68% on the average. It also achieves higher generalization accuracy compared with well-known machine learning algorithm IB1-4 and C4.5.

机译：k最近邻（k-NN）分类是一种简单有效的分类方法。然而，由于不相关和嘈杂的特征，它遭受了过度敏感的问题。有两种方法可以放松这种敏感性。一种是为每个特征分配权重，另一种方式是选择相关特征的子集。现有研究表明，这两种方法都可以提高泛化精度，但是无法预测哪种方法更适合特定数据集。在本文中，我们提出了一种通过结合这两种方法来提高k-NN有效性的算法。具体来说，我们首先选择所有相关特征，然后为每个相关特征分配权重。对UCI机器学习存储库中的14个数据集进行了实验，结果表明，我们的算法在所有测试数据集上均达到了最高准确性或接近最高准确性。平均将泛化精度提高8.68％。与众所周知的机器学习算法IB1-4和C4.5相比，它还具有更高的泛化精度。

著录项

来源
《人工知能学会論文誌》 |2002年第3期|共8页
作者
Yongguang Bao; Xiaoyong Du; Naohiro Ishii;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
Machine learning; Classification; Feature selection; Feature weighting; Rough sets; Information theory;

机译：机器学习分类分类特征选择特征权重粗糙集信息论;

相似文献

外文文献
中文文献
专利

1. Improving Performance of the k-Nearest Neighbor Classifier by Combining Feature Selection with Feature Weighting [J] . Yongguang Bao, Xiaoyong Du, Naohiro Ishii 人工知能学会論文誌 . 2002,第3期

机译：通过将特征选择与特征权重相结合来提高k最近邻分类器的性能
2. Integrating Instance Selection, Instance Weighting, and Feature Weighting for Nearest Neighbor Classifiers by Coevolutionary Algorithms [J] . Derrac J., Triguero I., Garcia S., Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on . 2012,第5期

机译：通过协进化算法集成最近邻分类器的实例选择，实例加权和特征加权
3. ACO Based Feature Subset Selection for Multiple k-Nearest Neighbor Classifiers [J] . Shailendra Kumar Shrivastava, Pradeep Mewada International Journal on Computer Science and Engineering . 2011,第5期

机译：多个k最近邻分类器的基于ACO的特征子集选择
4. Enhancement of Performance of K-Nearest Neighbors Classifiers for the Prediction of Diabetes Using Feature Selection Method [C] . Subhash Chandra Gupta, Noopur Goel IEEE International Conference on Computing Communication and Automation . 2020

机译：使用特征选择方法增强K最近邻分类器对糖尿病的预测性能
5. Randomized and Evolutionary Approaches to Dataset Characterization, Feature Weighting, and Sampling in K-Nearest Neighbors [D] . Basak, Suryoday. 2020

机译：基于数据集特征的随机和进化方法，具有在k离邻居中的采样和抽样
6. Person Re-Identification with RGB-D Camera in Top-View Configuration through Multiple Nearest Neighbor Classifiers and Neighborhood Component Features Selection [O] . Marina Paolanti, Luca Romeo, Daniele Liciotti, 2018

机译：通过多个最近邻分类器和邻域组件特征选择在俯视配置下使用RGB-D摄像机对人员进行重新识别
7. Improving Performance of the k-Nearest Neighbor Classifier by Combining Feature Selection with Feature Weighting. [O] . Yongguang Bao, Xiaoyong Du, Naohiro Ishii 2002

机译：通过将功能选择与特征加权相结合来提高k最近邻分类的性能。
8. Computerized Pattern Recognition Applications to Chemical Analysis. Development of Interactive Feature Selection Methods for the K-Nearest Neighbor Technique. [R] . pichler, marty a. perone,sam p. 1974

机译：计算机模式识别在化学分析中的应用。 K-最近邻技术交互特征选择方法的发展。

Improving Performance of the k-Nearest Neighbor Classifier by Combining Feature Selection with Feature Weighting

摘要

著录项

相似文献

相关主题

期刊订阅