首页> 外文会议>Proceedings of the 2007 International Conference on Machine Learning and Cybernetics >A NEW DENSITY-BASED METHOD FOR REDUCING THE AMOUNT OF TRAINING DATA IN K-NN TEXT CLASSIFICATION
【24h】

A NEW DENSITY-BASED METHOD FOR REDUCING THE AMOUNT OF TRAINING DATA IN K-NN TEXT CLASSIFICATION

机译:一种新的基于密度的K-NN文本分类中减少训练数据量的方法

获取原文

摘要

With the rapid development of WWW, text classification has become the key technology in organizing and processing large amount of text data.As a simple, effective and nonparametric classification method, k-NN method is widely used in text classification.But k-NN clasifier not only has large computational demands, but also may decrease the precision of classification because of uneven density of training data.In this paper, a new density-based method for reducing the amount of training data is presented, which not only reduces the computational demands of k-NN classifier, but also improves the classification precision.The experiments show that the new method has better performance than the traditional k-NN method.
机译:随着WWW的飞速发展,文本分类已成为组织和处理大量文本数据的关键技术.k-NN方法是一种简单,有效且非参数的分类方法,广泛用于文本分类。由于训练数据的密度不均匀,不仅计算量大,而且可能降低分类的精度。本文提出了一种新的基于密度的减少训练数据量的方法,不仅减少了计算量实验表明,该方法比传统的k-NN方法具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号