首页> 外文会议>International Conference on Industrial and Information Systems >Study of text classification methods for data sets with huge features
【24h】

Study of text classification methods for data sets with huge features

机译:具有巨大功能的数据集文本分类方法研究

获取原文

摘要

Text classification has gained booming interest over the past few years. In this paper we look at the main approaches that have been taken towards text classification. The key text classification techniques including text model, feature selection methods and text classification algorithms are discussed. This work focus on the implementation of a text classification system based on Mutual Information and K-Nearest Neighbor algorithm and Support Vector Machine. The experimental results on Reuters collection are also presented. It shows that Mutual Information is a kind of efficient dimension reduction method for text data sets with huge features.
机译:文本分类在过去几年中获得了蓬勃发展的兴趣。在本文中,我们研究了对文本分类所采取的主要方法。讨论了包括文本模型,特征选择方法和文本分类算法的关键文本分类技术。这项工作侧重于基于互信息和k最近邻算法的文本分类系统的实现和支持向量机。还提出了路透社收集的实验结果。它表明,相互信息是具有巨大特征的文本数据集的一种有效的维度减少方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号