Comparison and Improvements of Feature Extraction Methods for Text Categorization

机译：文本分类特征提取方法的比较与改进

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature extraction is a key point of text categorization. The accuracy of extraction will directly affect the accuracy of text classification. This paper introduces and compares 4 commonly used methods of text feature extraction: IG (Information gain), MI (Mutual information), CHI (x~2 statistics), DF (Document frequency), and proposes an improved method based on the method of CHI. Experiment result shows that the proposed method can improve the accuracy of text categorization.

机译：特征提取是文本分类的关键点。提取的准确性将直接影响文本分类的准确性。本文介绍并比较了4种常用的文本特征提取方法：IG（信息增益），MI（互信息），CHI（X〜2统计），DF（文档频率），并提出了一种基于方法的改进方法奇。实验结果表明，该方法可以提高文本分类的准确性。

著录项

来源
《International Conference on Frontiers of Manufacturing Science and Measuring Technology》|2014年||共5页
会议地点
作者
WANG Juan; ZHANG ZhiXun; WANG Yongdong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TH16-53;
关键词
Text Categorization; Feature Extraction; X~2 statistics;

机译：文本分类;特征提取;X〜2统计;

相似文献

外文文献
中文文献
专利

1. Text Categorization Optimization By A Hybrid Approach Using Multiple Feature Selection And Feature Extraction Methods [J] . K. Rajeswari, Sneha Nakil, Neha Patil, International Journal of Engineering Research and Applications . 2014,第5期

机译：基于多种特征选择和特征提取的混合方法文本分类优化
2. A Comprehensive Empirical Comparison of Modern Supervised Classification and Feature Selection Methods for Text Categorization [J] . Yindalon Aphinyanaphongs, Lawrence D. Fu, Zhiguo Li, Journal of the American Society for Information Science and Technology . 2014,第10期

机译：现代监督分类和特征选择方法在文本分类中的综合经验比较
3. Trigonometric comparison measure: A feature selection method for text categorization [J] . Kim Kyoungok, Zzang See Young Data & Knowledge Engineering . 2019,第JANa期

机译：三角比较度量：一种用于文本分类的特征选择方法
4. Comparison and Improvements of Feature Extraction Methods for Text Categorization [C] . WANG Juan, ZHANG ZhiXun, WANG Yongdong International Conference on Frontiers of Manufacturing Science and Measuring Technology . 2014

机译：文本分类特征提取方法的比较与改进
5. A new feature selection method based on support vector machines for text categorization. [D] . Xu, Yaquan. 2006

机译：一种基于支持向量机的文本分类新特征选择方法。
6. Improved Feature-Selection Method Considering the Imbalance Problem in Text Categorization [O] . Jieming Yang, Zhaoyang Qu, Zhiying Liu -1

机译：文本分类中考虑不平衡问题的改进特征选择方法
7. Text Categorization Optimization By A Hybrid Approach Using Multiple Feature Selection And Feature Extraction Methods [O] . K. Rajeswari, Sneha Nakil 2014

机译：基于多特征选择和特征提取方法的混合方法文本分类优化

Comparison and Improvements of Feature Extraction Methods for Text Categorization

摘要

著录项

相似文献

相关主题

期刊订阅