An Improved Mutual Information-Based Feature Selection Algorithm for Text Classification

机译：一种改进的基于互信息的文本分类特征选择算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature selection plays an important role in text classification, and contributes directly to the accuracy of the classification. In order to correct the defects, such as mutual information-Based feature selection method tends to select rare words and those words from small samples as features, and negative MI value. This paper proposes a new improved feature evaluation function for automatic text classification by taking word frequency, concentration rate between classes and dispersion within class into overall consideration. According to experimental results, the improved algorithm is well placed to remedy the defect that the original MI evaluation function is prone to select rare words, and can improve the performance of classification significantly.

机译：特征选择在文本分类中起着重要作用，并且直接有助于分类的准确性。为了纠正这些缺陷，诸如基于互信息的特征选择方法倾向于从稀疏样本中选择稀有词和那些词作为特征，并选择负的MI值。通过综合考虑词频，词类集中度和类内离散度，提出了一种新的改进的自动文本分类特征评估功能。根据实验结果，改进后的算法可以很好地弥补原来的MI评价函数易于选择稀有词的缺陷，并能显着提高分类的性能。

著录项

来源
《International Conference on Intelligent Human-Machine Systems and Cybernetics》|2013年|126-129|共4页
会议地点
作者
Xiaoyu Jiang; Shui Jin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
feature selection; mutual information; text classification;

机译：特征选择互信息文本分类;

相似文献

外文文献
中文文献
专利

1. Feature selection algorithm for text classification based on improved mutual information [J] . CONG Shuai, ZHANG Ji-bin, XU Zhi-ming, 哈尔滨工业大学学报（英文版） . 2011,第003期

机译：基于改进互信息的文本分类特征选择算法
2. A Two-stage Text Feature Selection Algorithm for Improving Text Classification [J] . Ashokkumar P., Shankar Siva G., Srivastava Gautam, ACM transactions on Asian and low-resource language information processing . 2021,第3期

机译：改进文本分类的两级文本特征选择算法
3. IMPROVED TEXT FEATURE SELECTION ALGORITHMS IN CLASSIFICATION SEARCH OF ENVIRONMENTAL PROTECTION INFORMATION [J] . RONGJIE YANG, SHUAI MAN Journal of Environmental Protection and Ecology . 2019,第3期

机译：环保信息分类搜索中改进的文本特征选择算法
4. An Improved Mutual Information-Based Feature Selection Algorithm for Text Classification [C] . Xiaoyu Jiang, Shui Jin International Conference on Intelligent Human-Machine Systems and Cybernetics . 2013

机译：文本分类的改进的基于互信息的特征选择算法
5. Improving Feature Learning, Feature Selection, and Classification in Facial Expression Analysis [D] . Liu, Ping 2015

机译：改善面部表情分析中的特征学习，特征选择和分类
6. Parameter Selection in Mutual Information-Based Feature Selection in Automated Diagnosis of Multiple Epilepsies Using Scalp EEG [O] . Wesley T. Kerr, Ariana Anderson, Hongjing Xia, -1

机译：使用ScalP EEG自动诊断的基于相互信息的特征选择参数选择
7. Feature Selection Using Improved Mutual Information for Text Classification [O] . Jana Novovičová, Antonín Malík, Pavel Pudil 2004

机译：功能选择，使用改进的文本分类的相互信息

An Improved Mutual Information-Based Feature Selection Algorithm for Text Classification

摘要

著录项

相似文献

相关主题

期刊订阅