Feature selection for text classification using genetic algorithms

机译：使用遗传算法的文本分类特征选择

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In text classification, feature selection is essential to improve the classification effectiveness. This paper provides an empirical study of a feature selection method based on genetic algorithms for different text representation methods. This feature selection algorithm can accomplish two goals: in one hand is the search of a feature subset such that the performance of classifier is best; in other hands is find a feature subset with the smallest dimensionality which achieves higher accuracy in classification. To evaluate the performance of this approach, three from the best classifiers have been selected: Naive Bayes (NB), Nearest Neighbors (KNN) and Support Vector Machines (SVMs). Our objective is to determine whether the genetic algorithms based feature selection will improve the performances in text classification with smaller size using F-measure. Experimentations were carried out on two benchmark document collections 20Newsgroups, and Reuters-21578. And the results were very interesting.

机译：在文本分类中，特征选择对于提高分类效果至关重要。本文提供了基于遗传算法的特征选择方法用于不同文本表示方法的实证研究。这种特征选择算法可以实现两个目标：一方面是对特征子集进行搜索，以使分类器的性能达到最佳。另一方面，找到具有最小维数的特征子集，该子集可实现更高的分类精度。为了评估这种方法的性能，从最佳分类器中选择了三个：朴素贝叶斯（NB），最近邻（KNN）和支持向量机（SVM）。我们的目标是确定基于遗传算法的特征选择是否可以使用F-measure来以较小的尺寸提高文本分类的性能。对两个基准文档集20Newsgroups和Reuters-21578进行了实验。结果非常有趣。

著录项

来源
《International Conference on Modelling, Identification and Control》|2016年|806-810|共5页
会议地点
作者
Noria Bidi; Zakaria Elberrichi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Classification algorithms; Support vector machines; Genetic algorithms; Niobium; Text categorization; Machine learning algorithms; Algorithm design and analysis;

机译：分类算法;支持向量机;遗传算法;铌;文本分类;机器学习算法;算法设计与分析;

相似文献

外文文献
中文文献
专利

1. The Feature Selection Method based on Genetic Algorithm for Efficient of Text Clustering and Text Classification [J] . Sung-Sam Hong, Wanhee Lee, Myung-Mook Han International Journal of Advances in Soft Computing and Its Applications . 2015,第1aSpecial期

机译：基于遗传算法的高效文本聚类和分类的特征选择方法
2. Genetic Algorithm based Feature Selection in High Dimensional Text Dataset Classification [J] . FERHAT OZGUR CATAK WSEAS Transactions on Information Science and Applications . 2015,第Null期

机译：高维文本数据集分类中基于遗传算法的特征选择
3. A Two-stage Text Feature Selection Algorithm for Improving Text Classification [J] . Ashokkumar P., Shankar Siva G., Srivastava Gautam, ACM transactions on Asian and low-resource language information processing . 2021,第3期

机译：改进文本分类的两级文本特征选择算法
4. Feature Selection For Text Classification Using Genetic Algorithms [C] . Noria Bidi, Zakaria Elberrichi International Conference on Modelling, Identification and Control . 2016

机译：使用遗传算法进行文本分类的功能选择
5. Genetic algorithms for feature selection and classification of complex chromatographic and spectroscopic data [D] . Mirjankar, Nikhil Suresh 2012

机译：遗传算法用于复杂色谱和光谱数据的特征选择和分类
6. Cost-Constrained feature selection in binary classification: adaptations for greedy forward selection and genetic algorithms [O] . Rudolf Jagdhuber, Michel Lang, Arnulf Stenzl, 2020

机译：二元分类中受成本约束的特征选择：贪婪前向选择和遗传算法的改编
7. An Optimized Feature Selection Technique in Diversified Natural Scene Text for Classification Using Genetic Algorithm [O] . Ghulam Jillani Ansari, Jamal Hussain Shah, Mylene C. Q. Farias, 2021

机译：不同遗传算法分类分类自然场景文本的优化特征选择技术
8. Selection of Relevant Features for Classification of Movements From Single Movement-Related Potentials Using a Genetic Algorithm. [R] . Yom-Tov, E., Inbar, G. F. 2001

机译：用遗传算法选择单个运动相关电位运动分类的相关特征。

Feature selection for text classification using genetic algorithms

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅