A Hybrid Text Classification model based on Rough Sets and Genetic Algorithms

机译：一种基于粗糙集和遗传算法的混合文本分类模型

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Automatic categorization of documents into predefined taxonomies is a crucial step in data mining and knowledge discovery. Standard machine learning techniques like support vector machines(SVM) and related large margin methods have been successfully applied for this task Unfortunately, the high dimensionality of input feature vectors impacts on the classification speed. The kernel parameters setting for SVM in a training process impacts on the classification accuracy. Feature selection is another factor that impacts classification accuracy. The objective of this work is to reduce the dimension of feature vectors, optimizing the parameters to improve the SVM classification accuracy and speed In order to improve classification speed we spent rough sets theory to reduce the feature vector space. We present a genetic algorithm approach for feature selection and parameters optimization to improve classification accuracy. Experimental results indicate our method is more effective than traditional SVM methods and other traditional methods.

机译：将文档自动分类为预定分类学，是数据挖掘和知识发现的重要步骤。标准机器学习技术如支持向量机（SVM）和相关的大型裕度方法已成功应用此项任务，输入特征向量的高维度对分类速度影响。 SVM在培训过程中的内核参数设置对分类准确性的影响。特征选择是影响分类准确性的另一个因素。这项工作的目的是减少特征向量的尺寸，优化参数以提高SVM分类精度和速度，以提高分类速度，我们花了粗糙集理论来减少特征矢量空间。我们提出了一种遗传算法方法，用于提高分类准确性的特征选择和参数优化方法。实验结果表明，我们的方法比传统的SVM方法和其他传统方法更有效。

著录项

来源
《International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing》|2008年||共4页
会议地点
作者
Xiaoyue Wang; Zhen Hua; Rujiang Bai;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP31-532;
关键词
Text; Classification; model;

机译：文字;分类;型号;

相似文献

外文文献
中文文献
专利

1. Rough Set Based Hybrid Algorithm For Text Classification [J] . Duoqian Miao, Qiguo Duan, Hongyun Zhang, Expert systems with applications . 2009,第5期

机译：基于粗糙集的混合文本分类算法
2. Hybrid System based on Rough Sets and Genetic Algorithms for Medical Data Classifications [J] . Hanaa Ismail Elshazly, Ahmad Taher Azar, Aboul Ella Hassanien, International journal of fuzzy system applications . 2013,第4期

机译：基于粗糙集和遗传算法的医学数据分类混合系统
3. A hybrid model based on rough sets theory and genetic algorithms for stock price forecasting [J] . Cheng CH, Chen TL, Wei LY Information Sciences: An International Journal . 2010,第9期

机译：基于粗糙集理论和遗传算法的股票价格混合模型
4. A Hybrid Text Classification model based on Rough Sets and Genetic Algorithms [C] . Xiaoyue Wang, Zhen Hua, Rujiang Bai International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing . 2008

机译：一种基于粗糙集和遗传算法的混合文本分类模型
5. MICA: A Hybrid Method for Corpus-Based Algorithmic Composition of Music Based on Genetic Algorithms, Zipf's Law, and Markov Models [D] . Nagelberg, Alan 2014

机译：MICA：基于遗传算法，齐普夫定律和马尔可夫模型的基于语料库的音乐算法合成的混合方法
6. Hybrid Model Based on Genetic Algorithms and SVM Applied to Variable Selection within Fruit Juice Classification [O] . C. Fernandez-Lozano, C. Canto, M. Gestal, 2013

机译：基于遗传算法和支持向量机的混合模型在果汁分类中的变量选择
7. A Data Preprocessing Algorithm for Classification Model Based On Rough Sets [O] . Xiang-wei Li, Yian-fang Qi 2012

机译：基于粗糙集的分类模型数据预处理算法
8. Rough Set Feature Selection Algorithms for Textual Case-Based Classification. [R] . Gupta, K. M., Aha, D. W., Moore, P. 2006

机译：基于文本案例分类的粗糙集特征选择算法。

A Hybrid Text Classification model based on Rough Sets and Genetic Algorithms

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅