Text Classification Using Ensemble Features Selection and Data Mining Techniques

机译：使用集成特征选择和数据挖掘技术进行文本分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text categorization is a task of text mining/analytics which involves extracting useful information from unstructured resources followed by categorizing these documents. In this paper, we classify the TechTC dataset collected from various Web directories. We employed feature selection methods such as Gini index, chi-square, t-statistic, correlation which drastically reduced the model building time. Various neural network models such as probabilistic neural network, group method of data handling, multi layer perceptron yielded higher accuracies compared to other techniques applied in literature.

机译：文本分类是文本挖掘/分析的任务，涉及从非结构化资源中提取有用的信息，然后对这些文档进行分类。在本文中，我们对从各种Web目录收集的TechTC数据集进行分类。我们采用了特征选择方法，例如基尼系数，卡方，t统计量，相关性，从而大大缩短了模型构建时间。与文献中应用的其他技术相比，各种神经网络模型（例如概率神经网络，数据处理的分组方法，多层感知器）产生了更高的准确性。

著录项

来源
《International conference on swarm, evolutionary, and memetic computing》|2015年|176-186|共11页
会议地点
作者
B. Shravankumar; Vadlamani Ravi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Text mining; Document classification; Feature selection; Classification models;

机译：文本挖掘;文件分类;功能选择;分类模型;

相似文献

外文文献
中文文献
专利

1. Feature Selection Techniques and Classification Accuracy of Supervised Machine Learning in Text Mining [J] . Loise Makara, Kennedy Ogada, Dennis Njagi Journal of Information Engineering and Applications . 2019,第3期

机译：文本挖掘中监督机器学习的特征选择技术与分类精度
2. A systematic review on techniques of feature selection and classification for text mining [J] . K. Sridharan, P. Sivakumar International journal of business information systems . 2018,第4期

机译：文本挖掘特征选择与分类技术的系统综述
3. Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method-a Comparative Study [J] . Applied biochemistry and biotechnology, Part A. enzyme engineering and biotechnology . 2020,第2期

机译：使用集合数据挖掘技术预测皮肤病和特征选择方法 - 比较研究
4. Text Classification Using Ensemble Features Selection and Data Mining Techniques [C] . B. Shravankumar, Vadlamani Ravi International Conference on Swarm, Evolutionary, and Memetic Computing . 2015

机译：使用集合功能选择和数据挖掘技术进行文本分类
5. Sampling and text classification techniques for data mining. [D] . Chen, Bin. 2001

机译：用于数据挖掘的采样和文本分类技术。
6. Discriminative and informative features for biomolecular text mining with ensemble feature selection [O] . Sofie Van Landeghem, Thomas Abeel, Yvan Saeys, -1

机译：具有集成特征选择的生物分子文本挖掘的区分性和信息性特征
7. Discriminative and informative features for biomolecular text mining with ensemble feature selection [O] . Van Landeghem, Sofie, Abeel, Thomas, Saeys, Yvan, 2010

机译：具有集成特征选择的生物分子文本挖掘的区分性和信息性特征

Text Classification Using Ensemble Features Selection and Data Mining Techniques

摘要

著录项

相似文献

相关主题

期刊订阅