首页> 外文会议>International Conference on Swarm, Evolutionary, and Memetic Computing >Text Classification Using Ensemble Features Selection and Data Mining Techniques
【24h】

Text Classification Using Ensemble Features Selection and Data Mining Techniques

机译:使用集合功能选择和数据挖掘技术进行文本分类

获取原文

摘要

Text categorization is a task of text mining/analytics which involves extracting useful information from unstructured resources followed by categorizing these documents. In this paper, we classify the TechTC dataset collected from various Web directories. We employed feature selection methods such as Gini index, chi-square, t-statistic, correlation which drastically reduced the model building time. Various neural network models such as probabilistic neural network, group method of data handling, multi layer perceptron yielded higher accuracies compared to other techniques applied in literature.
机译:文本分类是文本挖掘/分析的任务,涉及从非结构化资源中提取有用信息,然后分类这些文档。在本文中,我们将从各种Web目录中收集的TechTc数据集进行分类。我们采用了特征选择方法,如Gini指数,Chi-Square,T统计,相关性,其急剧减少了模型建筑时间。与文献中应用的其他技术相比,各种神经网络模型如概率神经网络,组数据处理,多层Perceptron产生了更高的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号