【24h】

CCM: A Text Classification Model by Clustering

机译:CCM:群集通过群集文本分类模型

获取原文

摘要

In this paper, a new Cluster based Classification Model (CCM) for suspicious email detection and other text classification tasks, is presented. Comparative experiments of the proposed model against traditional classification models and the boosting algorithm are also discussed. Experimental results show that the CCM outperforms traditional classification models as well as the boosting algorithm for the task of suspicious email detection on terrorism domain email dataset and topic categorization on the Reuters-21578 and 20 Newsgroups datasets. The overall finding is that applying a cluster based approach to text classification tasks simplifies the model and at the same time increases the accuracy.
机译:本文提出了一种用于可疑电子邮件检测和其他文本分类任务的新的基于集群的分类模型(CCM)。还讨论了对传统分类模型的提出模型和升压算法的比较实验。实验结果表明,CCM优于传统的分类模型以及恐怖主义域电子邮件数据集的可疑电子邮件检测任务的促进算法,并对Reuters-21578和20新闻组数据集进行了主题分类。整体发现是将基于集群的方法应用于文本分类任务简化了模型,同时增加了准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号