首页> 外文会议>Asia Information Retrieval Symposium(AIRS 2005); 20051013-15; Jeju Island(KR) >Practical Application of Associative Classifier for Document Classification
【24h】

Practical Application of Associative Classifier for Document Classification

机译:关联分类器在文档分类中的实际应用

获取原文
获取原文并翻译 | 示例

摘要

In practical text classification tasks, the ability to interpret the classification result is as important as the ability to classify exactly. The associative classifier has favorable characteristics, rapid training, good classification accuracy, and excellent interpretation. However, the associative classifier has some obstacles to overcome when it is applied in the area of text classification. First of all, the training process of the associative classifier produces a huge amount of classification rules, which makes the prediction for a new document ineffective. We resolve this by pruning the rules according to their contribution to correct classifications. In addition, since the target text collection generally has a high dimension, the training process might take a very long time. We propose mutual information between the word and class variables as a feature selection measure to reduce the space dimension. Experimental classification results using the 20-newsgroups dataset show many benefits of the associative classification in both training and predicting.
机译:在实际的文本分类任务中,解释分类结果的能力与准确分类的能力一样重要。关联分类器具有良好的特征,训练迅速,分类准确度高,解释性好。但是,将关联分类器应用于文本分类领域时,需要克服一些障碍。首先,关联分类器的训练过程会产生大量的分类规则,这使得对新文档的预测无效。我们通过根据规则对正确分类的贡献来修剪规则来解决此问题。另外,由于目标文本集合通常具有较高的维度,因此培训过程可能需要很长时间。我们提出单词和类变量之间的共同信息,作为减少空间尺寸的特征选择措施。使用20个新闻组数据集的实验分类结果显示了关联分类在训练和预测中的许多好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号