首页> 外文期刊>Knowledge and information systems >A high-performing comprehensive learning algorithm for text classification without pre-labeled training set
【24h】

A high-performing comprehensive learning algorithm for text classification without pre-labeled training set

机译:无需预先标记训练集的高性能文本分类综合学习算法

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we investigate a comprehensive learning algorithm for text classification without pre-labeled training set based on incremental learning. In order to overcome the high cost in getting labeled training examples, this approach reforms fuzzy partition clustering to obtain a small quantity of labeled training data. Then the incremental learning of Bayesian classifier is applied. The model of the proposed classifier is composed of a Naive-Bayes-based incremental learning algorithm and a modified fuzzy partition clustering method. For improved efficiency, a feature reduction is designed based on the Quadratic Entropy in Mutual Information. We perform experiments to demonstrate the performance of the approach, and the results show that our approach is feasible and effective.
机译:在本文中,我们研究了一种基于增量学习的,无需预先标记的训练集的文本分类综合学习算法。为了克服获取带标签的训练样本的高成本,该方法对模糊分区聚类进行了改革,以获得少量的带标签的训练数据。然后应用贝叶斯分类器的增量学习。提出的分类器模型由基于朴素贝叶斯的增量学习算法和改进的模糊分区聚类方法组成。为了提高效率,基于互信息中的二次熵设计了特征缩减。我们进行实验以证明该方法的性能,结果表明我们的方法是可行和有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号