...
首页> 外文期刊>Journal of information and computational science >An EM Based Semi-Supervised Learning Algorithm for Question Classification
【24h】

An EM Based Semi-Supervised Learning Algorithm for Question Classification

机译:基于EM的问题分类半监督学习算法

获取原文
获取原文并翻译 | 示例
           

摘要

In many question classification problems based on statistic learning, unlabeled training examples are readily available but labeled ones are fairly expensive to obtain. This provides a strong motivation to improve the question classification accuracy by using large quantities of unlabeled questions. In this paper, a new semi-supervised learning algorithm is proposed for question classification. This algorithm combines Expectation-Maximization (EM) and modified Bayes classifier. First, the initial parameters of modified Bayes classifier are estimated from just the labeled questions. Then, the classifier is used to assign class label to each unlabeled questions and the model is revised iteratively to convergence. Experiments on the Chinese question system of tourism domain show that the method could effectively exploit unlabeled examples to improve the classification accuracy.
机译:在许多基于统计学习的问题分类问题中,未加标签的训练示例很容易获得,但加标签的训练示例却相当昂贵。这提供了使用大量未标记问题来提高问题分类准确性的强烈动机。本文提出了一种新的半监督学习算法用于问题分类。该算法结合了期望最大化(EM)和改进的贝叶斯分类器。首先,仅从标记的问题中估计出改进的贝叶斯分类器的初始参数。然后,使用分类器将类别标签分配给每个未标记的问题,然后对模型进行迭代修订以收敛。通过对旅游领域汉语问题系统的实验表明,该方法可以有效地利用未标注的实例来提高分类的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号