首页> 外文会议>International Conference on Research, Innovation and Vision for the Future >Improving the Accuracy of Question Classification with Machine Learning
【24h】

Improving the Accuracy of Question Classification with Machine Learning

机译:通过机器学习提高问题分类的准确性

获取原文

摘要

Question classification is an important phase in question answering systems. In this paper, we propose to apply i) hierarchical classifiers, ii) hierarchical classifiers in combination with semi-supervised learning and iii) hierarchy expansion for question classification for improving the precision. When the number of classes is large, the performance of classification algorithms may be affected. In order to improve the performance by reducing the number of classes for each classifier, we propose to use hierarchical classifiers according to the question taxonomy, in which each internal node is attached a classifier. We try to use semi-supervised learning to consume unlabeled questions with expectation to improve the performance of classifiers in the hierarchy. We explored different applications of learning methods in for each classifier of the hierarchy: a) supervised learning for all classifiers at all levels; b) semi-supervised learning for the first-level classifier and supervised learning for other classifiers; c) semi-supervised learning for all classifiers. The experiments show that the first method (a) has better results than those of flat classification; the second method (b) produces better results than those of the first method while the effort to increase the performance of fine classifiers in the last method (c) is not so successful. As another effort, we propose to automatically group question classes by clustering in order to expand a node which has a large number of classes in the question taxonomy. The experiment also shows that the overall precision is improved.
机译:问题分类是问答系统中的一个重要阶段。在本文中,我们提出申请I)分级分类,II)分层分类结合半监督学习和iii)层级扩展的问题分类以提高精度。当类的数目是大的,分类算法的性能可能受到影响。为了通过减少类的数量为每个分类器,以提高性能,我们提出根据问题分类法,其中每个内部节点连接的分类器使用分层分类器。我们尝试使用半监督学习消耗与预期未标记的问题,提高层次结构中的分类器的性能。我们探讨了学习的层次结构中的每个分类方法的不同的应用:1)监督学习各级各分类; B)半监督学习的第一级分类器,并监督学习用于其它分类; C)半监督学习的所有分类。实验结果表明,第一种方法(a)中具有比平坦分类的更好的结果;第二种方法(b)中产生比第一种方法的更好的结果,同时,以增加在最后方法(c)细分类器的性能的努力也不是那么成功。作为另一项努力,我们通过以扩大其在问题分类大量的类节点集群建议自动组班问题。实验还表明,总体精度提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号