首页> 外文会议>European Conference on Artificial Intelligence;Conference on Prestigious Applications of Intelligent Systems >Combination of Active Learning and Self-Paced Learning for Deep Answer Selection with Bayesian Neural Network
【24h】

Combination of Active Learning and Self-Paced Learning for Deep Answer Selection with Bayesian Neural Network

机译:与贝叶斯神经网络的深度答案选择的积极学习与自我定日学习的结合

获取原文

摘要

Answer Selection is an important subtask of Question Answering tasks. For this learning-to-rank problem, deep learning methods have outperformed traditional methods. To train a high-quality deep answer selection model, it often requires large amounts of labeled data, which is a costly and noise-prone process. Active learning and semi-supervised learning are usually applied in the modelling training procedure to achieve optimal accuracy with fewer labeled training samples. However, traditional active learning methods rely on good uncertainty estimates that are hard to obtain with standard neural networks. And the performance of semi-supervised learning methods are always affected adversely by the quality of the pseudo-labeled data. In this work, we propose a new framework integrating active learning and self-paced learning in training deep answer selection models. This framework proposes an uncertainty quantification method based on Bayesian neural network, which can guide active learning and self-paced learning in the same iterative process of model training. Experiments were conducted on two kinds of deep answer selection models with real-world datasets including YahooCQA and SemiEvalCQA. The results reveal that the proposed method can significantly reduce the labeled samples for model training.
机译:回答选择是问题应答任务的重要子任务。对于这种学习排名的问题,深度学习方法表现优于传统方法。要培训高质量的深度答案选择模型,它通常需要大量标记数据,这是一种昂贵和噪音的过程。积极学习和半监督学习通常在建模培训程序中应用,以实现具有较少标记训练样本的最佳准确性。然而,传统的主动学习方法依赖于良好的不确定性估计,这很难获得标准神经网络。半监督学习方法的性能始终受到伪标记数据的质量的不利影响。在这项工作中,我们提出了一种新的框架,整合了积极学习和自我节奏的学习,在训练深度答案选择模型中。该框架提出了一种基于贝叶斯神经网络的不确定性量化方法,可以在模型训练的相同迭代过程中引导主动学习和自学学习。在具有现实世界数据集的两种深度答案选择模型上进行了实验,包括Yahoocqa和SemivalcQA。结果表明,所提出的方法可以显着减少标记样品进行模型培训。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号