首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction
【24h】

Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction

机译:基于动态参数预测的卷积神经网络图像问答

获取原文

摘要

We tackle image question answering (ImageQA) problem by learning a convolutional neural network (CNN) with a dynamic parameter layer whose weights are determined adaptively based on questions. For the adaptive parameter prediction, we employ a separate parameter prediction network, which consists of gated recurrent unit (GRU) taking a question as its input and a fully-connected layer generating a set of candidate weights as its output. However, it is challenging to construct a parameter prediction network for a large number of parameters in the fully-connected dynamic parameter layer of the CNN. We reduce the complexity of this problem by incorporating a hashing technique, where the candidate weights given by the parameter prediction network are selected using a predefined hash function to determine individual weights in the dynamic parameter layer. The proposed network-joint network with the CNN for ImageQA and the parameter prediction network-is trained end-to-end through back-propagation, where its weights are initialized using a pre-trained CNN and GRU. The proposed algorithm illustrates the state-of-the-art performance on all available public ImageQA benchmarks.
机译:我们通过学习具有动态参数层的卷积神经网络(CNN)来解决图像问答(ImageQA)问题,该动态参数层的权重是根据问题自适应确定的。对于自适应参数预测,我们使用一个单独的参数预测网络,该网络由以问题为输入的门控循环单元(GRU)和生成一组候选权重作为其输出的完全连接层组成。但是,在CNN的完全连接的动态参数层中构造用于大量参数的参数预测网络具有挑战性。我们通过结合哈希技术来降低此问题的复杂性,在哈希技术中,使用预定义的哈希函数选择参数预测网络给出的候选权重,以确定动态参数层中的各个权重。拟议的带有ImageQA的CNN和参数预测网络的网络联合网络是通过反向传播进行端到端训练的,其中,其权重是使用预先训练的CNN和GRU初始化的。提出的算法说明了在所有可用的公共ImageQA基准上的最新性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号