首页> 外文会议>IEEE International Conference on Image Processing >Multigap: Multi-pooled inception network with text augmentation for aesthetic prediction of photographs
【24h】

Multigap: Multi-pooled inception network with text augmentation for aesthetic prediction of photographs

机译:Multigap:具有文本增强功能的多池起始网络,用于照片的美学预测

获取原文

摘要

With the advent of deep learning, convolutional neural networks have solved many imaging problems to a large extent. However, it remains to be seen if the image “bottleneck” can be unplugged by harnessing complementary sources of data. In this paper, we present a new approach to image aesthetic evaluation that learns both visual and textual features simultaneously. Our network extracts visual features by appending global average pooling blocks on multiple inception modules (MultiGAP), while textual features from associated user comments are learned from a recurrent neural network. Experimental results show that the proposed method is capable of achieving state-of-the-art performance on the AVA / AVA-Comments datasets. We also demonstrate the capability of our approach in visualizing aesthetic activations.
机译:随着深度学习的到来,卷积神经网络已在很大程度上解决了许多成像问题。但是,是否可以通过利用互补的数据源来拔出图像“瓶颈”,还有待观察。在本文中,我们提出了一种新的图像美学评估方法,该方法可以同时学习视觉和文本特征。我们的网络通过在多个初始模块(MultiGAP)上附加全局平均池块来提取视觉特征,而关联用户评论的文本特征则从递归神经网络中学习。实验结果表明,该方法能够在AVA / AVA-Comments数据集上实现最先进的性能。我们还展示了我们的方法在可视化美学激活方面的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号