Multigap: Multi-pooled inception network with text augmentation for aesthetic prediction of photographs

机译：Multigap：具有文本增强功能的多池起始网络，用于照片的美学预测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the advent of deep learning, convolutional neural networks have solved many imaging problems to a large extent. However, it remains to be seen if the image “bottleneck” can be unplugged by harnessing complementary sources of data. In this paper, we present a new approach to image aesthetic evaluation that learns both visual and textual features simultaneously. Our network extracts visual features by appending global average pooling blocks on multiple inception modules (MultiGAP), while textual features from associated user comments are learned from a recurrent neural network. Experimental results show that the proposed method is capable of achieving state-of-the-art performance on the AVA / AVA-Comments datasets. We also demonstrate the capability of our approach in visualizing aesthetic activations.

机译：随着深度学习的到来，卷积神经网络已在很大程度上解决了许多成像问题。但是，是否可以通过利用互补的数据源来拔出图像“瓶颈”，还有待观察。在本文中，我们提出了一种新的图像美学评估方法，该方法可以同时学习视觉和文本特征。我们的网络通过在多个初始模块（MultiGAP）上附加全局平均池块来提取视觉特征，而关联用户评论的文本特征则从递归神经网络中学习。实验结果表明，该方法能够在AVA / AVA-Comments数据集上实现最先进的性能。我们还展示了我们的方法在可视化美学激活方面的能力。

著录项

来源
《IEEE International Conference on Image Processing》|2017年|1722-1726|共5页
会议地点
作者
Yong-Lian Hii; John See; Magzhan Kairanbay; Lai-Kuan Wong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Feature extraction; Recurrent neural networks; Image color analysis; Standards; Logic gates;

机译：可视化;特征提取;递归神经网络;图像颜色分析;标准;逻辑门;

相似文献

外文文献
中文文献
专利

1. Multi-Pooled Inception Features for No-Reference Image Quality Assessment [J] . Domonkos Varga Applied Sciences . 2020,第6期

机译：用于无参考图像质量评估的多池初始特征
2. MUFOLD-SS: New deep inception-inside-inception networks for protein secondary structure prediction [J] . Fang Chao, Shang Yi, Xu Dong Proteins: Structure, Function, and Genetics . 2018,第5期

机译：MUFOLD-SS：用于蛋白质二级结构预测的新的深度初始 - 内初始网络
3. Photograph aesthetical evaluation and classification with deep convolutional neural networks [J] . Tan Yunlan, Tang Pengjie, Zhou Yimin, Neurocomputing . 2017,第MARa8期

机译：深度卷积神经网络对照片进行美学评估和分类
4. MULTIGAP: MULTI-POOLED INCEPTION NETWORK WITH TEXT AUGMENTATION FOR AESTHETIC PREDICTION OF PHOTOGRAPHS [C] . Yong-Lian Hii, John See, Magzhan Kairanbay, IEEE International Conference on Image Processing . 2017

机译：MultiGap：多池初始网络，具有文本增强用于照片的美学预测
5. Analysing the effects of data augmentation and free parameters for text classification with recurrent convolutional neural networks. [D] . Quijas, Jonathan K. 2017

机译：使用递归卷积神经网络分析数据扩充和自由参数对文本分类的影响。
6. MUFOLD-SS: New Deep Inception-Inside-Inception Networks for Protein Secondary Structure Prediction [O] . Chao Fang, Yi Shang, Dong Xu -1

机译：MUFOLD-SS：用于蛋白质二级结构预测的新的深层接收-内部接收-接收网络
7. SAINT: self-attention augmented inception-inside-inception network improves protein secondary structure prediction [O] . Mostofa Rafid Uddin, Sazan Mahbub, M Saifur Rahman, 2020

机译：圣徒：自我关注增强成立内 - 内初始网络改善了蛋白质二级结构预测

Multigap: Multi-pooled inception network with text augmentation for aesthetic prediction of photographs

摘要

著录项

相似文献

相关主题

期刊订阅