Photograph aesthetical evaluation and classification with deep convolutional neural networks

Tan Yunlan; Tang Pengjie; Zhou Yimin; Luo Wenlang; Kang Yongping; Li Guangyao

首页> 外文期刊>Neurocomputing >Photograph aesthetical evaluation and classification with deep convolutional neural networks

【24h】

Photograph aesthetical evaluation and classification with deep convolutional neural networks

机译：深度卷积神经网络对照片进行美学评估和分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In response to the growth of digital photography and its many related applications, researchers have been actively investigating methods for providing automated aesthetical evaluation and classification of photographs. For computational networks to recognize aesthetic qualities, the learning algorithms must be trained using sample sets of characteristics that have known aesthetic values. Traditional methods for developing this training have required manual extraction of aesthetic features for use in the practice datasets. With abundant appearance of convolutional neural networks (CNN), the networks have learned features automatically and have acted as important tools for evaluation and classification. At the time of our research, several existing convolutional neural networks for photograph aesthetical classification only used shallow depth networks, which limit the improvement of performance. In addition, most methods have extracted only one patch as a training sample, such as a down-sized crop from each image. However, a single patch might not represent the entire image accurately, which could cause ambiguity during training. What's more, for existing datasets, the numbers of high quality images of each category are mostly too small to train deep CNN networks. To solve these problems, we introduce a novel photograph aesthetic classifier with a deep and wide CNN for fine granularity aesthetical quality prediction. First, we download a large number of consumer photographic images from DPChallenge.com (a well-known online photography portal) to construct a dataset suitable for aesthetic quality assessment. Then, we zoom out the images into 256x256 by bilinear interpolation and crop 10 patches (Center+four Comers+Flipping). Once we have associated the set with the image's training labels, We feed the images with the bag of patches into the fine-tuned networks. Our proposed computational method is configured to classify the photographs into high and low aesthetic values. A training pattern specifying an output of (0, 1) indicates that the corresponding image belongs to the "low aesthetic quality" set. Likewise, a training pattern with an output of (1, 0) indicates that the corresponding image belongs to the "high aesthetic quality" set. Experimental results show that the accuracy of classification provided by our method is greater than 87.10%, which is noticeably better than the state-of-the-art methods. In addition, our experiments show that our results are fundamentally consistent with human visual perception and aesthetic judgments.

机译：为了响应数字摄影及其许多相关应用的增长，研究人员一直在积极研究用于提供照片的自动美学评估和分类的方法。为了使计算网络能够识别美学品质，必须使用具有已知美学价值的特征样本集来训练学习算法。开发此培训的传统方法要求手动提取美学特征以用于实践数据集。随着卷积神经网络（CNN）的出现，这些网络可以自动学习特征，并成为评估和分类的重要工具。在我们进行研究时，现有的几种用于照片美学分类的卷积神经网络仅使用浅深度网络，这限制了性能的提高。另外，大多数方法仅从一个图像中提取一个补丁作为训练样本，例如缩小尺寸的作物。但是，单个补丁可能无法准确地表示整个图像，这可能在训练过程中造成歧义。此外，对于现有的数据集，每个类别的高质量图像的数量大多太少而无法训练深层的CNN网络。为了解决这些问题，我们引入了一种新颖的照片美学分类器，该分类器具有深而宽的CNN，可用于精细粒度的美学质量预测。首先，我们从DPChallenge.com（一个著名的在线摄影门户网站）下载了大量的消费者摄影图像，以构建适合美学质量评估的数据集。然后，我们通过双线性插值法将图像缩小为256x256，并裁剪10个补丁（中心+四角+翻转）。将集合与图像的训练标签相关联后，我们将带有补丁包的图像馈入经过微调的网络中。我们提出的计算方法被配置为将照片分为高和低美学价值。指定输出为（0，1）的训练模式表示相应的图像属于“低美学质量”集。同样，输出为（1、0）的训练模式表示相应的图像属于“高美学质量”集。实验结果表明，我们的方法提供的分类准确率大于87.10％，明显优于最新方法。此外，我们的实验表明，我们的结果与人类的视觉感知和审美判断从根本上是一致的。

著录项

来源
《Neurocomputing》 |2017年第8期|165-175|共11页
作者
Tan Yunlan; Tang Pengjie; Zhou Yimin; Luo Wenlang; Kang Yongping; Li Guangyao;
展开▼
作者单位

Jinggangshan Univ, Sch Elect & Informat Engn, Jian, Jiangxi, Peoples R China|Natl Adm Surveying Mapping & Geoinformat, Key Lab Watershed Ecol & Geog Environm Monitoring, Jian, Jiangxi, Peoples R China|Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China;

Natl Adm Surveying Mapping & Geoinformat, Key Lab Watershed Ecol & Geog Environm Monitoring, Jian, Jiangxi, Peoples R China|Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China;

Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China;

Jinggangshan Univ, Sch Elect & Informat Engn, Jian, Jiangxi, Peoples R China|Natl Adm Surveying Mapping & Geoinformat, Key Lab Watershed Ecol & Geog Environm Monitoring, Jian, Jiangxi, Peoples R China;

Jinggangshan Univ, Sch Elect & Informat Engn, Jian, Jiangxi, Peoples R China|Natl Adm Surveying Mapping & Geoinformat, Key Lab Watershed Ecol & Geog Environm Monitoring, Jian, Jiangxi, Peoples R China;

Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Image aesthetics; Quality assessment; High aesthetics; Low aesthetics; Image classification; Deep convolutional neural network; GoogLeNet; Feature representation;

机译：图像美学;质量评估;高美学;低美学;图像分类;深度卷积神经网络;GoogLeNet;特征表示;

相似文献

外文文献
中文文献
专利

1. Application of Deep Convolutional Neural Networks in Attention-Deficit/Hyperactivity Disorder Classification: Data Augmentation and Convolutional Neural Network Transfer Learning [J] . Zhu Li, Chang Weike Journal of Medical Imaging and Health Informatics . 2019,第8期

机译：深度卷积神经网络在注意力缺陷/多动障碍分类中的应用：数据增强与卷积神经网络转移学习
2. Fine-tuning deep convolutional neural networks for distinguishing illustrations from photographs [J] . Gando Gota, Yamada Taiga, Sato Haruhiko, Expert Systems with Application . 2016,第Deca期

机译：微调深度卷积神经网络以区分插图和照片
3. Hierarchical aesthetic quality assessment using deep convolutional neural networks [J] . Kao Yueying, Huang Kaiqi, Maybank Steve Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing . 2016,第Null期

机译：使用深度卷积神经网络进行层次美学质量评估
4. Image Aesthetic Evaluation Using Parallel Deep Convolution Neural Network [C] . Lihua Guo, Fudi Li, Alan Wee-Chung Liew International Conference on Digital Image Computing: Techniques and Applications . 2016

机译：并行深度卷积神经网络的图像美学评价
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Plant Disease Classification: A Comparative Evaluation of Convolutional Neural Networks and Deep Learning Optimizers [O] . Muhammad Hammad Saleem, Johan Potgieter, Khalid Mahmood Arif 2020

机译：植物病分类：卷积神经网络和深度学习优化器的比较评价
7. Evaluation of deep convolutional neural networks for automatic classification of common maternal fetal ultrasound planes [O] . Xavier P. Burgos-Artizzu, David Coronado-Gutiérrez, Brenda Valenzuela-Alcaraz, 2020

机译：普通母体胎儿超声波自动分类深度卷积神经网络的评价

Photograph aesthetical evaluation and classification with deep convolutional neural networks

摘要

著录项

相似文献

相关主题

期刊订阅