首页> 外文期刊>Neurocomputing >Photograph aesthetical evaluation and classification with deep convolutional neural networks
【24h】

Photograph aesthetical evaluation and classification with deep convolutional neural networks

机译:深度卷积神经网络对照片进行美学评估和分类

获取原文
获取原文并翻译 | 示例
           

摘要

In response to the growth of digital photography and its many related applications, researchers have been actively investigating methods for providing automated aesthetical evaluation and classification of photographs. For computational networks to recognize aesthetic qualities, the learning algorithms must be trained using sample sets of characteristics that have known aesthetic values. Traditional methods for developing this training have required manual extraction of aesthetic features for use in the practice datasets. With abundant appearance of convolutional neural networks (CNN), the networks have learned features automatically and have acted as important tools for evaluation and classification. At the time of our research, several existing convolutional neural networks for photograph aesthetical classification only used shallow depth networks, which limit the improvement of performance. In addition, most methods have extracted only one patch as a training sample, such as a down-sized crop from each image. However, a single patch might not represent the entire image accurately, which could cause ambiguity during training. What's more, for existing datasets, the numbers of high quality images of each category are mostly too small to train deep CNN networks. To solve these problems, we introduce a novel photograph aesthetic classifier with a deep and wide CNN for fine granularity aesthetical quality prediction. First, we download a large number of consumer photographic images from DPChallenge.com (a well-known online photography portal) to construct a dataset suitable for aesthetic quality assessment. Then, we zoom out the images into 256x256 by bilinear interpolation and crop 10 patches (Center+four Comers+Flipping). Once we have associated the set with the image's training labels, We feed the images with the bag of patches into the fine-tuned networks. Our proposed computational method is configured to classify the photographs into high and low aesthetic values. A training pattern specifying an output of (0, 1) indicates that the corresponding image belongs to the "low aesthetic quality" set. Likewise, a training pattern with an output of (1, 0) indicates that the corresponding image belongs to the "high aesthetic quality" set. Experimental results show that the accuracy of classification provided by our method is greater than 87.10%, which is noticeably better than the state-of-the-art methods. In addition, our experiments show that our results are fundamentally consistent with human visual perception and aesthetic judgments.
机译:为了响应数字摄影及其许多相关应用的增长,研究人员一直在积极研究用于提供照片的自动美学评估和分类的方法。为了使计算网络能够识别美学品质,必须使用具有已知美学价值的特征样本集来训练学习算法。开发此培训的传统方法要求手动提取美学特征以用于实践数据集。随着卷积神经网络(CNN)的出现,这些网络可以自动学习特征,并成为评估和分类的重要工具。在我们进行研究时,现有的几种用于照片美学分类的卷积神经网络仅使用浅深度网络,这限制了性能的提高。另外,大多数方法仅从一个图像中提取一个补丁作为训练样本,例如缩小尺寸的作物。但是,单个补丁可能无法准确地表示整个图像,这可能在训练过程中造成歧义。此外,对于现有的数据集,每个类别的高质量图像的数量大多太少而无法训练深层的CNN网络。为了解决这些问题,我们引入了一种新颖的照片美学分类器,该分类器具有深而宽的CNN,可用于精细粒度的美学质量预测。首先,我们从DPChallenge.com(一个著名的在线摄影门户网站)下载了大量的消费者摄影图像,以构建适合美学质量评估的数据集。然后,我们通过双线性插值法将图像缩小为256x256,并裁剪10个补丁(中心+四角+翻转)。将集合与图像的训练标签相关联后,我们将带有补丁包的图像馈入经过微调的网络中。我们提出的计算方法被配置为将照片分为高和低美学价值。指定输出为(0,1)的训练模式表示相应的图像属于“低美学质量”集。同样,输出为(1、0)的训练模式表示相应的图像属于“高美学质量”集。实验结果表明,我们的方法提供的分类准确率大于87.10%,明显优于最新方法。此外,我们的实验表明,我们的结果与人类的视觉感知和审美判断从根本上是一致的。

著录项

  • 来源
    《Neurocomputing》 |2017年第8期|165-175|共11页
  • 作者单位

    Jinggangshan Univ, Sch Elect & Informat Engn, Jian, Jiangxi, Peoples R China|Natl Adm Surveying Mapping & Geoinformat, Key Lab Watershed Ecol & Geog Environm Monitoring, Jian, Jiangxi, Peoples R China|Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China;

    Natl Adm Surveying Mapping & Geoinformat, Key Lab Watershed Ecol & Geog Environm Monitoring, Jian, Jiangxi, Peoples R China|Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China;

    Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China;

    Jinggangshan Univ, Sch Elect & Informat Engn, Jian, Jiangxi, Peoples R China|Natl Adm Surveying Mapping & Geoinformat, Key Lab Watershed Ecol & Geog Environm Monitoring, Jian, Jiangxi, Peoples R China;

    Jinggangshan Univ, Sch Elect & Informat Engn, Jian, Jiangxi, Peoples R China|Natl Adm Surveying Mapping & Geoinformat, Key Lab Watershed Ecol & Geog Environm Monitoring, Jian, Jiangxi, Peoples R China;

    Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Image aesthetics; Quality assessment; High aesthetics; Low aesthetics; Image classification; Deep convolutional neural network; GoogLeNet; Feature representation;

    机译:图像美学;质量评估;高美学;低美学;图像分类;深度卷积神经网络;GoogLeNet;特征表示;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号