首页> 外文学位 >Visual characteristics for computational prediction of aesthetics and evoked emotions.
【24h】

Visual characteristics for computational prediction of aesthetics and evoked emotions.

机译:用于美学和诱发情绪的计算预测的视觉特征。

获取原文
获取原文并翻译 | 示例

摘要

Human emotions and aesthetic feelings that are aroused by natural photographs have been actively studied during the past decades due to their potential applications to the development of intelligent computer systems and to broad areas of science and technology related to human emotion and aesthetics. In this dissertation, investigations of visual characteristics that evoke human emotion and aesthetic feelings are presented. First, shape features were studied in natural images in terms of how they influence emotions aroused in human beings. Shapes and their characteristics-such as roundness, angularity, simplicity, and complexity have been found to evoke emotions in human perceivers, with evidence from psychological studies of facial expressions, dancing poses, and even simple synthetic visual patterns. Capturing these characteristics algorithmically to incorporate in computational studies, however, has proven difficult. Moreover, little prior research has modeled the dimensionality of emotions aroused by roundness and angularity. In this study, a collection of shape features was developed, which encoded the visual characteristics of roundness, angularity, and complexity using edge, corner, and contour distributions. Evaluation of those features were performed on the International Affective Picture System (IAPS) dataset, where evidence was provided regarding the significance of roundness-angularity and simplicity-complexity on predicting emotional content in images. Second, an investigation into three visual characteristics, i.e., roundness, angularity, and simplicity, of complex scenes that evoke human emotion was performed. Built upon the high-dimensional shape features, novel computational methods were developed to map visual content to the scales of roundness, angularity, and simplicity as three new constructs. The scope of the previous psychological hypothesis was, therefore, expanded by examining these three visual characteristics in computer analysis of complex scenes. The results produced by the three new constructs were compared to the hundreds of visual qualities examined by previous studies. The three constructs were completely interpretable and could be used in other applications involving roundness, angularity, and simplicity of visual scenes. Meanwhile, a large collection of ecologically valid stimuli (i.e., photographs humans regularly encounter on the Web), containing more than 40K images crawled from web albums, was generated using crowdsourcing and was subjected to human subject emotion ratings. Critically, these three new visual constructs achieved classification accuracy comparable to the hundreds of shape, texture, composition, and facial feature characteristics previously examined. This reduces the number of features required for classification by about two orders of magnitude. In addition, our experimental results showed that the three constructs showed consistent capacity in classifying both dimensions of emotions. Finally, a novel deep learning algorithm was developed to automatically learn effective visual characteristics for image aesthetics assessment. The proposed RAPID (RAting PIctorial aesthetics using Deep learning) system, incorporates heterogeneous inputs generated from the image, which include a global view and a local view, and unifies the feature learning and classifier training using a double-column deep convolutional neural network. The experimental results showed that the RAPID system significantly outperformed the state of the art on the AVA dataset. The results of the three studies demonstrate (1) the capability of roundness, angularity, and complexity of complex scenes to evoke human emotions, and (2) the capability of global view and fine-grained details of complex scenes to evoke aesthetic feelings.
机译:由于自然照片引起的人类情感和审美感觉在智能计算机系统的发展以及与人类情感和审美有关的广泛科学和技术领域中的潜在应用,因此在过去的几十年中已经得到了积极的研究。本文对引起人的情感和审美感觉的视觉特征进行了研究。首先,在自然图像中研究形状特征如何影响人类引起的情绪。通过面部表情,舞蹈姿势甚至简单的合成视觉模式的心理学研究的证据,已经发现形状及其特征(例如圆度,棱角,简单性和复杂性)会引起人类感知者的情绪。然而,已证明很难通过算法捕获这些特征以将其纳入计算研究中。而且,很少有先前的研究对圆度和棱角引起的情感的维数建模。在这项研究中,开发了形状特征的集合,这些特征使用边缘,拐角和轮廓分布编码了圆度,棱角和复杂度的视觉特征。在国际情感图片系统(IAPS)数据集上对这些功能进行了评估,其中提供了有关圆​​角和简单复杂度对预测图像中情感内容的重要性的证据。其次,对引起人的情感的复杂场景的三个视觉特征,即圆度,棱角和简单性进行了研究。基于高维形状特征,开发了新颖的计算方法,以将视觉内容映射为三个新结构的圆度,棱角和简单程度。因此,通过在复杂场景的计算机分析中检查这三个视觉特征,扩大了先前的心理假设的范围。将这三种新构建体产生的结果与先前研究检查的数百种视觉质量进行了比较。这三种构造完全可以解释,可以用于涉及圆度,棱角和视觉场景简单性的其他应用程序。同时,使用众包生成了一大批具有生态学意义的刺激(即人类经常在网上遇到的照片),其中包含从网络相册中抓取的4万多幅图像,并且受到了人类受试者的情感评价。至关重要的是,这三个新的视觉构造实现了与先前检查的数百种形状,纹理,组成和面部特征相似的分类精度。这将分类所需的特征数量减少了大约两个数量级。此外,我们的实验结果表明,这三种结构在对情感的两个维度进行分类时均显示出一致的能力。最后,开发了一种新颖的深度学习算法,可自动学习有效的视觉特征以进行图像美学评估。拟议的RAPID(使用深度学习进行评分美学)系统整合了从图像生成的异构输入,包括全局视图和局部视图,并使用双列深度卷积神经网络统一了特征学习和分类器训练。实验结果表明,RAPID系统在AVA数据集上的性能明显优于现有技术。这三项研究的结果表明(1)复杂场景的圆度,棱角和复杂性唤起人类情感的能力,以及(2)复杂场景的全局视图和细粒度细节唤起审美感觉的能力。

著录项

  • 作者

    Lu, Xin.;

  • 作者单位

    The Pennsylvania State University.;

  • 授予单位 The Pennsylvania State University.;
  • 学科 Computer science.;Information technology.
  • 学位 Ph.D.
  • 年度 2016
  • 页码 142 p.
  • 总页数 142
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:50:24

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号