deepsing: Generating sentiment-aware visual stories using cross-modal music translation

Nikolaos Passalis; Stavros Doropoulos

首页> 外文期刊>Expert systems with applications >deepsing: Generating sentiment-aware visual stories using cross-modal music translation

【24h】

deepsing: Generating sentiment-aware visual stories using cross-modal music translation

机译：深度：使用跨模态音乐转换生成情绪感知视觉故事

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we propose a deep learning method for performing attributed-based music-to-image translation. The proposed method is applied for synthesizing visual stories according to the sentiment expressed by songs. The generated images aim to induce the same feelings to the viewers, as the original song does, reinforcing the primary aim of music, i.e., communicating feelings. The process of music-to-image translation poses unique challenges, mainly due to the unstable mapping between the different modalities involved in this process. In this paper, we employ a trainable cross-modal translation method to overcome this limitation, leading to the first, to the best of our knowledge, deep learning method for generating sentiment-aware visual stories. The proposed method was evaluated both quantitatively and qualitatively using a collection of songs that belong to 10 different genres, demonstrating that it is indeed possible to generate visual content that can match the sentiment expressed in songs. A user study was also conducted further validating the ability of the proposed method to provide sentiment-enriched visualizations.

机译：在本文中，我们提出了一种用于执行基于属性的音乐到图像转换的深度学习方法。该提出的方法用于根据歌曲表达的情绪来合成视觉故事。由于原始歌曲所做的，所产生的图像旨在引起观众的同样的感受，加强音乐的主要目标，即传达感受。音乐到图像翻译的过程造成了独特的挑战，主要是由于该过程中涉及的不同模式之间的不稳定映射。在本文中，我们采用了培训的跨模型翻译方法来克服这一限制，从我们所知，深入学习方法的首先生成情绪感知视觉故事。使用属于10种不同类型的歌曲的集合来定量和定性评估所提出的方法，表明它确实可以生成可与歌曲中表达的情绪相匹配的视觉内容。还进行了用户学习，进一步验证了所提出的方法提供富集的可视化的能力。

著录项

来源
《Expert systems with applications》 |2021年第2期|114059.1-114059.9|共9页
作者
Nikolaos Passalis; Stavros Doropoulos;
展开▼
作者单位

Aristotle University of Thessaloniki Greece;

DataScouting Greece;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Music-to-image translation; Visual stories; Generative Adversarial Networks; Neural style transfer;

机译：音乐到图像翻译;视觉故事;生成的对抗网络;神经风格转移;

相似文献

外文文献
中文文献
专利

1. The texture of musical sounds: Cross-modal associations between visual textures and musical timbres and intervals [J] . Joshua Peterson, Thomas Langlois, Stephen Palmer Journal of vision . 2014,第10期

机译：音乐声音的纹理：视觉纹理与音乐音色和音程之间的跨模式关联
2. Cross-Modal Perception of Noise-in-Music: Audiences Generate Spiky Shapes in Response to Auditory Roughness in a Novel Electroacoustic Concert Setting [J] . Kongmeng Liew, PerMagnus Lindborg, Ruth Rodrigues, Frontiers in Psychology . 2018,第4期

机译：音乐噪声的跨模态感知：观众在新型电声音乐会环境中响应听觉粗糙度生成尖刻的形状
3. Cross-Modal Priming Effect of Rhythm on Visual Word Recognition and Its Relationships to Music Aptitude and Reading Achievement [J] . Tess S. Fotidzis, Heechun Moon, Jessica R. Steele, Brain Sciences . 2018,第12期

机译：节奏对视觉单词识别的跨模式启动效应及其与音乐能力和阅读成绩的关系
4. Audio-Visual Embedding for Cross-Modal Music Video Retrieval through Supervised Deep CCA [C] . Donghuo Zeng, Yi Yu, Keizo Oyama IEEE International Symposium on Multimedia . 2018

机译：通过受监督的深层CCA进行跨模态音乐视频检索的视听嵌入
5. JPEG to STL translation software for color/texture mapping in support of 3D printing of surfaces used in visual/tactile cross-modal cognitive neuroscience research. [D] . Dahasahasra, Rashmi Sanjay. 2014

机译：用于颜色/纹理映射的JPEG到STL转换软件，支持视觉/触觉跨模态认知神经科学研究中使用的表面3D打印。
6. Cross-Modal Perception of Noise-in-Music: Audiences Generate Spiky Shapes in Response to Auditory Roughness in a Novel Electroacoustic Concert Setting [O] . Kongmeng Liew, PerMagnus Lindborg, Ruth Rodrigues, -1

机译：音乐噪声的跨模态感知：观众在新型电声音乐会环境中响应听觉粗糙度生成尖刻的形状
7. deepsing: Generating sentiment-aware visual stories using cross-modal music translation [O] . Nikolaos Passalis, Stavros Doropoulos 2021

机译：深度：使用跨模态音乐转换生成情绪感知视觉故事

deepsing: Generating sentiment-aware visual stories using cross-modal music translation

摘要

著录项

相似文献

相关主题

期刊订阅