首页> 外文期刊>Expert systems with applications >deepsing: Generating sentiment-aware visual stories using cross-modal music translation
【24h】

deepsing: Generating sentiment-aware visual stories using cross-modal music translation

机译:深度:使用跨模态音乐转换生成情绪感知视觉故事

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper we propose a deep learning method for performing attributed-based music-to-image translation. The proposed method is applied for synthesizing visual stories according to the sentiment expressed by songs. The generated images aim to induce the same feelings to the viewers, as the original song does, reinforcing the primary aim of music, i.e., communicating feelings. The process of music-to-image translation poses unique challenges, mainly due to the unstable mapping between the different modalities involved in this process. In this paper, we employ a trainable cross-modal translation method to overcome this limitation, leading to the first, to the best of our knowledge, deep learning method for generating sentiment-aware visual stories. The proposed method was evaluated both quantitatively and qualitatively using a collection of songs that belong to 10 different genres, demonstrating that it is indeed possible to generate visual content that can match the sentiment expressed in songs. A user study was also conducted further validating the ability of the proposed method to provide sentiment-enriched visualizations.
机译:在本文中,我们提出了一种用于执行基于属性的音乐到图像转换的深度学习方法。该提出的方法用于根据歌曲表达的情绪来合成视觉故事。由于原始歌曲所做的,所产生的图像旨在引起观众的同样的感受,加强音乐的主要目标,即传达感受。音乐到图像翻译的过程造成了独特的挑战,主要是由于该过程中涉及的不同模式之间的不稳定映射。在本文中,我们采用了培训的跨模型翻译方法来克服这一限制,从我们所知,深入学习方法的首先生成情绪感知视觉故事。使用属于10种不同类型的歌曲的集合来定量和定性评估所提出的方法,表明它确实可以生成可与歌曲中表达的情绪相匹配的视觉内容。还进行了用户学习,进一步验证了所提出的方法提供富集的可视化的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号