首页> 外文期刊>Multimedia Systems >Adaptive music resizing with stretching, cropping and insertion: A generic content-aware music resizing framework
【24h】

Adaptive music resizing with stretching, cropping and insertion: A generic content-aware music resizing framework

机译:通过拉伸,裁剪和插入来调整自适应音乐大小的调整:一种通用的可感知内容的音乐大小调整框架

获取原文
获取原文并翻译 | 示例
       

摘要

Content-aware music adaption, i.e. music resizing, in temporal constraints starts drawing attention from multimedia communities, because there are plenty of real-world scenarios, e.g. animation production and radio advertisement production. The goal of music resizing is to change the length of a music track to a user preferred length using a series of basic operations, e.g. compression, prolonging, cropping and insertion. The only existing music resizing approach so far, called LyDAR, is based on the lyrics analysis and just utilizes the compression operation to resize a music piece. As a result, LyDAR suffers from some limitations, e.g., it can neither prolong a music track nor compress music pieces with very small stretch rates. In this paper, we propose a content-aware music resizing framework, named MUSIZ. In general, MUSIZ outperforms LyDAR in three aspects: (a) Except for the compression operation, MUSIZ takes advantages of prolonging, cropping and insertion operations to handle the resizing requests of both compression and prolonging, (b) Observing the diversity of quality degradation for different segments, we propose the concept of stretch-resistance to measure the degree of quality degradation after a segment is stretched. The stretch-resistance is modeled based on both acoustical and lyrics features, © Cropping and insertion operations are utilized before stretching. We develop the contiguity-preservative cropping and insertion algorithms to remove and insert music segments while smoothing the abrupt change at the joint between the manipulated segments. Comprehensive user studies show that the music tracks resized by MUSIZ achieve better quality than those produced by existing approaches.
机译:在时间限制下,内容感知的音乐适应性(即音乐大小调整)开始引起多媒体社区的关注,因为存在许多现实世界的场景,例如动画制作和广播广告制作。调整音乐大小的目的是使用一系列基本操作,例如将音乐曲目的长度更改为用户首选的长度。压缩,延长,裁剪和插入。到目前为止,唯一存在的音乐大小调整方法称为LyDAR,它基于歌词分析,并且仅利用压缩操作来调整音乐片段的大小。结果,LyDAR受某些限制,例如,它既不能延长音乐曲目,也不能以非常小的拉伸率压缩音乐作品。在本文中,我们提出了一个名为MUSIZ的内容感知音乐调整大小框架。通常,MUSIZ在三个方面都优于LyDAR:(a)除压缩操作外,MUSIZ还利用延长,裁剪和插入操作的优势来处理压缩和延长的调整大小请求;(b)观察到质量下降的多样性在不同的段中,我们提出了抗拉伸性的概念,以测量段被拉伸后质量下降的程度。拉伸阻力是基于声学和歌词特征建模的,©拉伸前会进行裁剪和插入操作。我们开发了邻接保护的裁剪和插入算法,以删除和插入音乐片段,同时平滑操作片段之间的关节处的突变。全面的用户研究表明,由MUSIZ调整大小的音乐曲目比现有方法制作的音乐曲目质量更高。

著录项

  • 来源
    《Multimedia Systems》 |2013年第4期|359-380|共22页
  • 作者单位

    Department of Computer Science and Technology, Tsinghua University, Beijing 100084, People's Republic of China;

    School of Software, Tsinghua University, Beijing 100084, People's Republic of China;

    School of Software, Tsinghua University, Beijing 100084, People's Republic of China;

    Department of Computer Science and Technology, Tsinghua University, Beijing 100084, People's Republic of China;

    Department of Computer Science and Technology, Tsinghua University, Beijing 100084, People's Republic of China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Music resizing; Music retargeting; Stretch-resistance;

    机译:音乐大小调整;重新定位音乐;耐拉伸;
  • 入库时间 2022-08-18 02:06:19

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号