首页> 外文学位 >Acoustic models for the analysis and synthesis of the singing voice.
【24h】

Acoustic models for the analysis and synthesis of the singing voice.

机译:用于分析和合成歌声的声学模型。

获取原文
获取原文并翻译 | 示例

摘要

Throughout our history, the singing voice has been a fundamental tool for musical expression. While analysis and digital synthesis techniques have been developed for normal speech, few models and techniques have been focused on the singing voice. The central theme of this research is the development of models aimed at the characterization and synthesis of the singing voice. First, a spectral model is presented in which asymmetric generalized Gaussian functions are used to represent the formant structure of a singing voice in a flexible manner. Efficient methods for searching the parameter space are investigated and challenges associated with smooth parameter trajectories are discussed. Next a model for glottal characterization is introduced by first presenting an analysis of the relationship between measurable spectral qualities of the glottal waveform and perceptually relevant time-domain parameters. A mathematical derivation of this relationship is presented and is extended as a method for parameter estimation. These concepts are then used to outline a procedure for modifying glottal textures and qualities in the frequency domain.; By combining these models with the Analysis-by-Synthesis/ Overlap-Add sinusoidal model, the spectral and glottal models are shown to be capable of characterizing the singing voice according to traits such as level of training and registration. An application is presented in which these parameterizations are used to implement a system for singing voice enhancement. Subjective listening tests were conducted in which listeners showed an overall preference for outputs produced by the proposed enhancement system over both unmodified voices and voices enhanced with competitive methods.
机译:在我们的整个历史中,歌声一直是表达音乐的基本工具。尽管分析和数字合成技术已开发用于正常语音,但很少有模型和技术专注于歌声。这项研究的中心主题是旨在表征和合成歌声的模型的开发。首先,提出了一种频谱模型,其中使用非对称广义高斯函数以灵活的方式表示歌唱声音的共振峰结构。研究了搜索参数空间的有效方法,并讨论了与平滑参数轨迹相关的挑战。接下来,通过首先介绍声门波形的可测量频谱质量与感知上相关的时域参数之间的关系,来介绍声门表征模型。提出了这种关系的数学推导,并将其扩展为参数估计方法。这些概念随后被用于概述在频域中修改声门纹理和质量的过程。通过将这些模型与“综合分析” /“重叠叠加”正弦模型相结合,频谱和声门模型可以根据训练和注册等特征来表征歌声。提出了一种应用,其中使用这些参数化来实现用于歌唱语音增强的系统。进行了主观听觉测试,其中听众对拟议的增强系统产生的输出总体上偏爱未修改的声音和采用竞争方法增强的声音。

著录项

  • 作者

    Lee, Matthew E.;

  • 作者单位

    Georgia Institute of Technology.;

  • 授予单位 Georgia Institute of Technology.;
  • 学科 Engineering Electronics and Electrical.; Music.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 127 p.
  • 总页数 127
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;音乐;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号