首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >High-resolution sinusoidal modeling of unvoiced speech
【24h】

High-resolution sinusoidal modeling of unvoiced speech

机译:高层言论的高分辨率正弦建模

获取原文

摘要

In this paper, a recently proposed high-resolution Sinusoidal Model, dubbed the extended adaptive Quasi-Harmonic Model (eaQHM), is applied on modeling unvoiced speech sounds. Unvoiced speech sounds are parts of speech that are highly non-stationary in the time-frequency plane. Standard sinusoidal models fail to model them accurately and efficiently, thus introducing artefacts, while the reconstructed signals do not attain the quality and naturalness of the originals. Motivated by recently proposed non-stationary transforms, such as the Fan-Chirp Transform (FChT), eaQHM is tested to confront these effects and it is shown that highly accurate, artefact-free representations of unvoiced sounds are possible using the non-stationary properties of the model. Experiments on databases of unvoiced sounds show that, on average, eaQHM improves the Signal to Reconstruction Error Ratio (SRER) obtained by the standard Sinusoidal Model (SM) by 93%. Moreover, modeling superiority is also supported via informal listening tests with two other models, namely the SM and the well-known STRAIGHT method.
机译:本文称,最近提出的高分辨率正弦模型被称为扩展的自适应准谐波模型(EAQHM),应用于建模清音语音声音。清晰的语音声音是时频平面中非常非静止的语音的一部分。标准正弦型号未能准确且有效地模拟它们,从而引入人工制品,而重建的信号不会达到原件的质量和自然。最近提出的非静止变换(如扇子啁啾变换(FCHT)),EAQHM被测试以面对这些效果,并且示出了使用非静止性能可以高度准确,无声的无声声音的无人物的无意义表示模型。关于清音声音数据库的实验表明,平均而言,EAQHM将通过标准正弦模型(SM)获得的重建误差比(SRER)提高了93%。此外,还通过与另外两种模型的非正式收听测试支持建模优势,即SM和众所周知的直线方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号