High-resolution sinusoidal modeling of unvoiced speech

机译：高层言论的高分辨率正弦建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a recently proposed high-resolution Sinusoidal Model, dubbed the extended adaptive Quasi-Harmonic Model (eaQHM), is applied on modeling unvoiced speech sounds. Unvoiced speech sounds are parts of speech that are highly non-stationary in the time-frequency plane. Standard sinusoidal models fail to model them accurately and efficiently, thus introducing artefacts, while the reconstructed signals do not attain the quality and naturalness of the originals. Motivated by recently proposed non-stationary transforms, such as the Fan-Chirp Transform (FChT), eaQHM is tested to confront these effects and it is shown that highly accurate, artefact-free representations of unvoiced sounds are possible using the non-stationary properties of the model. Experiments on databases of unvoiced sounds show that, on average, eaQHM improves the Signal to Reconstruction Error Ratio (SRER) obtained by the standard Sinusoidal Model (SM) by 93%. Moreover, modeling superiority is also supported via informal listening tests with two other models, namely the SM and the well-known STRAIGHT method.

机译：本文称，最近提出的高分辨率正弦模型被称为扩展的自适应准谐波模型（EAQHM），应用于建模清音语音声音。清晰的语音声音是时频平面中非常非静止的语音的一部分。标准正弦型号未能准确且有效地模拟它们，从而引入人工制品，而重建的信号不会达到原件的质量和自然。最近提出的非静止变换（如扇子啁啾变换（FCHT）），EAQHM被测试以面对这些效果，并且示出了使用非静止性能可以高度准确，无声的无声声音的无人物的无意义表示模型。关于清音声音数据库的实验表明，平均而言，EAQHM将通过标准正弦模型（SM）获得的重建误差比（SRER）提高了93％。此外，还通过与另外两种模型的非正式收听测试支持建模优势，即SM和众所周知的直线方法。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2016年||共5页
会议地点
作者
George P. Kafentzis; Yannis Stylianou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信理论;
关键词
Sinusoidal Model; Speech Analysis; Unvoiced Speech; extended Adaptive Quasi-Harmonic Model;

机译：正弦模型;言语分析;清音演讲;扩展自适应准谐波模型;
入库时间 2022-08-21 07:30:18

相似文献

外文文献
中文文献
专利

1. Sinusoidal modeling and modification of unvoiced speech [J] . Macon M.W., Clements M.A. IEEE Transactions on Speech and Audio Proceeding . 1997,第6期

机译：语音的正弦建模和修改
2. Sinusoidal modeling and modification of unvoiced speech [J] . Macon M.W., Clements M.A. IEEE Transactions on Speech and Audio Proceessing . 1997,第6期

机译：语音的正弦建模和修改
3. Kalman-filtering speech enhancement method based on avoiced-unvoiced speech model [J] . Zenton Goh, Kah-Chye Tan, Tan B.T.G. IEEE Transactions on Speech and Audio Proceessing . 1999,第5期

机译：基于清音模型的卡尔曼滤波语音增强方法
4. High-resolution sinusoidal modeling of unvoiced speech [C] . George P. Kafentzis, Yannis Stylianou IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：清晰语音的高分辨率正弦建模
5. High-resolution sinusoidal analysis for resolving harmonic collisions in music audio signal processing. [D] . Ehmann, Andreas F. 2011

机译：高分辨率正弦分析，用于解决音乐音频信号处理中的谐波冲突。
6. Aeroacoustic production of low-frequency unvoiced speech sounds [O] . Michael H. Krane -1

机译：空气声产生低频清音
7. A fast method for high-resolution voiced/unvoiced detection and glottal closure/opening instant estimation of speech [O] . Koutrouvelis, A., Kafentzis, GP, Gaubitch, N.D., 2015

机译：高分辨率浊音/清音检测和声门闭合/打开即时估计的快速方法
8. Optimum Classification of Voiced Speech, Unvoiced Speech and Silence in the Presence of Noise and Interference. [R] . mcaulay,robert j. -1

机译：浊音的最佳分类，清音和沉默的噪音和干扰的情况下。

High-resolution sinusoidal modeling of unvoiced speech

摘要

著录项

相似文献

相关主题

期刊订阅