首页> 外文OA文献 >Importance of the dynamic range of an analysis window function for phase-only and magnitude-only reconstruction of speech
【2h】

Importance of the dynamic range of an analysis window function for phase-only and magnitude-only reconstruction of speech

机译:分析窗口功能的动态范围对于语音的仅相位和仅幅度重构的重要性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The short-time Fourier transform (STFT) of a speech signal has two components: the short-time magnitude spectrum and the short-time phase spectrum. It is traditionally believed that the short-timemagnitude spectrum plays the dominant role for speech perception at small window durations (20-40ms). However, recent perceptual studies have shown that the short-time phase spectrum can contribute as much to speech intelligibility as the short-time magnitude spectrum. It was observed that the use of the rectangular (non-tapered) analysis window for the computation of the short-time phase spectrum is more advantageous than the use of the Hamming (tapered) analysis window. This paper investigates the effect that the dynamic range of an analysis window has on the intelligibility of speech for phaseonly and magnitude-only stimuli. For this purpose, the Chebyshev analysis window with adjustable equi-ripple side-lobes is employed. Two types of magnitude-only stimuli are investigated: random phase and zero phase. It is shown that the intelligibility of the magnitudeonly stimuli constructed with zero phase is independent of the dynamic range of the analysis window, while the random phase stimuli are intelligible only for analysis windows with high dynamic range. This study also shows that for low dynamic range analysis windows, the short-time phase spectrum at small window durations (20-40ms) contributes as much as to speech intelligibility as the short-time magnitude spectrum.
机译:语音信号的短时傅立叶变换(STFT)具有两个分量:短时幅度谱和短时相位谱。传统上认为,短时幅度频谱在小窗口持续时间(20-40ms)内对语音感知起着主导作用。但是,最近的感知研究表明,短时相位谱对语音清晰度的贡献与短时幅度谱一样大。已经观察到,使用矩形(非锥形)分析窗口来计算短时相位谱比使用汉明(锥形)分析窗口更有利。本文研究了分析窗口的动态范围对仅相位和仅幅度刺激的语音清晰度的影响。为此,使用具有可调节等波纹旁瓣的切比雪夫分析窗。研究了两种类型的仅幅度刺激:随机相位和零相位。结果表明,零相位构造的仅幅度刺激的可理解性与分析窗口的动态范围无关,而随机相位刺激仅对于具有高动态范围的分析窗口可理解。这项研究还表明,对于低动态范围分析窗口,在小窗口持续时间(20-40ms)内的短时相位谱对语音清晰度的贡献与短时幅度谱一样大。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号