首页> 外文会议>International Conference on Signal Processing and Co1213-15mmunication Systems >Preference for 20-40 ms window duration in speech analysis
【24h】

Preference for 20-40 ms window duration in speech analysis

机译:致辞分析中的20-40毫秒窗口持续时间

获取原文

摘要

In speech processing the short-time magnitude spectrum is believed to contain most of the information about speech intelligibility and it is normally computed using the short-time Fourier transform over 20–40 ms window duration. In this paper, we investigate the effect of the analysis window duration on speech intelligibility in a systematic way. For this purpose, both subjective and objective experiments are conducted. The subjective experiment is in a form of a consonant recognition task by human listeners, whereas the objective experiment is in a form of an automatic speech recognition (ASR) task. In our experiments various analysis window durations are investigated. For the subjective experiment we construct speech stimuli based purely on the short-time magnitude information. The results of the subjective experiment show that the analysis window duration of 15–35 ms is the optimum choice when speech is reconstructed from the short-time magnitude spectrum. Similar conclusions were made based on the results of the objective (ASR) experiment. The ASR results were found to have statistically significant correlation with the subjective intelligibility results.
机译:在语音处理中,据信包含短时幅度谱,其中包含关于语音智能性的大多数信息,并且通常使用超过20-40毫秒窗口持续时间的短时傅里叶变换来计算。在本文中,我们以系统的方式调查分析窗口持续时间对语音可懂度的影响。为此目的,进行主观和客观实验。主观实验是人类听众的辅音识别任务的形式,而客观实验是一种自动语音识别(ASR)任务的形式。在我们的实验中,调查了各种分析窗口持续时间。对于主观实验,我们将纯粹基于短时幅度信息构建语音刺激。主观实验的结果表明,分析窗口持续时间为15-35 ms是从短时幅度谱重建语音时的最佳选择。基于目标(ASR)实验的结果进行了类似的结论。发现ASR结果与主观可清晰度结果具有统计学上的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号