首页> 外文会议>Asia-Pacific Signal and Information Processing Association Annual Summit and Conference >Enhancement of speech intelligibility under noisy reverberant conditions based on modulation spectrum concept
【24h】

Enhancement of speech intelligibility under noisy reverberant conditions based on modulation spectrum concept

机译:基于调制频谱概念的嘈杂混响条件下的语音可懂度提高

获取原文

摘要

This study focuses on identifying effective features for controlling speech to increase speech intelligibility under adverse conditions. Previous methods either reduce noise and reverberation throughout speech presentation or enhance speech before presenting it by controlling its intensity and/or spectral properties to increase intelligibility. Among them, a method based on modulation transfer function theory, in which the environmental effects are inverted to anticipate attenuation of the modulation spectrum of speech, shows excellent potential due to its systematic and explicit derivation of intelligibility enhancement against environmental smears. However, directly obtaining that inversion requires estimating the modulation transfer function. The estimate seems complicated and tolerant under realistic variable conditions. This study takes a different approach: analyzing the relations of smeared modulation spectra by the environments for intelligibility to extract effective modifying features. First, we conduct listening tests for intelligibility in noise with different types of enhanced speech. Next, we extract acoustic and modulation frequency components in the smeared modulation spectra by noise showing high correlation with intelligibility scores. Finally, we examine the intelligibility benefits of modifying these components by performing listening tests. The results show that these components effectively increase intelligibility by at most 20%, which demonstrates that our concept is valid.
机译:本研究重点是识别控制言论的有效特征,以在不利条件下增加语音识别性。以前的方法在通过控制其强度和/或光谱属性来提高可懂度之前,在整个语音呈现或增强语音之前减少噪声和混响。其中,一种基于调制传递函数理论的方法,其中倒置环境效应以预测语音调制谱的衰减,呈现出优异的潜力,因为它的系统和明确推导了对环境涂片的可懂度增强。但是,直接获得该反转需要估计调制传递函数。在现实的可变条件下,估计似乎复杂和宽容。本研究采用不同的方法:通过环境来分析涂抹调制光谱的关系,以提取有效修改功能。首先,我们对不同类型的增强语音进行噪声的噪音聆听测试。接下来,我们通过噪声提取涂片调制光谱中的声学和调制频率分量,显示与可懂度分数高的相关性。最后,我们通过执行听力测试来检查修改这些组件的可智能化益处。结果表明,这些组件最多有效地提高了理智,最多20%,这表明我们的概念有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号