Enhancement of speech intelligibility under noisy reverberant conditions based on modulation spectrum concept

机译：基于调制频谱概念的嘈杂混响条件下的语音可懂度提高

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This study focuses on identifying effective features for controlling speech to increase speech intelligibility under adverse conditions. Previous methods either reduce noise and reverberation throughout speech presentation or enhance speech before presenting it by controlling its intensity and/or spectral properties to increase intelligibility. Among them, a method based on modulation transfer function theory, in which the environmental effects are inverted to anticipate attenuation of the modulation spectrum of speech, shows excellent potential due to its systematic and explicit derivation of intelligibility enhancement against environmental smears. However, directly obtaining that inversion requires estimating the modulation transfer function. The estimate seems complicated and tolerant under realistic variable conditions. This study takes a different approach: analyzing the relations of smeared modulation spectra by the environments for intelligibility to extract effective modifying features. First, we conduct listening tests for intelligibility in noise with different types of enhanced speech. Next, we extract acoustic and modulation frequency components in the smeared modulation spectra by noise showing high correlation with intelligibility scores. Finally, we examine the intelligibility benefits of modifying these components by performing listening tests. The results show that these components effectively increase intelligibility by at most 20%, which demonstrates that our concept is valid.

机译：本研究重点是识别控制言论的有效特征，以在不利条件下增加语音识别性。以前的方法在通过控制其强度和/或光谱属性来提高可懂度之前，在整个语音呈现或增强语音之前减少噪声和混响。其中，一种基于调制传递函数理论的方法，其中倒置环境效应以预测语音调制谱的衰减，呈现出优异的潜力，因为它的系统和明确推导了对环境涂片的可懂度增强。但是，直接获得该反转需要估计调制传递函数。在现实的可变条件下，估计似乎复杂和宽容。本研究采用不同的方法：通过环境来分析涂抹调制光谱的关系，以提取有效修改功能。首先，我们对不同类型的增强语音进行噪声的噪音聆听测试。接下来，我们通过噪声提取涂片调制光谱中的声学和调制频率分量，显示与可懂度分数高的相关性。最后，我们通过执行听力测试来检查修改这些组件的可智能化益处。结果表明，这些组件最多有效地提高了理智，最多20％，这表明我们的概念有效。

著录项

来源
《Asia-Pacific Signal and Information Processing Association Annual Summit and Conference》|2020年|753-758|共6页
会议地点
作者
Thuan Van Ngo; Tuan Vu Ho; Masashi Unoki; Rieko Kubo; Masato Akagi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Frequency modulation; Speech enhancement; Feature extraction; Indexes; Correlation; Reverberation; Signal to noise ratio;

机译：频率调制;语音增强;特征提取;索引;相关;混响;信噪比;

相似文献

外文文献
中文文献
专利

1. Japanese speech intelligibility estimation and prediction using objective intelligibility indices under noisy and reverberant conditions [J] . Kobayashi Yosuke, Kondo Kazuhiro Applied Acoustics . 2019,第DECa期

机译：在嘈杂和混响条件下使用客观清晰度指数进行日语语音清晰度估计和预测
2. A deep learning based segregation algorithm to increase speech intelligibility for hearing-impaired listeners in reverberant-noisy conditions [J] . Yan Zhao, DeLiang Wang, Eric M. Johnsonb, The Journal of the Acoustical Society of America . 2018,第3aPta1期

机译：基于深度学习的分离算法，增加了回音噪声条件中听力障碍听众的语音清晰度
3. Modulation enhancement of speech by a pre-processing algorithm for improving intelligibility in reverberant environments [J] . Akiko Kusumoto, Takayuki Arai, Keisuke Kinoshita, Speech Communication . 2005,第2期

机译：通过预处理算法增强语音的调制度，以改善混响环境中的清晰度
4. Speech intelligibility enhancement in noisy reverberant conditions [C] . Junfeng Li, Risheng Xia, Qiang Fang, International Symposium on Chinese Spoken Language Processing . 2016

机译：在嘈杂的混响条件下增强语音清晰度
5. Analyzing the Contribution of Envelope Modulations to the Intelligibility of Reverberant Speech [D] . Muralimanohar, Ramesh Kumar. 2018

机译：分析包络调制对混响语音清晰度的贡献
6. A deep learning based segregation algorithm to increase speech intelligibility for hearing-impaired listeners in reverberant-noisy conditions [O] . Yan Zhao, DeLiang Wang, Eric M. Johnson, -1

机译：一种基于深度学习的分离算法可在混响嘈杂的情况下提高听力障碍听众的语音清晰度
7. Predicting speech intelligibility in adverse conditions: evaluation of the speech-based envelope power spectrum model [O] . Jørgensen Søren, Dau Torsten 2011

机译：在不利条件下预测语音清晰度：评估基于语音的包络功率谱模型

Enhancement of speech intelligibility under noisy reverberant conditions based on modulation spectrum concept

摘要

著录项

相似文献

相关主题

期刊订阅