首页> 外文会议>Audio Engineering Society International Convention >Investigation of an Encoder-Decoder LSTM model on the enhancement of speech intelligibility in noise for hearing impaired listeners
【24h】

Investigation of an Encoder-Decoder LSTM model on the enhancement of speech intelligibility in noise for hearing impaired listeners

机译:对听力障碍听众噪声增强的编码器解码器LSTM模型的研究

获取原文

摘要

Hearing impaired (HI) listeners often struggle to follow conversations when exposed in a complex acoustic environment. This is partly due to the reduced ability in recovering the target speech Temporal Envelope (ENV) cues from Temporal Fine Structure (TFS). This study investigates the enhancement of speech intelligibility in HI listeners, by processing the ENV of speech signals corrupted by real-world environmental noise. An Encoder-Decoder Long Short Term Memory (LSTM) model is exploited after perceptually motivated processing stages to compensate for the important ENV characteristics of comprehensible speech for hearing impairment. The computational model is evaluated using the Short-Time Objective Intelligibility (STOI) measure for speech intelligibility. Finally, results indicate a 6% improvement in the mean STOI measure across different SNR values.
机译:听证障碍(嗨)听众常常在暴露在复杂的声学环境中努力遵循对话。这部分是由于从时间细结构(TFS)中恢复目标语音颞包络(ENV)提示的能力降低。本研究通过处理由现实世界环境噪声损坏的语音信号的env,调查了HI听众中的语音清晰度的增强。在感知的加工阶段之后利用编码器解码器长短短期存储器(LSTM)模型,以补偿听力损伤的综合言论的重要形态特征。使用短时间客观可懂度(STOI)测量来评估计算模型的语音可懂度。最后,结果表明不同SNR值的平均STOI测量的6%改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号