首页> 外文会议>Audio Engineering Society international convention >Investigation of an Encoder-Decoder LSTM model on the enhancement of speech intelligibility in noise for hearing impaired listeners
【24h】

Investigation of an Encoder-Decoder LSTM model on the enhancement of speech intelligibility in noise for hearing impaired listeners

机译:增强听觉障碍者噪声中语音清晰度的编解码器LSTM模型的研究

获取原文

摘要

Hearing impaired (HI) listeners often struggle to follow conversations when exposed in a complex acoustic environment. This is partly due to the reduced ability in recovering the target speech Temporal Envelope (ENV) cues from Temporal Fine Structure (TFS). This study investigates the enhancement of speech intelligibility in HI listeners, by processing the ENV of speech signals corrupted by real-world environmental noise. An Encoder-Decoder Long Short Term Memory (LSTM) model is exploited after perceptually motivated processing stages to compensate for the important ENV characteristics of comprehensible speech for hearing impairment. The computational model is evaluated using the Short-Time Objective Intelligibility (STOI) measure for speech intelligibility. Finally, results indicate a 6% improvement in the mean STOI measure across different SNR values.
机译:听力受损(HI)的听众在复杂的声学环境中暴露时,通常难以跟随对话。部分原因是由于从时间精细结构(TFS)恢复目标语音“时间包络”(ENV)提示的能力降低。这项研究通过处理现实世界环境噪声破坏的语音信号的ENV,研究了HI听众语音清晰度的提高。在感知驱动的处理阶段之后,开发了编码器-解码器长期短期记忆(LSTM)模型,以补偿可理解语音对听力障碍的重要ENV特性。使用用于语音清晰度的短时目标清晰度(STOI)评估评估计算模型。最后,结果表明,在不同的SNR值上,平均STOI测量值提高了6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号