首页> 外文期刊>Hearing Research: An International Journal >Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users
【24h】

Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users

机译:基于神经网络的语音增强改善了耳蜗植入用户的噪声的语音可懂度

获取原文
获取原文并翻译 | 示例
           

摘要

Speech understanding in noisy environments is still one of the major challenges for cochlear implant (CI) users in everyday life. We evaluated a speech enhancement algorithm based on neural networks (NNSE) for improving speech intelligibility in noise for CI users. The algorithm decomposes the noisy speech signal into time-frequency units, extracts a set of auditory-inspired features and feeds them to the neural network to produce an estimation of which frequency channels contain more perceptually important information (higher signal-to-noise ratio, SNR). This estimate is used to attenuate noise dominated and retain speech-dominated CI channels for electrical stimulation, as in traditional n of-m CI coding strategies. The proposed algorithm was evaluated by measuring the speech-in-noise performance of 14 CI users using three types of background noise. Two NNSE algorithms were compared: a speaker-dependent algorithm, that was trained on the target speaker used for testing, and a speaker-independent algorithm, that was trained on different speakers. Significant improvements in the intelligibility of speech in stationary and fluctuating noises were found relative to the unprocessed condition for the speaker-dependent algorithm in all noise types and for the speaker independent algorithm in 2 out of 3 noise types. The NNSE algorithms used noise-specific neural networks that generalized to novel segments of the same noise type and worked over a range of SNRs. The proposed algorithm has the potential to improve the intelligibility of speech in noise for CI users while meeting the requirements of low computational complexity and processing delay for application in CI devices. (C) 2016 The Authors. Published by Elsevier B.V.
机译:在嘈杂环境中的言语理解仍然是日常生活中耳蜗植入物(CI)用户的主要挑战之一。我们评估了基于神经网络(NNSE)的语音增强算法,用于提高CI用户噪声的语音可懂度。该算法将嘈杂的语音信号分解成时频单元,提取一组听觉激发的特征,并将它们馈送到神经网络,以产生频率信道包含更多感知重要信息的估计(更高的信噪比, SNR)。这估计用于衰减噪声主导,并保留用于电气刺激的语音主导的CI通道,如传统的CI CI编码策略。通过使用三种类型的背景噪声测量14个CI用户的语音性能来评估所提出的算法。比较了两个NNSE算法:扬声器依赖算法,在用于测试的目标扬声器上培训,以及在不同扬声器上培训的扬声器无关的算法。相对于所有噪声类型中的扬声器相关算法和扬声器独立算法中的扬声器依赖算法的未加工条件,发现了静止和波动噪声的可理解性的显着改进。 NNSE算法使用了噪声特异性神经网络,其广泛地用于相同噪声类型的新颖段,并在一系列SNR上工作。所提出的算法有可能提高CI用户的噪声的可理解性,同时满足CI设备应用低计算复杂性和处理延迟的要求。 (c)2016年作者。 elsevier b.v出版。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号